Derek L
- Research Program Mentor
MPH at University of California Los Angeles (UCLA)
Expertise
Statistics, Data Visualization, Data Analytics, Machine Learning, Artificial Intelligence, Natural Language Processing, Public Health
Bio
I am an applied statistician by training, currently working on analytics related to enhancing EHR data at a health tech startup called Verana Health. Previously, I worked on analytics in the biotechnology/pharmaceutical sector at Genentech to improve the efficiency of our clinical trial operations. I completed an applied master's program in biostatistics at UCLA, focusing on statistical analysis and statistical learning (commonly known as machine learning). My undergraduate training at Stanford University focused on interdisciplinary connections found in the field of biology. I completed internships at the National Institutes of Health (NIH), the Santa Clara County Public Health Department, and the Stanford Social Innovation Review. Besides my technical knowledge in statistics and machine learning, I am highly experienced in perhaps one of the most important aspects of determining project success: gathering business requirements and presenting domain knowledge in an easy-to-understand fashion.Project ideas
Data Analytics Report
In this project, the student will formulate a question they'd like to address, based on their interests. Examples could be sports, healthcare, fashion, etc. We will walk together on what the data analysis process is like: how to produce a quality dataset, determining what the data look like, and presenting a story of what the data describe. I refer to data in its plural form because datum would be the singular form of data. The project deliverable could be a report documenting the landscape of the data being analyzed and an assessment of what the data tell us, a guide for when certain data analysis methods are appropriate, etc.
Statistical Modeling/Machine Learning Models
This project is for the student who is proficient in Python or R and more familiar with the tenets of good data analysis and would like to take the next step, which is predictive modeling. The student will identify a research question, perform rigorous data analysis, and research/build the appropriate models to help him/her answer the research question. The end project likely will be a report in Word, Markdown, or PowerPoint format.