- Drafted study protocol for testing impact of different settings of time-zero (start of follow-up) in target trial emulation longitudinal study focusing on chronic diseases without active comparator.
- Developed validation study to justify or disprove proposed time-zero settings by adapting previous active-user comparator RWE studies into non-user comparator design and compared results with well-studied treatment effects.
- Performed data extraction from large RWE database (Optum Clinformatics Data Mart) using SQL and R, dplyr, and data harmonization for creating cohorts using different time-zero settings.
- Assessed performance of different settings of time-zero using survival analysis models (Cox model).
- Automated cohort creation pipeline and created Shiny app/user-friendly R package for selection and visualization.
Chunhui Gu
After five years as a medical student, I found myself more interested in and good at quantitative sciences and programming. Still love medicine but have felt tired about cramming style education in medicine, I wish I can boost the development of medicine from another aspect with my multi-disciplinary background. Considered myself as "full-stack" biomedical data scientist with 5 years of experience in biostatistics, computer science, and bioinformatics. I am currently a PhD candidate in Biostatistics at UTHealth School of Public Health, and working jointly as a GRA in MD Anderson biostatistics and Clinical Cancer Prevention Department. Looking for a full-time position in biomedical data science and machine learning.