Projects
ML Claims Processing Pipeline
- Lead a team to build an end-to-end SageMaker ML pipeline to process America's medicare and medicaid claims using features engineered from ICD9, ICD10, CPT, HCPCS, RBCS, other healthcare code sets and part A and B claim characteristics.
Member Risk and Opportunity Pipeline
- Created a Machine Learning model to predict the risk and opportunity of health insurance members for ACO populations with an accuracy of 98% saving a $1 million a year and bringing the predictions in-house.
Special Reports on Member Trends
- Produced special reports on trends in our member population such as the effect of birth month on vaccination status in children. Presented findings to C-suite and VP level.
Skills
Machine Learning
Regression, Cross Validation, Classification, Recommendation, and NLP
Python Libraries
Scipy, Pandas, Seaborn, Sci-kit Learn, Numpy, Pyspark, MLFlow, SHAP
Programming Languages
R, Python, HTML, CSS, JavaScript, Bash
Data Management
ETL, Data Governance, Relational Databases
Database Language
SQL
Data Reporting Tools
Tableau, PowerBI
Other Tools
Microsoft Office, Git, Visual Studio Code, Pycharm, AWS SageMaker
Experience
GDIT, Remote
Senior Data Scientist
September 2022 - PRESENT
- Engineered features from all aspects of Medicare claims data for predictive modeling in SageMaker using Pyspark and AWS Glue.
- Lead a team to tune models increasing the performance >20% over existing human-lead processes using a tuned and boosted randomized forest model.
- Implemented models in an AWS-hosted ML pipeline that processes over 100 million records daily.
- Advised on the creation of a dashboard to monitor performance drift in QuickSight.
Innovista Health Solutions, Remote
Market Analyst
July 2021 - September 2022
- Developed a risk assessment and opportunity ML algorithm for Medicare and Commercial members using data collected by nursing staff. The model was applied to palliative, hospice, dialysis, primary care, and in-network patients.
- Analyzed population data and found statistically relevant opportunities for quality improvement.
- Shared data visualizations and whitepapers with C-suite executives and VPs in my department to demonstrate that we could bring AI/ML in-house.
- Advised on the use of Sentiment Analysis of incoming Documents to increase the speed of ETL from sFTP servers into a local SQL Server.
Innovative Genomics Laboratories, San Antonio, TX
Molecular Technologist
February 2021 - July 2021
- Lead validation testing, compiled raw data files, and statistically analyzed data for a PGx panel.
- Managed a team of 3 scientists working on 6 different projects for molecular validations, ensured data integrity and consistency across tests.
- Presented findings to the Medical Director to launch the panel.
- Authored detailed test procedures and taught lab personnel to perform the tests.
Texas State University, San Marcos
Graduate Research Assistant
January 2020 - December 2021
- Extracted, sequenced, and analyzed over 300 genes and 20 genomes from bacteria associated with an endangered beetle for the USFWS resulting in a publication in an academic journal. View Publication
- Leveraged the STAR cluster supercomputer to align and annotate over 100Gb of metagenomic data from insects and their environments using Bash and R scripts such as vegan and phyloseq.
- Performed multiple analyses on genetic, genomic, and metagenomic data using the DADA2 pipeline, nBLAST, and MicrobiomeAnalyst.
Texas State University, San Marcos
Graduate Instructional Assistant
August 2018 - May 2021
- Prepared and led undergraduate classes including: Introductory Biology, Medical Microbiology, Applied Biotechnology, and Bacterial Genetics.
GoScribes, New Braunfels
ER Medical Scribe
January 2017 - January 2018
- Followed medical doctors in the ER and provided an accurate and detailed record of the patient's visit in addition to inputting that information into an Electronic Medical Record through dictation.
Seton Medical Center Austin, Austin
Clinical Assistant
May 2016 - October 2016
- Cared for patients on the pulmonary and renal floor.
- Took vitals and assisted nurses with regular care.
Education
University of Texas, Austin, TX
Post Graduate Program in Artificial Intelligence and Machine Learning
Post Graduate Program in Artificial Intelligence and Machine Learning
Google Analytics Professional Certificate, Austin, TX
Google Data Analytics
Google Data Analytics
Texas State University, San Marcos, TX
Master's in Science - Bacterial Genetics (2020)
Master's in Science - Bacterial Genetics (2020)
Texas State University, San Marcos, TX
Bachelor's in Science - Biology (2018)
Bachelor's in Science - Biology (2018)