Zachary Mays

Senior Data Scientist

My Projects

Projects

ML Claims Processing Pipeline

  • Lead a team to build an end-to-end SageMaker ML pipeline to process America's medicare and medicaid claims using features engineered from ICD9, ICD10, CPT, HCPCS, RBCS, other healthcare code sets and part A and B claim characteristics.

Member Risk and Opportunity Pipeline

  • Created a Machine Learning model to predict the risk and opportunity of health insurance members for ACO populations with an accuracy of 98% saving a $1 million a year and bringing the predictions in-house.

Special Reports on Member Trends

  • Produced special reports on trends in our member population such as the effect of birth month on vaccination status in children. Presented findings to C-suite and VP level.

Skills

Machine Learning Regression, Cross Validation, Classification, Recommendation, and NLP
Python Libraries Scipy, Pandas, Seaborn, Sci-kit Learn, Numpy, Pyspark, MLFlow, SHAP
Programming Languages R, Python, HTML, CSS, JavaScript, Bash
Data Management ETL, Data Governance, Relational Databases
Database Language SQL
Data Reporting Tools Tableau, PowerBI
Other Tools Microsoft Office, Git, Visual Studio Code, Pycharm, AWS SageMaker

Experience

GDIT, Remote

Senior Data Scientist

September 2022 - PRESENT

  • Engineered features from all aspects of Medicare claims data for predictive modeling in SageMaker using Pyspark and AWS Glue.
  • Lead a team to tune models increasing the performance >20% over existing human-lead processes using a tuned and boosted randomized forest model.
  • Implemented models in an AWS-hosted ML pipeline that processes over 100 million records daily.
  • Advised on the creation of a dashboard to monitor performance drift in QuickSight.

Innovista Health Solutions, Remote

Market Analyst

July 2021 - September 2022

  • Developed a risk assessment and opportunity ML algorithm for Medicare and Commercial members using data collected by nursing staff. The model was applied to palliative, hospice, dialysis, primary care, and in-network patients.
  • Analyzed population data and found statistically relevant opportunities for quality improvement.
  • Shared data visualizations and whitepapers with C-suite executives and VPs in my department to demonstrate that we could bring AI/ML in-house.
  • Advised on the use of Sentiment Analysis of incoming Documents to increase the speed of ETL from sFTP servers into a local SQL Server.

Innovative Genomics Laboratories, San Antonio, TX

Molecular Technologist

February 2021 - July 2021

  • Lead validation testing, compiled raw data files, and statistically analyzed data for a PGx panel.
  • Managed a team of 3 scientists working on 6 different projects for molecular validations, ensured data integrity and consistency across tests.
  • Presented findings to the Medical Director to launch the panel.
  • Authored detailed test procedures and taught lab personnel to perform the tests.

Texas State University, San Marcos

Graduate Research Assistant

January 2020 - December 2021

  • Extracted, sequenced, and analyzed over 300 genes and 20 genomes from bacteria associated with an endangered beetle for the USFWS resulting in a publication in an academic journal. View Publication
  • Leveraged the STAR cluster supercomputer to align and annotate over 100Gb of metagenomic data from insects and their environments using Bash and R scripts such as vegan and phyloseq.
  • Performed multiple analyses on genetic, genomic, and metagenomic data using the DADA2 pipeline, nBLAST, and MicrobiomeAnalyst.

Texas State University, San Marcos

Graduate Instructional Assistant

August 2018 - May 2021

  • Prepared and led undergraduate classes including: Introductory Biology, Medical Microbiology, Applied Biotechnology, and Bacterial Genetics.

GoScribes, New Braunfels

ER Medical Scribe

January 2017 - January 2018

  • Followed medical doctors in the ER and provided an accurate and detailed record of the patient's visit in addition to inputting that information into an Electronic Medical Record through dictation.

Seton Medical Center Austin, Austin

Clinical Assistant

May 2016 - October 2016

  • Cared for patients on the pulmonary and renal floor.
  • Took vitals and assisted nurses with regular care.

Education

University of Texas, Austin, TX
Post Graduate Program in Artificial Intelligence and Machine Learning
Google Analytics Professional Certificate, Austin, TX
Google Data Analytics
Texas State University, San Marcos, TX
Master's in Science - Bacterial Genetics (2020)
Texas State University, San Marcos, TX
Bachelor's in Science - Biology (2018)