Skip to content
View Amorfati123's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Amorfati123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Amorfati123/README.md

About Me:

I’m a data scientist who builds end-to-end ML and data engineering systems that ship in real healthcare and public health environments. I focus on making models usable and trustworthy by pairing strong modeling with reproducible pipelines, validation, and workflow integration.

At the CDC, I worked on surveillance informatics in Palantir Foundry (1CDP). I designed modular ingestion and transformation pipelines, implemented schema validation and data quality checks, used lineage to debug upstream issues, and built tools that reduced manual burden for epidemiologists and state partners. I also developed structured, auditable workflows for semi-automated tasks like schema mapping, with human review, versioned configurations, and clear traceability.

What I work on most:

ML engineering: training and inference pipelines, distributed processing, monitoring, reproducibility

Data engineering: schema management, validation gates, lineage-driven debugging, scalable transforms

Applied healthcare AI: interpretable models, uncertainty-aware decisions, clinical workflow fit

Tools: Python, SQL, PyTorch, Spark/PySpark, Git, containers, Palantir Foundry

I like practical problems where correctness, traceability, and maintainability matter as much as model performance.

Tech Stack:

Python R PowerShell Bash Script TypeScript Windows Terminal Markdown Azure AWS Apache Spark Apache Kafka Chart.js Apache Apache Tomcat MicrosoftSQLServer MongoDB MySQL Postgres Adobe Adobe Acrobat Reader Adobe Lightroom Adobe Photoshop Keras Matplotlib mlflow NumPy Pandas Plotly PyTorch scikit-learn Scipy TensorFlow GitLab CI GitHub Actions Git GitHub GitLab

GitHub Stats:



GitHub Trophies


Pinned Loading

  1. wound-forecast wound-forecast Public

    Forecasting wound trajectory using ML

    Jupyter Notebook 1

  2. CheXNet CheXNet Public

    Forked from arnoweng/CheXNet

    A pytorch reimplementation of CheXNet

    Python 1

  3. NEDSS-DataReporting NEDSS-DataReporting Public

    Forked from CDCgov/NEDSS-DataReporting

    Data Near Real Time Reporting micro services for Modernized NBS System

    TSQL

  4. SpecKV SpecKV Public

    Jupyter Notebook 2

  5. bridge2ai-voice-parkinsons-ast bridge2ai-voice-parkinsons-ast Public

    Jupyter Notebook 1

  6. periop-prediction-framework periop-prediction-framework Public

    Machine learning models for predicting postoperative delirium from perioperative EHR data, including baseline models, domain-structured ensembles, interpretability, calibration, and decision curve …

    Jupyter Notebook 1