Building end-to-end NGS pipelines that transform raw sequencing reads into actionable genomic insights. Python · AWS · PostgreSQL · GATK.
From raw sequencing reads to annotated variants — every step automated, validated, and reproducible at scale.
5+ years of production bioinformatics — from raw reads to clinical insights.
SQL-driven genomic data management system — 8-table PostgreSQL schema, Ensembl REST API batch annotation, priority-ranked variant HTML/JSON reports. 13 Pytest tests.
Bioinformatics data scientist roles where I can build scalable genomic pipelines and drive data-driven research decisions.