Available for new roles · Indianapolis, IN

Bioinformatics

Building end-to-end NGS pipelines that transform raw sequencing reads into actionable genomic insights. Python · AWS · PostgreSQL · GATK.

View Projects GitHub Profile
0
Years Experience
0+
Datasets / Year
0%
Pipeline Time Reduced
0
GitHub Projects
WORKFLOW

The NGS Pipeline.

From raw sequencing reads to annotated variants — every step automated, validated, and reproducible at scale.

FASTQ
Raw reads
Quality Control
Phred scores
Alignment
BWA-MEM
REF
BAM Process
Sort & index
SNP
Variant Calling
GATK
VEP
Annotation
Ensembl VEP
Report
JSON + HTML
CAPABILITIES

Built for Scale.

5+ years of production bioinformatics — from raw reads to clinical insights.

NGS Analysis
DNA-seq · RNA-seq · WGS · WES
FASTQBAMVCFVariant Calling
Python Engineering
OOP · Modular · Tested
Python 3.10PytestREST APIBash
Cloud & HPC
AWS · GCP · HPC Clusters
AWS S3EC2BatchGCP
Data Engineering
SQL · PostgreSQL · MySQL
PostgreSQLMySQLpsycopg2
Bioinformatics Tools
GATK · STAR · BWA · VEP
GATKSTARBWA-MEMVEP
Transcriptomics
RNA-seq · featureCounts · DESeq2
featureCountsTrimmomaticMultiQC
OPEN SOURCE

Four Projects.
Real Impact.

01 — NGS PIPELINE
End-to-End NGS Pipeline
Modular Python + Bash: FASTQ → BWA-MEM → GATK HaplotypeCaller → Ensembl VEP → PostgreSQL. Full test suite.
35% less manual intervention
30% faster variant retrieval
500+ datasets/year
PythonGATKBWAPostgreSQLAWS
View on GitHub →
02 — RNA-SEQ
Scalable RNA-seq Framework
Trimmomatic → STAR 2-pass → featureCounts → count matrix. Auto HTML QC reports. AWS Batch parallel processing.
28% faster turnaround
15 Pytest unit tests
Auto HTML QC report
STARfeatureCountsTrimmomaticAWS Batch
View on GitHub →
03 — DATA MGMT
Genomic Data Management

SQL-driven genomic data management system — 8-table PostgreSQL schema, Ensembl REST API batch annotation, priority-ranked variant HTML/JSON reports. 13 Pytest tests.

  • 32% faster cohort retrieval
  • 22% shorter report time
  • 13 Pytest unit tests
PostgreSQLEnsembl APIREST APIPytest
View on GitHub →
04 — METABIOME
MetaBiome Platform — Full Stack Microbiome
Full-stack microbiome platform — shotgun metagenomic pipeline, FastAPI REST API, PostgreSQL longitudinal schema, React dashboard, AWS ECS/Batch deployment.
32% faster cohort retrieval
22% shorter report time
13 Pytest unit tests
PostgreSQLEnsembl APIREST APIPytest
View on GitHub →
CAREER

Work Experience.

CurrentMay 2025 — Present
Indianapolis, IN
Bioinformatics Analyst
Translation Commons
  • NGS pipelines (Python + Bash, AWS HPC) — reduced processing time 30% across 500+ datasets/year
  • Scalable RNA-seq workflows with automated Pytest QC — lowered reanalysis requests by 18%
  • Optimized PostgreSQL BAM/VCF tracking — increased variant retrieval speed by 35%
GraduateAug 2023 — May 2025
Indianapolis, IN
Graduate Research Assistant
Indiana University · Bioinformatics
  • DNA-seq and RNA-seq across 12+ concurrent academic studies using OOP Python workflows
  • Refactored Bash scripts to modular Python — reduced pipeline runtime by 30%
  • Transcriptome analysis scripts improving gene expression consistency by 22%
IndustryJun 2020 — Jul 2023
Remote
Bioinformatics Analyst
Cipla Ltd
  • Automated genomic reporting pipelines integrating SQL — improved efficiency by 28%
  • In silico PCR validation scripts — increased primer specificity screening by 26%
  • SQL data storage optimization — decreased cohort retrieval latency by 32%
CONNECT

Let's Work Together.

Open to opportunities

Bioinformatics data scientist roles where I can build scalable genomic pipelines and drive data-driven research decisions.

EDUCATION
M.S. Bioinformatics · Indiana University · 2025
B.S. Bioinformatics · Saveetha Engineering College · 2022
Umesh's AI Assistant
Online · Powered by Claude
Hi! I'm Umesh's AI assistant. I can tell you about his bioinformatics work, skills, and projects — or help you schedule a meeting or call with him! 🧬
just now