A reproducible, containerized Nextflow pipeline for clinical microbiology labs — automating QC, assembly, AMR typing, MLST, and phylogenetic surveillance into a single audit-ready HTML report.
Whole-genome sequencing is now standard in clinical microbiology. But the bioinformatics tooling hasn't kept up — labs stitch together ad-hoc scripts that break, can't be audited, and aren't reproducible between runs.
Each module is a self-contained Nextflow process with its own Docker container, test data, and documentation. Mix and match with --skip_* flags.
Simulated run on clinical-grade Illumina reads (2×150bp, ~100× coverage). All outputs generated by the pipeline with no manual intervention.
| Sample ID | Gene | Drug Class | % Identity | Coverage | Phenotype |
|---|---|---|---|---|---|
| KP-001 | blaCTX-M-15 | Beta-lactam (ESBL) | 99.8% | 100% | R |
| KP-001 | aac(6')-Ib-cr | Aminoglycoside | 98.2% | 100% | R |
| KP-003 | oqxAB | Fluoroquinolone | 97.5% | 98% | I |
| KP-005 | blaTEM-1B | Beta-lactam | 100% | 100% | R |
| KP-007 | mcr-1.1 | Colistin | 99.1% | 99% | R |
| KP-009 | — (no AMR genes) | — | — | — | S |
| KP-011 | blaKPC-2 | Carbapenem | 100% | 100% | R |
I build reproducible bioinformatics tools at the intersection of clinical microbiology and computational genomics. BacTrack-NF grew out of frustration with fragmented tooling in a clinical WGS setting — I needed something I could hand to a lab technician with one command.
Open to remote bioinformatics positions — analyst, pipeline engineer, or computational researcher roles. Comfortable across the full stack: bash to Python to Nextflow to Docker.