🧬 N. gonorrhoeae Comprehensive Analysis Pipeline

Comprehensive, Flexible Bioinformatics Pipeline for Genomic Surveillance and AMR Detection

Pipeline Statistics

14 Workflows
50+ Modules/Processes
9 Analysis Types
3 QC Stages

Pipeline Overview

This Nextflow pipeline provides comprehensive genomic analysis for Neisseria gonorrhoeae outbreak investigation and surveillance. It processes raw sequencing reads through quality control, assembly, variant calling, phylogenetic analysis, and antimicrobial resistance (AMR) profiling to generate actionable clinical insights.

Key Capabilities

Main Workflows

Downsample Reads Downsample Reads to desired coverage and reduce run time
Reads QC FastP-based quality filtering and trimming
Assembly SPAdes assembly with statistics generation
Assembly QC Coverage and assembly quality checks
MASH Species Check Proceed with only those Reads identified as Gonorrhoea
Variant Calling Snippy with caching, core alignment, and Gubbins
Phylogeny RAxML-NG phylogenetic tree construction
Outbreak Detection SNP distance-based cluster identification
Recombination Functional annotation of recombinant regions
MLST Multi-locus sequence typing and clustering
AMR Profiler Chromosomal and HGT resistance detection
AMR Typing NG-MAST/NG-STAR strain typing
Clinical Treatment recommendations and priority classification
Downsampling Optional read depth normalization
Final QC Post-assembly comprehensive quality filtering
Reports Comprehensive manifest generation

Pipeline Architecture

The pipeline is organized into 14 main workflows and 2 subworkflows that orchestrate over 50 specialized processes. Each workflow is designed to be modular and can be enabled/disabled via command-line parameters, allowing flexible execution based on analysis needs.

Quality Control Strategy

The pipeline implements a three-stage QC approach:

Caching and Performance

The pipeline implements intelligent caching for computationally expensive operations, particularly Snippy variant calling. It supports five separate cache directories:

Smart cache filtering ensures downsampled samples are re-analyzed while leveraging existing results for unchanged samples.

Technology Stack

Built with: