Nextflow Parameters
Complete reference for all pipeline parameters.
Required Parameters
| Parameter |
Description |
--input |
Path to samplesheet CSV |
--variants |
Path to VCF/MAF variants file |
--fasta |
Reference FASTA (with .fai index) |
Mode
| Parameter |
Default |
Description |
--mode |
dna |
Analysis mode: dna (cfDNA/somatic) or rna (RNA-seq with transcriptome-aware filtering) |
Output Options
| Parameter |
Default |
Description |
--outdir |
results |
Output directory |
--format |
vcf |
Output format (vcf or maf) |
--suffix |
'' |
Suffix for output filenames |
--column_prefix |
'' |
Prefix for gbcms count columns in MAF output |
--preserve_barcode |
false |
Keep original Tumor_Sample_Barcode from input MAF |
mFSD Options (DNA only)
| Parameter |
Default |
Description |
--mfsd |
false |
Enable mFSD analysis — adds 34 mFSD columns to MAF and 7 MFSD_* INFO fields to VCF. See mFSD metrics. |
--mfsd_parquet |
false |
Write a companion <sample>.fsd.parquet with raw per-variant fragment size arrays. Requires --mfsd. |
Quality & Filtering Options
| Parameter |
Default |
Description |
--min_mapq |
20 |
Minimum mapping quality |
--min_baseq |
20 |
Minimum base quality |
--fragment_qual_threshold |
10 |
Quality margin for fragment consensus — when R1/R2 disagree, the higher-quality allele wins only if the difference exceeds this |
--context_padding |
5 |
Minimum flanking bases for Phase 3 alignment (auto-increased in repeats) |
--adaptive_context |
true |
Dynamically increase context padding in tandem repeat regions |
--filter_duplicates |
true |
Filter duplicate reads |
--filter_secondary |
true |
Filter secondary alignments |
--filter_supplementary |
true |
Filter supplementary alignments |
--filter_qc_failed |
true |
Filter QC failed reads |
--filter_improper_pair |
false |
Filter improperly paired reads |
--filter_indel |
false |
Filter reads with indels |
--filter_by_sample |
false |
Filter multi-sample MAF by Tumor_Sample_Barcode (details) |
--show_normalization |
false |
Add norm_* columns showing left-aligned coordinates in output |
UMI & BAQ Options
| Parameter |
Default |
Description |
--umi_tag |
'' |
UMI BAM tag for deduplication (e.g., XM, RX). When set, reads sharing the same UMI are grouped as a single observation. |
--apply_baq |
false |
Apply Base Alignment Quality recalibration (DNA mode only). Reduces false positives near indels. |
RNA-Specific Options
These parameters are only used when --mode rna is specified.
| Parameter |
Default |
Description |
--rna_editing_db |
'' |
Path to REDIportal editing database file (e.g., TABLE1_hg38_v3.txt). Flags ALT sites that overlap known A→I RNA editing positions. |
--enforce_strandedness |
true |
Enforce dUTP strand-specific library prep. Disable with false for unstranded RNA-seq libraries (--no-strandedness equivalent). |
RNA mode defaults
RNA mode uses different PairHMM gap penalties by default (gap_open=5e-3, gap_extend=0.25) to tolerate RT-induced stutter at homopolymers. These can be overridden via the alignment backend parameters below.
Alignment Backend (Advanced)
| Parameter |
Default |
Description |
--alignment_backend |
pairhmm |
Phase 3 alignment backend: pairhmm (WFA2 + PairHMM, default) or sw (Smith-Waterman). See CLI Reference. |
--llr_threshold |
2.3 |
PairHMM log-likelihood ratio threshold for confident calls |
--gap_open_prob |
1e-4 |
PairHMM gap-open probability for non-repeat regions |
--gap_extend_prob |
0.1 |
PairHMM gap-extend probability for non-repeat regions |
--gap_open_prob_repeat |
1e-2 |
PairHMM gap-open probability for tandem repeat regions |
--gap_extend_prob_repeat |
0.5 |
PairHMM gap-extend probability for tandem repeat regions |
Resource Limits
| Parameter |
Default |
Description |
--max_cpus |
16 |
Maximum CPUs per job |
--max_memory |
128.GB |
Maximum memory per job |
--max_time |
240.h |
Maximum runtime per job |
Representative metrics from cfDNA duplex BAM samples:
| Sample Type |
BAM Size |
Variants |
Runtime |
CPUs |
| ctDNA (plasma) |
1.3 GB |
608 |
~25s |
4 |
| Plasma control |
776 MB |
608 |
~20s |
4 |
Execution Profiles
| Profile |
Description |
docker |
Local with Docker containers |
singularity |
HPC with Singularity |
slurm |
SLURM cluster with Singularity |
local |
No container (requires local install) |
Advanced
task.ext.args — Arbitrary CLI Arguments
Any CLI option not exposed as a Nextflow parameter can be passed via task.ext.args in your config:
process {
withName: GBCMS_DNA {
ext.args = '--verbose'
}
}
See DNA CLI Reference or RNA CLI Reference for all available options.
abbreviations