Quickstart Guide

This guide will help you get the pipeline up and running as quickly as possible.

Step 1: Clone the Repository

git clone https://github.com/bibymaths/nf-illumina2lineage.git
cd nf-illumina2lineage

Step 2: Install Mamba (if not already installed)

wget "https://github.com/conda-forge/miniforge/releases/latest/download/Mambaforge-Linux-x86_64.sh"
bash Mambaforge-Linux-x86_64.sh

Step 3: Create and Activate the Environment

conda update -y conda
mamba env create -p --file environment.yaml
mamba activate sars_genome_assembly

Set up Java and Nexflow

curl -s https://get.sdkman.io | bash 

In new terminal

sdk install java 17.0.10-tem  
java -version
curl -s https://get.nextflow.io | bash  
chmod +x nextflow 
mkdir -p $HOME/.local/bin/
mv nextflow $HOME/.local/bin/ 
nextflow info

Before you launch Nextflow, unset the Conda‐JDK variables and point to SDKMAN’s inside the conda environment.

unset JAVA_CMD JAVA_HOME
export JAVA_HOME="$HOME/.sdkman/candidates/java/current"
export JAVA_CMD="$JAVA_HOME/bin/java" 

Nextflow requires Bash 3.2 (or later) and Java 17 (or later, up to 24) to be installed. More information on Nextflow installation.

Step 4: Run the Pipeline with Nextflow

nextflow run main.nf

To use a Docker profile:

nextflow run main.nf -profile docker

Input Data Requirements

  • Illumina paired-end FASTQ files
  • SARS-CoV-2 reference genome (automatically downloaded by the pipeline)

Output Overview

  • Cleaned FASTQ files and QC reports
  • Sorted, indexed BAM files
  • VCF variant files
  • Consensus sequences (FASTA)
  • Pangolin lineage assignments
  • Phylogenetic tree (Newick format)

For advanced configuration, see the parameters section.

💡 Tip: Use multiqc to summarize all QC results in one place.

You're now ready to use the pipeline!