Skip to content

Quickstart

This guide will help you get the pipeline up and running as quickly as possible. To use your own data, you must override the input parameters or modify the downloadData process, as the current script hardcodes a download from OSF.

Step 1: Clone the Repository

git clone https://github.com/bibymaths/nf-illumina2lineage.git
cd nf-illumina2lineage

Step 2: Install Pixi

Install Pixi using the official installer:

curl -fsSL https://pixi.sh/install.sh | bash

Restart your shell, or reload your shell configuration:

source ~/.bashrc

Verify the installation:

pixi --version

Step 3: Install and Use the Environment

From the root directory of this pipeline, install the Pixi environment:

pixi install

Run commands inside the environment using:

pixi run <command>

For example:

pixi run python --version
pixi run nextflow -version

To enter the environment shell:

pixi shell

If your environment is named illumina2lineage instead of default, use this version:

Step 3: Install and Use the Environment

From the root directory of this pipeline, install the Pixi environment:

pixi install -e illumina2lineage

Run commands inside the environment using:

pixi run -e illumina2lineage <command>

For example:

pixi run -e illumina2lineage python --version
pixi run -e illumina2lineage nextflow -version

To enter the environment shell:

pixi shell -e illumina2lineage

Set up Java and Nexflow if not already installed

curl -s https://get.sdkman.io | bash 

In a new terminal:

sdk install java 17.0.10-tem  
java -version
curl -s https://get.nextflow.io | bash  
chmod +x nextflow 
mkdir -p $HOME/.local/bin/
mv nextflow $HOME/.local/bin/ 
nextflow info
unset JAVA_CMD JAVA_HOME
export JAVA_HOME="$HOME/.sdkman/candidates/java/current"
export JAVA_CMD="$JAVA_HOME/bin/java" 

Nextflow requires Bash 3.2 (or later) and Java 17 (or later, up to 24) to be installed. More information on Nextflow installation.

Step 4: Run the Pipeline with Nextflow

nextflow run main.nf

Input Data Requirements

  • Illumina paired-end FASTQ files
  • SARS-CoV-2 reference genome (automatically downloaded by the pipeline)

Output Overview

  • Cleaned FASTQ files and QC reports
  • Sorted, indexed BAM files
  • VCF variant files
  • Consensus sequences (FASTA)
  • Pangolin lineage assignments
  • Phylogenetic tree (Newick format)

For advanced configuration, see the parameters section.

💡 Tip: Use multiqc to summarize all QC results in one place.

You're now ready to use the pipeline!