Workflow Managers

Workflow managers stich together multiple steps to create a more seemless and reproducible pipeline for data manipulation. If you find yourself regularily running certain programs/files in tandem, or you have a large number of files you want to process in a similar manner, you should consider finding or developing a pipeline for your efforts.

Common workflow management languages include Nextflow, Snakemake, and Cromwell.

Nextflow

Nextflow is available on Rhino as a module

module spider Nextflow

If the pipeline uses containerization, you should also load an Apptainer module.

execute the pipeline with

nextflow run ./main.nf -c nextflow.config --profile [local,slurm]

Nf-core

Nf-core is an open-source community repository of bioinformatic workflows written in nextflow. There are many modules, subworkflows, and full workflows for many types of bioinformatic experiments. Before creating a nextflow workflow from scratch, you should see if you can leverage existing code, or find a workflow that does what you need. There is also an institutional profile available for Fred Hutch to run nf-core workflows on Rhino. See here for more details.

Resources