LoFreq

Overview

LoFreq*(i.e. LoFreq version 2) is a fast and sensitive variant-caller for inferring SNVs and indels from next-generation sequencing data.

Version 2.1.5 is installed on the CSF. This has been compiled from source against HTSLib 1.20.

Restrictions on use

There are no restrictions on accessing the software on the CSF.

You must cite your usage of LoFreq – please see the citation page for details.

Set up procedure

We now recommend loading modulefiles within your jobscript so that you have a full record of how the job was run. See the example jobscript below for how to do this. Alternatively, you may load modulefiles on the login node and let the job inherit these settings.

Load one of the following modulefiles:

module load apps/gcc/lofreq/2.1.5

Running the application

Please do not run LoFreq on the login node. Jobs should be submitted to the compute nodes via batch.

You may view the help text by running the following on the login node, after loading the modulefile:

lofreq                 # Will show the list of commands supported
lofreq call            # Help about the "call" command
lofreq call-parallel   # Help about the "call-parallel" command
lofreq command         # Help about command

Serial batch job submission

Create a batch submission script (which will load the modulefile in the jobscript), for example:

#!/bin/bash --login
#$ -cwd             # Job will run from the current directory
                    # NO -V line - we load modulefiles in the jobscript

module load apps/gcc/lofreq/2.1.5

lofreq call options in.bam

# For example:
lofreq call -f ref.fa -o vars.vcf aln.bam

Submit the jobscript using:

qsub scriptname

where scriptname is the name of your jobscript.

Parallel batch job submission

Create a batch submission script (which will load the modulefile in the jobscript), for example:

#!/bin/bash --login
#$ -cwd
#### Now choose one of the following parallel environments
#$ -pe smp.pe  8      # Intel hardware, 2--32 cores. Use this for high-memory jobs too.
## OR
#$ -pe amd.pe  8      # AMD Genoa hardware, 2-168 cores.

# The smp.pe environment can run high-memory jobs. Add extra flags as needed (check the docs.)

module load apps/gcc/lofreq/2.1.5

lofreq call-parallel --pp-threads $NSLOTS options in.bam

# For example
lofreq call-parallel --pp-threads $NSLOTS -f ref.fa -o vars.vcf aln.bam
                                    #
                                    # Will be replaced by the number of cores requested above

# The somatic command
lofreq somatic --threads $NSLOTS -n normal.bam -t tumor.bam -f hg19.fa  -o out_ [-d dbsnp.vcf.gz]
                  #
                  # The somatic command uses --threads (not --pp-threads)

Submit the jobscript using:

qsub scriptname

where scriptname is the name of your jobscript.

Further info

Updates

None.

Last modified on October 31, 2024 at 3:09 pm by George Leaver