{"id":6033,"date":"2022-04-29T11:59:46","date_gmt":"2022-04-29T10:59:46","guid":{"rendered":"https:\/\/ri.itservices.manchester.ac.uk\/csf3\/?page_id=6033"},"modified":"2022-04-29T15:07:25","modified_gmt":"2022-04-29T14:07:25","slug":"bbtools","status":"publish","type":"page","link":"https:\/\/ri.itservices.manchester.ac.uk\/csf3\/software\/applications\/bbtools\/","title":{"rendered":"BBTools"},"content":{"rendered":"<h2>Overview<\/h2>\n<p><a href=\"https:\/\/jgi.doe.gov\/data-and-tools\/software-tools\/bbtools\/\">BBTools<\/a><\/p>\n<p>BBTools is a suite of fast, multithreaded bioinformatics tools designed for analysis of DNA and RNA sequence data. BBTools can handle common sequencing file formats such as fastq, fasta, sam, scarf, fasta+qual, compressed or raw, with autodetection of quality encoding and interleaving. Program descriptions and options are shown when running the shell scripts with no parameters.<\/p>\n<p>The BBTools suite includes programs such as:<\/p>\n<ul>\n<li>bbduk \u2013 filters or trims reads for adapters and contaminants using k-mers<\/li>\n<li>bbmap \u2013 short-read aligner for DNA and RNA-seq data<\/li>\n<li>bbmerge \u2013 merges overlapping or nonoverlapping pairs into a single reads<\/li>\n<li>reformat \u2013 converts sequence files between different formats such as fastq and fasta<\/li>\n<\/ul>\n<p>Version 38.96 is installed on the CSF.<\/p>\n<h2>Restrictions on use<\/h2>\n<p>BBTools is open source and free for unlimited use, and is used regularly by DOE JGI and other institutions around the world.<br \/>\nBBtools may be cited using the primary website: BBMap \u2013 Bushnell B. \u2013 <a href=\"http:\/\/sourceforge.net\/projects\/bbmap\/\">http:\/\/sourceforge.net\/projects\/bbmap\/<\/a><\/p>\n<h2>Set up procedure<\/h2>\n<p>We now recommend loading modulefiles within your jobscript so that you have a full record of how the job was run. See the example jobscript below for how to do this. Alternatively, you may load modulefiles on the login node and let the job <abbr title=\"add '#$ -V' to your jobscript\">inherit these settings<\/abbr>.<\/p>\n<p>Load one of the following modulefiles:<\/p>\n<pre>module load apps\/binapps\/bbtools\/38.96\r\n<\/pre>\n<h2>Running the application<\/h2>\n<p>Please do not run bbtools on the login node. Jobs should be submitted to the compute nodes via batch.<\/p>\n<p>Program descriptions and options are shown when running the shell scripts with no parameters, e.g.<\/p>\n<pre>module load apps\/binapps\/bbtools\/38.96\r\n.\/bbmap.sh\r\n\r\n##### omitted for clarity #####\r\n\r\nTo index:     bbmap.sh ref=\r\nTo map:       bbmap.sh in= out=<output>\r\nTo map without writing an index:\r\n    bbmap.sh ref=&lt;reference fasta&gt; in=&lt;reads&gt; out=&lt;output sam&gt; nodisk<output>\r\n\r\n##### omitted for clarity #####\r\n\r\n<\/output><\/output><\/pre>\n<h3>Serial batch job submission<\/h3>\n<p>Create a batch submission script (which will load the modulefile in the jobscript), for example:<\/p>\n<pre>#!\/bin\/bash --login\r\n#$ -cwd             # Job will run from the current directory\r\n### You may need to add a high memory flag here\r\n\r\n# Choose your required version\r\nmodule load apps\/binapps\/bbtools\/38.96\r\n\r\n# Choose the program you would like to run, there are various programs available but for illustrative purposes have used bbmap\r\n# The below will map without writing an index\r\n.\/bbmap  ref=&lt;reference fasta&gt; in=&lt;reads&gt; out=&lt;output sam&gt; nodisk<output>\r\n\r\n\r\n<\/output><\/pre>\n<p>Submit the jobscript using:<\/p>\n<pre>qsub <em>scriptname<\/em><\/pre>\n<p>where <em>scriptname<\/em> is the name of your jobscript.<\/p>\n<p>If you need more RAM (memory) to complete the analysis successfully, and you may well do!, please add the flags mentioned at the <a href=\"\/csf3\/batch\/high-memory-jobs\/\">high-memory jobs<\/a> page for more information.<\/p>\n<h3>Parallel batch job submission<\/h3>\n<p><strong>PLEASE NOTE: BBMap, like many other BBTool programs are multithreaded for both indexing and mapping. It will use all available threads unless capped with the <code>\u201ct=\u201d flag<\/code>, but it scales near-linearly with processor cores<\/strong><br \/>\nCreate a batch submission script (which will load the modulefile in the jobscript), for example:<\/p>\n<pre>#!\/bin\/bash --login\r\n#$ -cwd             # Job will run from the current directory\r\n#$ -pe smp.pe 16    # Number of cores, can be 2--32\r\n### You may need to add a high memory flag here\r\n\r\n# Choose your required version\r\nmodule load apps\/binapps\/bbtools\/38.96\r\n\r\n# $NSLOTS is automatically set to the number of cores requested above\r\n.\/bbmap  t=$NSLOTS ref=&lt;reference fasta&gt; in=&lt;reads&gt; out=&lt;output sam&gt; nodisk<output>\r\n\r\n<\/output><\/pre>\n<p>Submit the jobscript using:<\/p>\n<pre>qsub <em>scriptname<\/em><\/pre>\n<p>where <em>scriptname<\/em> is the name of your jobscript.<\/p>\n<p>If you need more RAM (memory) to complete the analysis successfully, and you may well do!, please add the flags mentioned at the <a href=\"\/csf3\/batch\/high-memory-jobs\/\">high-memory jobs<\/a> page for more information.<\/p>\n<h2>Further info<\/h2>\n<ul>\n<li><a href=\"https:\/\/jgi.doe.gov\/data-and-tools\/software-tools\/bbtools\/bb-tools-user-guide\/\">BBTools User Guide<\/a><\/li>\n<\/ul>\n<h2>Updates<\/h2>\n<p>None.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Overview BBTools BBTools is a suite of fast, multithreaded bioinformatics tools designed for analysis of DNA and RNA sequence data. BBTools can handle common sequencing file formats such as fastq, fasta, sam, scarf, fasta+qual, compressed or raw, with autodetection of quality encoding and interleaving. Program descriptions and options are shown when running the shell scripts with no parameters. The BBTools suite includes programs such as: bbduk \u2013 filters or trims reads for adapters and contaminants.. <a href=\"https:\/\/ri.itservices.manchester.ac.uk\/csf3\/software\/applications\/bbtools\/\">Read more &raquo;<\/a><\/p>\n","protected":false},"author":12,"featured_media":0,"parent":86,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-6033","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/ri.itservices.manchester.ac.uk\/csf3\/wp-json\/wp\/v2\/pages\/6033","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/ri.itservices.manchester.ac.uk\/csf3\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/ri.itservices.manchester.ac.uk\/csf3\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/ri.itservices.manchester.ac.uk\/csf3\/wp-json\/wp\/v2\/users\/12"}],"replies":[{"embeddable":true,"href":"https:\/\/ri.itservices.manchester.ac.uk\/csf3\/wp-json\/wp\/v2\/comments?post=6033"}],"version-history":[{"count":11,"href":"https:\/\/ri.itservices.manchester.ac.uk\/csf3\/wp-json\/wp\/v2\/pages\/6033\/revisions"}],"predecessor-version":[{"id":6046,"href":"https:\/\/ri.itservices.manchester.ac.uk\/csf3\/wp-json\/wp\/v2\/pages\/6033\/revisions\/6046"}],"up":[{"embeddable":true,"href":"https:\/\/ri.itservices.manchester.ac.uk\/csf3\/wp-json\/wp\/v2\/pages\/86"}],"wp:attachment":[{"href":"https:\/\/ri.itservices.manchester.ac.uk\/csf3\/wp-json\/wp\/v2\/media?parent=6033"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}