{"id":559,"date":"2015-06-02T17:25:23","date_gmt":"2015-06-02T17:25:23","guid":{"rendered":"http:\/\/ri.itservices.manchester.ac.uk\/icsf\/?page_id=559"},"modified":"2015-06-02T17:42:48","modified_gmt":"2015-06-02T17:42:48","slug":"ensembl","status":"publish","type":"page","link":"https:\/\/ri.itservices.manchester.ac.uk\/icsf\/software\/applications\/ensembl\/","title":{"rendered":"Ensembl Tools"},"content":{"rendered":"<h2>Overview<\/h2>\n<p><a href=\"http:\/\/www.ensembl.org\/info\/docs\/tools\/vep\/index.html\">Ensembl<\/a> provides a Variant Effect Predictor script (written in perl) to determine the effect of your variants (SNPs, insertions, deletions, CNVs or structural variants) on genes, transcripts, and protein sequence, as well as regulatory regions.<\/p>\n<p>Versions 79 and 80 of Ensembl Tools are installed on the iCSF.<\/p>\n<h2>Restrictions on use<\/h2>\n<p>There are no restrictions on access this software on the iCSF. The software is released under the <a href=\"http:\/\/www.ensembl.org\/info\/about\/legal\/code_licence.html\">Apache License<\/a> and all users are required to read and adhere to the license terms.<\/p>\n<h2>Set up procedure<\/h2>\n<p>To access the software you must first load <em>one<\/em> of the following modulefile:<\/p>\n<pre>\r\nmodule load apps\/binapps\/ensembl\/80\/vep\r\nmodule load apps\/binapps\/ensembl\/79\/vep\r\n<\/pre>\n<p>This will automatically load the perl 5.20.2 modulefile.<\/p>\n<h2>Running the application<\/h2>\n<p>Please note that the VEP script will not be able to connect to an external MySQL database. Hence you should use the script in its <a href=\"http:\/\/www.ensembl.org\/info\/docs\/tools\/vep\/script\/vep_cache.html#offline\">offline<\/a> mode.<\/p>\n<p>You will need to download the cache files for offline use. Please see <a href=\"http:\/\/www.ensembl.org\/info\/docs\/tools\/vep\/script\/vep_cache.html\">Ensembl Caches and Databases Download<\/a> website for full details. A brief summary is given here:<\/p>\n<p>First create a directory for the cache files. By default the VEP script assumes the cache files will be in your home directory in a hidden folder named <code>.vep<\/code>. However due to the size of the cache files (approx 4GB each when compressed, more when uncompressed) it is advised you put them in an <a href=\"http:\/\/ri.itservices.manchester.ac.uk\/rds\/\">RDS area<\/a> &#8211; e.g., if your group has a shared directory on the Isilon storage or if you have a shortcut in your home directory named <code>data<\/code> pointing at additional Isilon storage for your group. In the example below we assume you have a shortcut named <code>data<\/code> in your home directory. But you can specify any directory name using the flag as indicated below.<\/p>\n<pre>\r\n# Create a directory for your VEP cache files\r\ncd ~\/data\r\nmkdir vep\r\ncd vep\r\n\r\n# Download from the VEP website. For example for version 80, a 4GB (compressed) file:\r\nwget ftp:\/\/ftp.ensembl.org\/pub\/release-80\/variation\/VEP\/homo_sapiens_vep_80_GRCh37.tar.gz\r\n\r\n# Uncompress it (expands to 6.1GB)\r\ntar xzf homo_sapiens_vep_80_GRCh37.tar.gz\r\n\r\n# Optional - remove the original compressed archive\r\nrm homo_sapiens_vep_80_GRCh37.tar.gz\r\n<\/pre>\n<p>To run the VEP, ensure you have loaded the modulefile above then:<\/p>\n<pre>\r\n# Notice we use the ~\/data\/vep directory created earlier\r\nvariant_effect_predictor.pl --cache --dir_cache ~\/data\/vep --offline  -i <em>input.vcf<\/em> -o <em>output.txt<\/em>\r\n<\/pre>\n<p>An example input <code>vcf<\/code> file is available by using the filename: <\/p>\n<pre>$ENSEMBL_VEP_DIR\/example_GRCh38.vcf<\/pre>\n<p>This requires the ensembl modulefile to be loaded before use.<\/p>\n<h2>Further info<\/h2>\n<ul>\n<li><a href=\"http:\/\/www.ensembl.org\/info\/docs\/tools\/index.html\">Ensembl Tools website<\/a><\/li>\n<\/ul>\n<h2>Updates<\/h2>\n<p>None.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Overview Ensembl provides a Variant Effect Predictor script (written in perl) to determine the effect of your variants (SNPs, insertions, deletions, CNVs or structural variants) on genes, transcripts, and protein sequence, as well as regulatory regions. Versions 79 and 80 of Ensembl Tools are installed on the iCSF. Restrictions on use There are no restrictions on access this software on the iCSF. The software is released under the Apache License and all users are required.. <a href=\"https:\/\/ri.itservices.manchester.ac.uk\/icsf\/software\/applications\/ensembl\/\">Read more &raquo;<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":43,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-559","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/ri.itservices.manchester.ac.uk\/icsf\/wp-json\/wp\/v2\/pages\/559","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/ri.itservices.manchester.ac.uk\/icsf\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/ri.itservices.manchester.ac.uk\/icsf\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/ri.itservices.manchester.ac.uk\/icsf\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/ri.itservices.manchester.ac.uk\/icsf\/wp-json\/wp\/v2\/comments?post=559"}],"version-history":[{"count":8,"href":"https:\/\/ri.itservices.manchester.ac.uk\/icsf\/wp-json\/wp\/v2\/pages\/559\/revisions"}],"predecessor-version":[{"id":570,"href":"https:\/\/ri.itservices.manchester.ac.uk\/icsf\/wp-json\/wp\/v2\/pages\/559\/revisions\/570"}],"up":[{"embeddable":true,"href":"https:\/\/ri.itservices.manchester.ac.uk\/icsf\/wp-json\/wp\/v2\/pages\/43"}],"wp:attachment":[{"href":"https:\/\/ri.itservices.manchester.ac.uk\/icsf\/wp-json\/wp\/v2\/media?parent=559"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}