Difference between revisions of "Applications/Ncbi-blast"

From HPC
Jump to: navigation , search
(Created page with "==Application Details== * Description: NCBI-Blast finds regions of similarity between biological sequences. The program compares nucleotide or protein sequences to sequence d...")
 
m (Batch Script)
 
(11 intermediate revisions by 2 users not shown)
Line 2: Line 2:
  
 
* Description: NCBI-Blast finds regions of similarity between biological sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance.  
 
* Description: NCBI-Blast finds regions of similarity between biological sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance.  
* Version: 2.4.0
+
* Version: 2.4.0 , 2.6.0
* Modules: ncbi-blast/2.4.0
+
* Modules: ncbi-blast/2.4.0, ncbi-blast/2.6.0
 
* Licence: Open-source (BLAST is a registered trademark of the National Library of Medicine)
 
* Licence: Open-source (BLAST is a registered trademark of the National Library of Medicine)
  
 
==Usage Examples==
 
==Usage Examples==
 +
  
 
===Command Set===
 
===Command Set===
  
{| class="wikitable"
+
ncbi-blast provides the following commands:
| style="width:50%" | <Strong></Strong>
 
| style="width:50%" | <Strong></Strong>
 
|-
 
| [[Applications|Applications]]
 
| How to use the application modules that are available on Viper.
 
|-
 
| [[General|General]]
 
| General information about using Viper.
 
|-
 
| [[Training|Training]]
 
| Training information about how to extend your knowledge of using Viper.
 
|-
 
| [[Programming|Programming]]
 
| Programming support on writing and compiling programs in a range of different languages available.
 
|-
 
|}
 
blastdb_aliastool 
 
blastdbcmd     
 
blastn 
 
blastx           
 
deltablast 
 
legacy_blast.pl 
 
makembindex   
 
psiblast 
 
rpstblastn 
 
tblastn 
 
update_blastdb.pl
 
blastdbcheck     
 
blast_formatter 
 
blastp 
 
convert2blastmask 
 
dustmasker 
 
makeblastdb     
 
makeprofiledb 
 
rpsblast 
 
segmasker 
 
tblastx 
 
windowmasker
 
  
 +
* blast_formatter 
 +
* blastdb_aliastool 
 +
* blastdbcheck     
 +
* blastdbcmd     
 +
* blastn 
 +
* blastp 
 +
* blastx           
 +
* convert2blastmask 
 +
* deltablast 
 +
* dustmasker 
 +
* legacy_blast.pl 
 +
* makeblastdb     
 +
* makembindex   
 +
* makeprofiledb 
 +
* psiblast 
 +
* rpsblast 
 +
* rpstblastn 
 +
* segmasker 
 +
* tblastn 
 +
* tblastx 
 +
* update_blastdb.pl
 +
* windowmasker
  
<pre style="background-color: #f5f5dc; color: black; font-family: monospace, sans-serif;">
 
  
[username@login] module load abyss/1.5.2/gcc-4.9.3
+
===Interactive Session===
[username@login] wget http://www.bcgsc.ca/platform/bioinfo/software/abyss/releases/1.3.4/test-data.tar.gz
 
[username@login] tar xzvf test-data.tar.gz
 
[username@login} abyss-pe k=25 name=test \
 
    in='test-data/reads1.fastq test-data/reads2.fastq'
 
  
</pre>
+
<pre style="background-color: #000; color: white; font-family: monospace, sans-serif;">
  
===Calculate assembly contiguity statistics===
+
[username@login01 ~]$ interactive
 
+
salloc: Granted job allocation 3619736
<pre style="background-color: #f5f5dc; color: black; font-family: monospace, sans-serif;">
+
Job ID 3619736 connecting to c081, please wait...
 
+
c081.vc-main
[username@login] module load abyss/1.5.2/gcc-4.9.3
+
[username@c081 ~]$ module add ncbi-blast/2.
[username@login] abyss-fac test-unitigs.fa
+
ncbi-blast/2.11.0  ncbi-blast/2.12.0  ncbi-blast/2.4.0  ncbi-blast/2.6.0
 +
[username@c081 ~]$ module add ncbi-blast/2.6.0
 +
[username@c081 ~] blastp <user options>
  
 
</pre>
 
</pre>
  
===Parallel processing===
+
===Batch Script===
  
The np option of abyss-pe specifies the number of processes to use for the parallel MPI job. Without any MPI configuration, this will allow you to use multiple cores on a single machine. To use multiple machines for assembly, you must create a hostfile for mpirun, which is described in the mpirun man page.
+
<pre style="background-color: #f5f5dc; color: black; font-family: monospace, sans-serif;">
  
Do not run '''mpirun -np 8 abyss-pe'''. To run ABySS with 8 threads, use '''abyss-pe np=8'''. The abyss-pe driver script will start the MPI process, like so: '''mpirun -np 8 ABYSS-P.'''
+
#!/bin/bash
  
The paired-end assembly stage is multithreaded, but must run on a single machine. The number of threads to use may be specified with the parameter j. The default value for j is the value of np.
+
#SBATCH -J blast
 +
#SBATCH -N 1
 +
#SBATCH --ntasks-per-node 1
 +
#SBATCH -o job-%j.out
 +
#SBATCH -e job-%j.out
 +
#SBATCH -p compute
  
'''Note''': this example is done on a high memory node, usually access would be achieved with the scheduler
+
# perfoms NR blast (blastx)
 
+
infile="$1"
<pre style="background-color: black; color: white; border: 2px solid black; font-family: monospace, sans-serif;">
+
outfile="$(basename "${infile%.*}").xml"
 
+
database="/swissprot/uniprot_sprot"
[username@c230 ~]$ module add abyss/1.5.2/gcc-4.9.3
 
[username@c230 ~]$ mpirun abyss-pe np=40
 
 
 
</pre>
 
 
 
Through '''SLURM''' this would become the script:
 
 
 
<pre style="background-color: black; color: white; border: 2px solid black; font-family: monospace, sans-serif;">
 
 
 
#!/bin/bash
 
 
 
#SBATCH -J abyss
 
#SBATCH -p highmem
 
#SBATCH -N 2
 
#SBATCH --ntasks-per-node=40
 
#SBATCH -o %N.%j.%a.out
 
#SBATCH -e %N.%j.%a.err
 
#SBATCH --exclusive
 
#SBATCH -t 00:30:00
 
  
 
module purge
 
module purge
module load abyss/1.5.2/gcc-4.9.3
+
module load ncbi-blast/2.6.0
 
 
#Run your ABySS commands
 
  
abyss-pe name=test k=48 n=8 in='test-1.fa test-3.fa'
+
blastp \
 +
-query "${infile}" \
 +
-db "${database}" \
 +
-out "${outfile}".xml \
 +
-evalue 1e-5 \
 +
-num_threads 8 \
 +
-max_target_seqs 50 \
 +
-outfmt 5
  
 
</pre>
 
</pre>
Line 113: Line 86:
 
==Further Information==
 
==Further Information==
  
* [http://computing.bio.cam.ac.uk/local/doc/abyss.html http://computing.bio.cam.ac.uk/local/doc/abyss.html]
+
* [https://blast.ncbi.nlm.nih.gov/Blast.cgi https://blast.ncbi.nlm.nih.gov/Blast.cgi]
  
{|
+
{{Modulepagenav}}
|style="width:5%; border-width: 0" | [[File:icon_home.png]]
 
|style="width:95%; border-width: 0" |
 
* [[Main_Page|Home]]
 
* [[Applications|Application support]]
 
* [[General|General]]
 
* [[Training|Training]]
 
* [[Programming|Programming support]]
 
|-
 
|}
 

Latest revision as of 12:11, 17 November 2022

Application Details

  • Description: NCBI-Blast finds regions of similarity between biological sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance.
  • Version: 2.4.0 , 2.6.0
  • Modules: ncbi-blast/2.4.0, ncbi-blast/2.6.0
  • Licence: Open-source (BLAST is a registered trademark of the National Library of Medicine)

Usage Examples

Command Set

ncbi-blast provides the following commands:

  • blast_formatter
  • blastdb_aliastool
  • blastdbcheck
  • blastdbcmd
  • blastn
  • blastp
  • blastx
  • convert2blastmask
  • deltablast
  • dustmasker
  • legacy_blast.pl
  • makeblastdb
  • makembindex
  • makeprofiledb
  • psiblast
  • rpsblast
  • rpstblastn
  • segmasker
  • tblastn
  • tblastx
  • update_blastdb.pl
  • windowmasker


Interactive Session


[username@login01 ~]$ interactive
salloc: Granted job allocation 3619736
Job ID 3619736 connecting to c081, please wait...
c081.vc-main
[username@c081 ~]$ module add ncbi-blast/2.
ncbi-blast/2.11.0  ncbi-blast/2.12.0  ncbi-blast/2.4.0   ncbi-blast/2.6.0
[username@c081 ~]$ module add ncbi-blast/2.6.0
[username@c081 ~] blastp <user options>

Batch Script


#!/bin/bash

#SBATCH -J blast
#SBATCH -N 1
#SBATCH --ntasks-per-node 1
#SBATCH -o job-%j.out
#SBATCH -e job-%j.out
#SBATCH -p compute

# perfoms NR blast (blastx)
infile="$1"
outfile="$(basename "${infile%.*}").xml"
database="/swissprot/uniprot_sprot"

module purge
module load ncbi-blast/2.6.0

blastp \
 -query "${infile}" \
 -db "${database}" \
 -out "${outfile}".xml \
 -evalue 1e-5 \
 -num_threads 8 \
 -max_target_seqs 50 \
 -outfmt 5

Further Information





Modules | Main Page | Further Topics