BLAST
- For other uses, see (disambiguation).
Other questions that researchers use BLAST to answer are
- Which bacterial species have a protein that is related in lineage to a certain protein whose amino-acid sequence I know?
- Where does the DNA that I've just sequenced come from?
- What other genes encode proteins that exhibit structures or motifs such as the one I've just determined?
| Table of contents |
|
2 Program
2.1 Nucleotide-nucleotide BLAST (blastn)
3 External links2.2 Protein-protein BLAST (blastp) 2.3 Position-Specific Iterative BLAST (PSI-BLAST) |
BLAST is designed to take a query sequence (called the target sequence) and pairwise comparing it to all the sequences in a large (multi-gigabyte) library, finding the most similar sequences. Because it is comparing the target sequence to so many other sequences, the BLAST algorthm must be extremely fast. The algorithm works by searching for small regions that are exactly the same in the two sequences and then attempting to extend the alignment to either side until the comparison score reaches a certain threshold.
The BLAST web server, hosted by the NCBI, allows anyone with a web browser to perform similarity searches against constantly updated databases of proteins and DNA that include most of the newly sequenced organisms. The server includes many programs, but the most important are the following ones:
This program, given a DNA query, returns the most similar DNA sequences from the DNA database that the user specifies.
This program, given a protein query, returns the most similar DNA sequences from the protein database that the user specifies.
One of the more recent BLAST programs, this program is used for finding distant relatives of a protein. First, a list of all closely related proteins is created. Then these proteins are combined into a "profile" that is a sort of average sequence. A query against the protein database is then run using this profile, and a larger group of proteins found. This larger group is used to construct another profile, and the process is repeated.
By including related proteins in the search, PSI-BLAST is much more sensitive in picking up distant evolutionary relationships than the standard protein-protein BLAST.Algorithm
Program
Nucleotide-nucleotide BLAST (blastn)
Protein-protein BLAST (blastp)
Position-Specific Iterative BLAST (PSI-BLAST)