BLAST

The BLAST plugin enables you to BLAST a sequence against any of the genomes in the database, displaying a table of matches and extracting matching sequences.

The function can be accessed by selecting the ‘Analysis’ section on the main contents page.

../_images/analysis.png

Jump to the ‘Analysis’ category, follow the link to BLAST, then click ‘Launch BLAST’.

../_images/blast.png

Alternatively,it can be accessed following a query by clicking the ‘BLAST’ button in the Analysis list at the bottom of the results table. Please note that the list of functions here may vary depending on the setup of the database.

../_images/blast2.png

Select the isolate records to analyse (on large databases you will need to enter a list of ids). These will be pre-selected if you accessed the plugin following a query. Paste in a sequence to query - this be either a DNA or peptide sequence.

../_images/blast3.png

Click submit. If you are querying against 10 or fewer genomes then the results are run immediately, otherwise the job is sent to the job queue.

A table of BLAST results will be displayed.

../_images/blast4.png

Clicking any of the ‘extract’ buttons to display the matched sequence.

../_images/blast5.png

The extracted sequence is shown along with a translated sequence and flanking sequences.

../_images/blast6.png

At the bottom of the results table are links to export the matching sequences in FASTA format, (optionall) including flanking sequnces. You can also export the table in tab-delimited text or Excel formats.

../_images/blast11.png

Include in results table fieldset

This selection box allows you to choose which isolate provenance fields will be included in the results table.

../_images/blast7.png

Multiple values can be selected by clicking while holding down Ctrl.

Parameters fieldset

This section allows you to modify BLAST parameters. This affects sensitivity and speed.

../_images/blast8.png
  • BLASTN word size - This is the length of the initial identical match that BLAST requires before extending a match (default: 11). Increasing this value improves speed at the expense of sensitivity.

  • BLASTN scoring - This is a dropdown box of combinations of identical base rewards; mismatch penalties; and gap open and extension penalties. BLASTN has a constrained list of allowed values which reflects the available options in the list.

  • Hits per isolate - By default, only the best match is shown. Increase this value to the number of hits you’d like to see per isolate.

  • Flanking length - Set the size of the upstream and downstream flanking sequences that you’d like to include.

  • Use TBLASTX - This compares the six-frame translation of your nucleotide query sequence against the six-frame translation of the contig sequences. This is significantly slower than using BLASTN.

No matches

../_images/blast9.png

Click this option to create a row in the table indicating that a match was not found. This can be useful when screening a large number of isolates.

Filter fieldset

This section allows you to further filter your collection of isolates and the contig sequences to include.

../_images/blast10.png

Available options are:

  • Sequence method - Choose to only analyse contigs that have been generated using a particular method. This depends on the method being set when the contigs were uploaded.

  • Project - Only include isolates belonging to the chosen project. This enables you to select all isolates and filter to a project.