FASTA on the Sun Grid
By gkrish on Mar 12, 2007
What is FASTA?
FASTA is a set of biological sequence comparison programs for searching protein and DNA sequence databases. More information on FASTA can be found here
FASTA on SUN Grid
Protein sequences and databases can be extremely huge. FASTA uses MPI to make the sequence search and comparison algorithms parallel in nature, meaning they can run simultaneously across multiple nodes. This makes FASTA an ideal candidate for the SUN Grid which is a network of high compute power nodes with MPI support.
If you are expecting a FASTA download link and instructions to compile the source code, you have not realised the power of SUN Grid yet FASTA binaries are already available in the SUN Grid's Application Catalog. All you need to do is check out FASTA from the catalog, prepare your set of inputs and feed it to the application! After checking out the job, see the details of the job for instructions on how to prepare the data and the corresponding fasta command with / without mpi support.
A sample data to test run the FASTA application can be found here.
This data resource contains many sequence / database files to test run both the MPI and non-MPI versions of FASTA.