FASTA format

In bioinformatics and biochemistry, the FASTA format is a text-based format for representing either nucleotide sequences or amino acid (protein) sequences, in which nucleotides or amino acids are represented using single-letter codes.

FASTA format
Filename extensions
.fasta, .fas, .fa, .fna, .ffn, .faa, .mpfa, .frn
Internet media type
text/x-fasta
Uniform Type Identifier (UTI)no
Developed byDavid J. Lipman
William R. Pearson
Initial release1985
Type of formatBioinformatics
Extended fromASCII for FASTA
Extended toFASTQ format
Websitewww.ncbi.nlm.nih.gov/BLAST/fasta.shtml

The format allows for sequence names and comments to precede the sequences. It originated from the FASTA software package and has since become a near-universal standard in bioinformatics.

The simplicity of FASTA format makes it easy to manipulate and parse sequences using text-processing tools and scripting languages.

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.