Protein Data

A FASTA file can be uploaded to provide protein sequence data.


If the FASTA file is omitted then protein sequences are retrieved by looking up accession numbers via the UniprotKB web service. This assumes that the sequences used in the search correspond exactly with those of valid, current UniprotKB accession numbers.


Protein Identifiers

If a FASTA file is provided:

then the protein identifiers (columns 'Protein1' and 'Protein2') must match identifiers in the FASTA file. In a FASTA file, the word following the ">" symbol is the identifier of the sequence, and the rest of the line is the description.

If a FASTA file is not provided:

protein identifiers are assumed to be six character UniprotKB accession numbers. SwissProt style identifiers of the format: sp|accession|name are also accepted and in this case 'name' will be used for the protein labels.