Download datasets  (right-click and 'save target as'):

Notes: The protein models are peptide sequences predicted from the genomic assembly with evidence from EST alignments. The EST dataset is comprised of de-novo assembled nextgen RNAseq data.

 1.) P. purpureum genomic assembly (nucleotide)
 2.) Updated gene models (protein)
 3.) Assembled ESTs (nucleotide)
 4.) CDS (coding) sequences corresponding with gene models (nucleotide)
 5.) Gene model BLASTP output (Excel .xlsx, UPDATED 03-29-2012 for new gene models)
 6.) Protein alignments (PHYLIP .aln format) and RAxML trees (NEWICK .tre format) for the NEW protein models
 7.) P. purpureum putative transporters (.xls format, via TCDB)

Enter query sequences here in Fasta format

Or upload sequence file in fasta format:

Program database(s)
And/or upload sequence fasta file


The query sequence is NOT filtered for low complexity regions by default.

Filter   Low complexity Mask for lookup table only

Expect    Matrix Perform ungapped alignment

Query Genetic Codes (blastx only)

Database Genetic Codes (tblast[nx] only)

Frame shift penalty for blastx

Other advanced options:     

Alignment view

Descriptions    Alignments