Download datasets  (right-click and 'save target as'):

Notes: The protein models are peptide sequences predicted from the genomic assembly with evidence from EST alignments. The EST dataset is comprised of de-novo assembled nextgen RNAseq data.

 1.) P. purpureum genomic assembly (nucleotide)
 2.) Updated gene models (protein)
 3.) Assembled ESTs (nucleotide)
 4.) CDS (coding) sequences corresponding with gene models (nucleotide)
 5.) Gene model BLASTP output (Excel .xlsx, UPDATED 03-29-2012 for new gene models)
 6.) Protein alignments (PHYLIP .aln format) and RAxML trees (NEWICK .tre format) for the NEW protein models
 7.) P. purpureum putative transporters (.xls format, via TCDB)