|Organism||Source and release||Number of sequences|
Remark about sequence identifiers
Sequences are usually identifed by a the locus tags defined by the consortia responsible of the annotation (e.g. At5g20240.1). For some draft genomes, we have modified id using scaffold id_ species code (e.g. Phypa_96903)
Correspondance between UniProtKB ( last update: 22 may 2009) was made on the ordered locus when available (in 'Gene names' section).
Otherwise, mapping was done using the first blast hit of blast having anidentity score > 90%.
Kegg  data were download from the KEGG Orthology (KO) Database when available (last update: 02/02/2009)
GO terms, and particularly Plant GOslim, were obtained from the interpro and UniProt.
Clusters are tag by selected pubmed id referenced in UniProt entries. Annotator can also add their own publications.