Statistics
Reannotation typology | Rel. 1 (03-JUL-09) | Rel. 8 (16-JUN-11) | Rel. 9.1 (04-JUL-11) | Rel. 10 (14-DEC-11) |
---|---|---|---|---|
Error in molecule topology | 173 | 374 | 402 | 438 |
Modification of the Organism Species field | 6 | 8 | 8 | 20 |
Modification of the DEscription field | 120 | 221 | 222 | 230 |
Elimination of erroneous FTKey | 8 | 11 | 11 | 13 |
Unannotated gene | 67 | 86 | 86 | 90 |
Error in rRNA gene name | 7 | 4 | 4 | 4 |
Error in tRNA gene name | 116 | 143 | 143 | 147 |
Error in CDS gene name | 0 | 2 | 2 | 2 |
Error in strand of rRNA | 12 | 34 | 34 | 37 |
Error in strand of tRNA | 383 | 546 | 547 | 544 |
Modification of tRNA anticodon specificity a | 994 | 2231 | 2280 | 2540 |
Error in tRNA gene boundaries | 139 b | 2235 c | 2266 c | 2302 c |
Error in rRNA gene boundaries | 10 | 13 | 15 | 19 |
Error in CDS gene boundaries | 8 | 96 | 205 | 207 |
tRNAs not validated d | 402 | 286 | 286 | 294 |
a: for tRNA genes decoding the same amino acid, i.e. trnG, trnL, trnM and trnS
b: only error in tRNA limits affecting gene order
c: any kind of erroneous tRNA limits, including those not affecting gene order
d: this number includes tRNAs not validated by the reannotation pipeline and short tRNAs (< 45 bp)
not checked by the pipeline due to their inability to form a complete secondary structure.
Taxonomical Statistics
Entry number of the main Metazoa taxa (Rel. 10, 14-DEC-11).Taxon | Total | Partial | Linear | Congeneric | Mini-circle |
---|---|---|---|---|---|
Acanthocephala | 1 | 0 | 0 | 0 | 0 |
Annelida | 18 | 9 | 0 | 0 | 0 |
Arthropoda | 527 | 112 | 1 | 156 | 2 |
Brachiopoda | 5 | 1 | 0 | 0 | 0 |
Bryozoa | 3 | 0 | 0 | 0 | 0 |
Cephalochordata | 9 | 0 | 0 | 8 | 0 |
Chaetognatha | 5 | 0 | 0 | 0 | 0 |
Cnidaria | 70 | 22 | 25 | 21 | 0 |
Ctenophora | 1 | 0 | 0 | 0 | 0 |
Echinodermata | 29 | 3 | 0 | 9 | 0 |
Echiura | 2 | 0 | 0 | 2 | 0 |
Entoprocta | 2 | 0 | 0 | 0 | 0 |
Hemichordata | 3 | 0 | 0 | 2 | 0 |
Hyperotreti | 2 | 0 | 0 | 0 | 0 |
Mollusca | 148 | 14 | 0 | 74 | 0 |
Myzostomida | 2 | 2 | 0 | 0 | 0 |
Nematoda | 75 | 9 | 0 | 44 | 2 |
Nemertea | 5 | 1 | 0 | 3 | 0 |
Onychophora | 6 | 2 | 0 | 2 | 0 |
Platyhelminthes | 45 | 5 | 0 | 34 | 0 |
Porifera | 31 | 2 | 0 | 4 | 0 |
Priapulida | 2 | 1 | 0 | 0 | 0 |
Rotifera | 3 | 0 | 0 | 0 | 0 |
Sipuncula | 3 | 1 | 0 | 0 | 0 |
Tardigrada | 2 | 0 | 0 | 0 | 0 |
Tunicata | 13 | 0 | 0 | 5 | 0 |
Vertebrata | 2062 | 290 | 0 | 990 | 0 |
Xenoturbellida | 1 | 0 | 0 | 0 | 0 |
Total | 3075 | 474 | 26 | 1354 | 4 |
BLAST Database Statistics
Databases referred to Rel. 10 (14-DEC-2011).Database | Sequences |
---|---|
mtDNA | 3075 |
CDS_nt | 39341 |
tRNA | 64528 |
rRNA | 6058 |
NCR >=25nt | 9377 |
Protein | 39335 |
History
Warning on mt entries not included in MitoZoa: four entries have not been included in MitoZoa due to the lack of annotations of all genes and/or low sequence quality (AAPE02072785, FJ403244, HQ330989, HQ415764).
Novelties of Rel. 10: deletion of two duplicated entries (AP002931, NC_004449). Deletion of six entries retired by the Authors or by RefSeq (GU936203, GU936204, JF793665, NC_003170, NC_004383, NC_008943). Substitution of the entry NC_009082 with HM600781, due to uncertain gene annotation. Reannotation of entries belonging to Onychophora. Inclusion of three cnidarian entries shorter than 7 kb, because of their taxonomic importance.
Novelties of Rel. 9: optimization of reannotation of the protein-coding genes; annotation of "loss of highly conserved regions" in protein-coding genes, possibly due to frameshift or nucleotide substitutions.
Novelties of Rel. 8: implementation of a BLAST service against MitoZoa and against five distinct datasets of functionally homogeneous mitochondrial sequences; reannotation of protein-coding genes; identification and standardization of the annotation of protein-coding genes showing frameshift sites post-transcriptionally corrected by RNA editing or programmed translational frameshifting; creation of the new FTkey "prec_ORF" (precursor ORF) including a precursor ORF with frameshift(s) recovered by RNA editing or programmed translational frameshifting; updating of the "Organism Species" (OS) and "Organism Classification" (OC) fields of all existing MitoZoa entries; automated identification of gene order differences in congeneric species, and validation/correction through literature check; deletion of a duplicated entry (EF035448).
Correction in Rel. 7.1: modification of the anticodon specificity of five tRNA genes in entries EU345430, EU747728, EU682403, HM189212 and HM753535.
Novelties of Rel. 6.2: correction of a few bugs in the annotation of some NCR; annotation of all genes in three new entries (D83491, L07095, L07096), and completion of the annotation in two existing entries (AJ426041, AY250707).
Correction in Rel. 6: deletion of 24 duplicated entries (AB016274, AB079597, AB238966, AB242163, AF106038, AJ001562, AM181033, AM181034, AM181036, AP002929, AP004101, AP004355, AP004414, AP004417, AY158677, AY525783, DQ068951, DQ080041, DQ333813, DQ355300, DQ355301, EF071948, FJ590427, FJ752428).
Novelties of Rel. 5: modification of the identification criteria of the "longest non-coding region".
Corrections in Rel. 4.8: modification of annotations and comment lines in 10 entries, mostly belonging to congeneric species (AB360979, AB361005, DQ52643, DQ526431, DQ665851, EU266073, FJ529186, GQ265897, GQ888714, NC_009687), and deletion of the duplicated entry NC_006899.
Novelties of Rel. 4.7: annotation and name standardization of pseudogenes; inclusion of mtDNA entries > 7kb corresponding to subgenomic circles; update of the "Genetic Code" list; inclusion of additional information on gender-specific mitotypes; inclusion of additional information on RNA editing; improvement of the tRNA reannotation pipeline (with correction of small differences in tRNA limits); correction of bugs in rRNA reannotation; correction of bugs related to additional information on rRNAs.