< 3rd term

Nucleotide database

Last update on the 21st of October, 2017

In this task we study several nucleotide databases of NCBI and capabilities they provide.

List of downloads
File Link
Mitochondrion protein genes protein_genes.ods

Brief description of tasks

The following tasks were done and reported via Google Forms:

  • quallity assession of Zea mays genome assembly;
  • 7 keys used in feature tables were described;
  • Global Ocean Sampling Expedition project description;
  • mitochondrial genomes of Paramecium caudatum were found and counted.

Some tasks needed additional information, which is stored in this page.

Mitochondrial proteins of P. caudatum

protein_genes.ods

P. caudatum (fig. 1) is one of the most recognizable species in Ciliophora taxon. To investigate its mitochondrion genome a query (paramecium caudatum[Organism]) AND gene_in_mitochondrion[PROP] and "complete genome"[KYWD] was submitted into NCBI databases. Two entries were found, a RefSeq database one under accession NC_014262 was taken for futher investigation. The table with genomic organisation of mitochondrial proteins is available in protein_genes.ods file.

Paramecium caudatum Ehrenberg, 1833.jpg
Fig. 1. Paramecium caudatum under microscope.

Several species were studied for whole mitochondrial genomes. Search results and processed data is shown in table 1. Mitochondrion genome size and number of its genes are correlated so does MTgenome and genome size. Interestingly, genome size grows faster then MTgenome one, which stands for uncommon functions of mitochondria and nuclei in cell life.

Table 1. Mitochondrionl genome properties of several kingdoms.
Group Organism MTgenome size, bp Genes Genome size, Mb MT/genome ratio, 10^-6
Opisthokonta Homo sapiens 16569 37 2996 5,53
Chromalveolata Paramecium caudatum 43660 49 30 1455,33
Opisthokonta Saccharomyces cerevisiae YJM1242 95658 46 12 7971,50
Excavata Jakoba libera strain ATCC 50422 100252 115 NA NA
Viridiplantae Arabidopsis thaliana 366924 131 116 3163,14

Genome sizes

To estimate typical genome sizes of life kingdoms the NCBI Genome database was used. Obtained data was proccessed through LibreOffice Calc and results were merged into table 2. Median size is proposed as typical for the corresponding group of organisms.

Table 2. Genome size of kingdoms.
Group Minial Median Maximal
Viroids 246 348 434
Viruses 220 10044 2473870
Archea 137797 1580350 6451200
Bacteria 104827 2752850 68003500
Eukaryota 245805 53011000 27602700000