Home page Term 1 Term 2 Term 3 About me Faculty website |
Nucleic Acid Sequence DatabasesAssembly quality analysisSpecies: Homo sapiens (russian name: человек разумный)![]() There are 206 assemblies overall. For the following analysis the assembly called GRCh38.p12 was chosen.
Feature keysBelow there is a list of feature keys with links to sequence annotations containing them and coordinates of the corresponding features in the corresponding sequences.1)Centromere Link Position:1..305 Description: region of biological interest identified as a centromere and which has been experimentally characterized. 2)Exon Link Position:101..1036 Description: region of genome that codes for portion of spliced mRNA, rRNA and tRNA; may contain 5'UTR, all CDSs and 3' UTR. 3)Mat_peptide Link Position:join(1598..1694,2244..2386) Description: mature peptide or protein coding sequence; coding sequence for the mature or final peptide or protein product following post-translational modification; the location does not include the stop codon (unlike the corresponding CDS). 4)Intron Link Position:332..1589 Description: a segment of DNA that is transcribed, but removed from within the transcript by splicing together the sequences (exons) on either side of it. 5)Sig_peptide Link Position:join(277..331,1590..1597) Description: signal peptide coding sequence; coding sequence for an N-terminal domain of a secreted protein; this domain is involved in attaching nascent polypeptide to the membrane leader sequence. 6)Regulatory Link Position:215..223 Description: any region of sequence that functions in the regulation of transcription, translation, replication or chromatin structure. 7)mRNA Link Position:join(248..331,1590..1694,2244..2386) Description: messenger RNA; includes 5'untranslated region (5'UTR), coding sequences (CDS, exon) and 3'untranslated region (3'UTR). Genome projectName: The 100,000 Genomes ProjectAims: Better understanding of rare genetic diseases and cancer, paving the way for future therapeutic methods. Launch year: 2012 Link to the webpage: Link Organisation: Genomics England (in collaboration with NHS) Country: United Kingdom Total number of sequenced genomes (as planned): 100 000 Number of currently sequenced genomes: 87 231 (October 1, 2018) Last publication (link): The latest publication has not been placed on PubMed. Here is the link Mitochondrial Genes of a cryptomonadThe search was conducted on the ENA website.Search query text: tax_tree(3027) AND mol_type="genomic DNA" AND topology="CIRCULAR" AND organelle="mitochondrion" AND dataclass="STD" There was 8 results in Release and 0 in Update. The species chosen was Rhodomonas salina, Russian name is "Родомонас солевой". ![]() AC: AF288090 The following table (link below) was obtained by parsing the ENA entry with a Python script . Table of mitochondrial CDSes of Rhodomonas salina (.xlsx) |