UniProt

Tasks

There is some data about proteoms of Escherichia coli (strain K12) and Lentibacillus amyloliquefaciens.

  1. Table 1. The common information about proteomes.

    OrganismProteome IDThe number of sequencesThe number of residues
    Escherichia coli (strain K12)UP00000062543061356195
    Lentibacillus amyloliquefaciensUP00005033135531007125

  2. Table 2. The frequency of letters' occurrence in the proteomes.

    The residue% in E. coli% L. amyloliquefaciensThe difference of %
    L10.6724%9.2923%1.3801%
    E5.7622%7.5462%-1.7840%
    I6.0102%7.5356%-1.5254%
    A9.5140%7.2473%2.2667%
    G7.3746%7.0370%0.3376%
    V7.0729%6.8575%0.2154%
    K4.4062%6.2792%-1.8730%
    S5.8023%6.0143%-0.2120%
    D5.1513%5.8024%-0.6511%
    T5.3986%5.5296%-0.1310%
    N3.9453%4.6495%-0.7042%
    F3.8915%4.4527%-0.5612%
    R5.5110%4.0400%1.4710%
    Q4.4398%3.8491%0.5907%
    P4.4264%3.6161%0.8103%
    Y2.8452%3.4682%-0.6230%
    M2.8200%2.9683%-0.1483%
    H2.2667%2.1316%0.1351%
    W1.5311%1.0599%0.4712%
    C1.1581%0.6233%0.5348%
    U0.0002%0.0000%0.0002%

    *Calculated by python: program (if you want to download it: program). Command line: "python wordcount.py Escherichia_coli_K12.txt Lentibacillus_amyloliquefaciens.txt wordcount.txt".

    The most common letter in E.coli's and L. amyloliquefaciens' proteoms is Leu. Another common letters: Ala (more in E.coli), Gly (more in E.coli), Val (more in E.coli), Glu (more in L. amyloliquefaciens), Ile (more in L. amyloliquefaciens).
    The most rare residue Cys (more in E.coli) and Trp (more in E.coli).
    The fact that there is much more Glu, Ile, Lys in L. amyloliquefaciens' proteome then in E.coli's.

  3. Compare wordcount vs compseq.

    In the process ...

    Wordcount-help
    Compseq-help


Term II
© Potanina Darya, 2017