Tasks
Create a file for the report (<your_name>.docx).
- GENOME
- Chose a virus and download its genome. Socially important viruses are preferable.
- Describe the genome: size, RNA or DNA, number of proteins encoded etc. Chose one protein for further work
- COMPARE TWO GENOMES
Choose genome of another virus from the same family or genus.
- Create dotplot of the two genomes and describe rearrangements.
- Is it correct that borders of the rearrangements are not inside genes?
- FAMILY OF THE PROTEIN
- Choose a protein (viral one or any other)
- Find and download sequences of all homologous proteins within the certain taxon (Family? For example).
- Align the sequences and revise the alignment
- Reconstruct a phylogenetic tree.
- 3D PROTEIN STRUCTURE
- Find homologous proteins in the PDB.
Open the 3D stucture of one of found proteins using JSmol applet on its web page at the PDB site ( http://www.rcsb.org/ )
- Create and incude into your report a nice image (-s) demonstrating the protein and its contacts with a ligand (if any)
- Find about 10 sufficiently distant homologs of the protein whose 3D structure you have studied. Use Blast of this potein sequence. "Sufficiently distant homologs" means that the percent of identity between your protein and each homolog is between 30% and 60%. Align the sequences of your protein and of selected homologs.
- Select a conserved block in the alignment
- Describe the role of few conserved positions in the 3D structure
- (-) Create a 3D model of your protein based on homology to a protein with known 3D structure.
- (-) Superpose the model and the template
- PROTEIN DOMAINS
- Describe the domain architecture of your protein
- Select one of domains of your protein and describe several domain architectures including this domain
- Extra task. FIND and ANNOTATE ONE GENE in a CONTIG of the UNANNOTATED GENOME
Choose any contig of an appropriate length (10 – 30 Kb) from the genome of Amoeboaphelidium protococcarum. File with the genome is "X5.fasta". File with the lengths of contigs is "X5.info".
- Predict genes by similarity with known proteins (using the appropriate version of BLAST against a protein database).
- For one found gene predict its function by homology.
Hints
4g Use https://swissmodel.expasy.org/ server
4h For superimposition of two or more 3D structures use any of two services
http://www.ebi.ac.uk/msd-srv/ssm/cgi-bin/ssmserver
http://yanglab.nankai.edu.cn/mTM-align/
There are also more such services.
5a In Pfam database https://pfam.xfam.org/ use sequence search service. Include the table with domains and graphical representation of the domain architecture into your report.
5b Go (by clicking) to the page of the domain. See the link architecture.