Orthologs and paralogs

File containing genomes of all bacteria was created:

cat /P/y16/term4/Proteomes/PSEMY.fasta >> genomes 
cat /P/y16/term4/Proteomes/PSEAE.fasta >> genomes
cat /P/y16/term4/Proteomes/POLAQ.fasta >> genomes
cat /P/y16/term4/Proteomes/RHIME.fasta >> genomes
cat /P/y16/term4/Proteomes/PASMU.fasta >> genomes
cat /P/y16/term4/Proteomes/PROMH.fasta >> genomes
cat /P/y16/term4/Proteomes/ECOLI.fasta >> genomes
cat /P/y16/term4/Proteomes/ROSDO.fasta >> genomes
               

Blast database was created:

makeblastdb -in genomes.fasta  -dbtype prot
               

Blast search was performed:

blastp -query CLPX_ECOLI.fasta -db genomes.fasta -evalue 0.001 -outfmt 7
               

File for seqret was created. Fasta file containing sequences was generated:

seqret -seq @seq.fasta -out seqs.fasta
               

Muscle alignment:

muscle -in seqs.fasta -out align.fasta
               

MEGA

Tree of found homologous proteins was created using UPGMA method by MEGA7 program. Tree is presented in Fig. 1. You can also observe some examples of orthologs and paralogs in this figure.

Figure 1. Tree image. Paralogs are marked red, orthologs are green.(within groups)

Contacts: vorobiovarita@kodomo.fbb.msu.ru

© vorobiovarita 2018