BLAST | JalView

Search by similarity | BLAST, E-value

Task 1

According the task of pr 12 I used BLAST to search proteins which are potentionally homologous to my protein (0A0U3W9X5|A0A0U3W9X5_9BACI Fructose-1,6-bisphosphatase OS=Lentibacillus amyloliquefaciens). To search by similarity I chose database Swiss-Prot, algorothm protein-protein BLAST. When search settings were on default (E-value = 10, WordSize = 6, Max target sequences = 100) 36 proteins were found and only one with large e-value, so then I set E-value = 20 to find two proteins with large e-value (37 proteins were found).

Table 1. Characteristics 9 selected findings of the BLAST.
*Line with "my protein" is in bold.
#ID/ACName of proteinCoverageIdentity %E-valueHomologous
1A0A0U3W9X5_9BACI/A0A0U3W9X5Fructose-1,6-bisphosphatase
2GLPX_BACSU/Q03224.1Fructose-1,6-bisphosphatase class 2100%74%3e-174yes
3GLPX_CORGL/Q6M6E7.1Fructose-1,6-bisphosphatase class 297%55%1e-115yes
4GLPX_MYCBO/Q7U0N5.2Fructose-1,6-bisphosphatase class 297%51%2e-111yes
5FBSB_CYAA5/B1WQ07.1D-fructose 1,6-bisphosphatase class 299%48%2e-109yes
6FBSB_MICAN/B0JKN5.1D-fructose 1,6-bisphosphatase class 299%47%3e-109yes
7GLPX_MYCLB/B8ZSG6.2Fructose-1,6-bisphosphatase class 297%50%1e-107yes
8FBSB2_ACAM1/B0BZF8.1D-fructose 1,6-bisphosphatase class 298%48%1e-106yes
9RDRP_PVMR/P95469.1RNA replication polyprotein15%39%2.8not
10RECA_PARDE/P95469.1Protein RecA25%30%19not

Picture 1. The representation of alignment. (You can also see it here)

To see the whole alignment press: Jalview progect.

Analysis of the homology of finding with small e-value

There are another smaller blocks and areas of overlap length 2-3 amino acid residue. Sum up, I think that these 8 proteins (my and 7 finding with small e-value) are homologous.

Analysis of the homology of finding with large e-value.

So there are only points of coincidence and no real blocks which include all 10 proteins. Sum up, I think that these two proteins (with large e-value) are nonhomologous with my protein. They even called other than the first 8 (homologous) proteins.
They are also nonhomologous among themselves because they haven't any common conserved positions.

Task 2

For this task I used Pfam where I searched proteins which are included a family ZZ. I chose two proteins with the following architectures whose differences seemed interesting but not very drastically to me. These protein's ID you can see in a table 2.

Table 2. Proteins having homologous sections (domains).
ArchitectureID/ACName of the proteinOrganism
ZZ, N_BRCA1_IGA0A0D2CWB4_9EURO/A0A0D2CWB4Uncharacterized proteinExophiala xenobiotica
PB1, ZZ, N_BRCA1_IGF4PSP5_DICFS/F4PSP5ZZ-type zinc finger-containing proteinDictyostelium fasciculatum (strain SH3)

For searching I set E-value = 1.0E-5, WordSize = 2 and I didn't change other settings so they were on default.

Picture 2. Map of the local similarity.

Analysis of the map of the local similarity.
Let's look at vertical axis which shows the query sequence (A0A0D2CWB4). We can see 4 parts of the homologous units: A, B, C, D. Then look at horizontal axis which shows the database sequence (F4PSP5). It compared of the same parts with with some modifications.
There is duplication of parts A and B in the database sequence. And there are some deletions (or nonhomologous inserts - ?) in the second duplicated unit. The deletions cut the second unit A to A' and A'', the second unit B to B' and B'' and the unit C to C' and C''. Units B'' and C' aren't divided, but I called them in different ways because they are visual divided if you look at query sequence.

Sum up, A0A0D2CWB4 has sequence of homologous parts ABABB'C'CD and F4PSP5 has sequence ABCD.


Term II
© Potanina Darya, 2017