RID: B1XC8VK101R Job Title:Protein Sequence Program: BLASTP Query: None ID: lcl|Query_723842(amino acid) Length: 1896 Database: swissprot Non-redundant UniProtKB/SwissProt sequences Sequences producing significant alignments: Scientific Common Max Total Query E Per. Acc. Description Name Name Taxid Score Score cover Value Ident Len Accession RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Sagiyama virus NA 59303 3832 3832 100% 0.0 100.00 2467 Q9JGL0.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Getah virus NA 59300 3813 3813 100% 0.0 99.31 2467 Q5Y389.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Ross river v... NA 11031 3326 3326 100% 0.0 84.06 2480 P13887.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Semliki Fore... NA 11033 2759 2759 99% 0.0 70.71 2432 P08411.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Mayaro virus... NA 374990 2655 2655 99% 0.0 68.43 2437 Q8QZ73.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Chikungunya ... NA 371095 2578 2578 99% 0.0 66.47 2474 Q5XXP4.1 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... O'nyong-nyon... NA 374989 2530 2593 91% 0.0 71.89 2513 O90368.1 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... O'nyong-nyon... NA 11028 2530 2593 92% 0.0 71.37 2514 P13886.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Igbo Ora virus NA 79899 2529 2592 92% 0.0 71.13 2513 O90370.1 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Chikungunya ... NA 371094 2525 2594 92% 0.0 71.68 2474 Q8JUX6.1 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Barmah Fores... NA 11020 2224 2279 94% 0.0 64.95 2411 P87515.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Venezuelan e... NA 36385 2115 2115 99% 0.0 55.40 2493 P36328.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Venezuelan e... NA 36384 2114 2162 95% 0.0 56.89 2499 Q9WJC7.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Venezuelan e... NA 36382 2114 2114 99% 0.0 55.68 2485 P36327.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Venezuelan e... NA 11038 2109 2109 99% 0.0 55.34 2493 P27282.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Ockelbo virus NA 31699 2108 2160 89% 0.0 60.60 2515 P27283.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Sindbis virus NA 11034 2107 2156 95% 0.0 57.66 2513 P03317.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Venezuelan e... NA 376610 2100 2148 91% 0.0 59.44 2497 Q8V294.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Eastern equi... NA 374596 2073 2120 89% 0.0 60.76 2471 Q306W6.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Eastern equi... NA 374597 2065 2112 89% 0.0 60.88 2474 Q306W8.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Eastern equi... NA 374598 2065 2112 91% 0.0 58.25 2494 Q4QXJ8.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Western equi... NA 11039 2041 2088 89% 0.0 59.95 2467 P13896.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Aura virus NA 44158 1970 2015 91% 0.0 57.94 2499 Q86924.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Salmon pancr... NA 84589 1301 1301 86% 0.0 43.19 2601 Q8JJX1.1 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Sleeping dis... NA 78540 1299 1299 86% 0.0 43.30 2593 Q8QL53.1 RecName: Full=Polyprotein nsP1234; Short=P1234; AltName:... Ross river v... NA 11032 812 812 29% 0.0 72.62 1149 P13888.2 RecName: Full=Polyprotein nsP1234; Short=P1234; AltName:... Middelburg v... NA 11023 413 413 25% 4e-123 49.69 995 P03318.2 RecName: Full=Replicase large subunit; AltName: Full=183 kDa... Odontoglossu... NA 138662 76.3 76.3 13% 1e-13 27.50 1612 Q84133.2 RecName: Full=Replicase large subunit; AltName: Full=183 kDa... Odontoglossu... NA 138661 75.5 75.5 14% 2e-13 27.27 1612 P89659.2 RecName: Full=Replicase large subunit; AltName: Full=183 kDa... Tobacco mild... NA 12241 64.7 64.7 13% 3e-10 25.89 1609 P18339.2 RecName: Full=Replicase large subunit; AltName: Full=183 kDa... Turnip vein-... NA 29272 60.8 60.8 14% 5e-09 25.18 1601 Q88920.1 RecName: Full=Replicase large subunit; AltName: Full=182 kDa... Youcai mosai... NA 228578 58.2 58.2 14% 3e-08 25.52 1597 Q66220.2 RecName: Full=Replicase polyprotein 1ab; Contains: RecName:... Beet yellows... NA 478555 54.7 54.7 12% 4e-07 25.91 3094 Q08534.2 RecName: Full=Non-structural polyprotein pORF1; Includes:... Hepatitis E ... NA 31767 41.6 41.6 5% 0.003 29.81 1693 P29324.1 RecName: Full=Non-structural polyprotein pORF1; Includes:... Hepatitis E ... NA 33774 41.6 41.6 5% 0.003 29.81 1693 P33424.2 RecName: Full=Non-structural polyprotein pORF1; Includes:... Hepatitis E ... NA 652674 41.2 41.2 5% 0.005 29.81 1693 Q81862.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 11045 40.0 40.0 5% 0.011 31.53 2116 P13889.5 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 376264 40.0 40.0 5% 0.011 31.53 2116 Q99IE5.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 376267 40.0 40.0 5% 0.011 31.53 2116 Q8BCR0.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 376265 40.0 40.0 5% 0.012 31.53 2116 Q99IE7.1 RecName: Full=Non-structural polyprotein pORF1; Includes:... Hepatitis E ... NA 512346 39.7 39.7 5% 0.016 28.85 1693 Q9WC28.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 376263 39.7 39.7 5% 0.016 29.75 2116 Q6X2U2.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 376266 39.3 39.3 5% 0.017 31.53 2116 Q9J6K9.2 RecName: Full=Non-structural polyprotein pORF1; Includes:... Hepatitis E ... NA 31768 39.3 39.3 5% 0.018 28.85 1691 Q03495.1 RecName: Full=Non-structural polyprotein pORF1; Includes:... Hepatitis E ... NA 509627 39.3 39.3 5% 0.018 29.81 1707 Q9IVZ9.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 376262 38.9 38.9 5% 0.024 29.75 2116 Q6X2U4.1 RecName: Full=Non-structural polyprotein pORF1; Includes:... Hepatitis E ... NA 31769 38.5 38.5 5% 0.035 28.85 1693 Q04610.1 Alignments: >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Sagiyama virus] Sequence ID: Q9JGL0.3 Length: 2467 Range 1: 1 to 1896 Score:3832 bits(9937), Expect:0.0, Method:Compositional matrix adjust., Identities:1896/1896(100%), Positives:1896/1896(100%), Gaps:0/1896(0%) Query 1 MKVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTI 60 MKVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTI Sbjct 1 MKVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTI 60 Query 61 LDVGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQD 120 LDVGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQD Sbjct 61 LDVGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQD 120 Query 121 VMATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTT 180 VMATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTT Sbjct 121 VMATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTT 180 Query 181 PFMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMF 240 PFMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMF Sbjct 181 PFMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMF 240 Query 241 SVGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTV 300 SVGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTV Sbjct 241 SVGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTV 300 Query 301 DYAVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVG 360 DYAVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVG Sbjct 301 DYAVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVG 360 Query 361 LNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLW 420 LNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLW Sbjct 361 LNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLW 420 Query 421 AFKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQGLP 480 AFKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQGLP Sbjct 421 AFKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQGLP 480 Query 481 YSGDRTearaaeeeekeaqeaeLTRAALPPLVSGSCADDIAQVDVEELTFRAGAGVVETP 540 YSGDRTEARAAEEEEKEAQEAELTRAALPPLVSGSCADDIAQVDVEELTFRAGAGVVETP Sbjct 481 YSGDRTEARAAEEEEKEAQEAELTRAALPPLVSGSCADDIAQVDVEELTFRAGAGVVETP 540 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD Sbjct 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE Sbjct 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG Sbjct 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV Sbjct 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH Sbjct 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ Sbjct 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL Sbjct 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV Sbjct 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 Query 1021 LETAGIRMTADEWNTILAFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYENN 1080 LETAGIRMTADEWNTILAFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYENN Sbjct 1021 LETAGIRMTADEWNTILAFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYENN 1080 Query 1081 HWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNRR 1140 HWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNRR Sbjct 1081 HWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNRR 1140 Query 1141 LPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTYD 1200 LPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTYD Sbjct 1141 LPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTYD 1200 Query 1201 LDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRAY 1260 LDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRAY Sbjct 1201 LDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRAY 1260 Query 1261 GYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLSS 1320 GYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLSS Sbjct 1261 GYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLSS 1320 Query 1321 MYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKGA 1380 MYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKGA Sbjct 1321 MYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKGA 1380 Query 1381 ATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPL 1440 ATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPL Sbjct 1381 ATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPL 1440 Query 1441 LSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVSED 1500 LSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVSED Sbjct 1441 LSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVSED 1500 Query 1501 VTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDANE 1560 VTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDANE Sbjct 1501 VTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDANE 1560 Query 1561 QICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVCS 1620 QICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVCS Sbjct 1561 QICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVCS 1620 Query 1621 SFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQQPPEQLDNVSLTSTTSTGSAWS 1680 SFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQQPPEQLDNVSLTSTTSTGSAWS Sbjct 1621 SFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQQPPEQLDNVSLTSTTSTGSAWS 1680 Query 1681 LPSETTYETMEVVAEVHTEppippprrrrAAVAQLRQDLEVTEEIEPYVIQQAEIMVMER 1740 LPSETTYETMEVVAEVHTEPPIPPPRRRRAAVAQLRQDLEVTEEIEPYVIQQAEIMVMER Sbjct 1681 LPSETTYETMEVVAEVHTEPPIPPPRRRRAAVAQLRQDLEVTEEIEPYVIQQAEIMVMER 1740 Query 1741 VATTDIRAIPVPARRAITMPVPAPRVRKVateppsepeapipaprkrrttsttppHNPGD 1800 VATTDIRAIPVPARRAITMPVPAPRVRKVATEPPSEPEAPIPAPRKRRTTSTTPPHNPGD Sbjct 1741 VATTDIRAIPVPARRAITMPVPAPRVRKVATEPPSEPEAPIPAPRKRRTTSTTPPHNPGD 1800 Query 1801 FVPRVPVELPWEPEDLDIQFGDLEPRRRNTRDWDVSTGIQFGDIDFNQSXLGRAGAYIFS 1860 FVPRVPVELPWEPEDLDIQFGDLEPRRRNTRDWDVSTGIQFGDIDFNQSXLGRAGAYIFS Sbjct 1801 FVPRVPVELPWEPEDLDIQFGDLEPRRRNTRDWDVSTGIQFGDIDFNQSXLGRAGAYIFS 1860 Query 1861 SDTGPGHLQQRSVRQHELPCETLYAHEDERIYPPAF 1896 SDTGPGHLQQRSVRQHELPCETLYAHEDERIYPPAF Sbjct 1861 SDTGPGHLQQRSVRQHELPCETLYAHEDERIYPPAF 1896 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Getah virus] Sequence ID: Q5Y389.3 Length: 2467 Range 1: 1 to 1896 Score:3813 bits(9887), Expect:0.0, Method:Compositional matrix adjust., Identities:1883/1896(99%), Positives:1887/1896(99%), Gaps:0/1896(0%) Query 1 MKVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTI 60 MKVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTI Sbjct 1 MKVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTI 60 Query 61 LDVGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQD 120 LDVGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQD Sbjct 61 LDVGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQD 120 Query 121 VMATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTT 180 VMATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTT Sbjct 121 VMATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTT 180 Query 181 PFMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMF 240 PFMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMF Sbjct 181 PFMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMF 240 Query 241 SVGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTV 300 SVGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTV Sbjct 241 SVGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTV 300 Query 301 DYAVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVG 360 DYAVTHHAEGFL+CKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVG Sbjct 301 DYAVTHHAEGFLMCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVG 360 Query 361 LNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLW 420 LNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLW Sbjct 361 LNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLW 420 Query 421 AFKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQGLP 480 AFKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQGLP Sbjct 421 AFKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQGLP 480 Query 481 YSGDRTearaaeeeekeaqeaeLTRAALPPLVSGSCADDIAQVDVEELTFRAGAGVVETP 540 YSGDRTEARAAEEEEKE QEAELTRAALPPLVSGSCADDIAQVDVEELTFRAGAGVVETP Sbjct 481 YSGDRTEARAAEEEEKEVQEAELTRAALPPLVSGSCADDIAQVDVEELTFRAGAGVVETP 540 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD Sbjct 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE Sbjct 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG Sbjct 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCK+GV Sbjct 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKKGV 780 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH Sbjct 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ Sbjct 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL Sbjct 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 SGDPWIKVLTNVPRGDFSATLEEW EEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV Sbjct 961 SGDPWIKVLTNVPRGDFSATLEEWHEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 Query 1021 LETAGIRMTADEWNTILAFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYENN 1080 LETAGIRMTADEWNTILAFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYENN Sbjct 1021 LETAGIRMTADEWNTILAFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYENN 1080 Query 1081 HWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNRR 1140 HWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNRR Sbjct 1081 HWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNRR 1140 Query 1141 LPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTYD 1200 LPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNL IPHKRVFWIAPPRVSGADRTYD Sbjct 1141 LPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLVIPHKRVFWIAPPRVSGADRTYD 1200 Query 1201 LDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRAY 1260 LDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRAY Sbjct 1201 LDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRAY 1260 Query 1261 GYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLSS 1320 GYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLSS Sbjct 1261 GYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLSS 1320 Query 1321 MYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKGA 1380 MYACNGLHTAGCAPSYRVRRADISGH EEAVVNAANAKGTVSDGVCRAVAKKWPSSFKGA Sbjct 1321 MYACNGLHTAGCAPSYRVRRADISGHSEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKGA 1380 Query 1381 ATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPL 1440 ATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPL Sbjct 1381 ATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPL 1440 Query 1441 LSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVSED 1500 LSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVSED Sbjct 1441 LSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVSED 1500 Query 1501 VTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDANE 1560 VTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDANE Sbjct 1501 VTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDANE 1560 Query 1561 QICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVCS 1620 QICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVCS Sbjct 1561 QICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVCS 1620 Query 1621 SFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQQPPEQLDNVSLTSTTSTGSAWS 1680 SFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQQPPEQLDNVSLTSTTSTGSAWS Sbjct 1621 SFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQQPPEQLDNVSLTSTTSTGSAWS 1680 Query 1681 LPSETTYETMEVVAEVHTEppippprrrrAAVAQLRQDLEVTEEIEPYVIQQAEIMVMER 1740 PSETTYETMEVVAEVHTEPPIPPPRRRRAAVAQLRQDLEVTEEIEPYV QQAEIMVMER Sbjct 1681 FPSETTYETMEVVAEVHTEPPIPPPRRRRAAVAQLRQDLEVTEEIEPYVTQQAEIMVMER 1740 Query 1741 VATTDIRAIPVPARRAITMPVPAPRVRKVateppsepeapipaprkrrttsttppHNPGD 1800 VATTDIRAIPVPARRAITMPVPAPRVRKVATEPP EPEAPIPAPRKRRTTST+PPHNP D Sbjct 1741 VATTDIRAIPVPARRAITMPVPAPRVRKVATEPPLEPEAPIPAPRKRRTTSTSPPHNPED 1800 Query 1801 FVPRVPVELPWEPEDLDIQFGDLEPRRRNTRDWDVSTGIQFGDIDFNQSXLGRAGAYIFS 1860 FVPRVPVELPWEPEDLDIQFGDLEPRRRNTRD DVSTGIQFGDIDFNQSXLGRAGAYIFS Sbjct 1801 FVPRVPVELPWEPEDLDIQFGDLEPRRRNTRDRDVSTGIQFGDIDFNQSXLGRAGAYIFS 1860 Query 1861 SDTGPGHLQQRSVRQHELPCETLYAHEDERIYPPAF 1896 SDTGPGHLQQ+SVRQHELPCETLYAHEDERIYPPAF Sbjct 1861 SDTGPGHLQQKSVRQHELPCETLYAHEDERIYPPAF 1896 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Ross river virus (STRAIN NB5092)] Sequence ID: P13887.2 Length: 2480 Range 1: 1 to 1909 Score:3326 bits(8623), Expect:0.0, Method:Compositional matrix adjust., Identities:1614/1920(84%), Positives:1729/1920(90%), Gaps:35/1920(1%) Query 1 MKVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTI 60 MKVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVP +TI Sbjct 1 MKVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPANITI 60 Query 61 LDVGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQD 120 LDVGSAPARRLMSDH+YHCICPMKSAEDPERLANYARKLAK +G VLDKNVSGKITDLQD Sbjct 61 LDVGSAPARRLMSDHSYHCICPMKSAEDPERLANYARKLAKTAGEVLDKNVSGKITDLQD 120 Query 121 VMATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTT 180 VMATPDLESPTFCLHTDETCRTRAEVAVYQDV HAPTSLYHQA+KGVRT YWIGFDTT Sbjct 121 VMATPDLESPTFCLHTDETCRTRAEVAVYQDV-XXHAPTSLYHQAMKGVRTVYWIGFDTT 179 Query 181 PFMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMF 240 PFMFE +AGAYP YSTNWADEQVLQARNIGLCAT LSEG RGK+SIMRKK LRPSDR MF Sbjct 180 PFMFEVVAGAYPTYSTNWADEQVLQARNIGLCATSLSEGHRGKISIMRKKRLRPSDRXMF 239 Query 241 SVGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTV 300 SVG TLY ESR+LL+SWHLPSVFHLKGKNSFTCRCDT+VSCEGYVVKKIT+SPG YGKTV Sbjct 240 SVGXTLYIESRRLLKSWHLPSVFHLKGKNSFTCRCDTIVSCEGYVVKKITMSPGTYGKTV 299 Query 301 DYAVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVG 360 YAVTHHAEGFL+CK+TDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVG Sbjct 300 GYAVTHHAEGFLMCKVTDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVG 359 Query 361 LNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLW 420 LNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREA+ADMEDEKPLGTRERTLTCCCLW Sbjct 360 LNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREAKADMEDEKPLGTRERTLTCCCLW 419 Query 421 AFKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQGLP 480 AFK+HK HTMYKRP+TQTIVKVPSTFDSFVIPSLWSSSLS+GIRQRIKLLL ++++ LP Sbjct 420 AFKNHKTHTMYKRPDTQTIVKVPSTFDSFVIPSLWSSSLSIGIRQRIKLLLGPKLSRDLP 479 Query 481 YSGDRTearaaeeeekeaqeaeLTRAALPPLVSGSCADDIAQVDVEELTFRAGAGVVETP 540 YSGDR EAR AE+E +E +EAELTR ALPPLV +CADD+ QVDVEELT+RAGAGVVETP Sbjct 480 YSGDRNEAREAEKEAEETKEAELTREALPPLVGSNCADDVDQVDVEELTYRAGAGVVETP 539 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 RNALKVTPQ D LIG+YLILSPQTVLKSEKL PIHPLAEQVT+MTHSGRSGRYPVD+YD Sbjct 540 RNALKVTPQERDQLIGAYLILSPQTVLKSEKLTPIHPLAEQVTIMTHSGRSGRYPVDRYD 599 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 GRVL+PTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEE+YEKVRAE Sbjct 600 GRVLVPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEENYEKVRAE 659 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 RAE EYVFDVDK+ C+K+E+ASGLVL GDLINPPFHEFAYEGLKIRPA P+ TT+IGVFG Sbjct 660 RAEAEYVFDVDKRTCVKREDASGLVLVGDLINPPFHEFAYEGLKIRPATPFQTTVIGVFG 719 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKSAIIK++VTTRDLVASGKKENCQEI+NDVK+QRGLDVTARTVDSILLNGC+RGV Sbjct 720 VPGSGKSAIIKSVVTTRDLVASGKKENCQEIVNDVKKQRGLDVTARTVDSILLNGCRRGV 779 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 ENLYVDEAFACHSGTLLALIA+V+P+GKV+LCGDPKQCGFFNLMQLKV++NH+ICT+VLH Sbjct 780 ENLYVDEAFACHSGTLLALIAMVKPTGKVILCGDPKQCGFFNLMQLKVNFNHDICTQVLH 839 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCTLP+TAIVSTLHYQGKMRTTN C+ PIQIDTTG++KPA GDIVLTCFR WVKQ Sbjct 840 KSISRRCTLPITAIVSTLHYQGKMRTTNLCSAPIQIDTTGTTKPAKGDIVLTCFRXWVKQ 899 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLY+P SEHVNVLLTRTENRLVWKTL Sbjct 900 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYAPSSEHVNVLLTRTENRLVWKTL 959 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 SGDPWIKVLTN+P+GDFSATLEEWQEEHD IM L ER VDPFQNKAKVCWAKCLVQV Sbjct 960 SGDPWIKVLTNIPKGDFSATLEEWQEEHDNIMNALRERSTAVDPFQNKAKVCWAKCLVQV 1019 Query 1021 LETAGIRMTADEWNTILAFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYENN 1080 LETAGIRMTA+EW+T+LAFREDRAYSPEVALNEICT+YYGVDLDSGLFSAQSVSL+YENN Sbjct 1020 LETAGIRMTAEEWDTVLAFREDRAYSPEVALNEICTKYYGVDLDSGLFSAQSVSLYYENN 1079 Query 1081 HWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNRR 1140 HWDNRPGGRMYGFN EVARK+ R+PFLRG M+SGLQ+NVPERK+QPF+AECNI+ NRR Sbjct 1080 HWDNRPGGRMYGFNREVARKFEQRYPFLRGKMDSGLQVNVPERKVQPFNAECNILLLNRR 1139 Query 1141 LPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTYD 1200 LPHALVTSYQQCRGERVEWLLKK+PG+ +LLVSEYNLA+PHKRVFWIAPP VSGADR YD Sbjct 1140 LPHALVTSYQQCRGERVEWLLKKLPGYHLLLVSEYNLALPHKRVFWIAPPHVSGADRIYD 1199 Query 1201 LDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRAY 1260 LDLGLP++AGRYDLVFVNIHTEYR HHYQQCVDHSM+LQMLGGDSLHLL PGGSLL+RAY Sbjct 1200 LDLGLPLNAGRYDLVFVNIHTEYRTHHYQQCVDHSMKLQMLGGDSLHLLXPGGSLLIRAY 1259 Query 1261 GYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLSS 1320 GYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLF+NFDNGRRAVTLHQANQ+LSS Sbjct 1260 GYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFTNFDNGRRAVTLHQANQRLSS 1319 Query 1321 MYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKGA 1380 M+ACNGLHTAGCAPSYRVRR DISGH EEAVVNAANAKGTV GVCRAVA+KWP SFKGA Sbjct 1320 MFACNGLHTAGCAPSYRVRRTDISGHAEEAVVNAANAKGTVGVGVCRAVARKWPDSFKGA 1379 Query 1381 ATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPL 1440 ATPVGTAK+++A+GM VIHAVGPNFSTVTEAEGDRELAAAYRAVA II+ +NIKSVA+PL Sbjct 1380 ATPVGTAKLVQANGMNVIHAVGPNFSTVTEAEGDRELAAAYRAVAGIINASNIKSVAIPL 1439 Query 1441 LSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVSED 1500 LSTG FSGGKDRVMQSLNHLFTA+D TDADVVIYCRDK WEKKIQEAIDRRTA+ELVSED Sbjct 1440 LSTGVFSGGKDRVMQSLNHLFTAMDTTDADVVIYCRDKAWEKKIQEAIDRRTAVELVSED 1499 Query 1501 VTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDANE 1560 ++LE+DL+RVHPDSCLVGR GYS TDGKL+SYLEGTRFHQTAVDMAEISTLWP+LQDANE Sbjct 1500 ISLESDLIRVHPDSCLVGRKGYSITDGKLHSYLEGTRFHQTAVDMAEISTLWPKLQDANE 1559 Query 1561 QICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVCS 1620 QICLYALGE+MDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTK IIVCS Sbjct 1560 QICLYALGESMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKAIIVCS 1619 Query 1621 SFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQQPPE-QLDNVSLTSTTSTGSAW 1679 SFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYI D VSL ST STGSAW Sbjct 1620 SFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIPAAASMHADTVSLDSTVSTGSAW 1679 Query 1680 SLPSETTYETMEVVAEVH-TEppippprrrrAAVAQLRQDLEVTEEIEPYVIQQAEI--- 1735 S PSE TYETMEVVAEVH +EPP+PPPRRRRA V Q+L ++ + + EI Sbjct 1680 SFPSEATYETMEVVAEVHHSEPPVPPPRRRRAQVTMHHQELLEVSDMHTPIAARVEIPVY 1739 Query 1736 ---MVMERVAT--TDIRAIPVPARRAI-TMPVPAPRVRK-------------Vateppse 1776 +V ERVA T A P+P RA+ +PVPAPR+++ V Sbjct 1740 DTAVVAERVAIPCTSEYATPIPTPRAVRVVPVPAPRIQRASTYRVSPTPTPRVLRASVCS 1799 Query 1777 peapipaprkrrttsttppHNPGDFVPRVPVELPWEPEDLDIQFGDLEPRRRNTRDWDVS 1836 P R PVELPWEPED+DIQFGD E Sbjct 1800 VTTSAGVEFPWAPEDLEVLTEPVHCEMREPVELPWEPEDVDIQFGDFE----------TP 1849 Query 1837 TGIQFGDIDFNQSXLGRAGAYIFSSDTGPGHLQQRSVRQHELPCETLYAHEDERIYPPAF 1896 IQFGDIDF+Q XL RAGAYIFSSDTGPGHLQQ+SVRQH LPCE LYAHE+ER YPPA Sbjct 1850 DKIQFGDIDFDQFXLSRAGAYIFSSDTGPGHLQQKSVRQHALPCEMLYAHEEERTYPPAL 1909 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Semliki Forest virus] Sequence ID: P08411.2 Length: 2432 Range 1: 4 to 1858 Score:2759 bits(7151), Expect:0.0, Method:Compositional matrix adjust., Identities:1359/1922(71%), Positives:1565/1922(81%), Gaps:94/1922(4%) Query 2 KVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTIL 61 KV VD+EADSPF+K+LQKAFP+FEVES QVTPNDHANARAFSHLATKLIEQE IL Sbjct 4 KVHVDIEADSPFIKSLQKAFPSFEVESLQVTPNDHANARAFSHLATKLIEQETDKDTLIL 63 Query 62 DVGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDV 121 D+GSAP+RR+MS H YHC+CPM+SAEDPERL YA+KLA ASG VLD+ ++GKITDLQ V Sbjct 64 DIGSAPSRRMMSTHKYHCVCPMRSAEDPERLVCYAKKLAAASGKVLDREIAGKITDLQTV 123 Query 122 MATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTP 181 MATPD ESPTFCLHTD TCRT AEVAVYQDVYAVHAPTSLYHQA+KGVRTAYWIGFDTTP Sbjct 124 MATPDAESPTFCLHTDVTCRTAAEVAVYQDVYAVHAPTSLYHQAMKGVRTAYWIGFDTTP 183 Query 182 FMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFS 241 FMF+ALAGAYP Y+TNWADEQVLQARNIGLCA L+EGR GKLSI+RKK L+P D VMFS Sbjct 184 FMFDALAGAYPTYATNWADEQVLQARNIGLCAASLTEGRLGKLSILRKKQLKPCDTVMFS 243 Query 242 VGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVD 301 VGSTLYTESRKLLRSWHLPSVFHLKGK SFTCRCDT+VSCEGYVVKKIT+ PG+YGKTV Sbjct 244 VGSTLYTESRKLLRSWHLPSVFHLKGKQSFTCRCDTIVSCEGYVVKKITMCPGLYGKTVG 303 Query 302 YAVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGL 361 YAVT+HAEGFLVCK TDTV+GERVSFPVCTYVP+TICDQMTGILATDVTPEDAQKLLVGL Sbjct 304 YAVTYHAEGFLVCKTTDTVKGERVSFPVCTYVPSTICDQMTGILATDVTPEDAQKLLVGL 363 Query 362 NQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWA 421 NQRIVVNGRTQRNTNTMKNYLLP+VA AFSKWARE +AD++DEKPLG RER+LTCCCLWA Sbjct 364 NQRIVVNGRTQRNTNTMKNYLLPIVAVAFSKWAREYKADLDDEKPLGVRERSLTCCCLWA 423 Query 422 FKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQGLPY 481 FK+ K+HTMYK+P+TQTIVKVPS F+SFVIPSLWS+ L++ +R RIK+LL+ + + L Sbjct 424 FKTRKMHTMYKKPDTQTIVKVPSEFNSFVIPSLWSTGLAIPVRSRIKMLLAKKTKRELIP 483 Query 482 SGDRTearaaeeeekeaqeaeLTRAALPPLVSGSCAD-DIAQVDVEELTFRAGAGVVETP 540 D + AR AE+EEKE EAELTR ALPPLV + A+ + VDVEEL + AGAGVVETP Sbjct 484 VLDASSARDAEQEEKERLEAELTREALPPLVPIAPAETGVVDVDVEELEYHAGAGVVETP 543 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 R+ALKVT Q +D L+G+Y++LSPQTVLKS KLAP+HPLAEQV ++TH+GR+GRY VD YD Sbjct 544 RSALKVTAQPNDVLLGNYVVLSPQTVLKSSKLAPVHPLAEQVKIITHNGRAGRYQVDGYD 603 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 GRVL+P G+AIPV EFQALSESATMVYNEREF+NRKL+HIA++GP+LNTDEE+YEKVRAE Sbjct 604 GRVLLPCGSAIPVPEFQALSESATMVYNEREFVNRKLYHIAVHGPSLNTDEENYEKVRAE 663 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 R + EYVFDVDKK C+K+EEASGLVL G+L NPPFHEFAYEGLKIRP+APY TT++GVFG Sbjct 664 RTDAEYVFDVDKKCCVKREEASGLVLVGELTNPPFHEFAYEGLKIRPSAPYKTTVVGVFG 723 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKSAIIK++VT DLV SGKKENCQEI+NDVK+ RGLD+ A+TVDSILLNGC+R V Sbjct 724 VPGSGKSAIIKSLVTKHDLVTSGKKENCQEIVNDVKKHRGLDIQAKTVDSILLNGCRRAV 783 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 + LYVDEAFACHSGTLLALIALV+P KVVLCGDPKQCGFFN+MQLKV++NHNICT V H Sbjct 784 DILYVDEAFACHSGTLLALIALVKPRSKVVLCGDPKQCGFFNMMQLKVNFNHNICTEVCH 843 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCT PVTAIVSTLHY GKMRTTN CN PI IDTTG +KP GDIVLTCFRGWVKQ Sbjct 844 KSISRRCTRPVTAIVSTLHYGGKMRTTNPCNKPIIIDTTGQTKPKPGDIVLTCFRGWVKQ 903 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQ+DYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLY+P SEHVNVLLTRTE+RLVWKTL Sbjct 904 LQLDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYAPASEHVNVLLTRTEDRLVWKTL 963 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 +GDPWIKVL+N+P+G+F+ATLEEWQEEHD IM+V+ A VD FQNKA VCWAK LV V Sbjct 964 AGDPWIKVLSNIPQGNFTATLEEWQEEHDKIMKVIEGPAAPVDAFQNKANVCWAKSLVPV 1023 Query 1021 LETAGIRMTADEWNTIL-AFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYEN 1079 L+TAGIR+TA+EW+TI+ AF+EDRAYSP VALNEICT+YYGVDLDSGLFSA VSL+YEN Sbjct 1024 LDTAGIRLTAEEWSTIITAFKEDRAYSPVVALNEICTKYYGVDLDSGLFSAPKVSLYYEN 1083 Query 1080 NHWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNR 1139 NHWDNRPGGRMYGFN A + AR FL+G ++G Q + ERK+QP S N++P NR Sbjct 1084 NHWDNRPGGRMYGFNAATAARLEARHTFLKGQWHTGKQAVIAERKIQPLSVLDNVIPINR 1143 Query 1140 RLPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTY 1199 RLPHALV Y+ +G RVEWL+ K+ G+ +LLVSEYNLA+P +RV W++P V+GADR Y Sbjct 1144 RLPHALVAEYKTVKGSRVEWLVNKVRGYHVLLVSEYNLALPRRRVTWLSPLNVTGADRCY 1203 Query 1200 DLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRA 1259 DL LGLP DAGR+DLVFVNIHTE+R HHYQQCVDH+M+LQMLGGD+L LL+PGGSLLMRA Sbjct 1204 DLSLGLPADAGRFDLVFVNIHTEFRIHHYQQCVDHAMKLQMLGGDALRLLKPGGSLLMRA 1263 Query 1260 YGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLS 1319 YGYAD++SE VV++L+RKFS+ RVLRP CVTSNTEVFLLFSNFDNG+R TLHQ N KLS Sbjct 1264 YGYADKISEAVVSSLSRKFSSARVLRPDCVTSNTEVFLLFSNFDNGKRPSTLHQMNTKLS 1323 Query 1320 SMYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKG 1379 ++YA +HTAGCAPSYRV+RADI+ E AVVNAANA+GTV DGVCRAVAKKWPS+FKG Sbjct 1324 AVYAGEAMHTAGCAPSYRVKRADIATCTEAAVVNAANARGTVGDGVCRAVAKKWPSAFKG 1383 Query 1380 AATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVP 1439 AATPVGT K + VIHAV PNFS TEAEGDRELAA YRAVA+ ++ ++ SVA+P Sbjct 1384 AATPVGTIKTVMCGSYPVIHAVAPNFSATTEAEGDRELAAVYRAVAAEVNRLSLSSVAIP 1443 Query 1440 LLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVSE 1499 LLSTG FSGG+DR+ QSLNHLFTA+DATDADV IYCRDK+WEKKIQEAID RTA+EL+++ Sbjct 1444 LLSTGVFSGGRDRLQQSLNHLFTAMDATDADVTIYCRDKSWEKKIQEAIDMRTAVELLND 1503 Query 1500 DVTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDAN 1559 DV L TDLVRVHPDS LVGR GYS TDG LYSY EGT+F+Q A+DMAEI TLWPRLQ+AN Sbjct 1504 DVELTTDLVRVHPDSSLVGRKGYSTTDGSLYSYFEGTKFNQAAIDMAEILTLWPRLQEAN 1563 Query 1560 EQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVC 1619 EQICLYALGETMD+IR+KCPV D+DSSTPP+TVPCLCRYAMTAER+ARLR + K+++VC Sbjct 1564 EQICLYALGETMDNIRSKCPVNDSDSSTPPRTVPCLCRYAMTAERIARLRSHQVKSMVVC 1623 Query 1620 SSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQQPPEQLD--------NVSLTS 1671 SSFPLPKY ++GVQKVKC++ L+FD TVPS+VSPRKY + D + + S Sbjct 1624 SSFPLPKYHVDGVQKVKCEKGLLFDPTVPSVVSPRKYAASTTDHSDRSLRGFDLDWTTDS 1683 Query 1672 TTSTGSAWSLPS------ETTYETME---VVAEVHTEppippprrrrAAVAQLRQDLEVT 1722 +++ SLPS ++ YE M V A+VH E A +A L D Sbjct 1684 SSTASDTMSLPSLQSCDIDSIYEPMAPIVVTADVHPE---------PAGIADLAAD---- 1730 Query 1723 EEIEPYVIQQAEIMVMERVATTDIRAIPVP--------ARRAITMPVPAPRVRKVatepp 1774 + P + A+ + +E IP P A RA PVPAPR Sbjct 1731 --VHP---EPADHVDLE-------NPIPPPRPKRAAYLASRAAERPVPAPR--------- 1769 Query 1775 sepeapipaprkrrttsttppHNPGDFVPRVPVELPWEPEDLDIQFGDLEPRRRNTRDWD 1834 P PR L + FGD + + Sbjct 1770 --------------------KPTPA---PRTAFR-----NKLPLTFGDFDEHEVDA---- 1797 Query 1835 VSTGIQFGDIDFNQSXLGRAGAYIFSSDTGPGHLQQRSVRQHELPCETLYAHEDERIYPP 1894 +++GI FGD D + LGRAGAYIFSSDTG GHLQQ+SVRQH L C L A E+E++YPP Sbjct 1798 LASGITFGDFD-DVLRLGRAGAYIFSSDTGSGHLQQKSVRQHNLQCAQLDAVEEEKMYPP 1856 Query 1895 AF 1896 Sbjct 1857 KL 1858 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Mayaro virus (strain Brazil)] Sequence ID: Q8QZ73.3 Length: 2437 Range 1: 3 to 1866 Score:2655 bits(6882), Expect:0.0, Method:Compositional matrix adjust., Identities:1309/1913(68%), Positives:1518/1913(79%), Gaps:67/1913(3%) Query 2 KVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTIL 61 KV VD+EA+SPFLK+LQ+AFPAFEVE+QQVTPNDHANARAFSHLATKLIEQE IL Sbjct 3 KVFVDIEAESPFLKSLQRAFPAFEVEAQQVTPNDHANARAFSHLATKLIEQETEKDTLIL 62 Query 62 DVGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDV 121 D+GSAPARR+MS+HTYHC+CPM+SAEDPERL YARKLAKASG V+D+N++ KI DLQ V Sbjct 63 DIGSAPARRMMSEHTYHCVCPMRSAEDPERLLYYARKLAKASGEVVDRNIAAKIDDLQSV 122 Query 122 MATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTP 181 MATPD ES TFCLHTD+TCRT+AEVAVYQDVYAVHAPTSLY QA+KGVRTAYWIGFDTTP Sbjct 123 MATPDNESRTFCLHTDQTCRTQAEVAVYQDVYAVHAPTSLYFQAMKGVRTAYWIGFDTTP 182 Query 182 FMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFS 241 FMF+ +AGAYP Y+TNWADEQVL+ARNIGLC+ L+EG GKLSIMRKK + PSD++MFS Sbjct 183 FMFDTMAGAYPTYATNWADEQVLKARNIGLCSASLTEGHLGKLSIMRKKKMTPSDQIMFS 242 Query 242 VGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVD 301 VGSTLY ESR+LL+SWHLPSVFHLKG+ S+TCRCDT+VSCEGYVVKKIT+SPG++GKT Sbjct 243 VGSTLYIESRRLLKSWHLPSVFHLKGRQSYTCRCDTIVSCEGYVVKKITMSPGVFGKTSG 302 Query 302 YAVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGL 361 YAVTHHAEGFLVCK TDT+ GERVSFP+CTYVP+TICDQMTGILAT+VTPEDAQKLLVGL Sbjct 303 YAVTHHAEGFLVCKTTDTIAGERVSFPICTYVPSTICDQMTGILATEVTPEDAQKLLVGL 362 Query 362 NQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWA 421 NQRIVVNGRTQRNTNTMKNYLLPVV+QAFSKWA+E R D EDEK +G RERTLTCCCLWA Sbjct 363 NQRIVVNGRTQRNTNTMKNYLLPVVSQAFSKWAKEYRLDQEDEKNMGMRERTLTCCCLWA 422 Query 422 FKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQGLPY 481 FK+HK HTMYK+P+TQTIVKVPS F+SFVIPSLWS+ LS+GIR RI+LLL +R + L Sbjct 423 FKTHKNHTMYKKPDTQTIVKVPSEFNSFVIPSLWSAGLSIGIRHRIRLLLQSRRVEPLVP 482 Query 482 SGDRTearaaeeeekeaqeaeLTRAALPPLV-SGSCADDIAQVDVEELTFRAGAGVVETP 540 S D EARAAE E EA+EAE T AALPPL+ + DDI +VDVEEL FRAGAGVVETP Sbjct 483 SMDVGEARAAEREAAEAKEAEDTLAALPPLIPTAPVLDDIPEVDVEELEFRAGAGVVETP 542 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 RNALKVTPQ D ++GSYL+LSPQTVLKS KL +HPLAE V ++TH GR+GRY VD YD Sbjct 543 RNALKVTPQDRDTMVGSYLVLSPQTVLKSVKLQALHPLAESVKIITHKGRAGRYQVDAYD 602 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 GRVL+PTGAAIPV +FQALSESATMVYNEREFINRKL+HIA++G ALNTDEE YEKVRAE Sbjct 603 GRVLLPTGAAIPVPDFQALSESATMVYNEREFINRKLYHIAVHGAALNTDEEGYEKVRAE 662 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 + EYV+DVD+K C+K+EEA GLV+ GDLINPPFHEFAYEGLK RPAAPY TT++GVFG Sbjct 663 STDAEYVYDVDRKQCVKREEAEGLVMIGDLINPPFHEFAYEGLKRRPAAPYKTTVVGVFG 722 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKS IIK++VT DLVASGKKENCQEIM DVKR R LD+TA+TVDS+LLNG K+ V Sbjct 723 VPGSGKSGIIKSLVTRGDLVASGKKENCQEIMLDVKRYRDLDMTAKTVDSVLLNGVKQTV 782 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 + LYVDEAFACH+GTLLALIA VRP KVVLCGDPKQCGFFNLMQL+V++NHNICT V H Sbjct 783 DVLYVDEAFACHAGTLLALIATVRPRKKVVLCGDPKQCGFFNLMQLQVNFNHNICTEVDH 842 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCTLP+TAIVSTLHY+G+MRTTN N P+ IDTTG +KP DIVLTCFRGWVKQ Sbjct 843 KSISRRCTLPITAIVSTLHYEGRMRTTNPYNKPVIIDTTGQTKPNREDIVLTCFRGWVKQ 902 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQ+DYRGHEVMTAAASQGLTRKGVYAVR KVNENPLY+ SEHVNVLLTRTE RLVWKTL Sbjct 903 LQLDYRGHEVMTAAASQGLTRKGVYAVRMKVNENPLYAQSSEHVNVLLTRTEGRLVWKTL 962 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 SGDPWIK L+N+P+G+F+ATLE+WQ EHD IMR + + A +D FQNKAKVCWAKCLV V Sbjct 963 SGDPWIKTLSNIPKGNFTATLEDWQREHDTIMRAITQEAAPLDVFQNKAKVCWAKCLVPV 1022 Query 1021 LETAGIRMTADEWNT-ILAFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYEN 1079 LETAGI+++A +W+ ILAF+EDRAYSPEVALNEICT+ YGVDLDSGLFSA VSL Y Sbjct 1023 LETAGIKLSATDWSAIILAFKEDRAYSPEVALNEICTKIYGVDLDSGLFSAPRVSLHYTT 1082 Query 1080 NHWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNR 1139 NHWDN PGGRMYGF+ E A + + PF RG SG Q+ V ERK QP CN++P NR Sbjct 1083 NHWDNSPGGRMYGFSVEAANRLEQQHPFYRGRWASG-QVLVAERKTQPIDVTCNLIPFNR 1141 Query 1140 RLPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTY 1199 RLPH LVT Y +GERVEWL+ KIPG+ +LLVSEYNL +P ++V WIAPP V+GAD TY Sbjct 1142 RLPHTLVTEYHPIKGERVEWLVNKIPGYHVLLVSEYNLILPRRKVTWIAPPTVTGADLTY 1201 Query 1200 DLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRA 1259 DLDLGLP +AGRYDLVFVN+HT YR HHYQQCVDH+M+LQMLGGD+L+LL+PGGSLL+ Sbjct 1202 DLDLGLPPNAGRYDLVFVNMHTPYRLHHYQQCVDHAMKLQMLGGDALYLLKPGGSLLLST 1261 Query 1260 YGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLS 1319 Y YADR SE VVTALAR+FS+FR + CVTSNTEVFLLF+NFDNGRR VTLHQ N KLS Sbjct 1262 YAYADRTSEAVVTALARRFSSFRAVTVRCVTSNTEVFLLFTNFDNGRRTVTLHQTNGKLS 1321 Query 1320 SMYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKG 1379 S+YA L AGCAP+Y V+RADI+ E+AVVNAAN +G V DGVCRAVA+KWP +F+ Sbjct 1322 SIYAGTVLQAAGCAPAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQAFRN 1381 Query 1380 AATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVP 1439 AATPVGTAK ++ D +IHAVGPNF+ +EAEGDR+LAAAYRAVA+ I+ +I SVA+P Sbjct 1382 AATPVGTAKTVKCDETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISSVAIP 1441 Query 1440 LLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVSE 1499 LLSTG FS GKDRV QSL+HL A+D T+A V IYCRDK WE+KI+ + R+A ELVS+ Sbjct 1442 LLSTGIFSAGKDRVHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATELVSD 1501 Query 1500 DVTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDAN 1559 ++ E +L RVHPDS LVGR GYS TDG LYSY+EGT+FHQ A+DMAEI+TLWPR+QDAN Sbjct 1502 ELQFEVNLTRVHPDSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRVQDAN 1561 Query 1560 EQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVC 1619 E ICLYALGETMD+IR +CPVED+DSSTPPKTVPCLCRYAMT ERV RLRM++TK+ +VC Sbjct 1562 EHICLYALGETMDNIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKDFVVC 1621 Query 1620 SSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQQPPEQLDNVSLTSTTSTGSAW 1679 SSF LPKYRI GVQ+VKC++V++FD P+ VSP +Y+ E + S +S Sbjct 1622 SSFQLPKYRIPGVQRVKCEKVMLFDAAPPASVSPVQYLTNQSE-----TTISLSSFSITS 1676 Query 1680 SLPSETTYETMEVVAEVHTEppippprrr---------rAAVAQLRQDLEVTEEIEPYVI 1730 S +T+ +E E+ + P A +A Sbjct 1677 DSSSLSTFPDLESAEELDHDSQSVRPALNEPDDHQPTPTAELATHPVPPPRPNRARRLAA 1736 Query 1731 QQAEIMVMERVATTDIRAIPVPARRAITMPVPAPRVRKVateppsepeapipaprkrrtt 1790 + ++ V ++ P+PA R PVPAPR Sbjct 1737 ARVQVQVEVHQPPSNQPTKPIPAPRTSLRPVPAPR------------------------- 1771 Query 1791 sttppHNPGDFVPRVPVELPWEPEDLDIQFGDLEPRRRNTRDWDVSTGIQFGDIDF---- 1846 +VPR VELPW E +D++FG P + I FGD Sbjct 1772 ---------RYVPRPVVELPWPLETIDVEFG--APTEEE-------SDITFGDFSASEWE 1813 Query 1847 ---NQSXLGRAGAYIFSSDTGPGHLQQRSVRQHELPCETLYAHEDERIYPPAF 1896 N SXLGRAGAYIFSSD GPGHLQQ+SVRQH+L + +E++YPP Sbjct 1814 TISNSSXLGRAGAYIFSSDVGPGHLQQKSVRQHDLEVPIMDRVIEEKVYPPKL 1866 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Chikungunya virus strain Senegal 37997] Sequence ID: Q5XXP4.1 Length: 2474 Range 1: 4 to 1903 Score:2578 bits(6682), Expect:0.0, Method:Compositional matrix adjust., Identities:1271/1912(66%), Positives:1499/1912(78%), Gaps:30/1912(1%) Query 3 VTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTILD 62 V VD++ADS FLKALQ+A+P FEVE +QVTPNDHANARAFSHLA KLIEQE+ TILD Sbjct 4 VYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDSTILD 63 Query 63 VGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDVM 122 +GSAPARR+MSD YHC+CPM+SAEDPERLANYARKLA A+G VLD+N+S KI DLQ VM Sbjct 64 IGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQAVM 123 Query 123 ATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTPF 182 A PD E+PTFCLHTD +CR RA+VA+YQDVYAVHAPTSLYHQAIKGVR AYWIGFDTTPF Sbjct 124 AVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDTTPF 183 Query 183 MFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFSV 242 M+ A+AGAYP+YSTNWADEQVL+A+NIGLC+T L+EGRRGKLSIMR K ++P DRV+FSV Sbjct 184 MYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVLFSV 243 Query 243 GSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVDY 302 GSTLY ESRKLL+SWHLPSVFHLKGK SFTCRCDTVVSCEGYVVK+ITISPG+YGKT Y Sbjct 244 GSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKTTGY 303 Query 303 AVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGLN 362 AVTHHA+GFL+CK TDTV GERVSF VCTYVPATICDQMTGILAT+VTPEDAQKLLVGLN Sbjct 304 AVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLVGLN 363 Query 363 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWAF 422 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWA+E R DMEDEK LG RERTLTCCCLWAF Sbjct 364 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCLWAF 423 Query 423 KSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQGL-PY 481 K K HT+YKRP+TQ+I KVP+ FDSFV+PSLWSS LS+ +R RIK LLS L PY Sbjct 424 KKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDLIPY 483 Query 482 SGDRTearaaeeeekeaqeaeLTRAALPPLVSGSCADDI-AQVDVEELTFRAGAGVVETP 540 SGD EAR AE+E +E +EAELTR ALPPL + DD+ ++DVE+L RAGAG++ETP Sbjct 484 SGDAKEARDAEKEAEEEREAELTREALPPLQAAQ--DDVQVEIDVEQLEDRAGAGIIETP 541 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 R A+KVT Q DH++G YL+LSPQTVL+S+KL+ IH LAEQV THSGR+GRY V+ YD Sbjct 542 RGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAYD 601 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 GR+L+P+G AI +FQ+LSESATMVYNEREF+NRKLHHIAL+GPALNTDEESYE VRAE Sbjct 602 GRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRAE 661 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 R E EYV+DVD++ C KKEEA+GLVL GDL NPP+HEFAYEGL+IRPA PY T +IGVFG Sbjct 662 RTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVFG 721 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKSAIIKN+VT +DLV SGKKENCQEI DV RQR L+++ARTVDS+LLNGC R V Sbjct 722 VPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRPV 781 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 + LYVDEAFACHSGTLLALIALVRP KVVLCGDPKQCGFFN+MQ+KV+YNHNICT+V H Sbjct 782 DVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVYH 841 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCTLPVTAIVS+LHY+GKMRTTN N PI +DTTGS+KP GD+VLTCFRGWVKQ Sbjct 842 KSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVKQ 901 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLY+ SEHVNVLLTRTE +LVWKTL Sbjct 902 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKTL 961 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 SGDPWIK L N P+G+F AT++EW+ EH IM + D FQNKA VCWAK LV + Sbjct 962 SGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVPI 1021 Query 1021 LETAGIRMTADEWNTIL-AFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYEN 1079 LETAGI++ +W+ I+ AF+EDRAYSPEVALNEICTR YGVDLDSGLFS VS+ Y + Sbjct 1022 LETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYAD 1081 Query 1080 NHWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNR 1139 NHWDNRPGG+M+GFN E A ++PF +G N+ Q+ V R+++ F+ NI+P+NR Sbjct 1082 NHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPANR 1141 Query 1140 RLPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTY 1199 RLPH+LV ++ +GER+EWL+ KI GH +LLVS YNL +P KRV W+AP + GAD TY Sbjct 1142 RLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYTY 1201 Query 1200 DLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRA 1259 +L+LGLP GRYDLV +NIHT +R HHYQQCVDH+M+LQMLGGDSL LL+PGGSLL+RA Sbjct 1202 NLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIRA 1261 Query 1260 YGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLS 1319 YGYADR SE VV L RKF + R L+P CVTSNTE+F LFSNFDNGRR T H N +L+ Sbjct 1262 YGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQLN 1321 Query 1320 SMYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKG 1379 + + AGCAPSYRV+R DI+ + EE VVNAAN +G DGVC+AV KKWP SFK Sbjct 1322 AAFVGQATR-AGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKN 1380 Query 1380 AATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVP 1439 +ATPVGTAK + VIHAVGPNFS +E+EGDRELAAAYR VA ++ + SVA+P Sbjct 1381 SATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIP 1440 Query 1440 LLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVSE 1499 LLSTG +SGGKDR+ QSLNHLFTALD+TDADVVIYCRDK WEKKI EAI RT +EL+ E Sbjct 1441 LLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVELLDE 1500 Query 1500 DVTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDAN 1559 ++++ D++RVHPDS L GR GYS T+G LYSYLEGTRFHQTAVDMAE+ T+WP+ +AN Sbjct 1501 HISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQTEAN 1560 Query 1560 EQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVC 1619 EQ+CLYALGE+++SIR KCPV+DAD+S+PPKTVPCLCRYAMT ERV RLRMN+ +IIVC Sbjct 1561 EQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVC 1620 Query 1620 SSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQQPPEQLDNVSLTSTTSTGSAW 1679 SSFPLPKY+IEGVQKVKC +V++FD VPS VSPR+Y + P E VS ++T+ T S + Sbjct 1621 SSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREY-KSPQETAQEVS-STTSLTHSQF 1678 Query 1680 SLPSETTYETMEVVAEVHTEppippprrrrAAVAQLRQDLEVTEEIEPYVIQQAEIMVME 1739 L + E + +++ + PIP P AV L ++ + +V+ A + Sbjct 1679 DLSVDG--EELPAPSDLEADAPIPEPTPDDRAVLTLPPTIDNFSAVSDWVMNTAPVAPPR 1736 Query 1740 RVATTDIRAIPVPARRAITMPVPAPRVRKVateppsepeapipaprkrrttsttppHNPG 1799 R ++ + R +P+ + R + + A I + P Sbjct 1737 RRRGKNLN-VTCDEREGNVLPMASVRFFRADLHSIVQETAEIRDTAASLQAPLSVATEPN 1795 Query 1800 DFVPRVPVELPWEPEDLDIQFGDLEPRRRNTRDWDVSTGIQF--GDID------------ 1845 ++P+ E I FGD + + ++ T F G++D Sbjct 1796 ----QLPISFGAPNETFPITFGDFDEGEIESLSSELLTFGDFSPGEVDDLTDSDWSTCSD 1851 Query 1846 -FNQSXLGRAGAYIFSSDTGPGHLQQRSVRQHELPCETLYAHEDERIYPPAF 1896 ++ XL RAG YIFSSDTGPGHLQQRSVRQ LP TL ++E+ YPP Sbjct 1852 TDDELXLDRAGGYIFSSDTGPGHLQQRSVRQTVLPVNTLEEVQEEKCYPPKL 1903 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [O'nyong-nyong virus strain SG650] Sequence ID: O90368.1 Length: 2513 Range 1: 4 to 1657 Score:2530 bits(6558), Expect:0.0, Method:Compositional matrix adjust., Identities:1192/1658(72%), Positives:1392/1658(83%), Gaps:8/1658(0%) Query 3 VTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTILD 62 V VD++ADS FLKALQ+A+P FEVE +QVTPNDHANARAFSHLA KLIEQE+ TILD Sbjct 4 VYVDIDADSAFLKALQRAYPMFEVEPKQVTPNDHANARAFSHLAIKLIEQEIDPDSTILD 63 Query 63 VGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDVM 122 +G APARR+MSD YHC+CPM+SAEDPERLANYARKLA A+G V DKN+SGKI DLQ VM Sbjct 64 IGPAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVTDKNISGKINDLQAVM 123 Query 123 ATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTPF 182 A P++E+ TFCLHTD TC+ R +VA+YQDVYAVHAPTSLYHQAIKGVR AYWIGFDTTPF Sbjct 124 AVPNMETSTFCLHTDATCKQRGDVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDTTPF 183 Query 183 MFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFSV 242 M+ A+AGAYP+YSTNWADEQVL+A+NIGLC+T LSEGRRGKLSIMR K L+P DRV+FSV Sbjct 184 MYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLSEGRRGKLSIMRGKKLKPCDRVLFSV 243 Query 243 GSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVDY 302 GSTLY ESRKLL+SWHLPSVFHLKGK SFTCRCDT+VSCEGYVVK++T+SPGIYGKT Y Sbjct 244 GSTLYPESRKLLQSWHLPSVFHLKGKLSFTCRCDTIVSCEGYVVKRVTMSPGIYGKTSGY 303 Query 303 AVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGLN 362 AVTHHA+GFL+CK TDTV GERVSF VCTYVPATICDQMTGILAT+VTPEDAQKLLVGLN Sbjct 304 AVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLVGLN 363 Query 363 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWAF 422 QRIVVNGRTQRNTNTMKNYLLP+VAQAFSKWA+E R DMEDEK LG RERTLTCCCLWAF Sbjct 364 QRIVVNGRTQRNTNTMKNYLLPIVAQAFSKWAKECRKDMEDEKLLGVRERTLTCCCLWAF 423 Query 423 KSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLS-ARMAQGLPY 481 + HK HT+YKRP+TQ+I KVP+ FDSFVIPSLWSS LS+ +R RIK LLS A + LP+ Sbjct 424 RKHKTHTVYKRPDTQSIQKVPAEFDSFVIPSLWSSGLSIPLRTRIKWLLSKAPKHEQLPH 483 Query 482 SGDRTearaaeeeekeaqeaeLTRAALPPLVSGSCADDI-AQVDVEELTFRAGAGVVETP 540 SG+ EA AE + E +EAELTR A+PPL + DD+ ++DVE+L RAGAG+VETP Sbjct 484 SGNAEEAAQAEMDAAEEREAELTREAMPPL--QATQDDVQVEIDVEQLEDRAGAGIVETP 541 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 R A+KVT Q D ++G YL+L+PQ VL+S+KL+ IH LAEQV THSGR+GRY V+ YD Sbjct 542 RGAIKVTAQPSDRVVGEYLVLTPQAVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAYD 601 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 GRVL+P+G AIP +FQ+LSESATMV+NEREF+NRKLHHIA++GPALNTDEESYE VR E Sbjct 602 GRVLVPSGYAIPQEDFQSLSESATMVFNEREFVNRKLHHIAMHGPALNTDEESYELVRVE 661 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 + E EYV+DVD+K C K+EEA+GLVL GDL +PP+HEFAYEGLKIRPA PY T +IGVFG Sbjct 662 KTEHEYVYDVDQKKCCKREEATGLVLVGDLTSPPYHEFAYEGLKIRPACPYKTAVIGVFG 721 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKSAIIKN+VT +DLV SGKKENCQEI NDV RQR L+++ARTVDS+LLNGC + V Sbjct 722 VPGSGKSAIIKNLVTRQDLVTSGKKENCQEISNDVMRQRKLEISARTVDSLLLNGCNKPV 781 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 E LYVDEAFACHSGTLLALIA+VRP KVVLCGDPKQCGFFN+MQ+KV+YNHNICT+V H Sbjct 782 EVLYVDEAFACHSGTLLALIAMVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVYH 841 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCTLPVTAIVS+LHY+ KMRTTN N PI +DTTG +KP GD+VLTCFRGWVKQ Sbjct 842 KSISRRCTLPVTAIVSSLHYESKMRTTNEYNQPIVVDTTGITKPEPGDLVLTCFRGWVKQ 901 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQIDYRG+EVMTAAASQGLTRKGVYAVRQKVNENPLY+ SEHVNVLLTRTE +L+WKTL Sbjct 902 LQIDYRGNEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLIWKTL 961 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 SGDPWIK+L N P+G+F AT++EW+ EH IM + D FQNKA VCWAKCLV + Sbjct 962 SGDPWIKILQNPPKGNFKATIKEWEAEHASIMAGICNHQMAFDTFQNKANVCWAKCLVPI 1021 Query 1021 LETAGIRMTADEWNTIL-AFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYEN 1079 L+TAGI+++ +W+ I+ AF+EDRAYSPEVALNEICTR YGVDLDSGLFS +S++Y + Sbjct 1022 LDTAGIKLSDRQWSQIVQAFKEDRAYSPEVALNEICTRIYGVDLDSGLFSKPLISVYYAD 1081 Query 1080 NHWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNR 1139 NHWDNRPGG+M+GFN EVA ++PF +G N Q+ + RK+ F+ E NI+P+NR Sbjct 1082 NHWDNRPGGKMFGFNPEVALMLEKKYPFTKGKWNINKQICITTRKVDEFNPETNIIPANR 1141 Query 1140 RLPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTY 1199 RLPH+LV + RGER+EWL+ KI GH MLLVS YNL +P KRV W+AP GAD TY Sbjct 1142 RLPHSLVAEHHTVRGERMEWLVNKINGHHMLLVSGYNLILPTKRVTWVAPLGTRGADYTY 1201 Query 1200 DLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRA 1259 +L+LGLP GRYDLV +NIHT +R HHYQQCVDH+M+LQMLGGDSL LL+PGGSLL+RA Sbjct 1202 NLELGLPATLGRYDLVVINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIRA 1261 Query 1260 YGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLS 1319 YGYADR SE V++ L RKF + R L+P C+TSNTE+F LFS FDNGRR T H N +L+ Sbjct 1262 YGYADRTSERVISVLGRKFRSSRALKPQCITSNTEMFFLFSRFDNGRRNFTTHVMNNQLN 1321 Query 1320 SMYACNGLHT-AGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFK 1378 ++YA GL T AGCAPSYRV+R DI+ + EE VVNAAN +G DGVC+AV +KWP SF+ Sbjct 1322 AVYA--GLATRAGCAPSYRVKRMDIAKNTEECVVNAANPRGVPGDGVCKAVYRKWPESFR 1379 Query 1379 GAATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAV 1438 +ATPVGTAK I VIHAVGPNFS +EAEGDRELA+ YR VA +S + SVA+ Sbjct 1380 NSATPVGTAKTIMCGQYPVIHAVGPNFSNYSEAEGDRELASVYREVAKEVSRLGVSSVAI 1439 Query 1439 PLLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVS 1498 PLLSTG +SGGKDR++QSLNHLFTA+D+TDADVVIYCRDK WEKKI EAI R+ +EL+ Sbjct 1440 PLLSTGVYSGGKDRLLQSLNHLFTAMDSTDADVVIYCRDKEWEKKITEAISLRSQVELLD 1499 Query 1499 EDVTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDA 1558 + ++++ D+VRVHPDS L GR GYS +G LYSYLEGTRFHQTAVDMAEI T+WP+ +A Sbjct 1500 DHISVDCDIVRVHPDSSLAGRKGYSTVEGALYSYLEGTRFHQTAVDMAEIYTMWPKQTEA 1559 Query 1559 NEQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIV 1618 NEQ+CLYALGE+++S+R KCPV+DAD+S PPKTVPCLCRYAMT ERVARLRMN+T +IIV Sbjct 1560 NEQVCLYALGESIESVRQKCPVDDADASFPPKTVPCLCRYAMTPERVARLRMNHTTSIIV 1619 Query 1619 CSSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKY 1656 CSSFPLPKY+IEGVQKVKC + L+FD VPS VSPR Y Sbjct 1620 CSSFPLPKYKIEGVQKVKCSKALLFDHNVPSRVSPRTY 1657 Range 2: 1867 to 1942 Score:63.2 bits(152), Expect:1e-09, Method:Compositional matrix adjust., Identities:36/82(44%), Positives:44/82(53%), Gaps:9/82(10%) Query 1818 IQFGDLEP---RRRNTRDWDVSTGIQFGDIDFNQSXLGRAGAYIFSSDTGPGHLQQRSVR 1874 + FGD EP +W + D D + XL RAG YIFSSDTG GHLQQ+SVR Sbjct 1867 LTFGDFEPGEVEELTDSEWSTCS-----DTD-EELXLDRAGGYIFSSDTGQGHLQQKSVR 1920 Query 1875 QHELPCETLYAHEDERIYPPAF 1896 Q LP + +E+ YPP Sbjct 1921 QTTLPVNIVEEVHEEKCYPPKL 1942 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [O'nyong-nyong virus strain Gulu] Sequence ID: P13886.2 Length: 2514 Range 1: 4 to 1677 Score:2530 bits(6558), Expect:0.0, Method:Compositional matrix adjust., Identities:1199/1680(71%), Positives:1400/1680(83%), Gaps:10/1680(0%) Query 3 VTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTILD 62 V VD++ADS FLKALQ+A+P FEVE +QVTPNDHANARAFSHLA KLIEQE+ TILD Sbjct 4 VYVDIDADSAFLKALQQAYPMFEVEPKQVTPNDHANARAFSHLAIKLIEQEIDPDSTILD 63 Query 63 VGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDVM 122 +GSAPARR+MSD YHC+CPM+SAEDPERLANYARKLA A+G V DKN+SGKI DLQ VM Sbjct 64 IGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVTDKNISGKINDLQAVM 123 Query 123 ATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTPF 182 A P++E+ TFCLHTD TC+ R +VA+YQDVYAVHAPTSLYHQAIKGVR AYWIGFDTTPF Sbjct 124 AVPNMETSTFCLHTDATCKQRGDVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDTTPF 183 Query 183 MFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFSV 242 M+ A+AGAYP+YSTNWADEQVL+A+NIGLC+T LSEGRRGKLSIMR K L+P DRV+FSV Sbjct 184 MYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLSEGRRGKLSIMRGKKLKPCDRVLFSV 243 Query 243 GSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVDY 302 GSTLY ESRKLL+SWHLPSVFHLKGK SFTCRCDT+VSCEGYVVK++T+SPGIYGKT Y Sbjct 244 GSTLYPESRKLLQSWHLPSVFHLKGKLSFTCRCDTIVSCEGYVVKRVTMSPGIYGKTSGY 303 Query 303 AVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGLN 362 AVTHHA GFL+CK TDTV GERVSF VCTYVPATICDQMTGILAT+VTPEDAQKLLVGLN Sbjct 304 AVTHHAGGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLVGLN 363 Query 363 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWAF 422 QRIVVNGRTQRNTNTMKNYLLP+VAQAFSKWA+E R DMEDEK LG RERTLTCCCLWAF Sbjct 364 QRIVVNGRTQRNTNTMKNYLLPIVAQAFSKWAKECRKDMEDEKLLGVRERTLTCCCLWAF 423 Query 423 KSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLS-ARMAQGLPY 481 + HK HT+YKRP+TQ+I KVP+ FDSFVIPSLWSS LS+ +R RIK LLS A + LP+ Sbjct 424 RKHKTHTVYKRPDTQSIQKVPAEFDSFVIPSLWSSGLSIPLRTRIKWLLSKAPKYEQLPH 483 Query 482 SGDRTearaaeeeekeaqeaeLTRAALPPLVSGSCADDI-AQVDVEELTFRAGAGVVETP 540 SG+ EA AE + E QEAELTR A+PPL + DDI ++DVE+L RAGAG+VETP Sbjct 484 SGNAEEAAQAETDAVEEQEAELTREAMPPL--QATQDDIQVEIDVEQLEDRAGAGIVETP 541 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 R A+KVT Q D ++G YL+L+PQ VL+S+KL+ IH LAEQV THSGR+GRY V+ YD Sbjct 542 RGAIKVTAQPSDLVVGEYLVLTPQAVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAYD 601 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 GRVL+P+G AIP +FQ+LSESATMV+NEREF+NRKLHHIA++GPALNTDEESYE VR E Sbjct 602 GRVLVPSGYAIPQEDFQSLSESATMVFNEREFVNRKLHHIAMHGPALNTDEESYELVRVE 661 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 + E EYV+DVD+K C K+EEA+GLVL GDL +PP+HEFAYEGLKIRPA PY T +IGVFG Sbjct 662 KTEHEYVYDVDQKKCCKREEATGLVLVGDLTSPPYHEFAYEGLKIRPACPYKTAVIGVFG 721 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKSAIIKN+VT +DLV SGKKENCQEI NDV RQR L+++ARTVDS+LLNGC + V Sbjct 722 VPGSGKSAIIKNLVTRQDLVTSGKKENCQEISNDVMRQRKLEISARTVDSLLLNGCNKPV 781 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 E LYVDEAFACHSGTLLALIA+VRP KVVLCGDPKQCGFFN+MQ+KV+YNHNICT+V H Sbjct 782 EVLYVDEAFACHSGTLLALIAMVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVYH 841 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCTLPVTAIVS+LHY+ KMRTTN N PI +DTTG +KP GD+VLTCFRGWVKQ Sbjct 842 KSISRRCTLPVTAIVSSLHYESKMRTTNEYNQPIVVDTTGITKPEPGDLVLTCFRGWVKQ 901 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQIDYRG+EVMTAAASQGLTRKGVYAVRQKVNENPLY+P SEHVNVLLTRTE +L WKTL Sbjct 902 LQIDYRGNEVMTAAASQGLTRKGVYAVRQKVNENPLYAPTSEHVNVLLTRTEGKLTWKTL 961 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 SGDPWIK+L N P+GDF AT++EW+ EH IM + D FQNKA VCWAKCLV + Sbjct 962 SGDPWIKILQNPPKGDFKATIKEWEAEHASIMAGICNHQMAFDTFQNKANVCWAKCLVPI 1021 Query 1021 LETAGIRMTADEWNTIL-AFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYEN 1079 L+TAGI+++ +W+ I+ AF+EDRAYSPEVALNEICTR YGVDLDSGLFS +S++Y + Sbjct 1022 LDTAGIKLSDRQWSQIVQAFKEDRAYSPEVALNEICTRIYGVDLDSGLFSKPLISVYYAD 1081 Query 1080 NHWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNR 1139 NHWDNRPGG+M+GFN EVA ++PF +G N Q+ + RK+ F+ E NI+P+NR Sbjct 1082 NHWDNRPGGKMFGFNPEVALMLEKKYPFTKGKWNINKQICITTRKVDEFNPETNIIPANR 1141 Query 1140 RLPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTY 1199 RLPH+LV + RGER+EWL+ KI GH MLLVS +NL +P KRV W+AP GAD TY Sbjct 1142 RLPHSLVAEHHSVRGERMEWLVNKISGHHMLLVSGHNLILPTKRVTWVAPLGTRGADYTY 1201 Query 1200 DLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRA 1259 +L+LGLP GRYDLV +NIHT +R HHYQQCVDH+M+LQMLGGDSL LL+PGGSLL+RA Sbjct 1202 NLELGLPATLGRYDLVVINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIRA 1261 Query 1260 YGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLS 1319 YGYADR SE V++ L RKF + R L+P C+TSNTE+F LFS FDNGRR T H N +L+ Sbjct 1262 YGYADRTSERVISVLGRKFRSSRALKPQCITSNTEMFFLFSRFDNGRRNFTTHVMNNQLN 1321 Query 1320 SMYACNGLHT-AGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFK 1378 ++YA GL T AGCAPSYRV+R DI+ + EE VVNAAN +G DGVC+AV +KWP SF+ Sbjct 1322 AVYA--GLATRAGCAPSYRVKRMDIAKNTEECVVNAANPRGVPGDGVCKAVYRKWPESFR 1379 Query 1379 GAATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAV 1438 +ATPVGTAK I VIHAVGPNFS +EAEGDRELA+ YR VA +S + SVA+ Sbjct 1380 NSATPVGTAKTIMCGQYPVIHAVGPNFSNYSEAEGDRELASVYREVAKEVSRLGVSSVAI 1439 Query 1439 PLLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVS 1498 PLLSTG +SGGKDR++QSLNHLF A+D+TDADVVIYCRDK WEKKI EAI R+ +EL+ Sbjct 1440 PLLSTGVYSGGKDRLLQSLNHLFAAMDSTDADVVIYCRDKEWEKKITEAISLRSQVELLD 1499 Query 1499 EDVTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDA 1558 + ++++ D+VRVHPDS L GR GYS +G LYSYLEGTRFHQTAVDMAEI T+WP+ +A Sbjct 1500 DHISVDCDIVRVHPDSSLAGRKGYSTVEGALYSYLEGTRFHQTAVDMAEIYTMWPKQTEA 1559 Query 1559 NEQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIV 1618 NEQ+CLYALGE+++S+R KCPV+DAD+S PPKTVPCLCRYAMT ERVARLRMN+T +IIV Sbjct 1560 NEQVCLYALGESIESVRQKCPVDDADASFPPKTVPCLCRYAMTPERVARLRMNHTTSIIV 1619 Query 1619 CSSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQQPPEQLDNVSLTSTTSTGSA 1678 CSSFPLPKY+IEGVQKVKC + L+FD VPS VSPR Y +P +++ T T + A Sbjct 1620 CSSFPLPKYKIEGVQKVKCSKALLFDHNVPSRVSPRTY--RPADEIIQTPQTPTEACQDA 1677 Range 2: 1868 to 1943 Score:63.2 bits(152), Expect:1e-09, Method:Compositional matrix adjust., Identities:35/82(43%), Positives:43/82(52%), Gaps:9/82(10%) Query 1818 IQFGDLEP---RRRNTRDWDVSTGIQFGDIDFNQSXLGRAGAYIFSSDTGPGHLQQRSVR 1874 + FGD EP +W + D D + L RAG YIFSSDTG GHLQQ+SVR Sbjct 1868 LTFGDFEPGEVEELTDSEWSTCS-----DTD-EELRLDRAGGYIFSSDTGQGHLQQKSVR 1921 Query 1875 QHELPCETLYAHEDERIYPPAF 1896 Q LP + +E+ YPP Sbjct 1922 QTTLPVNIVEEVHEEKCYPPKL 1943 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Igbo Ora virus] Sequence ID: O90370.1 Length: 2513 Range 1: 4 to 1677 Score:2529 bits(6555), Expect:0.0, Method:Compositional matrix adjust., Identities:1195/1680(71%), Positives:1400/1680(83%), Gaps:10/1680(0%) Query 3 VTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTILD 62 V VD++ADS FLKALQ+A+P FEVE +QVTPNDHANARAFSHLA KLIEQE+ G TIL Sbjct 4 VYVDIDADSAFLKALQRAYPMFEVEPKQVTPNDHANARAFSHLAIKLIEQEIDPGSTILG 63 Query 63 VGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDVM 122 +GSAPARR+MSD YHC+CPM+SAEDPERLANYARKLA A+G V DKN+SGKI DLQ VM Sbjct 64 IGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVTDKNISGKINDLQAVM 123 Query 123 ATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTPF 182 A P++E+ TFCLHTD TC+ R +VA+YQDVYAVHAPTSLYHQAIKGV AYWIGFDTTPF Sbjct 124 AVPNMETSTFCLHTDATCKQRGDVAIYQDVYAVHAPTSLYHQAIKGVHVAYWIGFDTTPF 183 Query 183 MFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFSV 242 M+ A+AGAYP+YSTNWADEQVL+A+NIGLC+T LSEGRRGKLSIMR K +P DRV+FSV Sbjct 184 MYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLSEGRRGKLSIMRGKKFKPCDRVLFSV 243 Query 243 GSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVDY 302 GSTLY ESRKLL+SWHLPSVFHLKGK SFTCRCDT+VSCEGYVVK++T+SPGIYGKT Y Sbjct 244 GSTLYPESRKLLQSWHLPSVFHLKGKLSFTCRCDTIVSCEGYVVKRVTMSPGIYGKTSGY 303 Query 303 AVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGLN 362 AVTHHA+GFL+CK TDTV GERVSF VCTYVPATICDQMTGILAT+VTPEDAQKLLVGLN Sbjct 304 AVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLVGLN 363 Query 363 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWAF 422 QRIVVNGRTQRNTNTMKNYLLP+VAQAFSKWA+E R DMEDEK LG RERTLTCCCLWAF Sbjct 364 QRIVVNGRTQRNTNTMKNYLLPIVAQAFSKWAKECRKDMEDEKLLGVRERTLTCCCLWAF 423 Query 423 KSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLS-ARMAQGLPY 481 + HK HT+YKRP+TQ+I KVP+ FDSFVIPSLWSS LS+ +R RIK LLS A + LP+ Sbjct 424 RKHKTHTVYKRPDTQSIQKVPAEFDSFVIPSLWSSGLSIPLRTRIKWLLSKAPKHEQLPH 483 Query 482 SGDRTearaaeeeekeaqeaeLTRAALPPLVSGSCADDI-AQVDVEELTFRAGAGVVETP 540 SG+ EA AE + E +EAELTR A+PPL + DD+ ++DVE+L RAGAG+VETP Sbjct 484 SGNAEEAAQAETDAVEEREAELTREAMPPL--QATQDDVQVEIDVEQLEDRAGAGIVETP 541 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 R A+KVT Q D ++G YL+L+PQ VL+S+KL IH LAEQV THSGR+GRY V+ YD Sbjct 542 RGAIKVTAQPSDLVVGEYLVLTPQAVLRSQKLGLIHALAEQVKTCTHSGRAGRYAVEAYD 601 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 GRVL+P+G AIP +FQ+LSESATMV+NEREF+NRKLHHIA++GPALNTDEESYE VR E Sbjct 602 GRVLVPSGYAIPQEDFQSLSESATMVFNEREFVNRKLHHIAMHGPALNTDEESYELVRVE 661 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 + E EYV+DVD+K C K+EEA+GLVL GDL +PP+HEFAYEGLKIRPA PY T +IGVFG Sbjct 662 KTEHEYVYDVDQKKCCKREEATGLVLVGDLTSPPYHEFAYEGLKIRPACPYKTAVIGVFG 721 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKSAIIKN+VT +DLV SGKKENCQEI NDV RQR L+++ARTVDS+LLNGC + V Sbjct 722 VPGSGKSAIIKNLVTRQDLVTSGKKENCQEISNDVMRQRKLEISARTVDSLLLNGCNKPV 781 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 E LYVDEAFACHSGTLLALIA+VRP KVVLCGDPKQCGFFN+MQ+KV+YNHNICT+V H Sbjct 782 EVLYVDEAFACHSGTLLALIAMVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVYH 841 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCTLPVTAIVS+LHY+ KMRTTN N PI +DTTG++KP GD+VLTCFRGWVKQ Sbjct 842 KSISRRCTLPVTAIVSSLHYESKMRTTNEYNQPIVVDTTGTTKPEPGDLVLTCFRGWVKQ 901 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQIDYRG+EVMTAAASQGLTRKGVYAVRQKVNENPLY+ SEHVNVLLTRTE +L+WKTL Sbjct 902 LQIDYRGNEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLIWKTL 961 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 SGDPWIK+L N P+G+F AT++EW+ EH IM + D FQNKA VCWAKCLV + Sbjct 962 SGDPWIKILQNPPKGNFKATIKEWEAEHASIMAGICNYQMAFDTFQNKANVCWAKCLVPI 1021 Query 1021 LETAGIRMTADEWNTIL-AFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYEN 1079 L+TAGI+++ +W+ I+ AF+EDRAYSPEVALNEICTR YGVDLDSGLFS +S++Y + Sbjct 1022 LDTAGIKLSDRQWSQIVQAFKEDRAYSPEVALNEICTRIYGVDLDSGLFSKPLISVYYAD 1081 Query 1080 NHWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNR 1139 NHWDNRPGG+M+GFN EVA ++PF +G N Q+ + RK+ F+ E NI+P+NR Sbjct 1082 NHWDNRPGGKMFGFNPEVALMLEKKYPFTKGKWNINKQICITTRKVDEFNPETNIIPANR 1141 Query 1140 RLPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTY 1199 RLPH+LV + RGER+EWL+ KI GH MLLVS YNL +P KRV W+AP GAD TY Sbjct 1142 RLPHSLVAEHHSVRGERMEWLVNKINGHHMLLVSGYNLILPTKRVTWVAPLGTRGADYTY 1201 Query 1200 DLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRA 1259 +L+LGLP GRYDLV +NIHT +R HHYQQCVDH+M+LQMLGGDSL LL+PGGSLL+RA Sbjct 1202 NLELGLPATLGRYDLVVINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIRA 1261 Query 1260 YGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLS 1319 YGYADR SE V++ L RKF + R L+P C+TSNTE+F LFS FDNGRR T H N +L+ Sbjct 1262 YGYADRTSERVISVLGRKFRSSRALKPQCITSNTEMFFLFSRFDNGRRNFTTHVMNNQLN 1321 Query 1320 SMYACNGLHT-AGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFK 1378 ++YA GL T AGCAPSYRV+R DI+ + EE VVNAAN +G DGVC+AV +KWP SF+ Sbjct 1322 AVYA--GLATRAGCAPSYRVKRMDIAKNTEECVVNAANPRGVPGDGVCKAVYRKWPESFR 1379 Query 1379 GAATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAV 1438 +ATPVGTAK I VIHAVGPNFS +EAEGDRELA+AYR VA +S + SVA+ Sbjct 1380 NSATPVGTAKTIMCGQYPVIHAVGPNFSNYSEAEGDRELASAYREVAKEVSRLGVSSVAI 1439 Query 1439 PLLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVS 1498 PLLSTG +SGGKDR++QSLNHLF A+D+TDADVVIYCRDK WEKKI EAI R+ +EL+ Sbjct 1440 PLLSTGVYSGGKDRLLQSLNHLFAAMDSTDADVVIYCRDKEWEKKITEAISLRSQVELLD 1499 Query 1499 EDVTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDA 1558 + ++++ D+VRVHPDS L GR GYS +G LYSYLEGTRFHQTAVDMAEI T+WP+ +A Sbjct 1500 DHISVDCDIVRVHPDSSLAGRKGYSTVEGALYSYLEGTRFHQTAVDMAEIYTMWPKQTEA 1559 Query 1559 NEQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIV 1618 NEQ+CLYALGE+++S+R KCPV+DAD+S PPKTVPCLCRYAMT ERVARLRMN+T +IIV Sbjct 1560 NEQVCLYALGESIESVRQKCPVDDADASFPPKTVPCLCRYAMTPERVARLRMNHTTSIIV 1619 Query 1619 CSSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQQPPEQLDNVSLTSTTSTGSA 1678 CSSFPLPKY+IEGVQKVKC + L+FD VPS VSPR Y +P +++ ST + A Sbjct 1620 CSSFPLPKYKIEGVQKVKCSKALLFDHNVPSRVSPRTY--RPADEIIQTPQISTEACQDA 1677 Range 2: 1867 to 1942 Score:63.2 bits(152), Expect:1e-09, Method:Compositional matrix adjust., Identities:35/82(43%), Positives:43/82(52%), Gaps:9/82(10%) Query 1818 IQFGDLEP---RRRNTRDWDVSTGIQFGDIDFNQSXLGRAGAYIFSSDTGPGHLQQRSVR 1874 + FGD EP +W + D D + L RAG YIFSSDTG GHLQQ+SVR Sbjct 1867 LTFGDFEPGEVEELTDSEWSTCS-----DTD-EELRLDRAGGYIFSSDTGQGHLQQKSVR 1920 Query 1875 QHELPCETLYAHEDERIYPPAF 1896 Q LP + +E+ YPP Sbjct 1921 QTTLPVNIVEEVHEEKCYPPKL 1942 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Chikungunya virus strain S27-African prototype] Sequence ID: Q8JUX6.1 Length: 2474 Range 1: 4 to 1680 Score:2525 bits(6545), Expect:0.0, Method:Compositional matrix adjust., Identities:1205/1681(72%), Positives:1401/1681(83%), Gaps:6/1681(0%) Query 3 VTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTILD 62 V VD++ADS FLKALQ+A+P FEVE +QVTPNDHANARAFSHLA KLIEQE+ TILD Sbjct 4 VYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDSTILD 63 Query 63 VGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDVM 122 +GSAPARR+MSD YHC+CPM+SAEDPERLANYARKLA A+G VLD+N+SGKI DLQ VM Sbjct 64 IGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISGKIGDLQAVM 123 Query 123 ATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTPF 182 A PD E+PTFCLHTD +CR RA+VA+YQDVYAVHAPTSLYHQAIKGVR AYW+GFDTTPF Sbjct 124 AVPDTETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRLAYWVGFDTTPF 183 Query 183 MFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFSV 242 M+ A+AGAYP+YSTNWADEQVL+A+NIGLC+T L+EGRRGKLSIMR K L P DRV+FSV Sbjct 184 MYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKLEPCDRVLFSV 243 Query 243 GSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVDY 302 GSTLY ESRKLL+SWHLPSVFHLKGK SFTCRCDTVVSCEGYVVK+IT+SPG+YGKT Y Sbjct 244 GSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITMSPGLYGKTTGY 303 Query 303 AVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGLN 362 AVTHHA+GFL+CK TDTV GERVSF VCTYVPATICDQMTGILAT+VTPEDAQKLLVGLN Sbjct 304 AVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLVGLN 363 Query 363 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWAF 422 QRIVVNGRTQRNTNTMKNY++PVVAQAFSKWA+E R DMEDEK LG RERTLTCCCLWAF Sbjct 364 QRIVVNGRTQRNTNTMKNYMIPVVAQAFSKWAKECRKDMEDEKLLGVRERTLTCCCLWAF 423 Query 423 KSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQGL-PY 481 K K HT+YKRP+TQ+I KV + FDSFV+PSLWSS LS+ +R RIK LLS L PY Sbjct 424 KKQKTHTVYKRPDTQSIQKVQAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDLTPY 483 Query 482 SGDRTearaaeeeekeaqeaeLTRAALPPLVSGSCADDIAQVDVEELTFRAGAGVVETPR 541 SGD EAR AE+E +E +EAELT ALPPL + D ++DVE+L RAGAG++ETPR Sbjct 484 SGDAQEARDAEKEAEEEREAELTLEALPPLQAAQ-EDVQVEIDVEQLEDRAGAGIIETPR 542 Query 542 NALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYDG 601 A+KVT Q DH++G YL+LSPQTVL+S+KL+ IH LAEQV THSGR+GRY V+ YDG Sbjct 543 GAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAYDG 602 Query 602 RVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAER 661 RVL+P+G AI +FQ+LSESATMVYNEREF+NRKLHHIA++GPALNTDEESYE VRAER Sbjct 603 RVLVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIAMHGPALNTDEESYELVRAER 662 Query 662 AETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFGV 721 E EYV+DVD++ C KKEEA+GLVL GDL NPP+HEFAYEGLKIRPA PY +IGVFGV Sbjct 663 TEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLKIRPACPYKIAVIGVFGV 722 Query 722 PGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGVE 781 PGSGKSAIIKN+VT +DLV SGKKENCQEI DV RQRGL+++ARTVDS+LLNGC R V+ Sbjct 723 PGSGKSAIIKNLVTRQDLVTSGKKENCQEITTDVMRQRGLEISARTVDSLLLNGCNRPVD 782 Query 782 NLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLHK 841 LYVDEAFACHSGTLLALIALVRP KVVLCGDPKQCGFFN+MQ+KV+YNHNICT+V HK Sbjct 783 VLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVYHK 842 Query 842 SISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQL 901 SISRRCTLPVTAIVS+LHY+GKMRTTN N PI +DTTGS+KP GD+VLTCFRGWVKQL Sbjct 843 SISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVKQL 902 Query 902 QIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTLS 961 QIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLY+ SEHVNVLLTRTE +LVWKTLS Sbjct 903 QIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKTLS 962 Query 962 GDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQVL 1021 GDPWIK L N P+G+F AT++EW+ EH IM + D FQNKA VCWAK LV +L Sbjct 963 GDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICSHQMTFDTFQNKANVCWAKSLVPIL 1022 Query 1022 ETAGIRMTADEWNTIL-AFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYENN 1080 ETAGI++ +W+ I+ AF+ED+AYSPEVALNEICTR YGVDLDSGLFS VS++Y +N Sbjct 1023 ETAGIKLNDRQWSQIIQAFKEDKAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVYYADN 1082 Query 1081 HWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNRR 1140 HWDNRPGG+M+GFN E A ++PF +G N Q+ V R+++ F+ NI+P+NRR Sbjct 1083 HWDNRPGGKMFGFNPEAASILERKYPFTKGKWNINKQICVTTRRIEDFNPTTNIIPANRR 1142 Query 1141 LPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTYD 1200 LPH+LV ++ +GER+EWL+ KI GH +LLVS +LA+P KRV W+AP V GAD TY+ Sbjct 1143 LPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGCSLALPTKRVTWVAPLGVRGADYTYN 1202 Query 1201 LDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRAY 1260 L+LGLP GRYDLV +NIHT +R HHYQQCVDH+M+LQMLGGDSL LL+PGGSLL+RAY Sbjct 1203 LELGLPATLGRYDLVVINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIRAY 1262 Query 1261 GYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLSS 1320 GYADR SE V+ L RKF + R L+P CVTSNTE+F LFSNFDNGRR T H N +L++ Sbjct 1263 GYADRTSERVICVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQLNA 1322 Query 1321 MYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKGA 1380 + AGCAPSYRV+R DI+ + EE VVNAAN +G DGVC+AV KKWP SFK + Sbjct 1323 AFVGQATR-AGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKNS 1381 Query 1381 ATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPL 1440 ATPVGTAK + VIHAVGPNFS +E+EGDRELAAAYR VA ++ + SVA+PL Sbjct 1382 ATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPL 1441 Query 1441 LSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVSED 1500 LSTG +SGGKDR+ QSLNHLFTA+D+TDADVVIYCRDK WEKKI EAI RT +EL+ E Sbjct 1442 LSTGVYSGGKDRLTQSLNHLFTAMDSTDADVVIYCRDKEWEKKISEAIQMRTQVELLDEH 1501 Query 1501 VTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDANE 1560 ++++ D+VRVHPDS L GR GYS T+G LYSYLEGTRFHQTAVDMAEI T+WP+ +ANE Sbjct 1502 ISIDCDVVRVHPDSSLAGRKGYSTTEGALYSYLEGTRFHQTAVDMAEIYTMWPKQTEANE 1561 Query 1561 QICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVCS 1620 Q+CLYALGE+++SIR KCPV+DAD+S+PPKTVPCLCRYAMT ERV RLRMN+ +IIVCS Sbjct 1562 QVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVCS 1621 Query 1621 SFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQQPPEQLDNVSLTSTTSTGSAWS 1680 SFPLPKY+IEGVQKVKC +V++FD VPS VSPR+Y +P ++ + T+T+ T S + Sbjct 1622 SFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREY--RPSQESVQEASTTTSLTHSQFD 1679 Query 1681 L 1681 L Sbjct 1680 L 1680 Range 2: 1828 to 1903 Score:68.9 bits(167), Expect:2e-11, Method:Compositional matrix adjust., Identities:40/80(50%), Positives:47/80(58%), Gaps:5/80(6%) Query 1818 IQFGDLEPRR-RNTRDWDVSTGIQFGDIDFNQSXLGRAGAYIFSSDTGPGHLQQRSVRQH 1876 + FGD P + D D ST D D ++ L RAG YIFSSDTGPGHLQQ+SVRQ Sbjct 1828 LTFGDFLPGEVDDLTDSDWST---CSDTD-DELRLDRAGGYIFSSDTGPGHLQQKSVRQS 1883 Query 1877 ELPCETLYAHEDERIYPPAF 1896 LP TL +E+ YPP Sbjct 1884 VLPVNTLEEVHEEKCYPPKL 1903 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Barmah Forest virus] Sequence ID: P87515.3 Length: 2411 Range 1: 6 to 1647 Score:2224 bits(5763), Expect:0.0, Method:Compositional matrix adjust., Identities:1071/1649(65%), Positives:1288/1649(78%), Gaps:10/1649(0%) Query 3 VTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTILD 62 V +DVE +S F K +Q FP FE+E+ Q TPNDHA+ARAFSHLATKLIE E ILD Sbjct 6 VKIDVEPESHFAKQVQSCFPQFEIEAVQTTPNDHAHARAFSHLATKLIEMETAKDQIILD 65 Query 63 VGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDVM 122 +GSAPARRL S+H YHC+CPMK EDPER+ YARKL S K + K+ DL+DV+ Sbjct 66 IGSAPARRLYSEHKYHCVCPMKCTEDPERMLGYARKLIAGSA----KGKAEKLRDLRDVL 121 Query 123 ATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTPF 182 ATPD+E+ + CLHTD +CR R +VAVYQDVYA+ APT+LYHQA+KGVRTAYWIGFDTTPF Sbjct 122 ATPDIETQSLCLHTDASCRYRGDVAVYQDVYAIDAPTTLYHQALKGVRTAYWIGFDTTPF 181 Query 183 MFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFSV 242 M++ALAGAYP YSTNWADEQVL++RNIGLC+ +SEG + SI+RKK L+ SDRVMFSV Sbjct 182 MYDALAGAYPLYSTNWADEQVLESRNIGLCSDKVSEGGKKGRSILRKKFLKQSDRVMFSV 241 Query 243 GSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVDY 302 GSTLYTESRKLL+SWHLPS FHLKGK+SFTCRCDT+VSCEGYV+KKIT+ PG+ GK + Y Sbjct 242 GSTLYTESRKLLQSWHLPSTFHLKGKSSFTCRCDTIVSCEGYVLKKITMCPGVTGKPIGY 301 Query 303 AVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGLN 362 AVTHH EGF+V K+TDT+RGERVSF VCTYVP T+CDQMTGILAT+VT +DAQKLLVGLN Sbjct 302 AVTHHKEGFVVGKVTDTIRGERVSFAVCTYVPTTLCDQMTGILATEVTADDAQKLLVGLN 361 Query 363 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWAF 422 QRIVVNGRTQRNTNTMKNYLLP+VAQA +KWA+EA+ DMEDE+PL R+RTLTC C WAF Sbjct 362 QRIVVNGRTQRNTNTMKNYLLPLVAQALAKWAKEAKQDMEDERPLNERQRTLTCLCCWAF 421 Query 423 KSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQGLPYS 482 K +K H +YKRP+TQ+IVKVP F SF + SLWS+ +S+ +RQ++K++L AR + Sbjct 422 KRNKRHAIYKRPDTQSIVKVPCEFTSFPLVSLWSAGMSISLRQKLKMMLQARQPTQIAAV 481 Query 483 GDRTearaaeeeeke--aqeaeLTRAALPPLVSGSCADDIAQVDVEELTFRAGAGVVETP 540 + AA E++ AEL AA P +V + + +V+VEEL RAG GVVETP Sbjct 482 TEELIQEAAAVEQEAVDTANAELDHAAWPSIVDTT--ERHVEVEVEELDQRAGEGVVETP 539 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 RN++KV+ Q D LIGSYLILSPQ VL+SEKLA IH LAEQV ++THSGRSGRY VDKY Sbjct 540 RNSIKVSTQIGDALIGSYLILSPQAVLRSEKLACIHDLAEQVKLVTHSGRSGRYAVDKYX 599 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 GRVL+PTG AI + FQALSESAT+VYNEREF+NRKL HIA+YG ALNTDEE YEKV E Sbjct 600 GRVLVPTGVAIDIQSFQALSESATLVYNEREFVNRKLWHIAVYGAALNTDEEGYEKVPVE 659 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 RAE++YVFDVD+K C+KKE+ASG VL G+L+NPPFHEFAYEGL+ RP+APY +GV+G Sbjct 660 RAESDYVFDVDQKMCLKKEQASGWVLCGELVNPPFHEFAYEGLRTRPSAPYKVHTVGVYG 719 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKSAIIKN VT DLV SGKKENC EIMNDV + R L +TA+TVDS+LLNG K Sbjct 720 VPGSGKSAIIKNTVTMSDLVLSGKKENCLEIMNDVLKHRALRITAKTVDSVLLNGVKHTP 779 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 LY+DEAF+CH+GTLLA IA+VRP KVVLCGDPKQCGFFN+MQLKV+YNH+IC+ V H Sbjct 780 NILYIDEAFSCHAGTLLATIAIVRPKQKVVLCGDPKQCGFFNMMQLKVNYNHDICSEVFH 839 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCT +TAIVS LHYQ +MRTTN I IDTTG++KPA D++LTCFRGWVKQ Sbjct 840 KSISRRCTQDITAIVSKLHYQDRMRTTNPRKGDIIIDTTGTTKPAKTDLILTCFRGWVKQ 899 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQ DYRG+EVMTAAASQGLTR VYAVR KVNENPLY+ SEHVNVLLTRTEN+LVWKTL Sbjct 900 LQQDYRGNEVMTAAASQGLTRASVYAVRTKVNENPLYAQTSEHVNVLLTRTENKLVWKTL 959 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 S DPWIK LTN PRG ++AT+ EW+ EH GIM+ + V+ F NK VCWAK L V Sbjct 960 STDPWIKTLTNPPRGHYTATIAEWEAEHQGIMKAIQGYAPPVNTFMNKVNVCWAKTLTPV 1019 Query 1021 LETAGIRMTADEWNTIL-AFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYEN 1079 LETAGI ++A++W+ +L F +D AYSPEVALN ICT+ YG DLD+GLFS SV + Y Sbjct 1020 LETAGISLSAEDWSELLPPFAQDVAYSPEVALNIICTKMYGFDLDTGLFSRPSVPMTYTK 1079 Query 1080 NHWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNR 1139 +HWDNR GG+MYGF+ + + A R P+LRG SG+Q+ V E ++Q ++ NI+P NR Sbjct 1080 DHWDNRVGGKMYGFSQQAYDQLARRHPYLRGREKSGMQIVVTEMRIQRPRSDANIIPINR 1139 Query 1140 RLPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTY 1199 RLPH+LV +++ R R E G+ MLLVSEYN+ +P+K++ W+AP GA T Sbjct 1140 RLPHSLVATHEYRRAARAEEFFTTTRGYTMLLVSEYNMNLPNKKITWLAPIGTQGAHHTA 1199 Query 1200 DLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRA 1259 +L+LG+P G +D V VN+ T +R HHYQQC DH+M+LQML GD+L ++PGGSL ++A Sbjct 1200 NLNLGIPPLLGSFDAVVVNMPTPFRNHHYQQCEDHAMKLQMLAGDALRHIKPGGSLWVKA 1259 Query 1260 YGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLS 1319 YGYADR SE VV ALARKF +FRV +P+CVTSNTEVFL FS FDNG+RA+ LH AN+K + Sbjct 1260 YGYADRHSEHVVLALARKFKSFRVTQPSCVTSNTEVFLHFSIFDNGKRAIALHSANRKAN 1319 Query 1320 SMYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKG 1379 S++ N AG AP+YRV+R DIS E+AVVNAAN +G GVC A+ +KWP +F Sbjct 1320 SIFQ-NTFLPAGSAPAYRVKRGDISNAPEDAVVNAANQQGVKGAGVCGAIYRKWPDAFGD 1378 Query 1380 AATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVP 1439 ATP GTA VIHAVGPNFS +E EGDR+LA+AYRA A I+ I +VAVP Sbjct 1379 VATPTGTAVSKSVQDKLVIHAVGPNFSKCSEEEGDRDLASAYRAAAEIVMDKKITTVAVP 1438 Query 1440 LLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVSE 1499 LLSTG ++GGK+RV QSLNHLFTA D TDADV IYC DK WEKKI+EAID RT++E+V + Sbjct 1439 LLSTGIYAGGKNRVEQSLNHLFTAFDNTDADVTIYCMDKTWEKKIKEAIDHRTSVEMVQD 1498 Query 1500 DVTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDAN 1559 DV LE +LVRVHP S L GR GYS G+++SYLEGT+FHQTAVD+AE+ LWP L+++N Sbjct 1499 DVQLEEELVRVHPLSSLAGRKGYSTDSGRVFSYLEGTKFHQTAVDIAEMQVLWPALKESN 1558 Query 1560 EQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVC 1619 EQI Y LGE+MD IR KCP ED D+STPP+TVPCLCRYAMT ERV RL+ NT VC Sbjct 1559 EQIVAYTLGESMDQIRGKCPTEDTDASTPPRTVPCLCRYAMTPERVYRLKCTNTTQFTVC 1618 Query 1620 SSFPLPKYRIEGVQKVKCDRVLIFDQTVP 1648 SSF LPKY I+GVQ+VKC+R++I D TVP Sbjct 1619 SSFELPKYHIQGVQRVKCERIIILDPTVP 1647 Range 2: 1694 to 1841 Score:55.1 bits(131), Expect:3e-07, Method:Compositional matrix adjust., Identities:60/166(36%), Positives:71/166(42%), Gaps:37/166(22%) Query 1750 PVPARRAITMPVPAPRV--------------RKVateppsepeapipaprkrrttsttpp 1795 P+PA R I PVPAPR V E P P+P PR +R Sbjct 1694 PIPAPRTIFRPVPAPRAPVLRTTPPPKPPRTFTVRAEVHQAPPTPVPPPRPKRAAKLARE 1753 Query 1796 HNPG----DFVPRVPVELPWEPEDLDIQFGDLEPRRRNTRDWDVSTGIQFGDIDFNQSXL 1851 +PG DF EL P + FGD IQ ++F L Sbjct 1754 MHPGFTFGDFGEHEVEELTASP----LTFGDF-----------AEGEIQGMGVEFEX--L 1796 Query 1852 GRAGAYIFSSDTGPGHLQQRSVRQHELPCETLYAHED-ERIYPPAF 1896 GRAG YIFSSDTGPGHLQQRSV Q+ E +Y E+I+ P Sbjct 1797 GRAGGYIFSSDTGPGHLQQRSVLQN-CTAECIYEPAKLEKIHAPKL 1841 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Venezuelan equine encephalitis virus (strain P676)] Sequence ID: P36328.2 Length: 2493 Range 1: 3 to 1924 Score:2115 bits(5481), Expect:0.0, Method:Compositional matrix adjust., Identities:1078/1946(55%), Positives:1350/1946(69%), Gaps:77/1946(3%) Query 2 KVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTIL 61 KV VD+E DSPFL+ALQ++FP FEVE++QVT NDHANARAFSHLA+KLIE EV TIL Sbjct 3 KVHVDIEEDSPFLRALQRSFPQFEVEAKQVTDNDHANARAFSHLASKLIETEVDPSDTIL 62 Query 62 DVGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDV 121 D+GSAPARR+ S H YHCICPM+ AEDP+RL YA KL K + DK + K+ +L V Sbjct 63 DIGSAPARRMYSKHKYHCICPMRCAEDPDRLYKYATKLKKNCKEITDKELDKKMKELAAV 122 Query 122 MATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTP 181 M+ PDLE+ T CLH DE+CR +VAVYQDVYAV PTSLYHQA KGVR AYWIGFDTTP Sbjct 123 MSDPDLETETMCLHDDESCRYEGQVAVYQDVYAVDGPTSLYHQANKGVRVAYWIGFDTTP 182 Query 182 FMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFS 241 FMF+ LAGAYP+YSTNWADE VL ARNIGLC++ + E R +SI+RKK L+PS+ V+FS Sbjct 183 FMFKNLAGAYPSYSTNWADETVLTARNIGLCSSDVMERSRRGMSILRKKYLKPSNNVLFS 242 Query 242 VGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVD 301 VGST+Y E R LLRSWHLPSVFHL+GK ++TCRC+T+VSC+GYVVK+I ISPG+YGK Sbjct 243 VGSTIYHEKRDLLRSWHLPSVFHLRGKQNYTCRCETIVSCDGYVVKRIAISPGLYGKPSG 302 Query 302 YAVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGL 361 YA T H EGFL CK+TDT+ GERVSFPVCTYVPAT+CDQMTGILATDV+ +DAQKLLVGL Sbjct 303 YAATMHREGFLCCKVTDTLNGERVSFPVCTYVPATLCDQMTGILATDVSADDAQKLLVGL 362 Query 362 NQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWA 421 NQRIVVNGRTQRNTNTMKNYLLPVVAQAF++WA+E + D EDE+PLG R+R L C WA Sbjct 363 NQRIVVNGRTQRNTNTMKNYLLPVVAQAFARWAKEYKEDQEDERPLGLRDRQLVMGCCWA 422 Query 422 FKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRI-KLLLSARMAQGLP 480 F+ HKI ++YKRP+TQTI+KV S F SFV+P + S++L +G+R RI K+L + L Sbjct 423 FRRHKITSIYKRPDTQTIIKVNSDFHSFVLPRIGSNTLEIGLRTRIRKMLEEHKEPSPLI 482 Query 481 YSGDRTearaaeeeekeaqeaeLTRAALPPLVSGSCADDIAQVDVEELTFRAGAGVVETP 540 + D EA+ A +E KE +EAE RAALPPL + + + DV+ + AGAG VETP Sbjct 483 TAEDIQEAKCAADEAKEVREAEELRAALPPL-AADFEEPTLEADVDLMLQEAGAGSVETP 541 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 R +KVT A + IGSY +LSPQ VLKSEKL+ IHPLAEQV V+THSGR GRY V+ Y Sbjct 542 RGLIKVTSYAGEDKIGSYAVLSPQAVLKSEKLSCIHPLAEQVIVITHSGRKGRYAVEPYH 601 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 G+V++P G AIPV +FQALSESAT+VYNEREF+NR LHHIA +G ALNTDEE Y+ V+ Sbjct 602 GKVVVPEGHAIPVQDFQALSESATIVYNEREFVNRYLHHIATHGGALNTDEEYYKTVKPS 661 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 + EY++D+D+K C+KKE +GL LTG+L++PPFHEFAYE L+ RPAAPY IGV+G Sbjct 662 EHDGEYLYDIDRKQCVKKELVTGLGLTGELVDPPFHEFAYESLRTRPAAPYQVPTIGVYG 721 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKS IIK+ VT +DLV S KKENC EI+ DVK+ +GLDV ARTVDS+LLNGCK V Sbjct 722 VPGSGKSGIIKSAVTKKDLVVSAKKENCAEIIRDVKKMKGLDVNARTVDSVLLNGCKHPV 781 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 E LY+DEAFACH+GTL ALIA++RP K VLCGDPKQCGFFN+M LKVH+NH ICT+V H Sbjct 782 ETLYIDEAFACHAGTLRALIAIIRPK-KAVLCGDPKQCGFFNMMCLKVHFNHEICTQVFH 840 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCT VT++VSTL Y +MRTTN T I IDTTGS+KP D++LTCFRGWVKQ Sbjct 841 KSISRRCTKSVTSVVSTLFYDKRMRTTNPKETKIVIDTTGSTKPKQDDLILTCFRGWVKQ 900 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQIDY+G+E+MTAAASQGLTRKGVYAVR KVNENPLY+P SEHVNVLLTRTE+R+VWKTL Sbjct 901 LQIDYKGNEIMTAAASQGLTRKGVYAVRYKVNENPLYAPTSEHVNVLLTRTEDRIVWKTL 960 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 +GDPWIK+LT G+F+AT+EEWQ EHD IMR + ERP D FQNKA VCWAK LV V Sbjct 961 AGDPWIKILTAKYPGNFTATIEEWQAEHDAIMRHILERPDPTDVFQNKANVCWAKALVPV 1020 Query 1021 LETAGIRMTADEWNTILAFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYENN 1080 L+TAGI MT ++WNT+ F D+A+S E+ LN++C R++G+DLDSGLFSA +V L NN Sbjct 1021 LKTAGIDMTTEQWNTVDYFETDKAHSAEIVLNQLCVRFFGLDLDSGLFSAPTVPLSIRNN 1080 Query 1081 HWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNRR 1140 HWDN P MYG N EV R+ + R+P L + +G ++ L+ + N+VP NRR Sbjct 1081 HWDNSPSPNMYGLNKEVVRQLSRRYPQLPRAVATGRVYDMNTGTLRNYDPRINLVPVNRR 1140 Query 1141 LPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIA-PPRVSGADRTY 1199 LPHALV + + + K+ G +L+V E L++P K+V W++ P + R Sbjct 1141 LPHALVLHHNEHPQSDFSSFVSKLKGRTVLVVGE-KLSVPGKKVDWLSDQPEATFRAR-- 1197 Query 1200 DLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRA 1259 LDLG+P D +YD+VF+N+ T Y+ HHYQQC DH+++L ML + L PGG+ + Sbjct 1198 -LDLGIPGDVPKYDIVFINVRTPYKYHHYQQCEDHAIKLSMLTKKACLHLNPGGTCVSIG 1256 Query 1260 YGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLS 1319 YGYADR SE ++ A+AR+F RV +P TEV +F +D R ++ + L+ Sbjct 1257 YGYADRASESIIGAIARQFKFSRVCKPKSSHEETEVLFVFIGYDRKARTHNPYKLSSTLT 1316 Query 1320 SMYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKG 1379 ++Y + LH AGCAPSY V R DI+ E ++NAAN+KG GVC A+ KK+P SF Sbjct 1317 NIYTGSRLHEAGCAPSYHVVRGDIATATEGVIINAANSKGQPGGGVCGALYKKFPESFDL 1376 Query 1380 AATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVP 1439 VG A++++ +IHAVGPNF+ V+E EGD++LA AY ++A I++ NN KSVA+P Sbjct 1377 QPIEVGKARLVKGAAKHIIHAVGPNFNKVSEVEGDKQLAEAYESIAKIVNDNNYKSVAIP 1436 Query 1440 LLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIE--LV 1497 LLSTG FSG KDR+ QSLNHL TALD TDADV IYCRDK WE ++EA+ RR A+E + Sbjct 1437 LLSTGIFSGNKDRLTQSLNHLLTALDTTDADVAIYCRDKKWEMTLKEAVARREAVEEICI 1496 Query 1498 SEDVTL---ETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPR 1554 S+D ++ + +LVRVHP S L GR GYS +DGK +SYLEGT+FHQ A D+AEI+ +WP Sbjct 1497 SDDSSVTEPDAELVRVHPKSSLAGRKGYSTSDGKTFSYLEGTKFHQAAKDIAEINAMWPV 1556 Query 1555 LQDANEQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTK 1614 +ANEQ+C+Y LGE+M SIR+KCPVE++++STPP T+PCLC +AMT ERV RL+ + + Sbjct 1557 ATEANEQVCMYILGESMSSIRSKCPVEESEASTPPSTLPCLCIHAMTPERVQRLKASRPE 1616 Query 1615 NIIVCSSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYI----------------- 1657 I VCSSFPLPKYRI GVQK++C + ++F VP+ + PRKY+ Sbjct 1617 QITVCSSFPLPKYRITGVQKIQCSQPILFSPKVPAYIHPRKYLVETPPVEETPESPAENQ 1676 Query 1658 --QQPPEQ--LDNVSLTST----------TSTGSAWSLPSETTYETMEVVAEVHTEppip 1703 + PEQ L NV T T S L T++ ++V A++H P + Sbjct 1677 STEGTPEQPALVNVDATRTRMPEPIIIEEEEEDSISLLSDGPTHQVLQVEADIHGSPSVS 1736 Query 1704 pprrrrAAVAQLRQD-LEVTEEIEPYVIQQAEIMVMER---VATTDIRAIPVPARRAI-- 1757 + D L + + ++ + + + + RA PVPA R + Sbjct 1737 SSSWSIPHASDFDVDSLSILDTLDGASVTSGAVSAETNSYFARSMEFRARPVPAPRTVFR 1796 Query 1758 TMPVPAPRVRKVateppsepeapipaprkrrttsttppHNPGDFVPRVPVELPWEPEDLD 1817 P PAPR R PG V RV E+L+ Sbjct 1797 NPPHPAPRTRTPPLAHSRASSRTSLVSTP-----------PG--VNRVITR-----EELE 1838 Query 1818 IQFGDLEPRRRNTRDWDVS---------TGIQFGDIDFNQSXLGRAGAYIFSSDTGPGHL 1868 P R +R VS T +F Q X AGAYIFSSDTG GHL Sbjct 1839 ALTPSRAPSRSASRTSLVSNPPGVNRVITREEFEAFVAQQQXRFDAGAYIFSSDTGQGHL 1898 Query 1869 QQRSVRQHELPCETLYAHEDERIYPP 1894 QQ+SVRQ L L E E Y P Sbjct 1899 QQKSVRQTVLSEVVLERTELEISYAP 1924 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Venezuelan equine encephalitis virus (strain Mena II)] Sequence ID: Q9WJC7.3 Length: 2499 Range 1: 3 to 1846 Score:2114 bits(5478), Expect:0.0, Method:Compositional matrix adjust., Identities:1053/1851(57%), Positives:1313/1851(70%), Gaps:91/1851(4%) Query 2 KVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTIL 61 KV VD+E DSPFL+ALQ++FP FEVE++QVT NDHANARAFSHLA+KLIE EV TIL Sbjct 3 KVHVDIEEDSPFLRALQRSFPQFEVEAKQVTDNDHANARAFSHLASKLIETEVEPSDTIL 62 Query 62 DVGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDV 121 D+GSAPARR+ S H YHCICPMK AEDP+RL YA KL K + DK + K+ +L +V Sbjct 63 DIGSAPARRMYSKHKYHCICPMKCAEDPDRLFKYAAKLKKNCKEITDKELDKKMKELAEV 122 Query 122 MATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTP 181 M PDLE+ T CLH DETCR +VAVYQDVYAV PTSLYHQA KGVR AYWIGFDTTP Sbjct 123 MNDPDLETETICLHDDETCRFEGQVAVYQDVYAVDGPTSLYHQANKGVRVAYWIGFDTTP 182 Query 182 FMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFS 241 FMF+ LAGAYP+YSTNWADE VL ARNIGLC++ + E R +SI+RKK L+PS+ V+FS Sbjct 183 FMFKNLAGAYPSYSTNWADETVLTARNIGLCSSDVMERSRRGMSILRKKFLKPSNNVLFS 242 Query 242 VGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVD 301 VGST+Y E R LLRSWHLPSVFHL+GK ++TCRC+T+VSC+GYVVK+I ISPG+YGK Sbjct 243 VGSTIYHEKRDLLRSWHLPSVFHLRGKQNYTCRCETIVSCDGYVVKRIAISPGLYGKPSG 302 Query 302 YAVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGL 361 YA T H EGFL CK+TDT+ GERVSFPVCTYVPAT+CDQMTGILATDV+ +DAQKLLVGL Sbjct 303 YAATMHREGFLCCKVTDTLNGERVSFPVCTYVPATLCDQMTGILATDVSADDAQKLLVGL 362 Query 362 NQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWA 421 NQRIVVNGRTQRNTNTMKNYLLPVVAQAF++WA+E + D EDE+PLG R+R L C WA Sbjct 363 NQRIVVNGRTQRNTNTMKNYLLPVVAQAFARWAKEYKEDQEDERPLGLRDRQLVMGCCWA 422 Query 422 FKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQ-GLP 480 F+ HKI ++YKRP+TQTI+KV S F SFV+P + SS+L +G+R RIK LL + + L Sbjct 423 FRKHKITSVYKRPDTQTIIKVNSDFHSFVLPRIGSSTLEIGLRTRIKKLLEEPVDRPPLI 482 Query 481 YSGDRTearaaeeeekeaqeaeLTRAALPPLVSGSCADDIAQVDVEELTFRAGAGVVETP 540 + D EA+ A +E KE +EAE RAALPPL S + + DV+ + AGAG VETP Sbjct 483 TADDIQEAKNAADEAKEVKEAEELRAALPPL-SADVEEPALEADVDLMLQEAGAGSVETP 541 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 R +KVT + IGSY +LSPQ VL+SEKL IHPLAEQV V+THSGR GRY V+ Y Sbjct 542 RGLIKVTSYGGEDKIGSYAVLSPQAVLRSEKLTCIHPLAEQVIVITHSGRKGRYAVEPYH 601 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 G+V++P G AIPV +FQALSESAT+VYNEREF+NR LHHIA +G ALNTDEE Y V+ Sbjct 602 GKVVVPEGQAIPVQDFQALSESATIVYNEREFVNRYLHHIATHGGALNTDEEYYRVVKPS 661 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 E EY++D+DKK C+KKE SGL LTG+L++PPFHEFAYE L+ RPAAPY IGV+G Sbjct 662 EHEGEYLYDIDKKQCVKKELVSGLGLTGELVDPPFHEFAYESLRTRPAAPYQVPTIGVYG 721 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKS IIK+ VT +DLV S KKENC EI+ DVK+ +GLDV ARTVDS+LLNGCK V Sbjct 722 VPGSGKSGIIKSAVTKKDLVVSAKKENCAEIIRDVKKMKGLDVNARTVDSVLLNGCKHPV 781 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 E LY+DEAFACH+GTL ALIA++RP K VLCGDPKQCGFFN+M LKVH+NH ICT+V H Sbjct 782 ETLYIDEAFACHAGTLRALIAIIRPK-KAVLCGDPKQCGFFNMMCLKVHFNHEICTQVFH 840 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCT VT++VSTL Y +MRTTN ++ I+IDTTGS+K D++LTCFRGWVKQ Sbjct 841 KSISRRCTKSVTSVVSTLFYDKRMRTTNPRDSKIEIDTTGSTKSKKEDLILTCFRGWVKQ 900 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQIDY+G+E+MTAAASQGLTRK VYAVR KVNENPLY+P SEHVNVLLTRTE+++VWKTL Sbjct 901 LQIDYKGNEIMTAAASQGLTRKSVYAVRYKVNENPLYAPTSEHVNVLLTRTEDKIVWKTL 960 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 +GDPWIK LT GDF+AT+EEWQ EHD IMR + E+P D FQNKA VCWAK LV V Sbjct 961 AGDPWIKTLTAKYPGDFTATMEEWQAEHDAIMRHILEKPDPTDVFQNKANVCWAKALVPV 1020 Query 1021 LETAGIRMTADEWNTILAFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYENN 1080 L+TAGI +T ++WNT+ F+ED+A+S E+ LN++C RY+G+DLDSGLFSA +V L NN Sbjct 1021 LKTAGIDLTTEQWNTVDYFKEDKAHSAEIVLNQLCVRYFGLDLDSGLFSAPTVPLSIRNN 1080 Query 1081 HWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNRR 1140 HWDN P MYG NHEV R+ + R+P L + +G ++ L+ + N+VP NRR Sbjct 1081 HWDNSPSPNMYGLNHEVVRQLSRRYPQLPRAVTTGRVYDMNTGTLRNYDPRINLVPVNRR 1140 Query 1141 LPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTY- 1199 LPHALVT + + K+ G +L+V E + I K V W++ D T+ Sbjct 1141 LPHALVTQHADHPPSDFSAFVSKLKGRTVLVVGE-KMNISGKAVDWLS----ETPDATFR 1195 Query 1200 -DLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMR 1258 LDLG+P + +YD+VFVN+ T+YR HHYQQC DH+++L ML + L PGG+ + Sbjct 1196 ARLDLGIPTELPKYDIVFVNVRTQYRYHHYQQCEDHAIKLSMLTKKACLHLNPGGTCVSI 1255 Query 1259 AYGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKL 1318 YGYADR SE ++ A+AR+F RV +P TEV +F FD R ++ + L Sbjct 1256 GYGYADRASESIIGAVARQFKFSRVCKPKVSKEETEVLFVFIGFDRKTRTHNPYKLSSTL 1315 Query 1319 SSMYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFK 1378 +++Y +GLH AGCAPSY V R DI+ E +VNAAN+KG GVC A+ +K+P SF Sbjct 1316 TNIYTGSGLHEAGCAPSYHVVRGDIATATEGVIVNAANSKGQPGSGVCGALYRKYPESFD 1375 Query 1379 GAATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAV 1438 VG A++++ +IHAVGPNFS V+E EGD++LA AY ++A II+ NN +SVA+ Sbjct 1376 LQPIEVGKARLVKGSSKHIIHAVGPNFSKVSEVEGDKQLAEAYESIAKIINDNNYRSVAI 1435 Query 1439 PLLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIE--L 1496 PLLSTG F+G KDR+MQSLNHL TALD TDADV IYCRDK WE ++E + RR A+E Sbjct 1436 PLLSTGIFAGNKDRLMQSLNHLLTALDTTDADVAIYCRDKKWEVTLKEVVARREAVEEIC 1495 Query 1497 VSEDVTL---ETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWP 1553 +SED ++ + +LVRVHP S L GR GYS +DGK +SYLEGT+FHQ A DMAEI+ +WP Sbjct 1496 ISEDSSVAEPDAELVRVHPKSSLAGRKGYSTSDGKTFSYLEGTKFHQAAKDMAEINAMWP 1555 Query 1554 RLQDANEQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNT 1613 +ANEQ+CLY LGE+M SIR+KCPVE++++STPP T+PCLC +AMT ERV RL+ + Sbjct 1556 TATEANEQVCLYILGESMSSIRSKCPVEESEASTPPSTLPCLCIHAMTPERVQRLKASRP 1615 Query 1614 KNIIVCSSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYI---------------- 1657 + I VCSSFPLPKYRI GVQK++C ++F VP + PRKY+ Sbjct 1616 EQITVCSSFPLPKYRITGVQKIQCSHPILFSPKVPEYIHPRKYLADATPADNEAAEPTME 1675 Query 1658 -------------QQPPEQLDNVSLTS-------------------TTSTGSAWSLPSET 1685 +QP E+ D++S+ S ++ S+WS+P + Sbjct 1676 CVQPLQEERPANTEQPVEEDDSISVLSEDAPHQVHQVEAEVHRSLCASAQSSSWSIPRAS 1735 Query 1686 TYETMEVVAEVHTEppippprrrrAAVAQLRQDLEVTEEIEPYVIQQA----EIMVMERV 1741 +E++ V+ + I LR +P V + I V+E + Sbjct 1736 DFESLSVLDSLGANDTISMGSSSNETALALRTIFRTPPIPKPRVRSTSTDVDSISVLESL 1795 Query 1742 -ATTDIRAI---------------------PVPARRAI--TMPVPAPRVRK 1768 +T+D+R+I PVPA R I T PVP PR R+ Sbjct 1796 GSTSDVRSIGSSSDETDVSVFDKGLEFMARPVPAPRTIFRTPPVPKPRARR 1846 Range 2: 1890 to 1930 Score:47.8 bits(112), Expect:6e-05, Method:Compositional matrix adjust., Identities:26/41(63%), Positives:27/41(65%), Gaps:0/41(0%) Query 1854 AGAYIFSSDTGPGHLQQRSVRQHELPCETLYAHEDERIYPP 1894 AGAYIFSSDTG GHLQQ+SVRQ L L E E Y P Sbjct 1890 AGAYIFSSDTGQGHLQQKSVRQTVLSEVVLERTELEISYAP 1930 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Venezuelan equine encephalitis virus (strain 3880)] Sequence ID: P36327.3 Length: 2485 Range 1: 3 to 1916 Score:2114 bits(5478), Expect:0.0, Method:Compositional matrix adjust., Identities:1079/1938(56%), Positives:1351/1938(69%), Gaps:69/1938(3%) Query 2 KVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTIL 61 KV VD+E DSPFL+ALQ++FP FEVE++QVT NDHANARAFSHLA+KLIE EV TIL Sbjct 3 KVHVDIEEDSPFLRALQRSFPQFEVEAKQVTDNDHANARAFSHLASKLIETEVDPSDTIL 62 Query 62 DVGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDV 121 D+GSAPARR+ S H YHCICPM+ AEDP+RL YA KL K + DK + K+ +L V Sbjct 63 DIGSAPARRMYSKHKYHCICPMRCAEDPDRLYKYATKLKKNCKEITDKELDKKMKELAAV 122 Query 122 MATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTP 181 M+ PDLE+ T CLH DE+CR +VAVYQDVYAV PTSLYHQA KGVR AYWIGFDTTP Sbjct 123 MSDPDLETETMCLHDDESCRYEGQVAVYQDVYAVDGPTSLYHQANKGVRVAYWIGFDTTP 182 Query 182 FMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFS 241 FMF+ LAGAYP+YSTNWADE VL ARNIGLC++ + E R +SI+RKK L+PS+ V+FS Sbjct 183 FMFKNLAGAYPSYSTNWADETVLTARNIGLCSSDVMERSRRGMSILRKKYLKPSNNVLFS 242 Query 242 VGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVD 301 VGST+Y E R LLRSWHLPSVFHL+GK ++TCRC+T+VSC+GYVVK+I ISPG+YGK Sbjct 243 VGSTIYHEKRDLLRSWHLPSVFHLRGKQNYTCRCETIVSCDGYVVKRIAISPGLYGKPSG 302 Query 302 YAVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGL 361 YA T H EGFL CK+TDT+ GERVSFPVCTYVPAT+CDQMTGILATDV+ +DAQKLLVGL Sbjct 303 YAATMHREGFLCCKVTDTLNGERVSFPVCTYVPATLCDQMTGILATDVSADDAQKLLVGL 362 Query 362 NQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWA 421 NQRIVVNGRTQRNTNTMKNYLLPVVAQAF++WA+E + D EDE+PLG R+R L C WA Sbjct 363 NQRIVVNGRTQRNTNTMKNYLLPVVAQAFARWAKEYKEDQEDERPLGLRDRQLVMGCCWA 422 Query 422 FKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRI-KLLLSARMAQGLP 480 F+ HKI ++YKRP+TQTI+KV S F SFV+P + S++L +G+R RI K+L + L Sbjct 423 FRRHKITSIYKRPDTQTIIKVNSDFHSFVLPRIGSNTLEIGLRTRIRKMLEEHKEPSPLI 482 Query 481 YSGDRTearaaeeeekeaqeaeLTRAALPPLVSGSCADDIAQVDVEELTFRAGAGVVETP 540 + D EA+ A +E KE +EAE RA LPPL + + + DV+ + AGAG VETP Sbjct 483 TAEDIQEAKCAADEAKEVREAEELRAVLPPL-AADVEEPTLEADVDLMLQEAGAGSVETP 541 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 R +KVT A + IGSY +LSPQ VLKSEKL+ IHPLAEQV V+THSGR GRY V+ Y Sbjct 542 RGLIKVTSYAGEDKIGSYAVLSPQAVLKSEKLSCIHPLAEQVIVITHSGRKGRYAVEPYH 601 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 G+V++P G AIPV +FQALSESAT+VYNEREF+NR LHHIA +G ALNTDEE Y+ V+ Sbjct 602 GKVVVPEGHAIPVQDFQALSESATIVYNEREFVNRYLHHIATHGGALNTDEEYYKTVKPS 661 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 + EY++D+D+K C+KKE +GL LTG+L++PPFHEFAYE L+ RPAAPY IGV+G Sbjct 662 EHDGEYLYDIDRKQCVKKELVTGLGLTGELVDPPFHEFAYESLRTRPAAPYQVPTIGVYG 721 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKS IIK+ VT +DLV S KKENC EI+ DVKR +GLDV ARTVDS+LLNGCK V Sbjct 722 VPGSGKSGIIKSAVTKKDLVVSAKKENCAEIIRDVKRIKGLDVNARTVDSVLLNGCKYPV 781 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 E LY+DEAFACH+GTL ALIA++RP K VLCGDPKQCGFFN+M LKVH+NH ICT+V H Sbjct 782 ETLYIDEAFACHAGTLRALIAIIRPK-KAVLCGDPKQCGFFNMMCLKVHFNHEICTQVFH 840 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCT VT++VSTL Y +MRTTN T I+IDTTGS+KP D++LTCFRGWVKQ Sbjct 841 KSISRRCTKSVTSVVSTLFYDKRMRTTNPKETKIEIDTTGSTKPKQDDLILTCFRGWVKQ 900 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQIDY+G+EVMTAAASQGLTRKGVYAVR KVNENPLY+P SEHVNVLLTRTE+R+VWKTL Sbjct 901 LQIDYKGNEVMTAAASQGLTRKGVYAVRYKVNENPLYAPTSEHVNVLLTRTEDRIVWKTL 960 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 +GDPWIK LT G+F+AT+EEWQ EHD IMR + ERP D FQNKA VCWAK LV V Sbjct 961 AGDPWIKTLTAKYPGNFTATIEEWQAEHDAIMRHILERPDPTDVFQNKANVCWAKALVPV 1020 Query 1021 LETAGIRMTADEWNTILAFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYENN 1080 L+TAGI MT ++WNT+ F D+A+S E+ LN++C R++G+DLDSGLFSA +V L NN Sbjct 1021 LKTAGIDMTTEQWNTVDYFETDKAHSAEIVLNQLCVRFFGLDLDSGLFSAPTVPLSIRNN 1080 Query 1081 HWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNRR 1140 HWDN P MYG N EV R+ + R+P L + +G ++ L+ + N+VP NRR Sbjct 1081 HWDNSPSPNMYGLNKEVVRQLSRRYPQLPRAVTTGRVYDMNTGTLRNYDPRINLVPVNRR 1140 Query 1141 LPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAP-PRVSGADRTY 1199 LPHALV + + + K+ G +L+V E L++P K V W++ P + R Sbjct 1141 LPHALVLHHNEHPQSDFSSFVSKLKGRTVLVVGE-KLSVPGKTVDWLSDRPEATFRAR-- 1197 Query 1200 DLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRA 1259 LDLG+P D +YD++F+N+ T Y+ HHYQQC DH+++L ML + L PGG+ + Sbjct 1198 -LDLGIPGDVPKYDIIFINVRTPYKYHHYQQCEDHAIKLSMLTKKACLHLNPGGTCVSIG 1256 Query 1260 YGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLS 1319 YGYADR SE ++ A+AR+F RV +P TEV +F +D R ++ + L+ Sbjct 1257 YGYADRASESIIGAIARQFKFSRVCKPKSSLEETEVLFVFIGYDRKARTHNPYKLSSTLT 1316 Query 1320 SMYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKG 1379 ++Y + LH AGCAPSY V R DI+ E ++NAAN+KG GVC A+ KK+P SF Sbjct 1317 NIYTGSRLHEAGCAPSYHVVRGDIATATEGVIINAANSKGQPGGGVCGALYKKFPESFDL 1376 Query 1380 AATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVP 1439 VG A++++ +IHAVGPNF+ V+E EGD++LA AY ++A I++ NN KSVA+P Sbjct 1377 QPIEVGKARLVKGAAKHIIHAVGPNFNKVSEIEGDKQLAEAYESIAKIVNDNNYKSVAIP 1436 Query 1440 LLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIE--LV 1497 LLSTG FSG KDR+ QSLNHL TALD TDADV IYCRDK WE ++EA+ RR A+E + Sbjct 1437 LLSTGIFSGNKDRLTQSLNHLLTALDTTDADVAIYCRDKKWEMTLKEAVARREAVEEICI 1496 Query 1498 SEDVTL---ETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPR 1554 S+D ++ + +LVRVHP S L GR GYS +DGK +SYLEGT+FHQ A D+AEI+ +WP Sbjct 1497 SDDSSVTEPDAELVRVHPKSSLAGRKGYSTSDGKTFSYLEGTKFHQAAKDIAEINAMWPV 1556 Query 1555 LQDANEQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTK 1614 +ANEQ+C+Y LGE+M SIR+KCPVE++++STPP T+PCLC +AMT ERV RL+ + + Sbjct 1557 ATEANEQVCMYILGESMSSIRSKCPVEESEASTPPSTLPCLCIHAMTPERVQRLKASRPE 1616 Query 1615 NIIVCSSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQQPPEQLDNVSLTST-- 1672 I VCSSFPLPKYRI GVQK++C + ++F VP+ + PRKY+ + P +N S T Sbjct 1617 QITVCSSFPLPKYRITGVQKIQCSQPILFSPKVPAYIHPRKYLVETPTVEENQSTEGTPE 1676 Query 1673 ---------------------TSTGSAWSLPSETTYETMEVVAEVHTEppippprrrrAA 1711 S L T++ ++V A++H P Sbjct 1677 QPTLITVGETRTRTPEPIIIEEEEDSISLLSDGPTHQVLQVEADIHGPPSASSSSWSIPH 1736 Query 1712 VAQLRQD-LEVTEEIEPYVIQQAEIMVMER---VATTDIRAIPVPARRAI--TMPVPAPR 1765 + D L + + +E + E V + + A PVPA R + P PAPR Sbjct 1737 ASDFDVDSLSILDTLEGASVTSEEASVETNSHFARSMEFLARPVPAPRTVFRNPPQPAPR 1796 Query 1766 VRKVateppsepeapipaprkrrttsttppHNPGDFVPRVPVELPWEPEDLDIQFGDLEP 1825 R P AP + + + + PG V RV E+L+ P Sbjct 1797 TR-----------TPSLAPSRASSRISLVSNPPG--VNRVITR-----EELEALTPSRAP 1838 Query 1826 RRRNTRDWDVS---------TGIQFGDIDFNQSXLGRAGAYIFSSDTGPGHLQQRSVRQH 1876 R +R VS T +F Q X AGAYIFSSDTG GHLQQ+SVRQ Sbjct 1839 SRSVSRTSLVSNPPGVNRVITREEFEAFVAQQQXRFDAGAYIFSSDTGQGHLQQKSVRQT 1898 Query 1877 ELPCETLYAHEDERIYPP 1894 L L E E Y P Sbjct 1899 VLSEVVLERTELEISYAP 1916 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Venezuelan equine encephalitis virus (strain Trinidad donkey)] Sequence ID: P27282.3 Length: 2493 Range 1: 3 to 1924 Score:2109 bits(5465), Expect:0.0, Method:Compositional matrix adjust., Identities:1077/1946(55%), Positives:1348/1946(69%), Gaps:77/1946(3%) Query 2 KVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTIL 61 KV VD+E DSPFL+ALQ++FP FEVE++QVT NDHANARAFSHLA+KLIE EV TIL Sbjct 3 KVHVDIEEDSPFLRALQRSFPQFEVEAKQVTDNDHANARAFSHLASKLIETEVDPSDTIL 62 Query 62 DVGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDV 121 D+GSAPARR+ S H YHCICPM+ AEDP+RL YA KL K + DK + K+ +L V Sbjct 63 DIGSAPARRMYSKHKYHCICPMRCAEDPDRLYKYATKLKKNCKEITDKELDKKMKELAAV 122 Query 122 MATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTP 181 M+ PDLE+ T CLH DE+CR +VAVYQDVYAV PTSLYHQA KGVR AYWIGFDTTP Sbjct 123 MSDPDLETETMCLHDDESCRYEGQVAVYQDVYAVDGPTSLYHQANKGVRVAYWIGFDTTP 182 Query 182 FMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFS 241 FMF+ LAGAYP+YSTNWADE VL ARNIGLC++ + E R +SI+RKK L+PS+ V+FS Sbjct 183 FMFKNLAGAYPSYSTNWADETVLTARNIGLCSSDVMERSRRGMSILRKKYLKPSNNVLFS 242 Query 242 VGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVD 301 VGST+Y E R LLRSWHLPSVFHL+GK ++TCRC+T+VSC+GYVVK+I ISPG+YGK Sbjct 243 VGSTIYHEKRDLLRSWHLPSVFHLRGKQNYTCRCETIVSCDGYVVKRIAISPGLYGKPSG 302 Query 302 YAVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGL 361 YA T H EGFL CK+TDT+ GERVSFPVCTYVPAT+CDQMTGILATDV+ +DAQKLLVGL Sbjct 303 YAATMHREGFLCCKVTDTLNGERVSFPVCTYVPATLCDQMTGILATDVSADDAQKLLVGL 362 Query 362 NQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWA 421 NQRIVVNGRTQRNTNTMKNYLLPVVAQAF++WA+E + D EDE+PLG R+R L C WA Sbjct 363 NQRIVVNGRTQRNTNTMKNYLLPVVAQAFARWAKEYKEDQEDERPLGLRDRQLVMGCCWA 422 Query 422 FKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRI-KLLLSARMAQGLP 480 F+ HKI ++YKRP+TQTI+KV S F SFV+P + S++L +G+R RI K+L + L Sbjct 423 FRRHKITSIYKRPDTQTIIKVNSDFHSFVLPRIGSNTLEIGLRTRIRKMLEEHKEPSPLI 482 Query 481 YSGDRTearaaeeeekeaqeaeLTRAALPPLVSGSCADDIAQVDVEELTFRAGAGVVETP 540 + D EA+ A +E KE +EAE RAALPPL + + + DV+ + AGAG VETP Sbjct 483 TAEDVQEAKCAADEAKEVREAEELRAALPPL-AADVEEPTLEADVDLMLQEAGAGSVETP 541 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 R +KVT A + IGSY +LSPQ VLKSEKL+ IHPLAEQV V+THSGR GRY V+ Y Sbjct 542 RGLIKVTSYAGEDKIGSYAVLSPQAVLKSEKLSCIHPLAEQVIVITHSGRKGRYAVEPYH 601 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 G+V++P G AIPV +FQALSESAT+VYNEREF+NR LHHIA +G ALNTDEE Y+ V+ Sbjct 602 GKVVVPEGHAIPVQDFQALSESATIVYNEREFVNRYLHHIATHGGALNTDEEYYKTVKPS 661 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 + EY++D+D+K C+KKE +GL LTG+L++PPFHEFAYE L+ RPAAPY IGV+G Sbjct 662 EHDGEYLYDIDRKQCVKKELVTGLGLTGELVDPPFHEFAYESLRTRPAAPYQVPTIGVYG 721 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKS IIK+ VT +DLV S KKENC EI+ DVK+ +GLDV ARTVDS+LLNGCK V Sbjct 722 VPGSGKSGIIKSAVTKKDLVVSAKKENCAEIIRDVKKMKGLDVNARTVDSVLLNGCKHPV 781 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 E LY+DEAFACH+GTL ALIA++RP K VLCGDPKQCGFFN+M LKVH+NH ICT+V H Sbjct 782 ETLYIDEAFACHAGTLRALIAIIRPK-KAVLCGDPKQCGFFNMMCLKVHFNHEICTQVFH 840 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCT VT++VSTL Y KMRTTN T I IDTTGS+KP D++LTCFRGWVKQ Sbjct 841 KSISRRCTKSVTSVVSTLFYDKKMRTTNPKETKIVIDTTGSTKPKQDDLILTCFRGWVKQ 900 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQIDY+G+E+MTAAASQGLTRKGVYAVR KVNENPLY+P SEHVNVLLTRTE+R+VWKTL Sbjct 901 LQIDYKGNEIMTAAASQGLTRKGVYAVRYKVNENPLYAPTSEHVNVLLTRTEDRIVWKTL 960 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 +GDPWIK LT G+F+AT+EEWQ EHD IMR + ERP D FQNKA VCWAK LV V Sbjct 961 AGDPWIKTLTAKYPGNFTATIEEWQAEHDAIMRHILERPDPTDVFQNKANVCWAKALVPV 1020 Query 1021 LETAGIRMTADEWNTILAFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYENN 1080 L+TAGI MT ++WNT+ F D+A+S E+ LN++C R++G+DLDSGLFSA +V L NN Sbjct 1021 LKTAGIDMTTEQWNTVDYFETDKAHSAEIVLNQLCVRFFGLDLDSGLFSAPTVPLSIRNN 1080 Query 1081 HWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNRR 1140 HWDN P MYG N EV R+ + R+P L + +G ++ L+ + N+VP NRR Sbjct 1081 HWDNSPSPNMYGLNKEVVRQLSRRYPQLPRAVATGRVYDMNTGTLRNYDPRINLVPVNRR 1140 Query 1141 LPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAP-PRVSGADRTY 1199 LPHALV + + + K+ G +L+V E L++P K V W++ P + R Sbjct 1141 LPHALVLHHNEHPQSDFSSFVSKLKGRTVLVVGE-KLSVPGKMVDWLSDRPEATFRAR-- 1197 Query 1200 DLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRA 1259 LDLG+P D +YD++FVN+ T Y+ HHYQQC DH+++L ML + L PGG+ + Sbjct 1198 -LDLGIPGDVPKYDIIFVNVRTPYKYHHYQQCEDHAIKLSMLTKKACLHLNPGGTCVSIG 1256 Query 1260 YGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLS 1319 YGYADR SE ++ A+AR+F RV +P TEV +F +D R ++ + L+ Sbjct 1257 YGYADRASESIIGAIARQFKFSRVCKPKSSLEETEVLFVFIGYDRKARTHNPYKLSSTLT 1316 Query 1320 SMYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKG 1379 ++Y + LH AGCAPSY V R DI+ E ++NAAN+KG GVC A+ KK+P SF Sbjct 1317 NIYTGSRLHEAGCAPSYHVVRGDIATATEGVIINAANSKGQPGGGVCGALYKKFPESFDL 1376 Query 1380 AATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVP 1439 VG A++++ +IHAVGPNF+ V+E EGD++LA AY ++A I++ NN KSVA+P Sbjct 1377 QPIEVGKARLVKGAAKHIIHAVGPNFNKVSEVEGDKQLAEAYESIAKIVNDNNYKSVAIP 1436 Query 1440 LLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIE--LV 1497 LLSTG FSG KDR+ QSLNHL TALD TDADV IYCRDK WE ++EA+ RR A+E + Sbjct 1437 LLSTGIFSGNKDRLTQSLNHLLTALDTTDADVAIYCRDKKWEMTLKEAVARREAVEEICI 1496 Query 1498 SEDVTL---ETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPR 1554 S+D ++ + +LVRVHP S L GR GYS +DGK +SYLEGT+FHQ A D+AEI+ +WP Sbjct 1497 SDDSSVTEPDAELVRVHPKSSLAGRKGYSTSDGKTFSYLEGTKFHQAAKDIAEINAMWPV 1556 Query 1555 LQDANEQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTK 1614 +ANEQ+C+Y LGE+M SIR+KCPVE++++STPP T+PCLC +AMT ERV RL+ + + Sbjct 1557 ATEANEQVCMYILGESMSSIRSKCPVEESEASTPPSTLPCLCIHAMTPERVQRLKASRPE 1616 Query 1615 NIIVCSSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYI----------------- 1657 I VCSSFPLPKYRI GVQK++C + ++F VP+ + PRKY+ Sbjct 1617 QITVCSSFPLPKYRITGVQKIQCSQPILFSPKVPAYIHPRKYLVETPPVDETPEPSAENQ 1676 Query 1658 ------QQPPEQLDNVSLTST--------TSTGSAWSLPSETTYETMEVVAEVHTEppip 1703 +QPP ++ + T T S L T++ ++V A++H P + Sbjct 1677 STEGTPEQPPLITEDETRTRTPEPIIIEEEEEDSISLLSDGPTHQVLQVEADIHGPPSVS 1736 Query 1704 pprrrrAAVAQLRQD-LEVTEEIEPYVIQQAEIMVMER---VATTDIRAIPVPARRAI-- 1757 + D L + + +E + + + A PVPA R + Sbjct 1737 SSSWSIPHASDFDVDSLSILDTLEGASVTSGATSAETNSYFAKSMEFLARPVPAPRTVFR 1796 Query 1758 TMPVPAPRVRKVateppsepeapipaprkrrttsttppHNPGDFVPRVPVELPWEPEDLD 1817 P PAPR R + P PG V RV E+L+ Sbjct 1797 NPPHPAPRTRTPSLAPSRACSRTSLVSTP-----------PG--VNRVITR-----EELE 1838 Query 1818 IQFGDLEPRRRNTRDWDVS---------TGIQFGDIDFNQSXLGRAGAYIFSSDTGPGHL 1868 P R +R VS T +F Q X AGAYIFSSDTG GHL Sbjct 1839 ALTPSRTPSRSVSRTSLVSNPPGVNRVITREEFEAFVAQQQXRFDAGAYIFSSDTGQGHL 1898 Query 1869 QQRSVRQHELPCETLYAHEDERIYPP 1894 QQ+SVRQ L L E E Y P Sbjct 1899 QQKSVRQTVLSEVVLERTELEISYAP 1924 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Ockelbo virus] Sequence ID: P27283.2 Length: 2515 Range 1: 6 to 1676 Score:2108 bits(5463), Expect:0.0, Method:Compositional matrix adjust., Identities:1015/1675(61%), Positives:1266/1675(75%), Gaps:21/1675(1%) Query 3 VTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTILD 62 V VDV+ SPF+ LQK+FP FEV +QQ TPNDHANARAFSHLA+KLIE EVPT TILD Sbjct 6 VNVDVDPQSPFVVQLQKSFPQFEVVAQQATPNDHANARAFSHLASKLIELEVPTTATILD 65 Query 63 VGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDVM 122 +GSAPARR+ S+H YHC+CPM+S EDP+R+ YA KLA+ + + +KN+ KI DL+ V+ Sbjct 66 IGSAPARRMFSEHQYHCVCPMRSPEDPDRMMKYASKLAEKACKITNKNLHEKIKDLRTVL 125 Query 123 ATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTPF 182 TPD E+P+ C H D TC TRAE +V QDVY ++AP ++YHQA+KGVRT YWIGFDTT F Sbjct 126 DTPDAETPSLCFHNDVTCNTRAEYSVMQDVY-INAPGTIYHQAMKGVRTLYWIGFDTTQF 184 Query 183 MFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFSV 242 MF A+AG+YPAY+TNWADE+VL+ARNIGLC+T LSEGR GKLSIMRKK L+P RV FSV Sbjct 185 MFSAMAGSYPAYNTNWADEKVLEARNIGLCSTKLSEGRTGKLSIMRKKELKPGSRVYFSV 244 Query 243 GSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVDY 302 GSTLY E R L+SWHLPSVFHLKGK S+TCRCDTVVSCEGYVVKKITISPGI G+TV Y Sbjct 245 GSTLYPEHRASLQSWHLPSVFHLKGKQSYTCRCDTVVSCEGYVVKKITISPGITGETVGY 304 Query 303 AVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGLN 362 AVT+++EGFL+CK+TDTV+GERVSFPVCTY+PATICDQMTGI+ATD++P+DAQKLLVGLN Sbjct 305 AVTNNSEGFLLCKVTDTVKGERVSFPVCTYIPATICDQMTGIMATDISPDDAQKLLVGLN 364 Query 363 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWAF 422 QRIV+NG+T RNTNTM+NYLLP +AQ FSKWA+E + D+++EK LGTRER LT CLWAF Sbjct 365 QRIVINGKTNRNTNTMQNYLLPTIAQGFSKWAKERKEDLDNEKMLGTRERKLTYGCLWAF 424 Query 423 KSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQGLPYS 482 ++ K+H+ Y+ P TQT VKVP++F +F + S+W++SL M +RQ++KL L + + L Sbjct 425 RTKKVHSFYRPPGTQTSVKVPASFSAFPMSSVWTTSLPMSLRQKMKLALQPKKEEKLLQV 484 Query 483 GDRTearaaeeeekeaqeaeLT--RAALPPLVSGSCADDIAQV--DVEELTFRAGAGVVE 538 + A E +EA R ALPPLV+ + A+V +VE L GA +VE Sbjct 485 PEELVMEAKAAFEDAQEEARAEKLREALPPLVADKDIEAAAEVVCEVEGLQADIGAALVE 544 Query 539 TPRNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDK 598 TPR +++ PQA+D +IG Y+++SP +VLK+ KLAP HPLA+QV ++THSGR+GRY V+ Sbjct 545 TPRGHVRIIPQANDRMIGQYIVVSPTSVLKNAKLAPAHPLADQVKIITHSGRAGRYAVEP 604 Query 599 YDGRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVR 658 YD +VL+P G+A+P EF ALSESAT+VYNEREF+NRKL+HIA++GPA NT+EE Y+ + Sbjct 605 YDAKVLMPAGSAVPWPEFLALSESATLVYNEREFVNRKLYHIAMHGPAKNTEEEQYKVTK 664 Query 659 AERAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGV 718 AE AETEYVFDVDKK C+KKEEASGLVL+G+L NPP+HE A EGLK RPA PY IGV Sbjct 665 AELAETEYVFDVDKKRCVKKEEASGLVLSGELTNPPYHELALEGLKTRPAVPYKVETIGV 724 Query 719 FGVPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKR 778 G PGSGKSAIIK+ VT RDLV SGKKENC+EI DV R RG+ +T++TVDS++LNGC + Sbjct 725 IGTPGSGKSAIIKSTVTARDLVTSGKKENCREIEADVLRLRGMQITSKTVDSVMLNGCHK 784 Query 779 GVENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNH---NIC 835 VE LYVDEAFACH+G LLALIA+VRP KVVLCGDPKQCGFFN+MQLKVH+NH +IC Sbjct 785 AVEVLYVDEAFACHAGALLALIAIVRPRKKVVLCGDPKQCGFFNMMQLKVHFNHPERDIC 844 Query 836 TRVLHKSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFR 895 T+ +K ISRRCT PVTAIVSTLHY GKM+TTN C I+ID TG++KP GDI+LTCFR Sbjct 845 TKTFYKFISRRCTQPVTAIVSTLHYDGKMKTTNPCKKNIEIDITGATKPKPGDIILTCFR 904 Query 896 GWVKQLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRL 955 GWVKQLQIDY GHEVMTAAASQGLTRKGVYAVRQKVNEN LY+ SEHVNVLLTRTE+RL Sbjct 905 GWVKQLQIDYPGHEVMTAAASQGLTRKGVYAVRQKVNENALYAITSEHVNVLLTRTEDRL 964 Query 956 VWKTLSGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAK 1015 VWKTL GDPWIK LTNVP+G+F AT+E+W+ EH GI+ +N +PF K VCWAK Sbjct 965 VWKTLQGDPWIKQLTNVPKGNFQATIEDWEAEHKGIIAAINSPAPRTNPFSCKTNVCWAK 1024 Query 1016 CLVQVLETAGIRMTADEWNTIL-AFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVS 1074 L +L TAGI +T +W+ + F +D+ +S AL+ IC +++G+DL SGLFS QS+ Sbjct 1025 ALEPILATAGIVLTGCQWSELFPQFADDKPHSAIYALDVICIKFFGMDLTSGLFSKQSIP 1084 Query 1075 LFYEN-------NHWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQP 1127 L Y HWDN PG R YG++H VA + + RFP + G QL++ + + Sbjct 1085 LTYHPADSARPVAHWDNSPGTRKYGYDHAVAAELSRRFPVFQL-AGKGTQLDLQTGRTRV 1143 Query 1128 FSAECNIVPSNRRLPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWI 1187 SA+ N+VP NR LPHALV +++ + VE L + H +L+VSE + PHKR+ WI Sbjct 1144 ISAQHNLVPVNRNLPHALVPEHKEKQPGPVEKFLNQFKHHSVLVVSEEKIEAPHKRIEWI 1203 Query 1188 APPRVSGADRTYDLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLH 1247 AP ++GAD+ Y+L G P A RYDLVF+NI T+YR HH+QQC DH+ L+ L +L+ Sbjct 1204 APIGIAGADKNYNLAFGFPPQA-RYDLVFINIGTKYRNHHFQQCEDHAATLKTLSRSALN 1262 Query 1248 LLRPGGSLLMRAYGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGR- 1306 L PGG+L++++YGYADR SE VVTALARKF RP CV+SNTE++L+F DN R Sbjct 1263 CLNPGGTLVVKSYGYADRNSEDVVTALARKFVRVSAARPECVSSNTEMYLIFRQLDNSRT 1322 Query 1307 RAVTLHQANQKLSSMYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVC 1366 R T H N +SS+Y G APSYR +R +I+ EEAVVNAAN G +GVC Sbjct 1323 RQFTPHHLNCVISSVYE-GTRDGVGAAPSYRTKRENIADCQEEAVVNAANPLGRPGEGVC 1381 Query 1367 RAVAKKWPSSFKGAATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVAS 1426 RA+ K+WP+SF +AT GTAK+ G VIHAVGP+F EAE + L AY AVA Sbjct 1382 RAIYKRWPNSFTDSATETGTAKLTVCHGKKVIHAVGPDFRKHPEAEALKLLQNAYHAVAD 1441 Query 1427 IISTNNIKSVAVPLLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQE 1486 +++ +NIKSVA+PLLSTG ++ GKDR+ SLN L TALD TDADV IYC DK W+++I Sbjct 1442 LVNEHNIKSVAIPLLSTGIYAAGKDRLEVSLNCLTTALDRTDADVTIYCLDKKWKERIDA 1501 Query 1487 AIDRRTAI-ELVSEDVTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDM 1545 + + ++ EL ED+ ++ +LV +HPDSCL GR G+S T GKLYSY EGT+FHQ A DM Sbjct 1502 VLQLKESVTELKDEDMEIDDELVWIHPDSCLKGRKGFSTTKGKLYSYFEGTKFHQAAKDM 1561 Query 1546 AEISTLWPRLQDANEQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERV 1605 AEI L+P Q++NEQ+C Y LGETM++IR KCPV+ SS+PPKT+PCLC YAMT ERV Sbjct 1562 AEIKVLFPNDQESNEQLCAYILGETMEAIREKCPVDHNPSSSPPKTLPCLCMYAMTPERV 1621 Query 1606 ARLRMNNTKNIIVCSSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQQP 1660 RLR NN K + VCSS PLPKY+I+ VQKV+C +V++F+ P+ V RKYI+ P Sbjct 1622 HRLRSNNVKEVTVCSSTPLPKYKIKNVQKVQCTKVVLFNPHTPAFVPARKYIEVP 1676 Range 2: 1900 to 1945 Score:51.2 bits(121), Expect:5e-06, Method:Compositional matrix adjust., Identities:25/46(54%), Positives:30/46(65%), Gaps:0/46(0%) Query 1851 LGRAGAYIFSSDTGPGHLQQRSVRQHELPCETLYAHEDERIYPPAF 1896 L G YIFS+DTGPGHLQ +SV Q++L TL + ERIY P Sbjct 1900 LTGVGGYIFSTDTGPGHLQMKSVLQNQLTEPTLERNVLERIYAPVL 1945 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; AltName: Full=p270 nonstructural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Sindbis virus] Sequence ID: P03317.2 Length: 2513 Range 1: 6 to 1777 Score:2107 bits(5458), Expect:0.0, Method:Compositional matrix adjust., Identities:1035/1795(58%), Positives:1306/1795(72%), Gaps:51/1795(2%) Query 3 VTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTILD 62 V VDV+ SPF+ LQK+FP FEV +QQVTPNDHANARAFSHLA+KLIE EVPT TILD Sbjct 6 VNVDVDPQSPFVVQLQKSFPQFEVVAQQVTPNDHANARAFSHLASKLIELEVPTTATILD 65 Query 63 VGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDVM 122 +GSAPARR+ S+H YHC+CPM+S EDP+R+ YA KLA+ + + +KN+ KI DL+ V+ Sbjct 66 IGSAPARRMFSEHQYHCVCPMRSPEDPDRMMKYASKLAEKACKITNKNLHEKIKDLRTVL 125 Query 123 ATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTPF 182 TPD E+P+ C H D TC RAE +V QDVY ++AP ++YHQA+KGVRT YWIGFDTT F Sbjct 126 DTPDAETPSLCFHNDVTCNMRAEYSVMQDVY-INAPGTIYHQAMKGVRTLYWIGFDTTQF 184 Query 183 MFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFSV 242 MF A+AG+YPAY+TNWADE+VL+ARNIGLC+T LSEGR GKLSIMRKK L+P RV FSV Sbjct 185 MFSAMAGSYPAYNTNWADEKVLEARNIGLCSTKLSEGRTGKLSIMRKKELKPGSRVYFSV 244 Query 243 GSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVDY 302 GSTLY E R L+SWHLPSVFHL GK S+TCRCDTVVSCEGYVVKKITISPGI G+TV Y Sbjct 245 GSTLYPEHRASLQSWHLPSVFHLNGKQSYTCRCDTVVSCEGYVVKKITISPGITGETVGY 304 Query 303 AVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGLN 362 AVTH++EGFL+CK+TDTV+GERVSFPVCTY+PATICDQMTGI+ATD++P+DAQKLLVGLN Sbjct 305 AVTHNSEGFLLCKVTDTVKGERVSFPVCTYIPATICDQMTGIMATDISPDDAQKLLVGLN 364 Query 363 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWAF 422 QRIV+NGRT RNTNTM+NYLLP++AQ FSKWA+E + D+++EK LGTRER LT CLWAF Sbjct 365 QRIVINGRTNRNTNTMQNYLLPIIAQGFSKWAKERKDDLDNEKMLGTRERKLTYGCLWAF 424 Query 423 KSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQGLPYS 482 ++ K+H+ Y+ P TQT VKVP++F +F + S+W++SL M +RQ++KL L + + L Sbjct 425 RTKKVHSFYRPPGTQTCVKVPASFSAFPMSSVWTTSLPMSLRQKLKLALQPKKEEKLLQV 484 Query 483 GDRTearaaeeeekeaqeaeLT--RAALPPLVSGSCADDIAQV--DVEELTFRAGAGVVE 538 + A E +EA R ALPPLV+ + A+V +VE L GA +VE Sbjct 485 SEELVMEAKAAFEDAQEEARAEKLREALPPLVADKGIEAAAEVVCEVEGLQADIGAALVE 544 Query 539 TPRNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDK 598 TPR +++ PQA+D +IG Y+++SP +VLK+ KLAP HPLA+QV ++THSGRSGRY V+ Sbjct 545 TPRGHVRIIPQANDRMIGQYIVVSPNSVLKNAKLAPAHPLADQVKIITHSGRSGRYAVEP 604 Query 599 YDGRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVR 658 YD +VL+P G A+P EF ALSESAT+VYNEREF+NRKL+HIA++GPA NT+EE Y+ + Sbjct 605 YDAKVLMPAGGAVPWPEFLALSESATLVYNEREFVNRKLYHIAMHGPAKNTEEEQYKVTK 664 Query 659 AERAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGV 718 AE AETEYVFDVDKK C+KKEEASGLVL+G+L NPP+HE A EGLK RPA PY IGV Sbjct 665 AELAETEYVFDVDKKRCVKKEEASGLVLSGELTNPPYHELALEGLKTRPAVPYKVETIGV 724 Query 719 FGVPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKR 778 G PGSGKSAIIK+ VT RDLV SGKKENC+EI DV R RG+ +T++TVDS++LNGC + Sbjct 725 IGTPGSGKSAIIKSTVTARDLVTSGKKENCREIEADVLRLRGMQITSKTVDSVMLNGCHK 784 Query 779 GVENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNH---NIC 835 VE LYVDEAFACH+G LLALIA+VRP KVVLCGDP QCGFFN+MQLKVH+NH +IC Sbjct 785 AVEVLYVDEAFACHAGALLALIAIVRPRKKVVLCGDPMQCGFFNMMQLKVHFNHPEKDIC 844 Query 836 TRVLHKSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFR 895 T+ +K ISRRCT PVTAIVSTLHY GKM+TTN C I+ID TG++KP GDI+LTCFR Sbjct 845 TKTFYKYISRRCTQPVTAIVSTLHYDGKMKTTNPCKKNIEIDITGATKPKPGDIILTCFR 904 Query 896 GWVKQLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRL 955 GWVKQLQIDY GHEVMTAAASQGLTRKGVYAVRQKVNENPLY+ SEHVNVLLTRTE+RL Sbjct 905 GWVKQLQIDYPGHEVMTAAASQGLTRKGVYAVRQKVNENPLYAITSEHVNVLLTRTEDRL 964 Query 956 VWKTLSGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAK 1015 VWKTL GDPWIK TN+P+G+F AT+E+W+ EH GI+ +N +PF K VCWAK Sbjct 965 VWKTLQGDPWIKQPTNIPKGNFQATIEDWEAEHKGIIAAINSPTPRANPFSCKTNVCWAK 1024 Query 1016 CLVQVLETAGIRMTADEWNTIL-AFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVS 1074 L +L TAGI +T +W+ + F +D+ +S AL+ IC +++G+DL SGLFS QS+ Sbjct 1025 ALEPILATAGIVLTGCQWSELFPQFADDKPHSAIYALDVICIKFFGMDLTSGLFSKQSIP 1084 Query 1075 LFYEN-------NHWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQP 1127 L Y HWDN PG R YG++H +A + + RFP + G QL++ + + Sbjct 1085 LTYHPADSARPVAHWDNSPGTRKYGYDHAIAAELSRRFPVFQL-AGKGTQLDLQTGRTRV 1143 Query 1128 FSAECNIVPSNRRLPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWI 1187 SA+ N+VP NR LPHALV Y++ + V+ L + H +L+VSE + P KR+ WI Sbjct 1144 ISAQHNLVPVNRNLPHALVPEYKEKQPGPVKKFLNQFKHHSVLVVSEEKIEAPRKRIEWI 1203 Query 1188 APPRVSGADRTYDLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLH 1247 AP ++GAD+ Y+L G P A RYDLVF+NI T+YR HH+QQC DH+ L+ L +L+ Sbjct 1204 APIGIAGADKNYNLAFGFPPQA-RYDLVFINIGTKYRNHHFQQCEDHAATLKTLSRSALN 1262 Query 1248 LLRPGGSLLMRAYGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGR- 1306 L PGG+L++++YGYADR SE VVTALARKF RP CV+SNTE++L+F DN R Sbjct 1263 CLNPGGTLVVKSYGYADRNSEDVVTALARKFVRVSAARPDCVSSNTEMYLIFRQLDNSRT 1322 Query 1307 RAVTLHQANQKLSSMYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVC 1366 R T H N +SS+Y G APSYR +R +I+ EEAVVNAAN G +GVC Sbjct 1323 RQFTPHHLNCVISSVYE-GTRDGVGAAPSYRTKRENIADCQEEAVVNAANPLGRPGEGVC 1381 Query 1367 RAVAKKWPSSFKGAATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVAS 1426 RA+ K+WP+SF +AT GTA+M G VIHAVGP+F EAE + L AY AVA Sbjct 1382 RAIYKRWPTSFTDSATETGTARMTVCLGKKVIHAVGPDFRKHPEAEALKLLQNAYHAVAD 1441 Query 1427 IISTNNIKSVAVPLLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQE 1486 +++ +NIKSVA+PLLSTG ++ GKDR+ SLN L TALD TDADV IYC DK W+++I Sbjct 1442 LVNEHNIKSVAIPLLSTGIYAAGKDRLEVSLNCLTTALDRTDADVTIYCLDKKWKERIDA 1501 Query 1487 AIDRRTAI-ELVSEDVTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDM 1545 A+ + ++ EL ED+ ++ +LV +HPDSCL GR G+S T GKLYSY EGT+FHQ A DM Sbjct 1502 ALQLKESVTELKDEDMEIDDELVWIHPDSCLKGRKGFSTTKGKLYSYFEGTKFHQAAKDM 1561 Query 1546 AEISTLWPRLQDANEQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERV 1605 AEI L+P Q++NEQ+C Y LGETM++IR KCPV+ SS+PPKT+PCLC YAMT ERV Sbjct 1562 AEIKVLFPNDQESNEQLCAYILGETMEAIREKCPVDHNPSSSPPKTLPCLCMYAMTPERV 1621 Query 1606 ARLRMNNTKNIIVCSSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQQP----- 1660 RLR NN K + VCSS PLPK++I+ VQKV+C +V++F+ P+ V RKYI+ P Sbjct 1622 HRLRSNNVKEVTVCSSTPLPKHKIKNVQKVQCTKVVLFNPHTPAFVPARKYIEVPEQPTA 1681 Query 1661 ----PEQLDNVSLTSTTSTGSAWSLPSETTYETMEVVAEVHTEppippprrrrAAVAQLR 1716 E+ V T + ST SL M+ +E + Sbjct 1682 PPAQAEEAPEVVATPSPSTADNTSLDVTDISLDMDDSSEG------------SLFSSFSG 1729 Query 1717 QDLEVTEEIEPYVIQQAEIMVMER--VATTDIRAIPVPARRAITMPVPAPRVRKV 1769 D +T ++ + + + +++R V D+ A+ PA P+P PR++K+ Sbjct 1730 SDNSIT-SMDSWSSGPSSLEIVDRRQVVVADVHAVQEPA------PIPPPRLKKM 1777 Range 2: 1898 to 1943 Score:49.7 bits(117), Expect:1e-05, Method:Compositional matrix adjust., Identities:24/46(52%), Positives:31/46(67%), Gaps:0/46(0%) Query 1851 LGRAGAYIFSSDTGPGHLQQRSVRQHELPCETLYAHEDERIYPPAF 1896 L G YIFS+DTGPGHLQ++SV Q++L TL + ERI+ P Sbjct 1898 LTGVGGYIFSTDTGPGHLQKKSVLQNQLTEPTLERNVLERIHAPVL 1943 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Venezuelan equine encephalitis virus (strain CPA201)] Sequence ID: Q8V294.3 Length: 2497 Range 1: 3 to 1716 Score:2100 bits(5441), Expect:0.0, Method:Compositional matrix adjust., Identities:1023/1721(59%), Positives:1273/1721(73%), Gaps:32/1721(1%) Query 2 KVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTIL 61 KV VD+E DSPFL+ALQ++FP FEVE++QVT NDHANARAFSHLA+KLIE EV TIL Sbjct 3 KVHVDIEEDSPFLRALQRSFPQFEVEAKQVTDNDHANARAFSHLASKLIETEVEPSDTIL 62 Query 62 DVGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDV 121 D+GSAPARR+ S H YHCICPMK AEDP+RL YA KL K + DK + K+ +L +V Sbjct 63 DIGSAPARRMYSKHKYHCICPMKCAEDPDRLFKYAAKLKKNCKDITDKELDKKMKELAEV 122 Query 122 MATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTP 181 M+ PDLE+ T CLH DETCR +VAVYQDVYAV PTSLYHQA KGVR AYWIGFDTTP Sbjct 123 MSDPDLETETICLHDDETCRFEGQVAVYQDVYAVDGPTSLYHQANKGVRVAYWIGFDTTP 182 Query 182 FMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFS 241 FMF+ LAGAYP+YSTNWADE VL ARNIGLC++ + E R +SI+RKK L+PS+ V+FS Sbjct 183 FMFKNLAGAYPSYSTNWADETVLTARNIGLCSSDVMERSRRGMSILRKKFLKPSNNVLFS 242 Query 242 VGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVD 301 VGST+Y E R LLRSWHLPSVFHL+GK ++TCRC+T+VSC+GYVVK+I ISPG+YGK Sbjct 243 VGSTIYHEKRDLLRSWHLPSVFHLRGKQNYTCRCETIVSCDGYVVKRIAISPGLYGKPSG 302 Query 302 YAVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGL 361 YA T H EGFL CK+TDT+ GERVSFPVCTYVPAT+CDQMTGILATDV+ +DAQKLLVGL Sbjct 303 YAATMHREGFLCCKVTDTLDGERVSFPVCTYVPATLCDQMTGILATDVSADDAQKLLVGL 362 Query 362 NQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWA 421 NQRIVVNGRTQRNTNTMKNYLLPVVAQAF++WA+E + D EDE+PLG R+R L C WA Sbjct 363 NQRIVVNGRTQRNTNTMKNYLLPVVAQAFARWAKEYKEDQEDERPLGLRDRQLVMGCCWA 422 Query 422 FKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQ-GLP 480 F+ HKI ++YKRP+TQTI+KV S F SFV+P + S++L +G+R RI+ LL + + L Sbjct 423 FRKHKITSVYKRPDTQTIIKVNSDFHSFVLPRIGSNTLEIGLRTRIRKLLEEPVDRPPLI 482 Query 481 YSGDRTearaaeeeekeaqeaeLTRAALPPLVSGSCADDIAQVDVEELTFRAGAGVVETP 540 + D EA+ A +E KE +EAE RAALPPL S + + DV+ + AGAG VETP Sbjct 483 TADDIQEAKNAADEAKEVKEAEELRAALPPL-SADVEEPALEADVDLMLQEAGAGSVETP 541 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 R +KVT A + IGSY +LSPQ VL+SEKL IHPLAEQV V+THSGR GRY V+ Y Sbjct 542 RGLIKVTSYAGEDKIGSYAVLSPQAVLRSEKLTCIHPLAEQVIVITHSGRKGRYAVEPYH 601 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 G+V++P G AIPV +FQALSESAT+VYNEREF+NR LHHIA +G ALNTDEE Y V+ Sbjct 602 GKVVVPEGQAIPVQDFQALSESATIVYNEREFVNRYLHHIATHGGALNTDEEYYRVVKPS 661 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 E EY++D+DKK C+KKE SGL LTG+L++PPFHEFAYE L+ RPAAPY IGV+G Sbjct 662 EHEGEYLYDIDKKQCVKKELVSGLGLTGELVDPPFHEFAYESLRTRPAAPYQVPTIGVYG 721 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKS IIK+ VT +DLV S KKENC EI+ DVK+ +GLDV ARTVDS+LLNGCK V Sbjct 722 VPGSGKSGIIKSAVTKKDLVVSAKKENCAEIIRDVKKMKGLDVNARTVDSVLLNGCKHPV 781 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 E LY+DEAFACH+GTL ALIA++RP K VLCGDPKQCGFFN+M LKVH+NH ICT+V H Sbjct 782 ETLYIDEAFACHAGTLRALIAIIRPK-KAVLCGDPKQCGFFNMMCLKVHFNHEICTQVFH 840 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCT VT++VSTL Y +MRTTN ++ I+IDTTGS+KP D++LTCFRGWVKQ Sbjct 841 KSISRRCTKSVTSVVSTLFYDKRMRTTNPRDSKIEIDTTGSTKPKKDDLILTCFRGWVKQ 900 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQIDY+G+E+MTAAASQGLTRKGVYAVR KVNENPLY+P SEHVNVLLTRTE+++VWKTL Sbjct 901 LQIDYKGNEIMTAAASQGLTRKGVYAVRYKVNENPLYAPTSEHVNVLLTRTEDKIVWKTL 960 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 +GDPWIK LT GDF+AT+EEWQ EHD IMR + E+P D FQNKA VCWAK LV V Sbjct 961 AGDPWIKTLTAKYPGDFTATMEEWQAEHDAIMRHILEKPDPTDVFQNKANVCWAKALVPV 1020 Query 1021 LETAGIRMTADEWNTILAFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYENN 1080 L+TAGI +T ++WNT+ F+ED+A+S E+ LN++C R++G+DLDSGLFSA +V L NN Sbjct 1021 LKTAGIDLTTEQWNTVDYFKEDKAHSAEIVLNQLCVRFFGLDLDSGLFSAPTVPLSIRNN 1080 Query 1081 HWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNRR 1140 HWDN P MYG N EV R+ + R+P L + +G ++ L+ + N+VP NRR Sbjct 1081 HWDNSPSPNMYGLNKEVVRQLSRRYPQLPRAVTTGRAYDMNTGTLRNYDPRINLVPVNRR 1140 Query 1141 LPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTY- 1199 LPHALVT + + K+ G +L+V E ++I K V W++ D T+ Sbjct 1141 LPHALVTQHADYPPSDFSAFVSKLKGRTVLVVGE-KMSISGKTVDWLS----ETPDSTFR 1195 Query 1200 -DLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMR 1258 LDLG+P + +YD+VFVN+ T+YR HHYQQC DH+++L ML + L PGG+ + Sbjct 1196 ARLDLGIPSELPKYDIVFVNVRTQYRYHHYQQCEDHAIKLSMLTKKACLHLNPGGTCVSI 1255 Query 1259 AYGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKL 1318 YGYADR SE ++ A+AR+F RV +P TEV +F FD R ++ + L Sbjct 1256 GYGYADRASESIIGAVARQFKFSRVCKPKVSKEETEVLFVFIGFDRKTRTHNPYKLSSTL 1315 Query 1319 SSMYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFK 1378 +++Y + LH AGCAPSY V R DI+ E +VNAAN+KG GVC A+ +K+P SF Sbjct 1316 TNIYTGSRLHEAGCAPSYHVVRGDIATATEGVIVNAANSKGQPGSGVCGALYRKYPESFD 1375 Query 1379 GAATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAV 1438 VG A++++ + +IHAVGPNF+ V+E EGD++LA AY ++A II+ NN +SVA+ Sbjct 1376 LQPIEVGKARLVKGNSKHLIHAVGPNFNKVSEVEGDKQLAEAYESIARIINDNNYRSVAI 1435 Query 1439 PLLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIE--L 1496 PLLSTG F+G KDR+MQSLNHL TALD TDADV IYCRDK WE ++E + RR A+E Sbjct 1436 PLLSTGIFAGNKDRLMQSLNHLLTALDTTDADVAIYCRDKKWEVTLKEVVARREAVEEIC 1495 Query 1497 VSEDVTL---ETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWP 1553 +SED ++ + +LVRVHP S L GR GYS +DGK +SYLEGT+FHQ A DMAEI+ +WP Sbjct 1496 ISEDSSVAEPDAELVRVHPKSSLAGRKGYSTSDGKTFSYLEGTKFHQAAKDMAEINAMWP 1555 Query 1554 RLQDANEQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNT 1613 +ANEQ+CLY LGE+M SIR+KCPVE++++STPP T+PCLC +AMT ERV RL+ + Sbjct 1556 AATEANEQVCLYILGESMSSIRSKCPVEESEASTPPSTLPCLCIHAMTPERVQRLKASRP 1615 Query 1614 KNIIVCSSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQ--------------- 1658 + I VCSSFPLPKYRI GVQK++C ++F VP + PRKY+ Sbjct 1616 EQITVCSSFPLPKYRITGVQKIQCSHPILFSPKVPEYIHPRKYLADAASANNEAAELTSV 1675 Query 1659 --QPPEQLDNVSLTSTTSTGSAWSLPSETTYETMEVVAEVH 1697 QP + + + S+ SE ++ +V AEVH Sbjct 1676 DVQPQLEESPENTEQLVEEEDSISVLSEAPHQVHQVEAEVH 1716 Range 2: 1888 to 1928 Score:47.8 bits(112), Expect:5e-05, Method:Compositional matrix adjust., Identities:26/41(63%), Positives:27/41(65%), Gaps:0/41(0%) Query 1854 AGAYIFSSDTGPGHLQQRSVRQHELPCETLYAHEDERIYPP 1894 AGAYIFSSDTG GHLQQ+SVRQ L L E E Y P Sbjct 1888 AGAYIFSSDTGQGHLQQKSVRQTVLSEVVLERTELEISYAP 1928 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Eastern equine encephalitis virus (strain PE-0.0155)] Sequence ID: Q306W6.3 Length: 2471 Range 1: 3 to 1651 Score:2073 bits(5371), Expect:0.0, Method:Compositional matrix adjust., Identities:1008/1659(61%), Positives:1241/1659(74%), Gaps:14/1659(0%) Query 2 KVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTIL 61 KV VD++ADSP++K+LQK FP FE+E+ QVT NDHANARAFSHLATKLIE EV IL Sbjct 3 KVHVDLDADSPYVKSLQKCFPHFEIEATQVTDNDHANARAFSHLATKLIESEVDPDQVIL 62 Query 62 DVGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDV 121 D+GSAP R S H YHCICPM SAEDP+RL YA KL K+ V D+ ++ K DL V Sbjct 63 DIGSAPVRHTHSKHKYHCICPMISAEDPDRLHRYADKLRKSD--VTDRFIASKAADLLTV 120 Query 122 MATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTP 181 M+TPD+E+P+ C+HTD TCR VAVYQDVYAVHAPTS+YHQA+KGVRT YWIGFDTTP Sbjct 121 MSTPDVETPSLCMHTDSTCRYHGTVAVYQDVYAVHAPTSIYHQALKGVRTIYWIGFDTTP 180 Query 182 FMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFS 241 FM++ +AGAYP Y+TNWADE VL+ARNIGLC++ L E R GK+SIMRKK L+P+++V+FS Sbjct 181 FMYKNMAGAYPTYNTNWADESVLEARNIGLCSSDLHEQRFGKISIMRKKKLQPTNKVVFS 240 Query 242 VGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVD 301 VGST+YTE R LLRSWHLP+VFHLKGK SFT RC+T+VSCEGYVVKKITISPGIYGK + Sbjct 241 VGSTIYTEERILLRSWHLPNVFHLKGKTSFTGRCNTIVSCEGYVVKKITISPGIYGKVDN 300 Query 302 YAVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGL 361 A T H EGFL CK+TDT+RGERVSFPVCTYVPAT+CDQMTGILATDV+ +DAQKLLVGL Sbjct 301 LASTMHREGFLSCKVTDTLRGERVSFPVCTYVPATLCDQMTGILATDVSVDDAQKLLVGL 360 Query 362 NQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWA 421 NQRIVVNGRTQRNTNTM NYLLP+VAQAFS+WARE AD+EDEK LG RER+L C WA Sbjct 361 NQRIVVNGRTQRNTNTMPNYLLPIVAQAFSRWAREYHADLEDEKDLGVRERSLVMGCCWA 420 Query 422 FKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQG-LP 480 FK+HKI ++YK+P TQT KVP+ F+SFV+P L S L + +R+RIK+LL + + Sbjct 421 FKTHKITSIYKKPGTQTTKKVPAVFNSFVVPQLTSYGLDIELRRRIKMLLEEKKKPAPII 480 Query 481 YSGDRTearaaeeeekeaqeaeLTRAALPPLVSGSCADDIAQVDVEELTFRAGAGVVETP 540 D + +EE + EAE RAALPPL+ + + D++ + AGAG VETP Sbjct 481 TEADVAHLKGMQEEAEVVAEAEAIRAALPPLLP-EVERETVEADIDLIMQEAGAGSVETP 539 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 R +KVT + +IGSY +LSPQ VL SEKLA IHPLAEQV VMTH GR+GRY V+ Y Sbjct 540 RRHIKVTTYPGEEMIGSYAVLSPQAVLNSEKLACIHPLAEQVLVMTHKGRAGRYKVEPYH 599 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 GRV++P+G AIP+ +FQALSESAT+VYNEREF+NR LHHIA+ G ALNTDEE Y+ +R+ Sbjct 600 GRVIVPSGTAIPIPDFQALSESATIVYNEREFVNRYLHHIAINGGALNTDEEYYKVLRSG 659 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 AE+EYVFD+D K C+KK EA + L GDL++PPFHEFAYE LK RPAAP+ IGV+G Sbjct 660 EAESEYVFDIDAKKCVKKAEAGPMCLVGDLVDPPFHEFAYESLKTRPAAPHKVPTIGVYG 719 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKS IIK+ VT RDLV S KKENC EI+ DVKR RG+DV ARTVDS+LLNG K V Sbjct 720 VPGSGKSGIIKSAVTKRDLVVSAKKENCTEIIKDVKRMRGMDVAARTVDSVLLNGVKHPV 779 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 + LY+DEAFACH+GTLLALIA+V+P KVVLCGDPKQCGFFN+M LKVH+NH ICT V H Sbjct 780 DTLYIDEAFACHAGTLLALIAIVKPK-KVVLCGDPKQCGFFNMMCLKVHFNHEICTEVYH 838 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCT VTAIVSTL Y +MRT N C+ I IDTT ++KP DI+LTCFRGWVKQ Sbjct 839 KSISRRCTRTVTAIVSTLFYDKRMRTVNPCSDKIIIDTTSTTKPLKDDIILTCFRGWVKQ 898 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQIDY+ HE+MTAAASQGLTRKGVYAVR KVNENPLY+ SEHVNVLLTRTE R+VWKTL Sbjct 899 LQIDYKNHEIMTAAASQGLTRKGVYAVRYKVNENPLYAQTSEHVNVLLTRTEKRIVWKTL 958 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 +GDPWIK LT G+FSATLEEWQ EHD IM+ + E PA D +QNK VCWAK L V Sbjct 959 AGDPWIKTLTAHYPGEFSATLEEWQAEHDAIMKRVLETPANSDVYQNKVHVCWAKALEPV 1018 Query 1021 LETAGIRMTADEWNTILAFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYENN 1080 L TA I +T +W TI AF++D+A+SPE+ALN +CTR++GVD+DSGLFSA +V L Y N Sbjct 1019 LATANITLTRSQWETIPAFKDDKAFSPEMALNFLCTRFFGVDIDSGLFSAPTVPLTYTNE 1078 Query 1081 HWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNRR 1140 HWDN PG YG A++ A R+P + +++G +V ++ ++ N+VP NRR Sbjct 1079 HWDNSPGPNRYGLCMRTAKELARRYPCILKAVDTGRVADVRTNTIRDYNPMINVVPLNRR 1138 Query 1141 LPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTY- 1199 LPH+LV S++ LL K+ G +L++ + +P KRV + P G TY Sbjct 1139 LPHSLVVSHRYTGDGNYSQLLSKLTGKTILVIGT-PINVPGKRVETLGP----GPQCTYK 1193 Query 1200 -DLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMR 1258 DLDLG+P G+YD++FVN+ T Y+ HHYQQC DH++ ML ++ L GG+ + Sbjct 1194 ADLDLGIPSMIGKYDIIFVNVRTPYKHHHYQQCEDHAIHHSMLTRKAVDHLNKGGTCVAL 1253 Query 1259 AYGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKL 1318 YG ADR +E +++A+AR F RV +P C NTEV +F DNG Q + L Sbjct 1254 GYGTADRATENIISAVARSFRFSRVCQPKCAWENTEVAFVFFGKDNGNHLRDQDQLSVVL 1313 Query 1319 SSMYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFK 1378 +++Y + + AG AP+YRV R DIS +E +VNAAN KG GVC A+ KKWP +F Sbjct 1314 NNIYQGSTQYEAGRAPAYRVIRGDISKSTDEVIVNAANNKGQPGAGVCGALYKKWPGAFD 1373 Query 1379 GAATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAV 1438 A GTA +++ +IHAVGPNFS ++E EG+++L+ Y +A II+ V++ Sbjct 1374 KAPIATGTAHLVKHTP-NIIHAVGPNFSRMSEVEGNQKLSEVYMDIAKIINKERYNKVSI 1432 Query 1439 PLLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAI-ELV 1497 PLLSTG ++GGKDRVMQSLNHLFTA+D TDADV IYC DK WE +I++AI R+ ++ ELV Sbjct 1433 PLLSTGVYAGGKDRVMQSLNHLFTAMDTTDADVTIYCLDKQWETRIKDAIARKESVEELV 1492 Query 1498 SEDVTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQD 1557 +D ++ +LVRVHP S LVGR GYS +GK++SYLEGTRFHQTA D+AEI +WP Q+ Sbjct 1493 EDDKPVDIELVRVHPQSSLVGRPGYSTNEGKVHSYLEGTRFHQTAKDIAEIYAMWPNKQE 1552 Query 1558 ANEQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNII 1617 ANEQICLY LGE+M SIR+KCPVE++++S+PP T+PCLC YAMTAERV RLRM + Sbjct 1553 ANEQICLYVLGESMTSIRSKCPVEESEASSPPHTIPCLCNYAMTAERVYRLRMAKNEQFA 1612 Query 1618 VCSSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKY 1656 VCSSF LPKYRI GVQK++C++ +IF VP + PRK+ Sbjct 1613 VCSSFQLPKYRITGVQKIQCNKPVIFSGVVPPAIHPRKF 1651 Range 2: 1861 to 1901 Score:47.0 bits(110), Expect:9e-05, Method:Compositional matrix adjust., Identities:23/41(56%), Positives:27/41(65%), Gaps:0/41(0%) Query 1854 AGAYIFSSDTGPGHLQQRSVRQHELPCETLYAHEDERIYPP 1894 AGAYIFSS+TG GHLQQ+S RQ +L L E+ Y P Sbjct 1861 AGAYIFSSETGQGHLQQKSTRQCKLQHPILERSVHEKFYAP 1901 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Eastern equine encephalitis virus (strain PE-3.0815)] Sequence ID: Q306W8.3 Length: 2474 Range 1: 3 to 1651 Score:2065 bits(5351), Expect:0.0, Method:Compositional matrix adjust., Identities:1010/1659(61%), Positives:1244/1659(74%), Gaps:14/1659(0%) Query 2 KVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTIL 61 KV VD++ADSP++K LQK FP FE+E+ QVT NDHANARAFSHLATKLIE EV IL Sbjct 3 KVHVDLDADSPYVKLLQKCFPHFEIEATQVTDNDHANARAFSHLATKLIESEVDPDQVIL 62 Query 62 DVGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDV 121 D+GSAP R S H YHCICPM SAEDP+RL YA KL K+ V D+ ++ K DL V Sbjct 63 DIGSAPVRHTHSKHKYHCICPMISAEDPDRLQRYADKLRKSD--VTDRFIASKAADLLTV 120 Query 122 MATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTP 181 M+TPD+E+P+ C+HTD TCR VAVYQDVYAVHAPTS+YHQA+KGVRT YWIGFDTTP Sbjct 121 MSTPDVETPSLCMHTDSTCRYHGTVAVYQDVYAVHAPTSIYHQALKGVRTIYWIGFDTTP 180 Query 182 FMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFS 241 FM++ +AGAYP Y+TNWADE VL+ARNIGLC++ L E R GK+SIMRKK L+P+++V+FS Sbjct 181 FMYKNMAGAYPTYNTNWADESVLEARNIGLCSSDLHEKRLGKISIMRKKKLQPTNKVVFS 240 Query 242 VGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVD 301 VGST+YTE R LLRSWHLP+VFHLKGK SFT RC+T+VSC+GYVVKKITISPGIYGK + Sbjct 241 VGSTIYTEERILLRSWHLPNVFHLKGKTSFTGRCNTIVSCDGYVVKKITISPGIYGKVDN 300 Query 302 YAVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGL 361 A T H EGFL CK+TDT+RGERVSFPVCTYVPAT+CDQMTGILATDV+ +DAQKLLVGL Sbjct 301 LASTLHREGFLSCKVTDTLRGERVSFPVCTYVPATLCDQMTGILATDVSVDDAQKLLVGL 360 Query 362 NQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWA 421 NQRIVVNGRTQRNTNTM+NYLLPVVAQAFS+WARE RAD+EDEK LG RER+L C WA Sbjct 361 NQRIVVNGRTQRNTNTMQNYLLPVVAQAFSRWAREYRADLEDEKDLGVRERSLVMGCCWA 420 Query 422 FKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLL-SARMAQGLP 480 FK+HKI ++YK+P TQTI KVP+ F+SFVIP S L++G+R+RIK+LL R + Sbjct 421 FKTHKITSIYKKPGTQTIKKVPAVFNSFVIPQFNSYGLNIGLRRRIKMLLEEKRKPAPII 480 Query 481 YSGDRTearaaeeeekeaqeaeLTRAALPPLVSGSCADDIAQVDVEELTFRAGAGVVETP 540 D + +EE + EAE RAALPPL+ + I + D++ + AGAG VETP Sbjct 481 TEADVAHLKGMQEEAEAVAEAEAVRAALPPLLPEVERETI-EADIDLIMQEAGAGSVETP 539 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 R +KVT + IGSY +LSPQ VL SEKLA IHPLAEQV VMTH GR+GRY V+ Y Sbjct 540 RRHIKVTTYPGEETIGSYAVLSPQAVLNSEKLACIHPLAEQVLVMTHKGRAGRYKVEPYH 599 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 GRV++P+G AIP+ +FQALSESAT+VYNEREF+NR LHHIA+ G A+NTDEE Y+ +R+ Sbjct 600 GRVVVPSGTAIPIPDFQALSESATIVYNEREFVNRYLHHIAINGGAINTDEEYYKVLRSS 659 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 A++EYVFD+D + C+KK +A + L G+L++PPFHEFAYE LK RPAAP+ IGV+G Sbjct 660 EADSEYVFDIDARKCVKKADAGPMCLVGELVDPPFHEFAYESLKTRPAAPHKVPTIGVYG 719 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKS IIK+ VT RDLV S KKENC EI+ DVKR RG+D+ ARTVDS+LLNG K V Sbjct 720 VPGSGKSGIIKSAVTKRDLVVSAKKENCTEIIKDVKRMRGMDIAARTVDSVLLNGVKHPV 779 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 + LY+DEAFACH+GTLLALIA+V+P KVVLCGDPKQCGFFN+M LKVH+NH ICT V H Sbjct 780 DTLYIDEAFACHAGTLLALIAIVKPK-KVVLCGDPKQCGFFNMMCLKVHFNHEICTEVYH 838 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCT VTAIVSTL Y +MRT N C+ I IDTT ++KP DI+LTCFRGWVKQ Sbjct 839 KSISRRCTKTVTAIVSTLFYDKRMRTVNPCSDKIIIDTTSTTKPQRDDIILTCFRGWVKQ 898 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQIDY+ HE+MTAAASQGLTRKGVYAVR KVNENPLY+ SEHVNVLLTRTE R+VWKTL Sbjct 899 LQIDYKNHEIMTAAASQGLTRKGVYAVRYKVNENPLYAQTSEHVNVLLTRTEKRIVWKTL 958 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 +GDPWIK LT G+FSATLEEWQ EHD IM + E PA D +QNK VCWAK L V Sbjct 959 AGDPWIKTLTAHYPGEFSATLEEWQAEHDAIMERILETPASSDVYQNKVHVCWAKALEPV 1018 Query 1021 LETAGIRMTADEWNTILAFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYENN 1080 L TA I +T +W TI AF++D+A+SPE+ALN +CTR++GVD+DSGLFSA +V L Y N Sbjct 1019 LATANITLTRSQWETIPAFKDDKAFSPEMALNFLCTRFFGVDIDSGLFSAPTVPLTYTNE 1078 Query 1081 HWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNRR 1140 HWDN PG YG A++ A R+P + +++G +V ++ +S N+VP NRR Sbjct 1079 HWDNSPGPNRYGLCMRTAKELARRYPCILKAVDTGRLADVRTNTIKDYSPLINVVPLNRR 1138 Query 1141 LPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTY- 1199 LPH+LV S++ LL K+ G +L++ +++P KRV + P G TY Sbjct 1139 LPHSLVVSHRYTGDGNYSQLLSKLIGKTVLVIGT-PISVPGKRVETLGP----GPQCTYK 1193 Query 1200 -DLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMR 1258 DLDLG+P G+YD++FVN+ T Y+ HHYQQC DH++ ML ++ L GG+ + Sbjct 1194 ADLDLGIPSTIGKYDIIFVNVRTPYKHHHYQQCEDHAIHHSMLTRKAVDHLNKGGTCVAL 1253 Query 1259 AYGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKL 1318 YG ADR +E +++A+AR F RV +P C NTEV +F DNG Q + L Sbjct 1254 GYGTADRATENIISAVARSFRFSRVCQPKCAWENTEVAFVFFGKDNGNHLRDQDQLSIVL 1313 Query 1319 SSMYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFK 1378 +++Y + + AG AP+YRV R DIS +EA+VNAAN KG GVC A+ KKWP +F Sbjct 1314 NNIYQGSTQYEAGRAPAYRVIRGDISKSTDEAIVNAANNKGQPGAGVCGALYKKWPGAFD 1373 Query 1379 GAATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAV 1438 GTA +++ +IHAVGPNFS V+E EG+++L+ Y +A II+ V++ Sbjct 1374 KVPIATGTAHLVKHTP-NIIHAVGPNFSRVSEVEGNQKLSEVYMDIAKIINRERYNKVSI 1432 Query 1439 PLLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAI-ELV 1497 PLLSTG ++GGKDRVMQSLNHLFTA+D TDADV IYC DK WE +I++AI R+ ++ ELV Sbjct 1433 PLLSTGIYAGGKDRVMQSLNHLFTAMDTTDADVTIYCLDKQWEARIKDAIARKESVEELV 1492 Query 1498 SEDVTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQD 1557 +D ++ +LVRVHP S LVGR GYS +GK++SYLEGTRFHQTA D+AEI +WP Q+ Sbjct 1493 EDDKPVDIELVRVHPLSSLVGRPGYSTDEGKVHSYLEGTRFHQTAKDIAEIYAMWPNKQE 1552 Query 1558 ANEQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNII 1617 ANEQICLY LGE+M SIR+KCPVED+++S+PP T+PCLC YAMTAERV RLRM + Sbjct 1553 ANEQICLYVLGESMTSIRSKCPVEDSEASSPPHTIPCLCNYAMTAERVYRLRMAKNEQFA 1612 Query 1618 VCSSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKY 1656 VCSSF LPKYRI GVQK++C++ +IF VP + PRK+ Sbjct 1613 VCSSFQLPKYRITGVQKIQCNKPVIFSGVVPPAIHPRKF 1651 Range 2: 1864 to 1904 Score:47.0 bits(110), Expect:8e-05, Method:Compositional matrix adjust., Identities:23/41(56%), Positives:27/41(65%), Gaps:0/41(0%) Query 1854 AGAYIFSSDTGPGHLQQRSVRQHELPCETLYAHEDERIYPP 1894 AGAYIFSS+TG GHLQQ+S RQ +L L E+ Y P Sbjct 1864 AGAYIFSSETGQGHLQQKSTRQCKLQNPILERSVHEKFYAP 1904 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Eastern equine encephalitis virus (strain Florida 91-469)] Sequence ID: Q4QXJ8.3 Length: 2494 Range 1: 3 to 1755 Score:2065 bits(5350), Expect:0.0, Method:Compositional matrix adjust., Identities:1027/1763(58%), Positives:1264/1763(71%), Gaps:77/1763(4%) Query 2 KVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTIL 61 KV VD++ADSPF+K+LQ+ FP FE+E+ QVT NDHANARAFSHLATKLIE EV T IL Sbjct 3 KVHVDLDADSPFVKSLQRCFPHFEIEATQVTDNDHANARAFSHLATKLIEGEVDTDQVIL 62 Query 62 DVGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDV 121 D+GSAP R S H YHCICPMKSAEDP+RL YA KL K+ V DK ++ K DL V Sbjct 63 DIGSAPVRHTHSKHKYHCICPMKSAEDPDRLYRYADKLRKSD--VTDKCIASKAADLLTV 120 Query 122 MATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTP 181 M+TPD E+P+ C+HTD TCR VAVYQDVYAVHAPTS+Y+QA+KGVRT YWIGFDTTP Sbjct 121 MSTPDAETPSLCMHTDSTCRYHGSVAVYQDVYAVHAPTSIYYQALKGVRTIYWIGFDTTP 180 Query 182 FMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFS 241 FM++ +AGAYP Y+TNWADE VL+ARNIGL ++ L E GK+SIMRKK L+P+++V+FS Sbjct 181 FMYKNMAGAYPTYNTNWADESVLEARNIGLGSSDLHEKSFGKVSIMRKKKLQPTNKVIFS 240 Query 242 VGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVD 301 VGST+YTE R LLRSWHLP+VFHLKGK SFT RC+T+VSCEGYVVKKIT+SPGIYGK + Sbjct 241 VGSTIYTEERILLRSWHLPNVFHLKGKTSFTGRCNTIVSCEGYVVKKITLSPGIYGKVDN 300 Query 302 YAVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGL 361 A T H EGFL CK+TDT+RGERVSFPVCTYVPAT+CDQMTGILATDV+ +DAQKLLVGL Sbjct 301 LASTMHREGFLSCKVTDTLRGERVSFPVCTYVPATLCDQMTGILATDVSVDDAQKLLVGL 360 Query 362 NQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWA 421 NQRIVVNGRTQRNTNTM+NYLLPVVAQAFS+WARE RAD+EDEK LG RER+L C WA Sbjct 361 NQRIVVNGRTQRNTNTMQNYLLPVVAQAFSRWAREHRADLEDEKGLGVRERSLVMGCCWA 420 Query 422 FKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQG-LP 480 FK+HKI ++YKRP TQTI KVP+ F+SFVIP S L +G+R+RIK+L A+ A + Sbjct 421 FKTHKITSIYKRPGTQTIKKVPAVFNSFVIPQPTSYGLDIGLRRRIKMLFDAKKAPAPII 480 Query 481 YSGDRTearaaeeeekeaqeaeLTRAALPPLVSGSCADDIAQVDVEELTFRAGAGVVETP 540 D + ++E + EAE RAALPPL+ + + D++ + AGAG VETP Sbjct 481 TEADVAHLKGLQDEAEAVAEAEAVRAALPPLLP-EVDKETVEADIDLIMQEAGAGSVETP 539 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 R +KVT + +IGSY +LSPQ VL SEKLA IHPLAEQV VMTH GR+GRY V+ Y Sbjct 540 RRHIKVTTYPGEEMIGSYAVLSPQAVLNSEKLACIHPLAEQVLVMTHKGRAGRYKVEPYH 599 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 GRV++P+G AIP+ +FQALSESAT+V+NEREF+NR LHHIA+ G ALNTDEE Y+ V++ Sbjct 600 GRVIVPSGTAIPILDFQALSESATIVFNEREFVNRYLHHIAVNGGALNTDEEYYKVVKST 659 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 ++EYVFD+D K C+KK +A + L G+L++PPFHEFAYE LK RPAAP+ IGV+G Sbjct 660 ETDSEYVFDIDAKKCVKKGDAGPMCLVGELVDPPFHEFAYESLKTRPAAPHKVPTIGVYG 719 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKS IIK+ VT RDLV S KKENC EI+ DVKR RG+D+ ARTVDS+LLNG K V Sbjct 720 VPGSGKSGIIKSAVTKRDLVVSAKKENCMEIIKDVKRMRGMDIAARTVDSVLLNGVKHSV 779 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 + LY+DEAFACH+GTLLALIA+V+P KVVLCGDPKQCGFFN+M LKVH+NH ICT V H Sbjct 780 DTLYIDEAFACHAGTLLALIAIVKPK-KVVLCGDPKQCGFFNMMCLKVHFNHEICTEVYH 838 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCT VT+IVSTL Y +MRT N CN I IDTT ++KP DI+LTCFRGWVKQ Sbjct 839 KSISRRCTKTVTSIVSTLFYDKRMRTVNPCNDKIIIDTTSTTKPLKDDIILTCFRGWVKQ 898 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQIDY+ HE+MTAAASQGLTRKGVYAVR KVNENPLY+ SEHVNVLLTRTE R+VWKTL Sbjct 899 LQIDYKNHEIMTAAASQGLTRKGVYAVRYKVNENPLYAQTSEHVNVLLTRTEKRIVWKTL 958 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 +GDPWIK LT G+F+ATLEEWQ EHD IM + E PA D FQNK VCWAK L V Sbjct 959 AGDPWIKTLTASYPGNFTATLEEWQAEHDAIMAKILETPASSDVFQNKVNVCWAKALEPV 1018 Query 1021 LETAGIRMTADEWNTILAFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYENN 1080 L TA I +T +W TI AF++D+AYSPE+ALN CTR++GVD+DSGLFSA +V L Y N Sbjct 1019 LATANITLTRSQWETIPAFKDDKAYSPEMALNFFCTRFFGVDIDSGLFSAPTVPLTYTNE 1078 Query 1081 HWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNRR 1140 HWDN PG MYG A++ A R+P + +++G +V ++ ++ N+VP NRR Sbjct 1079 HWDNSPGPNMYGLCMRTAKELARRYPCILKAVDTGRVADVRTDTIKDYNPLINVVPLNRR 1138 Query 1141 LPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTY- 1199 LPH+LV +++ L+ K+ G +L+V + IP KRV + P TY Sbjct 1139 LPHSLVVTHRYTGNGDYSQLVTKMTGKTVLVVGT-PMNIPGKRVETLGP----SPQCTYK 1193 Query 1200 -DLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMR 1258 +LDLG+P G+YD++F+N+ T YR HHYQQC DH++ ML ++ L GG+ + Sbjct 1194 AELDLGIPAALGKYDIIFINVRTPYRHHHYQQCEDHAIHHSMLTRKAVDHLNKGGTCIAL 1253 Query 1259 AYGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKL 1318 YG ADR +E +++A+AR F RV +P C NTEV +F DNG + + L Sbjct 1254 GYGTADRATENIISAVARSFRFSRVCQPKCAWENTEVAFVFFGKDNGNHLQDQDRLSVVL 1313 Query 1319 SSMYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFK 1378 +++Y + H AG AP+YRV R DI+ +E +VNAAN KG GVC A+ +KWP +F Sbjct 1314 NNIYQGSTQHEAGRAPAYRVVRGDITKSNDEVIVNAANNKGQPGSGVCGALYRKWPGAFD 1373 Query 1379 GAATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAV 1438 G A +++ VIHAVGPNFS ++E EGD++L+ Y +A II+ V++ Sbjct 1374 KQPVATGKAHLVK-HSPNVIHAVGPNFSRLSENEGDQKLSEVYMDIARIINNERFTKVSI 1432 Query 1439 PLLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAI-ELV 1497 PLLSTG ++GGKDRVMQSLNHLFTA+D TDAD+ IYC DK WE +I+EAI R+ ++ EL Sbjct 1433 PLLSTGIYAGGKDRVMQSLNHLFTAMDTTDADITIYCLDKQWESRIKEAITRKESVEELT 1492 Query 1498 SEDVTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQD 1557 +D ++ +LVRVHP S L GR GYS T+GK+YSYLEGTRFHQTA D+AEI +WP Q+ Sbjct 1493 EDDRPVDIELVRVHPLSSLAGRPGYSTTEGKVYSYLEGTRFHQTAKDIAEIYAMWPNKQE 1552 Query 1558 ANEQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNII 1617 ANEQICLY LGE+M+SIR+KCPVE++++S+PP T+PCLC YAMTAERV RLRM + Sbjct 1553 ANEQICLYVLGESMNSIRSKCPVEESEASSPPHTIPCLCNYAMTAERVYRLRMAKNEQFA 1612 Query 1618 VCSSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKY--------------------- 1656 VCSSF LPKYRI GVQK++C + +IF TVP + PRK+ Sbjct 1613 VCSSFQLPKYRITGVQKIQCSKPVIFSGTVPPAIHPRKFASVTVEDTPVVQPERLVPRRP 1672 Query 1657 ---------IQQPPEQLDNVSLTSTTSTG------------------SAWSLPSET---- 1685 I PP N S TS S G S WS+PS T Sbjct 1673 APPVPVPARIPSPPCTSTNGSTTSIQSLGEDQSASASSGAEISVDQVSLWSIPSATGFDV 1732 Query 1686 -----------TYETMEVVAEVH 1697 T+ TM V AE+H Sbjct 1733 RTSSSLSLEQPTFPTMVVEAEIH 1755 Range 2: 1884 to 1924 Score:47.0 bits(110), Expect:8e-05, Method:Compositional matrix adjust., Identities:23/41(56%), Positives:27/41(65%), Gaps:0/41(0%) Query 1854 AGAYIFSSDTGPGHLQQRSVRQHELPCETLYAHEDERIYPP 1894 AGAYIFSS+TG GHLQQ+S RQ +L L E+ Y P Sbjct 1884 AGAYIFSSETGQGHLQQKSTRQCKLQYPILERSVHEKFYAP 1924 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Western equine encephalitis virus] Sequence ID: P13896.3 Length: 2467 Range 1: 3 to 1652 Score:2041 bits(5288), Expect:0.0, Method:Compositional matrix adjust., Identities:994/1658(60%), Positives:1243/1658(74%), Gaps:9/1658(0%) Query 2 KVTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTIL 61 ++ VD++ADSP++K+LQ+ FP FE+E++QVT NDHANARAFSH+ATKLIE EV IL Sbjct 3 RIHVDLDADSPYVKSLQRTFPQFEIEARQVTDNDHANARAFSHVATKLIESEVDRDQVIL 62 Query 62 DVGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDV 121 D+GSAP R S+H YHCICPM SAEDP+RL YA +L K+ + DKN++ K DL +V Sbjct 63 DIGSAPVRHAHSNHRYHCICPMISAEDPDRLQRYAERLKKSD--ITDKNIASKAADLLEV 120 Query 122 MATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTP 181 M+TPD E+P+ C+HTD TCR VAVYQDVYAVHAPTS+YHQA+KGVRT YWIGFDTTP Sbjct 121 MSTPDAETPSLCMHTDATCRYFGSVAVYQDVYAVHAPTSIYHQALKGVRTIYWIGFDTTP 180 Query 182 FMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFS 241 FM++ +AG+YP Y+TNWADE+VL+ARNIGL + L E R GKLSI+RKK L+P+++++FS Sbjct 181 FMYKNMAGSYPTYNTNWADERVLEARNIGLGNSDLQESRLGKLSILRKKRLQPTNKIIFS 240 Query 242 VGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVD 301 VGST+YTE R LLRSWHLP+VFHLKGK++FT RC T+VSCEGYV+KKITISPG+YGK + Sbjct 241 VGSTIYTEDRSLLRSWHLPNVFHLKGKSNFTGRCGTIVSCEGYVIKKITISPGLYGKVEN 300 Query 302 YAVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGL 361 A T H EGFL CK+TDT+RGERVSF VCTYVPAT+CDQMTGILATDV+ +DAQKLLVGL Sbjct 301 LASTMHREGFLSCKVTDTLRGERVSFAVCTYVPATLCDQMTGILATDVSVDDAQKLLVGL 360 Query 362 NQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWA 421 NQRIVVNGRTQRNTNTM+NYLLPVVAQAFS+WARE RAD++DEK LG RERTLT C WA Sbjct 361 NQRIVVNGRTQRNTNTMQNYLLPVVAQAFSRWAREHRADLDDEKELGVRERTLTMGCCWA 420 Query 422 FKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQGLPY 481 FK+ KI ++YK+P TQTI KVP+ FDSFVIP L S L MG R+R+KLLL + Sbjct 421 FKTQKITSIYKKPGTQTIKKVPAVFDSFVIPRLTSHGLDMGFRRRLKLLLEPTVKPAPAI 480 Query 482 S-GDRTearaaeeeekeaqeaeLTRAALPPLVSGSCADDIAQVDVEELTFRAGAGVVETP 540 + D R ++E +E AE R ALPPL+ + + +V+ + AGAG VETP Sbjct 481 TMADVEHLRGLQQEAEEVAAAEEIREALPPLLP-EIEKETVEAEVDLIMQEAGAGSVETP 539 Query 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 R ++VT + IGSY ILSPQ VL SEKLA IHPLAEQV VMTH GR+GRY V+ Y Sbjct 540 RGHIRVTSYPGEEKIGSYAILSPQAVLNSEKLACIHPLAEQVLVMTHKGRAGRYKVEPYH 599 Query 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 G+V++P G A+PV +FQALSESAT+V+NEREF+NR LHHIA+ G ALNTDEE Y+ V+ + Sbjct 600 GKVIVPEGTAVPVQDFQALSESATIVFNEREFVNRYLHHIAINGGALNTDEEYYKTVKTQ 659 Query 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 ++EYVFD+D + C+K+E+A L LTGDL++PPFHEFAYE LK RPAAP+ IGV+G Sbjct 660 DTDSEYVFDIDARKCVKREDAGPLCLTGDLVDPPFHEFAYESLKTRPAAPHKVPTIGVYG 719 Query 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 VPGSGKS IIK+ VT +DLV S KKENC EI+ DV+R R +DV ARTVDS+LLNG K V Sbjct 720 VPGSGKSGIIKSAVTKKDLVVSAKKENCAEIIRDVRRMRRMDVAARTVDSVLLNGVKHPV 779 Query 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 LY+DEAFACH+GTLLALIA+V+P KVVLCGDPKQCGFFN+M LKVH+NH+ICT V H Sbjct 780 NTLYIDEAFACHAGTLLALIAIVKPK-KVVLCGDPKQCGFFNMMCLKVHFNHDICTEVYH 838 Query 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 KSISRRCT VTAIVSTL Y +M+T N C I IDTTG++KP D++LTCFRGWVKQ Sbjct 839 KSISRRCTQTVTAIVSTLFYDKRMKTVNPCADKIIIDTTGTTKPHKDDLILTCFRGWVKQ 898 Query 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 LQIDY+ HE+MTAAASQGLTRKGVYAVR KVNENPLYS SEHVNVLLTRTE R+VWKTL Sbjct 899 LQIDYKNHEIMTAAASQGLTRKGVYAVRYKVNENPLYSQTSEHVNVLLTRTEKRIVWKTL 958 Query 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 +GDPWIK LT GDF+A+L++WQ EHD IM + ++P D FQNK VCWAK L V Sbjct 959 AGDPWIKTLTAKYPGDFTASLDDWQREHDAIMARVLDKPQTADVFQNKVNVCWAKALEPV 1018 Query 1021 LETAGIRMTADEWNTILAFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYENN 1080 L TA I +T +W T+ F+ DRAYSPE+ALN CTR++GVDLDSGLFSA +V+L Y + Sbjct 1019 LATANIVLTRQQWETLHPFKHDRAYSPEMALNFFCTRFFGVDLDSGLFSAPTVALTYRDQ 1078 Query 1081 HWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNRR 1140 HWDN PG MYG N EVA++ + R+P + +++G ++ ++ +S N+VP NRR Sbjct 1079 HWDNSPGKNMYGLNREVAKELSRRYPCITKAVDTGRVADIRNNTIKDYSPTINVVPLNRR 1138 Query 1141 LPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTYD 1200 LPH+L+ ++ L K+ G +L++ + ++IP K+V + P + D Sbjct 1139 LPHSLIVDHKGQGTTDHSGFLSKMKGKSVLVIGD-PISIPGKKVESMGP--LPTNTIRCD 1195 Query 1201 LDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRAY 1260 LDLG+P G+YD++FVN+ T YR HHYQQC DH++ ML ++H L GG+ + Y Sbjct 1196 LDLGIPSHVGKYDIIFVNVRTPYRNHHYQQCEDHAIHHSMLTCKAVHHLNTGGTCVAIGY 1255 Query 1261 GYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLSS 1320 G ADR +E ++TA+AR F RV +P NTEV +F DNG + L + Sbjct 1256 GLADRATENIITAVARSFRFTRVCQPKNTAENTEVLFVFFGKDNGNHTHDQDRLGVVLDN 1315 Query 1321 MYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKGA 1380 +Y + + AG AP+YRV R DIS ++A+VNAAN+KG GVC A+ +KWP++F Sbjct 1316 IYQGSTRYEAGRAPAYRVIRGDISKSADQAIVNAANSKGQPGSGVCGALYRKWPAAFDRQ 1375 Query 1381 ATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPL 1440 VGTA++++ + + +IHAVGPNFS + E EGD +LAAAY ++ASI++ I ++VPL Sbjct 1376 PIAVGTARLVKHEPL-IIHAVGPNFSKMPEPEGDLKLAAAYMSIASIVNAERITKISVPL 1434 Query 1441 LSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVSED 1500 LSTG +SGGKDRVMQSL+HLFTA D TDADV IYC DK WE +I EAI R+ ++E++ +D Sbjct 1435 LSTGIYSGGKDRVMQSLHHLFTAFDTTDADVTIYCLDKQWETRIIEAIHRKESVEILDDD 1494 Query 1501 VTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDANE 1560 ++ DLVRVHP+S L GR GYS +GKLYSYLEGTRFHQTA D+AEI +WP +ANE Sbjct 1495 KPVDIDLVRVHPNSSLAGRPGYSVNEGKLYSYLEGTRFHQTAKDIAEIHAMWPNKSEANE 1554 Query 1561 QICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVCS 1620 QICLY LGE+M SIR+KCPVE++++S PP T+PCLC YAMTAERV RLR + VCS Sbjct 1555 QICLYILGESMSSIRSKCPVEESEASAPPHTLPCLCNYAMTAERVYRLRSAKKEQFAVCS 1614 Query 1621 SFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQ 1658 SF LPKYRI GVQK++C + ++F VP V PRKY + Sbjct 1615 SFLLPKYRITGVQKLQCSKPVLFSGVVPPAVHPRKYAE 1652 Range 2: 1857 to 1897 Score:47.0 bits(110), Expect:1e-04, Method:Compositional matrix adjust., Identities:24/41(59%), Positives:28/41(68%), Gaps:0/41(0%) Query 1854 AGAYIFSSDTGPGHLQQRSVRQHELPCETLYAHEDERIYPP 1894 AGAYIFSS+TG GHLQQ+SVRQ +L L E+ Y P Sbjct 1857 AGAYIFSSETGQGHLQQKSVRQCKLQEPILDRAVHEKYYAP 1897 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Aura virus] Sequence ID: Q86924.3 Length: 2499 Range 1: 6 to 1677 Score:1970 bits(5103), Expect:0.0, Method:Compositional matrix adjust., Identities:974/1681(58%), Positives:1211/1681(72%), Gaps:31/1681(1%) Query 3 VTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTILD 62 V VDV+ SPF+ LQK+FP FE+ +QQVTPNDHANARAFSHLA+KLIE E+PT VTILD Sbjct 6 VHVDVDPQSPFVLQLQKSFPQFEIVAQQVTPNDHANARAFSHLASKLIEHEIPTSVTILD 65 Query 63 VGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDVM 122 +GSAPARR+ S+H YHC+CPM+S EDP+RL NYA +LA +G + +K + K+ DL+ V+ Sbjct 66 IGSAPARRMYSEHKYHCVCPMRSPEDPDRLMNYASRLADKAGEITNKRLHDKLADLKSVL 125 Query 123 ATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTPF 182 +PD E+ T C H D CRT AEV+V Q+VY ++AP+++YHQA+KGVR YWIGFDTT F Sbjct 126 ESPDAETGTICFHNDVICRTTAEVSVMQNVY-INAPSTIYHQALKGVRKLYWIGFDTTQF 184 Query 183 MFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFSV 242 MF ++AG+YP+Y+TNWADE+VL+ARNIGLC+T L EG GKLS RKK L+P V FSV Sbjct 185 MFSSMAGSYPSYNTNWADERVLEARNIGLCSTKLREGTMGKLSTFRKKALKPGTNVYFSV 244 Query 243 GSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVDY 302 GSTLY E+R L+SWHLPSVFHLKGK SFTCRCDT V+CEGYVVKKITISPGI G+ Y Sbjct 245 GSTLYPENRADLQSWHLPSVFHLKGKQSFTCRCDTAVNCEGYVVKKITISPGITGRVNRY 304 Query 303 AVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGLN 362 VT+++EGFL+CKITDTV+GERVSFPVCTY+P +ICDQMTGILATD+ PEDAQKLLVGLN Sbjct 305 TVTNNSEGFLLCKITDTVKGERVSFPVCTYIPPSICDQMTGILATDIQPEDAQKLLVGLN 364 Query 363 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWAF 422 QRIVVNG+T RNTNTM+NYLLP VA SKWA+E +AD DEKPL RER L CLWAF Sbjct 365 QRIVVNGKTNRNTNTMQNYLLPAVATGLSKWAKERKADCSDEKPLNVRERKLAFGCLWAF 424 Query 423 KSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQGLPYS 482 K+ KIH+ Y+ P TQTIVKV + F +F + S+W++SL M +RQ++KLLL + + + Sbjct 425 KTKKIHSFYRPPGTQTIVKVAAEFSAFPMSSVWTTSLPMSLRQKVKLLLVKKTNKPVVTI 484 Query 483 GDRTearaaee--eekeaqeaeLTRAALPPL--VSGSCADDIAQVDVEELTFRAGAGVVE 538 D A E E E EAE ALPPL + A+D+ + +V +L AGA +VE Sbjct 485 TDTAVKNAQEAYNEAVETAEAEEKAKALPPLKPTAPPVAEDV-KCEVTDLVDDAGAALVE 543 Query 539 TPRNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDK 598 TPR +K+ PQ D IGSY ++SP VL++++L PIH LAEQV ++TH GR+GRY V+ Sbjct 544 TPRGKIKIIPQEGDVRIGSYTVISPAAVLRNQQLEPIHELAEQVKIITHGGRTGRYSVEP 603 Query 599 YDGRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVR 658 YD +VL+PTG + F ALSESAT+VYNEREF+NRKLHHIA G A NT+EE Y+ + Sbjct 604 YDAKVLLPTGCPMSWQHFAALSESATLVYNEREFLNRKLHHIATKGAAKNTEEEQYKVCK 663 Query 659 AERAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGV 718 A+ + EYV+DVD + C+K+E A GLVL G+L NPP+HE AYEGL+ RPAAPYH +GV Sbjct 664 AKDTDHEYVYDVDARKCVKREHAQGLVLVGELTNPPYHELAYEGLRTRPAAPYHIETLGV 723 Query 719 FGVPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKR 778 G PGSGKSAIIK+ VT +DLV SGKKENC+EI NDV++ RG+ + RTVDS+LLNG K+ Sbjct 724 IGTPGSGKSAIIKSTVTLKDLVTSGKKENCKEIENDVQKMRGMTIATRTVDSVLLNGWKK 783 Query 779 GVENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNH---NIC 835 V+ LYVDEAFACH+GTL+ALIA+V+P KVVLCGDPKQ FFNLMQLKV++N+ ++C Sbjct 784 AVDVLYVDEAFACHAGTLMALIAIVKPRRKVVLCGDPKQWPFFNLMQLKVNFNNPERDLC 843 Query 836 TRVLHKSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFR 895 T +K ISRRCT PVTAIVSTLHY GKMRTTN C I+ID GS+KP GDIVLTCFR Sbjct 844 TSTHYKYISRRCTQPVTAIVSTLHYDGKMRTTNPCKRAIEIDVNGSTKPKKGDIVLTCFR 903 Query 896 GWVKQLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRL 955 GWVKQ QIDY G AASQGLTR+GVYAVRQKVNENPLY+ SEHVNVLLTRTE+R+ Sbjct 904 GWVKQGQIDYPGPGGHDRAASQGLTRRGVYAVRQKVNENPLYAEKSEHVNVLLTRTEDRI 963 Query 956 VWKTLSGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAK 1015 VWKTL GDPWIK LTNVP+G+F+ATLEEWQ EH+ IM+ +N DPF +K CWAK Sbjct 964 VWKTLQGDPWIKYLTNVPKGNFTATLEEWQAEHEDIMKAINSTSTVSDPFASKVNTCWAK 1023 Query 1016 CLVQVLETAGIRMTADEWNTIL-AFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVS 1074 ++ +L TAGI +T ++W + FR D+ YS AL+ ICT+ +G+DL SG+FS + Sbjct 1024 AIIPILRTAGIELTFEQWEDLFPQFRNDQPYSVMYALDVICTKMFGMDLSSGIFSRPEIP 1083 Query 1075 LFYE-------NNHWDNRPGGRMYGFNHEV---ARKYAARFPFLRGNMNSGLQLNVPERK 1124 L + HWDN PGG+ +G+N V +KY +LR G Q+ + Sbjct 1084 LTFHPADVGRVRAHWDNSPGGQKFGYNKAVIPTCKKYPV---YLRA--GKGDQILPIYGR 1138 Query 1125 LQPFSAECNIVPSNRRLPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRV 1184 + SA N+VP NR LPH+L S Q+ + L ++PGH MLLVS+ KR+ Sbjct 1139 VSVPSARNNLVPLNRNLPHSLTASLQKKEAAPLHKFLNQLPGHSMLLVSKETCYCVSKRI 1198 Query 1185 FWIAPPRVSGADRTYDLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGD 1244 W+AP V GAD +DL G P RYDLV VN+ YR HHYQQC +H+ ++ L Sbjct 1199 TWVAPLGVRGADHNHDLHFGFP-PLSRYDLVVVNMGQPYRFHHYQQCEEHAGLMRTLARS 1257 Query 1245 SLHLLRPGGSLLMRAYGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDN 1304 +L+ L+PGG+L ++AYG+AD SE VV +LARKF +RP+C NTE+F +F DN Sbjct 1258 ALNCLKPGGTLALKAYGFADSNSEDVVLSLARKFVRASAVRPSCTQFNTEMFFVFRQLDN 1317 Query 1305 GR-RAVTLHQANQKLSSMYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSD 1363 R R T H N +S+++ N +G APSYRV+R +I+ EEAVVNAANA+G D Sbjct 1318 DRERQFTQHHLNLAVSNIFD-NYKDGSGAAPSYRVKRMNIADCTEEAVVNAANARGKPGD 1376 Query 1364 GVCRAVAKKWPSSFKGAATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRA 1423 GVCRA+ KKWP SF+ A T V TA M VIHAVGP+F T E + L AY Sbjct 1377 GVCRAIFKKWPKSFENATTEVETAVMKPCHNKVVIHAVGPDFRKYTLEEATKLLQNAYHD 1436 Query 1424 VASIISTNNIKSVAVPLLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKK 1483 VA I++ I SVA+PLLSTG ++ G DR+ SL LFTALD TDADV IYC DK WE++ Sbjct 1437 VAKIVNEKGISSVAIPLLSTGIYAAGADRLDLSLRCLFTALDRTDADVTIYCLDKKWEQR 1496 Query 1484 IQEAIDRRTAI-ELVSEDVTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTA 1542 I +AI R + EL D+ ++ L RVHPDSCL GYS GKLYSY EGT+FHQTA Sbjct 1497 IADAIRMREQVTELKDPDIEIDEGLTRVHPDSCLKDHIGYSTQYGKLYSYFEGTKFHQTA 1556 Query 1543 VDMAEISTLWPRLQDANEQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTA 1602 D+AEI L+P +Q ANEQICLY LGE M+SIR KCPVED+ +S PPKT+PCLC YAMTA Sbjct 1557 KDIAEIRALFPDVQAANEQICLYTLGEPMESIREKCPVEDSPASAPPKTIPCLCMYAMTA 1616 Query 1603 ERVARLRMNNTKNIIVCSSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQ--QP 1660 ER+ R+R N+ NI VCSSFPLPKYRI+ VQK++C +V++F+ VP + R YI +P Sbjct 1617 ERICRVRSNSVTNITVCSSFPLPKYRIKNVQKIQCTKVVLFNPDVPPYIPARVYINKDEP 1676 Query 1661 P 1661 P Sbjct 1677 P 1677 Range 2: 1841 to 1929 Score:45.4 bits(106), Expect:3e-04, Method:Compositional matrix adjust., Identities:33/92(36%), Positives:42/92(45%), Gaps:14/92(15%) Query 1816 LDIQFGDLEPRRRNTRDWDVSTGIQFGDIDFNQSXLGR-----------AGAYIFSSDTG 1864 + I FGD + R S FGD F+Q + R G YIFSSDTG Sbjct 1841 MPITFGDFAEGELD-RLLTPSPTPTFGD--FSQEEMDRFFGNRQYXLTGVGGYIFSSDTG 1897 Query 1865 PGHLQQRSVRQHELPCETLYAHEDERIYPPAF 1896 PGHLQQ+SV Q+ + E+I+ P Sbjct 1898 PGHLQQKSVIQNSTTEILIERSRLEKIHAPVL 1929 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Salmon pancreas disease virus] Sequence ID: Q8JJX1.1 Length: 2601 Range 1: 15 to 1745 Score:1301 bits(3368), Expect:0.0, Method:Compositional matrix adjust., Identities:751/1739(43%), Positives:1015/1739(58%), Gaps:112/1739(6%) Query 3 VTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPT-GVTIL 61 VTV++ AD P L + AFP FEV + + NDHA ARAFSHLATK IE+++ V + Sbjct 15 VTVNLPADHPALNQFKTAFPGFEVVASNRSSNDHAAARAFSHLATKWIERDIGGRQVIVA 74 Query 62 DVGSAPARRLMS--DHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQ 119 D+GSAPARR+ + + TYH +CP K AEDPERLA+YARKL +A V+ KITDL+ Sbjct 75 DIGSAPARRIGAPDNVTYHSVCPRKCAEDPERLASYARKLVRAVERGDGHLVNEKITDLK 134 Query 120 DVMATPD--LESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGF 177 DV+ PD LE+ + CL+ D +C+ +A++AVYQDVYAV AP+++Y QA KG R YWIGF Sbjct 135 DVLENPDTSLETTSICLNDDVSCKVKADIAVYQDVYAVDAPSTIYAQADKGTRVVYWIGF 194 Query 178 DTTPFMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDR 237 + F +A+AG++P Y NW+D VL A+N+ LC +GLSE R K L PS Sbjct 195 EPFVFHTDAMAGSFPLYDANWSDSAVLAAKNLPLCYSGLSEDSIKWRFRFRDKPLVPSGE 254 Query 238 VMFSVGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYG 297 + +SVGST Y E R L+SWHLPS FH N +TCRCDTVVSC GYVVKKITI GI G Sbjct 255 IHYSVGSTHYVEDRDKLKSWHLPSTFHFVAPNKYTCRCDTVVSCGGYVVKKITICEGIVG 314 Query 298 --KTVDYAVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQ 355 + A ++H +G +V K +DT+ E+VSFPV TY+PA ICDQMT + A V DA Sbjct 315 IPAKEELATSYHRDGVVVTKFSDTINHEQVSFPVVTYIPAVICDQMTAMTANPVKYSDAV 374 Query 356 KLLVGLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPL-GTRERTL 414 KLLVGLNQRIVVNG T RN N+M N L+PV A+A WA E R DMEDE+ L G T Sbjct 375 KLLVGLNQRIVVNGTTVRNVNSMDNSLIPVFARALCSWADEVRRDMEDEQDLYGITSVTT 434 Query 415 TCCCLWAFKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSAR 474 C A+ + HT Y+RP+ + + VP+ F + +L ++ L++ ++Q + L Sbjct 435 WICICRAYDKRQQHTFYRRPKQSSGIYVPAKFTGSLRAALSATYLNLPLKQLLLNTLKRA 494 Query 475 MAQGLPYSGDRTearaaeeeekeaqeaeLTR--AALPPLVSG---------SCADDIAQV 523 + D TEA A + E E R AA P ++ D ++ V Sbjct 495 IKPMDQAIADETEALAHDAAEVHELTEEERRQQAANPSYIADVLGQDDDEEEAGDGMSDV 554 Query 524 DVEELTFRAGAGVVETPRNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAP-IHPLAEQV 582 D+ E AGA +++ R +KV D+++G YL+LSP TVL++ KLA + PLAE+V Sbjct 555 DLGEED-GAGATIIDCQRGTVKVITAFGDNMMGEYLVLSPVTVLRTRKLAILLGPLAEEV 613 Query 583 TVMTHSGRSGRYPVDKYDGRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIAL 642 H GR+GRY ++K + +VLIPTG ++ FQAL+ESAT+ YN+ F R L +A Sbjct 614 MQYVHKGRTGRYAIEKNNLKVLIPTGVSLKTDHFQALAESATLTYNDYLFTCRTLDQLAT 673 Query 643 YGPALNTDEESYEKVRAERAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEG 702 G A NTDE Y+ V A +A EYV+++ K C+KKE+A+G VL GD+ NPP+H+FAYE Sbjct 674 RGSARNTDEVYYKLVDAAKARDEYVYELSSKQCVKKEDATGTVLQGDICNPPYHQFAYEA 733 Query 703 LKIRPAAPYHTTIIGVFGVPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLD 762 L+ RPA + IG++GVPG+GK+AII VTTRDLVASGKKENC++I V +RGL Sbjct 734 LRKRPAHTHDVHTIGIYGVPGAGKTAIITTEVTTRDLVASGKKENCEDIKRCVLERRGLK 793 Query 763 VTARTVDSILLNGCKRGVENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFN 822 + ARTVDS+ + V LYVDEA+ACHSGTLLALIA VRP+GKVVLCGDPKQ G N Sbjct 794 IAARTVDSLFYGAYRGAVNTLYVDEAYACHSGTLLALIAAVRPTGKVVLCGDPKQVGCVN 853 Query 823 LMQLKVHYNHNICTRVLHKSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSS 882 +Q+++HYNH I RVL K+ISRRCT +TAIVS L+Y+G+M+TTN C P+ IDTTGS+ Sbjct 854 QLQMRMHYNHEISDRVLRKNISRRCTHTLTAIVSNLNYEGRMKTTNPCKKPVLIDTTGST 913 Query 883 KPASGDIVLTCFRGWVKQLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSE 942 KP +VLTCFRGWVK L+ Y +E+MTAAASQGLTR+ VYAVR +V NPLY P SE Sbjct 914 KPDKEALVLTCFRGWVKDLKFLYPHNELMTAAASQGLTREKVYAVRCRVTTNPLYEPTSE 973 Query 943 HVNVLLTRTENRLVWKTLSGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEV 1002 H+ VLLTRT + LVWKTL DP I +L+ P+GD+SAT+E+W++EH+GI+ L E Sbjct 974 HITVLLTRTNDELVWKTLPNDPLIPILSKPPKGDYSATMEDWEDEHNGILAALREACVPR 1033 Query 1003 DPF-QNKAKVCWAKCLVQVLETAGIRMTADEWNTIL-AFREDRAYSPEVALNEICTRYYG 1060 F K CWA +VL AG+++T +++N I AFRED+ +S AL+ + T +G Sbjct 1034 MNFAHGKRNTCWAVTSSRVLHEAGVQITPEDYNRIFPAFREDKPHSALAALDAVATLVWG 1093 Query 1061 VDLDSGLFSAQSVSLFYENNHWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNS--GLQ- 1117 +D SG+ S + + EN+HW N G YG N + Y P + + G + Sbjct 1094 LDTSSGILSGKGSFMRLENSHWSNSNRGYEYGLNLDALEGYEIANPRMIKALKQRRGREC 1153 Query 1118 LNVPERKLQPFSAECNIVPSNRRLPHALVTSYQQCRGERVE-------W----------- 1159 ++ KL P VP NR +PH LV + + +E W Sbjct 1154 YDIETGKLVPLDPARVQVPINRIVPHVLVDTSAAAKPGFLENRLTVDRWDQVHSFKTRAA 1213 Query 1160 -----LLKKIPGHQML-------LVSEYNL----AIPHKRVFWIAPPRVSGADRTYDLDL 1203 L K++ + +L V++Y + + W PR GA D+ Sbjct 1214 VKFAELTKRVSYNSVLDLGAAPGGVTDYCVKKGKTVTSVSEQWDTKPR--GA-VVVTADI 1270 Query 1204 GLPM-DAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRAYGY 1262 P+ + G +DLVF + R HHY QC DH++ + GG +++AYG Sbjct 1271 NGPLNNLGIFDLVFCDAAGPRRYHHYAQCEDHAVLFTSACKHGVERTAKGGVFIVKAYGM 1330 Query 1263 ADRVSEMVVTALARKFSAFRVLRP-ACVTSNTEVFLLFSNFDNGR---RAVTLHQANQKL 1318 ADR +E V AR F + V +P + +N EVF FS GR A ++ +L Sbjct 1331 ADRRTERAVEGTARYFRSVSVEKPVSSRITNVEVFFKFS----GRCRPHARSIAHLGPQL 1386 Query 1319 SSMYA--------------CNGLHTA-------GCAPSYRVRRADISGHGEEAVVNAANA 1357 + +YA + + A G AP YRV +I EE +VNAAN+ Sbjct 1387 TDIYARTWKAYKMLARGSVADKVKVAEILNSMVGAAPGYRVLNRNIITAEEEVLVNAANS 1446 Query 1358 KGTVSDGVCRAVAKKWPSSFKGAATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDREL 1417 G DGVC A+ + +F A G A ++R T+IHA G +F V E G R+L Sbjct 1447 NGRPGDGVCGALYGAFGDAFPNGAIGAGNAVLVRGLEATIIHAAGADFREVDEETGARQL 1506 Query 1418 AAAYRAVASIISTNNIKSVAVPLLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRD 1477 AAYRA A++++ N I S A+PLLST FS G++R+ QS + L A D T+ DV IYC Sbjct 1507 RAAYRAAATLVTANGITSAAIPLLSTHIFSNGRNRLEQSFSALVEAFDTTECDVTIYCLA 1566 Query 1478 KNWEKKIQEAIDRRT----------------AIELVSEDVTLET---DLVRVHPDSCLVG 1518 N +IQ+ ID + +S+ TL + + V V S L G Sbjct 1567 NNMAARIQQLIDAHAREEFDEEVVVEEEEEHEADAMSDTETLSSFGDETVWVPKHSTLAG 1626 Query 1519 RNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDANEQICLYALGETMDSIRTKC 1578 R GYSA G S GT+FH+ AV M+ I WP+ ++AN ++ Y G+ + + C Sbjct 1627 RPGYSAYYGDRRSLFVGTKFHRAAVAMSSIEAAWPKTKEANAKLIEYIRGQHLVDVLKSC 1686 Query 1579 PVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVCSSFPLPKYRIEGVQKVKC 1637 PV+D PP ++PC C YAMT ERV L+ + +VCS+F LP I+ V KV+C Sbjct 1687 PVDDIPVGRPPSSLPCGCIYAMTPERVTVLKQRPQEGFVVCSAFKLPLTNIQDVTKVEC 1745 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Sleeping disease virus] Sequence ID: Q8QL53.1 Length: 2593 Range 1: 15 to 1744 Score:1299 bits(3361), Expect:0.0, Method:Compositional matrix adjust., Identities:759/1753(43%), Positives:1013/1753(57%), Gaps:141/1753(8%) Query 3 VTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPT-GVTIL 61 VTVD+ AD P L + AFP FEV + + NDHA ARAFSHLATK IE+++ V + Sbjct 15 VTVDLPADHPALNQFKTAFPGFEVVASNRSSNDHAAARAFSHLATKWIERDIDGRQVIVA 74 Query 62 DVGSAPARRLMS--DHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQ 119 D+GSAPARR+ + + TYH +CP K AEDPERLA+YARKL +A VS +ITDL+ Sbjct 75 DIGSAPARRVGAPDNVTYHSVCPRKCAEDPERLASYARKLVRAVEKGDGHLVSDRITDLK 134 Query 120 DVMATPD--LESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGF 177 DV+ PD LE+ + CL+ D +C+ +A++AVYQDVYAV AP+++Y QA KG R YWIGF Sbjct 135 DVLENPDTSLETTSICLNDDVSCKVKADIAVYQDVYAVDAPSTIYAQADKGTRVVYWIGF 194 Query 178 DTTPFMFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDR 237 + F +A+AG++P Y NW+D VL A+N+ LC +GLSE R K L PS Sbjct 195 EPFVFHTDAMAGSFPLYDANWSDSAVLAAKNLPLCYSGLSEDSIKWRFRFRDKPLVPSGE 254 Query 238 VMFSVGSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYG 297 + +SVGST Y E R L+SWHLPS FH N +TCRCDTVVSC GYVVKKITI GI G Sbjct 255 IHYSVGSTHYVEDRDKLKSWHLPSTFHFVAPNKYTCRCDTVVSCGGYVVKKITICEGIVG 314 Query 298 KTV--DYAVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQ 355 + + A ++H +G +V K +DT+ E+VSFPV TY+PA ICDQMT + A V D Sbjct 315 RPANEELATSYHRDGVVVTKFSDTINHEQVSFPVVTYIPAVICDQMTAMTADPVKYPDVV 374 Query 356 KLLVGLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPL-GTRERTL 414 KLLVGLNQRIVVNG T RN N+M N L+PV A+A WA EAR DMEDE+ + G T Sbjct 375 KLLVGLNQRIVVNGTTVRNVNSMDNSLIPVFARALCSWADEARRDMEDEQDMYGVTSVTT 434 Query 415 TCCCLWAFKSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSAR 474 C A+ + HT Y+RP+ + + VP+ F + SL ++ L++ ++Q + L Sbjct 435 WICICRAYDKRQQHTFYRRPKQSSGIYVPAKFTGSLRASLSATYLNLPLKQLLLNTLKRA 494 Query 475 MAQGLPYSGDRTearaaeeeekeaqeaeLTR--AALPPLVSGSCADDIAQVDVEEL---- 528 + G D TEARA + E E R AA P + AD + Q D EE+ Sbjct 495 IKPGDQALADETEARAHDAAEVHELTEEEGRQQAANPSYI----ADVLGQDDEEEVDDGM 550 Query 529 -------TFRAGAGVVETPRNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPI-HPLAE 580 G+ +++ R +KV D+ +G YL+LSP TVL++ KLA + PLAE Sbjct 551 SNVDLGEEDGVGSTIIDCQRGTVKVITAFGDNTMGEYLVLSPVTVLRTRKLAVLLGPLAE 610 Query 581 QVTVMTHSGRSGRYPVDKYDGRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHI 640 +V H GR+GRY ++K + +VLIPTG ++ + FQAL+ESAT+ YN+ F R L + Sbjct 611 EVMQYVHKGRTGRYAIEKNNLKVLIPTGVSLKTAHFQALTESATLTYNDYLFTCRTLDQL 670 Query 641 ALYGPALNTDEESYEKVRAERAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAY 700 A G A NTDE Y+ V A +A+ EYV+++ K C+KKE+A+G VL GD+ NPP+H+FA+ Sbjct 671 ATRGSAKNTDEVYYKLVDAAKAKDEYVYELSSKQCVKKEDATGTVLQGDICNPPYHQFAF 730 Query 701 EGLKIRPAAPYHTTIIGVFGVPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRG 760 E L+ RPA + IG++GVPG+GK+AII VTTRDLVASGKKENC++I V +RG Sbjct 731 EALRKRPAHTHDVHTIGIYGVPGAGKTAIITTEVTTRDLVASGKKENCEDIKRCVLERRG 790 Query 761 LDVTARTVDSILLNGCKRGVENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGF 820 L + ARTVDS+L + V+ LYVDEA+ACHSGTLLALIA VRP+GKVVLCGDPKQ G Sbjct 791 LKIAARTVDSLLYGAYRGAVDTLYVDEAYACHSGTLLALIAAVRPTGKVVLCGDPKQVGC 850 Query 821 FNLMQLKVHYNHNICTRVLHKSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTG 880 N +Q+++HYNH I RVL K+ISRRCT +TAIVS L+Y+G+M+TTN C P+ IDTTG Sbjct 851 VNQLQMRMHYNHEISDRVLRKNISRRCTHTLTAIVSNLNYEGRMKTTNPCKKPVLIDTTG 910 Query 881 SSKPASGDIVLTCFRGWVKQLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPL 940 S+KP +VLTCFRGWVK L+I Y +E+MTAAASQGLTR+ VYAVR +V NPLY P Sbjct 911 STKPDKEALVLTCFRGWVKDLKILYPHNELMTAAASQGLTREKVYAVRCRVTSNPLYEPT 970 Query 941 SEHVNVLLTRTENRLVWKTLSGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPA 1000 SEH+ VLLTRT + LVWKTL DP I +L+ P+GD+SAT+E+W++EH+GI+ L E Sbjct 971 SEHITVLLTRTNDELVWKTLPNDPLIPILSKPPKGDYSATMEDWEDEHNGILAALREACV 1030 Query 1001 EVDPF-QNKAKVCWAKCLVQVLETAGIRMTADEWNTIL-AFREDRAYSPEVALNEICTRY 1058 F K CWA +VL AG+ +T +++N I AFRED+ +S AL+ + Sbjct 1031 PRMNFAHGKRNTCWAVTSSRVLHEAGVLITPEDFNRIFPAFREDKPHSALAALDAVAALV 1090 Query 1059 YGVDLDSGLFSAQSVSLFYENNHWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNS--GL 1116 +G+D SG+ S + + EN+HW N G YG N + Y P + + G Sbjct 1091 WGLDTSSGILSGKGSFMRLENSHWSNSNRGYEYGLNLDALEGYEIANPRMIKALKQRRGR 1150 Query 1117 Q-LNVPERKLQPFSAECNIVPSNRRLPHALV----------------------------- 1146 + ++ KL P VP NR +PH LV Sbjct 1151 ECYDIETGKLVPMDPGRVQVPINRVVPHVLVDTSAAAKPGFLENRLSVDRWDQVHSFKTR 1210 Query 1147 -----------TSYQQ------CRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAP 1189 SY RG ++ +KK G + VSE W + Sbjct 1211 AAVKFAELTKRVSYNSVLDLGAARGGVTDYCVKK--GKTVTCVSEQ----------WDSK 1258 Query 1190 PRVSGADRTYDLDLGLPM-DAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHL 1248 PR GA D+ P+ + G +DLVF + R HHY QC DH+ R + Sbjct 1259 PR--GA-VVITADINGPLNNLGIFDLVFCDAAGPRRYHHYAQCEDHARRSTSACKHGVER 1315 Query 1249 LRPGGSLLMRAYGYADRVSEMVVTALARKFSAFRVLRP-ACVTSNTEVFLLFSNFDNGR- 1306 GG +++AYG ADR +E V AR F + V +P + +N EVF FS GR Sbjct 1316 TAKGGVFIVKAYGMADRRTERAVECTARYFKSVSVEKPVSSRITNVEVFFKFS----GRC 1371 Query 1307 --RAVTLHQANQKLSSMYA--------------CNGLHTA-------GCAPSYRVRRADI 1343 A ++ +L+ +YA + + A G AP YRV +I Sbjct 1372 RPHARSIAHLGPQLTDIYARTRKAYKMLARGSVADKVKVAEILNSMVGAAPGYRVLNKNI 1431 Query 1344 SGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKGAATPVGTAKMIRADGMTVIHAVGP 1403 EE +VNAAN+ G DGVC A+ + +F A G A ++R T+IHA G Sbjct 1432 ITAEEEVLVNAANSNGRPGDGVCGALYGAFGDAFPNGAIGAGNAVLVRGLEATIIHAAGA 1491 Query 1404 NFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPLLSTGTFSGGKDRVMQSLNHLFTA 1463 +F V E G R+L AAYRA A++++ N I S A+PLLST FS G++R+ QS L A Sbjct 1492 DFREVDEETGARQLRAAYRAAATLVTANGITSAAIPLLSTHIFSNGRNRLEQSFGALVEA 1551 Query 1464 LDATDADVVIYCRDKNWEKKIQEAIDRRT----------------AIELVSEDVTLET-- 1505 D T+ DV IYC N +IQ+ ID + + TL + Sbjct 1552 FDTTECDVTIYCLANNMAARIQQLIDDHAREEFDEEVVVEEEEEHEANAMCDTETLSSFG 1611 Query 1506 -DLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDANEQICL 1564 + V V S L GR GYSAT G S GT+FH+ AV M+ I WPR ++AN ++ Sbjct 1612 DETVWVPKHSTLAGRPGYSATYGDRRSLFVGTKFHRAAVAMSSIEAAWPRTKEANAKLIE 1671 Query 1565 YALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVCSSFPL 1624 Y G+ + + CPV D PP ++PC C YAMT ERV L+ + +VCS+F L Sbjct 1672 YIRGQHLVDVLKSCPVNDIPVGRPPSSLPCGCIYAMTPERVTVLKQRPQEGFVVCSAFKL 1731 Query 1625 PKYRIEGVQKVKC 1637 P I+ V KV+C Sbjct 1732 PLTNIQDVTKVEC 1744 >RecName: Full=Polyprotein nsP1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Ross river virus (STRAIN T48)] Sequence ID: P13888.2 Length: 1149 Range 1: 1 to 578 Score:812 bits(2097), Expect:0.0, Method:Compositional matrix adjust., Identities:427/588(73%), Positives:462/588(78%), Gaps:34/588(5%) Query 1333 APSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKGAATPVGTAKMIRA 1392 APSYRVRR DISGH EEAVVNAANAKGTV DGVCRAVA+KWP SFKGAATPVGTAK+++A Sbjct 1 APSYRVRRTDISGHAEEAVVNAANAKGTVGDGVCRAVARKWPDSFKGAATPVGTAKLVQA 60 Query 1393 DGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPLLSTGTFSGGKDR 1452 +GM VIHAVGPNFSTVTEAEGDRELAAAYRAVA II+ +NIKSVA+PLLSTG FSGGKDR Sbjct 61 NGMNVIHAVGPNFSTVTEAEGDRELAAAYRAVAGIINASNIKSVAIPLLSTGVFSGGKDR 120 Query 1453 VMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVSEDVTLETDLVRVHP 1512 VMQSLNHLFTA+D TDADVVIYCRDK WEKKIQEAIDRRTA+ELVSED++LE+DL+RVHP Sbjct 121 VMQSLNHLFTAMDTTDADVVIYCRDKAWEKKIQEAIDRRTAVELVSEDISLESDLIRVHP 180 Query 1513 DSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDANEQICLYALGETMD 1572 DSCLVGR GYS TDGKL+SYLEGTRFHQTAVDMAEISTLWP+LQDANEQICLYALGE+MD Sbjct 181 DSCLVGRKGYSITDGKLHSYLEGTRFHQTAVDMAEISTLWPKLQDANEQICLYALGESMD 240 Query 1573 SIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVCSSFPLPKYRIEGV 1632 SIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTK IIVCSSFPLPKYRIEGV Sbjct 241 SIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKAIIVCSSFPLPKYRIEGV 300 Query 1633 QKVKCDRVLIFDQTVPSLVSPRKYIQQPPE-QLDNVSLTSTTSTGSAWSLPSETTYETME 1691 QKVKCDRVLIFDQTVPSLVSPRKYI D VSL ST STGSAWS PSE TYETME Sbjct 301 QKVKCDRVLIFDQTVPSLVSPRKYIPAAASTHADTVSLDSTVSTGSAWSFPSEATYETME 360 Query 1692 VVAEVH-TEppippprrrrAAVAQLRQDLEVTEEIEPYVIQQAEI------MVMERVAT- 1743 VVAEVH +EPP+PPPRRRRA V Q+L ++ + + EI +V+ERVA Sbjct 361 VVAEVHHSEPPVPPPRRRRAQVTMHHQELLEVSDMHTPIAARVEIPVYDTAVVVERVAIP 420 Query 1744 -TDIRAIPVPARRAI-TMPVPAPRVRK-------------Vateppsepeapipaprkrr 1788 T A P+PA RA +PVPAPR+++ V Sbjct 421 CTSEYAKPIPAPRAARVVPVPAPRIQRASTYRVSPTPTPRVLRASVCSVTTSAGVEFPWA 480 Query 1789 ttsttppHNPGDFVPRVPVELPWEPEDLDIQFGDLEPRRRNTRDWDVSTGIQFGDIDFNQ 1848 P R PVELPWEPED+DIQFGD E S IQFGDIDF+Q Sbjct 481 PEDLEVLTEPVHCKMREPVELPWEPEDVDIQFGDFE----------TSDKIQFGDIDFDQ 530 Query 1849 SXLGRAGAYIFSSDTGPGHLQQRSVRQHELPCETLYAHEDERIYPPAF 1896 XLGRAGAYIFSSDTGPGHLQQ+SVRQH LPCE LY HE+ER YPPA Sbjct 531 FXLGRAGAYIFSSDTGPGHLQQKSVRQHALPCEMLYVHEEERTYPPAL 578 >RecName: Full=Polyprotein nsP1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Middelburg virus] Sequence ID: P03318.2 Length: 995 Range 1: 1 to 423 Score:413 bits(1062), Expect:4e-123, Method:Compositional matrix adjust., Identities:241/485(50%), Positives:296/485(61%), Gaps:66/485(13%) Query 1414 DRELAAAYRAVASIISTNNIKSVAVPLLSTGTFSGGKDRVMQSLNHLFTALDATDADVVI 1473 D +LAA YRAVAS+ + ++++A+PLLSTGTF+GGKDRV+QSLNHLFTALD TD DV I Sbjct 1 DADLAAVYRAVASL-ADETVRTMAIPLLSTGTFAGGKDRVLQSLNHLFTALDTTDVDVTI 59 Query 1474 YCRDKNWEKKIQEAIDRRTAIELVSEDVTLETDLVRVHPDSCLVGRNGYSATDGKLYSYL 1533 YCRDK+WEKKIQEAID RTA EL+ +D T+ +L RVHPDSCLVGR+G+S DG+L+SYL Sbjct 60 YCRDKSWEKKIQEAIDMRTATELLDDDTTVMKELTRVHPDSCLVGRSGFSTVDGRLHSYL 119 Query 1534 EGTRFHQTAVDMAEISTLWPRLQDANEQICLYALGETMDSIRTKCPVEDADSSTPPKTVP 1593 EGTRFHQTAVD+AE TLWPR ++ANEQI Y LGE+M++IRTKCPV+D DSS PP TVP Sbjct 120 EGTRFHQTAVDVAERPTLWPRREEANEQITHYVLGESMEAIRTKCPVDDTDSSAPPCTVP 179 Query 1594 CLCRYAMTAERVARLRMNNTKNIIVCSSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSP 1653 CLCRYAMT ERV RLR K VCSSFPLPKY+I GVQ+V C V++F+ VP+LVSP Sbjct 180 CLCRYAMTPERVHRLRAAQVKQFTVCSSFPLPKYKIPGVQRVACSAVMLFNHDVPALVSP 239 Query 1654 RKYIQQPPEQLDNVSLTSTTSTGSAWSLPSETTYETMEVVAEVHTEppippprrrrAAVA 1713 RKY E + +S+ + + S++ YE ME V Sbjct 240 RKY----REPSISSESSSSGLSVFDLDIGSDSEYEPMEPVQ------------------P 277 Query 1714 QLRQDLEVTEEIEPYVIQQAEIMVMERVATTDIRAIPVPARRAITMPVPAPRVRKVatep 1773 + DL V EE P +++ + R A RA P + + PVPAPR V Sbjct 278 EPLIDLAVVEETAPVRLERVAPVAAPRRA----RATPFTLEQRVVAPVPAPRTMPVRP-- 331 Query 1774 psepeapipaprkrrttsttppHNPGDFVPRVPVELPWEPEDLDIQFGDLEPRRRNTRDW 1833 PR PE I FGDL+ + Sbjct 332 -----------------------------PRRKKAATRTPE--RISFGDLDAECMAIIND 360 Query 1834 DVSTGIQFGDIDFNQ---SXLGRAGAYIFSSDTGPGHLQQRSVRQHELP-CETLYAHEDE 1889 D++ G FG +F + +XL RAGAYIFSSDTGPGHLQQRSVRQ L C HE E Sbjct 361 DLTFG-DFGAGEFERLTSAXLDRAGAYIFSSDTGPGHLQQRSVRQTRLADCVAEDVHE-E 418 Query 1890 RIYPP 1894 R++ P Sbjct 419 RVFAP 423 >RecName: Full=Replicase large subunit; AltName: Full=183 kDa protein; AltName: Full=RNA-directed RNA polymerase; Contains: RecName: Full=Replicase small subunit; AltName: Full=126 kDa protein; AltName: Full=Methyltransferase/RNA helicase; Short=MT/HEL [Odontoglossum ringspot virus (isolate Singapore 1)] Sequence ID: Q84133.2 Length: 1612 Range 1: 826 to 1100 Score:76.3 bits(186), Expect:1e-13, Method:Compositional matrix adjust., Identities:77/280(28%), Positives:121/280(43%), Gaps:27/280(9%) Query 720 GVPGSGKSA-IIKNMVTTRDLVASGKKENCQEIMNDVKRQ---RGLDVTARTVDSILLNG 775 GVPG GK+ I++ + DL+ KE C+ I+ + R RTVDS L++ Sbjct 826 GVPGCGKTKEILETVNFDEDLILVPGKEACKMIIKRANKSGHVRATRDNVRTVDSFLMHL 885 Query 776 CKRGVENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNIC 835 + L++DE H+G + L+AL +V GD +Q F N + + H Sbjct 886 KPKTYNKLFIDEGLMLHTGCVNFLVALSHCREAMVF-GDAEQIPFINRVANFPYPKHFRY 944 Query 836 TRVLHKSISR---RCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTG-------SSKPA 885 T + H+ + R RC VT +++ Y GK+ TN + + SKP Sbjct 945 TCLYHREVRRLSLRCPADVTHFMNS-KYDGKVLCTNDVIRSVDAEVVRGKGVFNPKSKPL 1003 Query 886 SGDIVLTCFRGWVKQLQIDYRGH-------EVMTAAASQGLTRKGVYAVRQKVNENPLYS 938 G I+ + ++ RG+ E+ T QG T + V VR L S Sbjct 1004 KGKIITFT---QSDKAELKERGYEEVSTFGEINTVHEIQGETFEDVSVVRLTPTPLELIS 1060 Query 939 PLSEHVNVLLTRTENRLVWKTLSGDPWIKVLTNVPR-GDF 977 S HV V LTR + ++ DP +KV +++ + DF Sbjct 1061 KSSPHVLVALTRHTKSFKYYSVVLDPLVKVWSDLSKVSDF 1100 >RecName: Full=Replicase large subunit; AltName: Full=183 kDa protein; AltName: Full=RNA-directed RNA polymerase; Contains: RecName: Full=Replicase small subunit; AltName: Full=126 kDa protein; AltName: Full=Methyltransferase/RNA helicase; Short=MT/HEL [Odontoglossum ringspot virus (isolate Korean Cy)] Sequence ID: P89659.2 Length: 1612 Range 1: 809 to 1100 Score:75.5 bits(184), Expect:2e-13, Method:Compositional matrix adjust., Identities:81/297(27%), Positives:124/297(41%), Gaps:27/297(9%) Query 703 LKIRPAAPYHTTIIGVFGVPGSGKSA-IIKNMVTTRDLVASGKKENCQEIMNDVKRQ--- 758 LK P + V GVPG GK+ I++ + DL+ KE C+ I+ + Sbjct 809 LKDGEPVPSDAKVTLVDGVPGCGKTKEILETVNFDEDLILVPGKEACKMIIKRANKSGHV 868 Query 759 RGLDVTARTVDSILLNGCKRGVENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQC 818 R RTVDS L++ + L++DE H+G + LIAL +V GD +Q Sbjct 869 RATKDNVRTVDSFLMHLKPKTYNKLFIDEGLMLHTGCVNFLIALSHCREAMVF-GDTEQI 927 Query 819 GFFNLMQLKVHYNHNICTRVLHKSISR---RCTLPVTAIVSTLHYQGKMRTTNRCNTPIQ 875 F N + + H H+ + R RC VT +++ Y GK+ TN + Sbjct 928 PFINRVANFPYPKHFATLVYDHREVRRLSLRCPADVTHFMNS-KYDGKVLCTNDVIRSVD 986 Query 876 IDTTG-------SSKPASGDIVLTCFRGWVKQLQIDYRGH-------EVMTAAASQGLTR 921 + SKP G I+ + ++ RG+ E+ T QG T Sbjct 987 AEVVRGKGVFNPKSKPLKGKIITFT---QSDKAELKERGYEEVSTFGEINTVHEIQGETF 1043 Query 922 KGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTLSGDPWIKVLTNVPR-GDF 977 + V VR L S S HV V LTR + ++ DP +KV +++ + DF Sbjct 1044 EDVSVVRLTPTPLELISKSSPHVLVALTRHTKSFKYYSVVLDPLVKVCSDLSKVSDF 1100 >RecName: Full=Replicase large subunit; AltName: Full=183 kDa protein; AltName: Full=RNA-directed RNA polymerase; Contains: RecName: Full=Replicase small subunit; AltName: Full=126 kDa protein; AltName: Full=Methyltransferase/RNA helicase; Short=MT/HEL [Tobacco mild green mosaic virus] Sequence ID: P18339.2 Length: 1609 Range 1: 814 to 1090 Score:64.7 bits(156), Expect:3e-10, Method:Compositional matrix adjust., Identities:73/282(26%), Positives:117/282(41%), Gaps:22/282(7%) Query 705 IRPAAPYHTT--IIGVFGVPGSGK-SAIIKNMVTTRDLVASGKKENCQEI---MNDVKRQ 758 +R P+ T ++ V GVPG GK + DL+ K+ I N Sbjct 814 MRDGEPHEPTAKMVLVDGVPGCGKYKGDFERFDLDEDLILVPGKQAAAMIRRRANSSGLI 873 Query 759 RGLDVTARTVDSILLNGCKRGVENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQC 818 R RTVDS+L++ R + L++DE H+G + L+ L+ + GD +Q Sbjct 874 RATMDNVRTVDSLLMHPKPRSHKRLFIDEGLMLHTGCVNFLV-LISGCDIAYIYGDTQQI 932 Query 819 GFFNLMQ---LKVHYNHNICTRVLHKSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQ 875 F N +Q H+ V + + RC V + + Y+G + TT+ + Sbjct 933 PFINRVQNFPYPKHFEKLQVDEVEMRRTTLRCPGDVNFFLQS-KYEGAVTTTSTVQRSVS 991 Query 876 IDTTGS-------SKPASGDIVLTCFRGWVKQLQIDYRGHE-VMTAAASQGLTRKGVYAV 927 + G SKP G IV + +++ +G++ V T QG T + V V Sbjct 992 SEMIGGKGVLNSVSKPLKGKIVTFT---QADKFELEEKGYKNVNTVHEIQGETFEDVSLV 1048 Query 928 RQKVNENPLYSPLSEHVNVLLTRTENRLVWKTLSGDPWIKVL 969 R L S S HV V LTR + T+ DP ++++ Sbjct 1049 RLTATPLTLISKSSPHVLVALTRHTKSFKYYTVVLDPLVQII 1090 >RecName: Full=Replicase large subunit; AltName: Full=183 kDa protein; AltName: Full=RNA-directed RNA polymerase; Contains: RecName: Full=Replicase small subunit; AltName: Full=126 kDa protein; AltName: Full=Methyltransferase/RNA helicase; Short=MT/HEL [Turnip vein-clearing virus] Sequence ID: Q88920.1 Length: 1601 Range 1: 815 to 1091 Score:60.8 bits(146), Expect:5e-09, Method:Compositional matrix adjust., Identities:71/282(25%), Positives:118/282(41%), Gaps:19/282(6%) Query 707 PAAPYHTTIIGVFGVPGSGKSA-IIKNMVTTRDLVASGKKENCQEIM---NDVKRQRGLD 762 P P + +I V GVPG GK+ II+ + + DL+ KE + I+ N R Sbjct 815 PPEP-NAKVILVDGVPGCGKTKEIIEKVNFSEDLILVPGKEASKMIIRRANQAGVIRADK 873 Query 763 VTARTVDSILLNGCKRGVENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFN 822 RTVDS L++ +R + L++DE H+G + L+ L + V GD KQ F Sbjct 874 DNVRTVDSFLMHPSRRVFKRLFIDEGLMLHTGCVNFLLLLSQCDVAYVY-GDTKQIPFIC 932 Query 823 LMQ---LKVHYNHNICTRVLHKSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTT 879 + H+ + + ++ RC VT ++ Y G + T+ ++ + Sbjct 933 RVANFPYPAHFAKLVADEKEVRRVTLRCPADVTYFLNK-KYDGAVMCTSAVERSVKAEVV 991 Query 880 GS-------SKPASGDIVLTCFRGWVKQLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVN 932 + P G I+ + L+ Y+ +V T QG T + VR Sbjct 992 RGKGALNPITLPLEGKILTFTQADKFELLEKGYK--DVNTVHEVQGETYEKTAIVRLTST 1049 Query 933 ENPLYSPLSEHVNVLLTRTENRLVWKTLSGDPWIKVLTNVPR 974 + S S HV V LTR + T+ DP + V++ + + Sbjct 1050 PLEIISSASPHVLVALTRHTTCCKYYTVVLDPMVNVISEMEK 1091 >RecName: Full=Replicase large subunit; AltName: Full=182 kDa protein; Contains: RecName: Full=Replicase small subunit; AltName: Full=125 kDa protein; AltName: Full=Methyltransferase/RNA helicase; Short=MT/HEL [Youcai mosaic virus] Sequence ID: Q66220.2 Length: 1597 Range 1: 818 to 1101 Score:58.2 bits(139), Expect:3e-08, Method:Compositional matrix adjust., Identities:74/290(26%), Positives:117/290(40%), Gaps:20/290(6%) Query 715 IIGVFGVPGSGKSA-IIKNMVTTRDLVASGKKENCQEIM---NDVKRQRGLDVTARTVDS 770 ++ V GVPG GK+ I++ + + DLV KE + I+ N R RTVDS Sbjct 818 LVLVDGVPGCGKTKEILEKVNFSEDLVLVPGKEASKMIIRRANQAGITRADKDNVRTVDS 877 Query 771 ILLNGCKRGVENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQ---LK 827 L++ KR + L++DE H+G + L+ L V D +Q F + Sbjct 878 FLMHPPKRVFKRLFIDEGLMLHTGCVNFLMLLSHCDVAYVYV-DTQQIPFICRVANFPYP 936 Query 828 VHYNHNICTRVLHKSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGS------ 881 H+ + + ++ RC VT ++ Y G + T+ + + Sbjct 937 AHFAKLVVDEKEDRRVTLRCPADVTYFLNQ-KYDGSVLCTSSVERSVSAEVVRGKGALNP 995 Query 882 -SKPASGDIVLTCFRGWVKQLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPL 940 + P G I+ F K +D +V T QG T + VR + S Sbjct 996 ITLPLEGKIL--TFTQADKFELLDKGYKDVNTVHEVQGETYEKTAIVRLTATPLEIISRA 1053 Query 941 SEHVNVLLTRTENRLVWKTLSGDPWIKVLTNVPRGDFSATLEEWQEEHDG 990 S HV V LTR R + T+ DP + V++ + G S L E + G Sbjct 1054 SPHVLVALTRHTTRCKYYTVVLDPMVNVISEL--GKLSNFLLEMYKVESG 1101 >RecName: Full=Replicase polyprotein 1ab; Contains: RecName: Full=Leader protease; Short=L-Pro; AltName: Full=Papain-like cysteine proteinase; Short=PCP; Contains: RecName: Full=Methyltransferase/helicase/RNA-directed RNA polymerase [Beet yellows virus isolate Ukraine] Sequence ID: Q08534.2 Length: 3094 Range 1: 2251 to 2520 Score:54.7 bits(130), Expect:4e-07, Method:Compositional matrix adjust., Identities:71/274(26%), Positives:112/274(40%), Gaps:37/274(13%) Query 722 PGSGKSA-IIKNMVTT----RDLVASGKKENCQEIMNDVKR------QRGLDVTAR--TV 768 PG GK+ +IK T L+ + K + +EI+ V R L R T+ Sbjct 2251 PGGGKTTTLIKVFCETFSKVNSLILTANKSSREEILAKVNRIVLDEGDTPLQTRDRILTI 2310 Query 769 DSILLNGCKRGVENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKV 828 DS L+N + LY+DE F H+G +A I + +L GD +Q + +L Sbjct 2311 DSYLMNNRGLTCKVLYLDECFMVHAGAAVACIEFTK-CDSAILFGDSRQIRYGRCSELDT 2369 Query 829 HYNHNICTRV-----LHKSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPI------QID 877 ++ V ++ +S RC V A +ST Y + TTN + +I+ Sbjct 2370 AVLSDLNRFVDDESRVYGEVSYRCPWDVCAWLSTF-YPKTVATTNLVSAGQSSMQVREIE 2428 Query 878 TTGSSKPASGDIVLTCFRGWVKQLQIDYRGHE--------VMTAAASQGLTRKGVYAVRQ 929 + + +S + LT + K L + V+T +QG T + V VR Sbjct 2429 SVDDVEYSSEFVYLTMLQSEKKDLLKSFGKRSRSSVEKPTVLTVHEAQGETYRKVNLVRT 2488 Query 930 KVNE-NPLYSPLSEHVNVLLTRTENRLVWKTLSG 962 K E +P S H+ V L+R L + LS Sbjct 2489 KFQEDDPFRS--ENHITVALSRHVESLTYSVLSS 2520 >RecName: Full=Non-structural polyprotein pORF1; Includes: RecName: Full=Methyltransferase; Includes: RecName: Full=Putative papain-like cysteine protease; Short=PLP; Includes: RecName: Full=NTPase/helicase; Includes: RecName: Full=RNA-directed RNA polymerase; Short=RdRp [Hepatitis E virus (strain Burma)] Sequence ID: P29324.1 Length: 1693 Range 1: 803 to 891 Score:41.6 bits(96), Expect:0.003, Method:Compositional matrix adjust., Identities:31/104(30%), Positives:46/104(44%), Gaps:23/104(22%) Query 1351 VVNAANAKGTVSDGVCRAVAKKWPSSFKGAATPVGTAKMIRADGMT--------VIHAVG 1402 +VNA+N G+C A +++P+SF A+ + DG +IHAV Sbjct 803 LVNASNVDHRPGGGLCHAFYQRYPASFDAAS-------FVMRDGAAAYTLTPRPIIHAVA 855 Query 1403 PNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPLLSTGTF 1446 P++ + L AAYR S + T A PLL TG + Sbjct 856 PDYRL---EHNPKRLEAAYRETCSRLGT-----AAYPLLGTGIY 891 >RecName: Full=Non-structural polyprotein pORF1; Includes: RecName: Full=Methyltransferase; Includes: RecName: Full=Putative papain-like cysteine protease; Short=PLP; Includes: RecName: Full=NTPase/helicase; Includes: RecName: Full=RNA-directed RNA polymerase; Short=RdRp [Hepatitis E virus (strain Pakistan)] Sequence ID: P33424.2 Length: 1693 Range 1: 803 to 891 Score:41.6 bits(96), Expect:0.003, Method:Compositional matrix adjust., Identities:31/104(30%), Positives:46/104(44%), Gaps:23/104(22%) Query 1351 VVNAANAKGTVSDGVCRAVAKKWPSSFKGAATPVGTAKMIRADGMT--------VIHAVG 1402 +VNA+N G+C A +++P+SF A+ + DG +IHAV Sbjct 803 LVNASNVDHRPGGGLCHAFYQRYPASFDAAS-------FVMRDGAAAYTLTPRPIIHAVA 855 Query 1403 PNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPLLSTGTF 1446 P++ + L AAYR S + T A PLL TG + Sbjct 856 PDYRL---EHNPKRLEAAYRETCSRLGT-----AAYPLLGTGIY 891 >RecName: Full=Non-structural polyprotein pORF1; Includes: RecName: Full=Methyltransferase; Includes: RecName: Full=Papain-like cysteine protease; Short=PLP; Includes: RecName: Full=NTPase/helicase; Includes: RecName: Full=RNA-directed RNA polymerase; Short=RdRp [Hepatitis E virus isolate Hetian] Sequence ID: Q81862.1 Length: 1693 Range 1: 803 to 891 Score:41.2 bits(95), Expect:0.005, Method:Compositional matrix adjust., Identities:31/104(30%), Positives:46/104(44%), Gaps:23/104(22%) Query 1351 VVNAANAKGTVSDGVCRAVAKKWPSSFKGAATPVGTAKMIRADGMT--------VIHAVG 1402 +VNA+N G+C A +++P+SF A+ + DG +IHAV Sbjct 803 LVNASNVDHRPGGGLCHAFYQRYPASFDAAS-------FVMRDGAAAYTLTPRPIIHAVA 855 Query 1403 PNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPLLSTGTF 1446 P++ + L AAYR S + T A PLL TG + Sbjct 856 PDYRL---EHNPKMLEAAYRETCSRLGT-----AAYPLLGTGIY 891 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain Therien] Sequence ID: P13889.5 Length: 2116 Range 1: 832 to 942 Score:40.0 bits(92), Expect:0.011, Method:Compositional matrix adjust., Identities:35/111(32%), Positives:47/111(42%), Gaps:10/111(9%) Query 1347 GEEAVVNAANAKGTVSDGVCRAV-----AKKWPSSFKGAATPVGTAKMIRADGMT---VI 1398 G + VVNAAN GVC A+ A + + A P G A G +I Sbjct 832 GCKVVVNAANEGLLAGSGVCGAIFANATAALAANCRRLAPCPTGEAVATPGHGCGYTHII 891 Query 1399 HAVGPNFSTVTEA--EGDRELAAAYRAVASIISTNNIKSVAVPLLSTGTFS 1447 HAV P A EG+ L AYR++ ++ + VA PLL G + Sbjct 892 HAVAPRRPRDPAALEEGEALLERAYRSIVALAAARRWACVACPLLGAGVYG 942 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain TO-336] Sequence ID: Q99IE5.1 Length: 2116 Range 1: 832 to 942 Score:40.0 bits(92), Expect:0.011, Method:Compositional matrix adjust., Identities:35/111(32%), Positives:46/111(41%), Gaps:10/111(9%) Query 1347 GEEAVVNAANAKGTVSDGVCRAV-----AKKWPSSFKGAATPVGTAKMIRADGMT---VI 1398 G + VVNAAN GVC A+ A + A P G A G +I Sbjct 832 GCKVVVNAANEGLLAGSGVCGAIFANATAALAADCRRLAPCPTGEAVATPGHGCGYTHII 891 Query 1399 HAVGPNFSTVTEA--EGDRELAAAYRAVASIISTNNIKSVAVPLLSTGTFS 1447 HAV P A EG+ L AYR++ ++ + VA PLL G + Sbjct 892 HAVAPRRPRDPAALEEGEALLERAYRSIVALAAARRWACVACPLLGAGVYG 942 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain RN-UK86] Sequence ID: Q8BCR0.1 Length: 2116 Range 1: 832 to 942 Score:40.0 bits(92), Expect:0.011, Method:Compositional matrix adjust., Identities:35/111(32%), Positives:46/111(41%), Gaps:10/111(9%) Query 1347 GEEAVVNAANAKGTVSDGVCRAV-----AKKWPSSFKGAATPVGTAKMIRADGMT---VI 1398 G + VVNAAN GVC A+ A + A P G A G +I Sbjct 832 GCKVVVNAANEGLLAGSGVCGAIFANATAALAADCRRLAPCPTGEAVATPGHGCGYTHII 891 Query 1399 HAVGPNFSTVTEA--EGDRELAAAYRAVASIISTNNIKSVAVPLLSTGTFS 1447 HAV P A EG+ L AYR++ ++ + VA PLL G + Sbjct 892 HAVAPRRPRDPAALEEGEALLERAYRSIVALAAARRWACVACPLLGAGVYG 942 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain TO-336 vaccine] Sequence ID: Q99IE7.1 Length: 2116 Range 1: 832 to 942 Score:40.0 bits(92), Expect:0.012, Method:Compositional matrix adjust., Identities:35/111(32%), Positives:46/111(41%), Gaps:10/111(9%) Query 1347 GEEAVVNAANAKGTVSDGVCRAV-----AKKWPSSFKGAATPVGTAKMIRADGMT---VI 1398 G + VVNAAN GVC A+ A + A P G A G +I Sbjct 832 GCKVVVNAANEGLLAGSGVCGAIFANATAALAADCRRLAPCPTGEAVATPGHGCGYTHII 891 Query 1399 HAVGPNFSTVTEA--EGDRELAAAYRAVASIISTNNIKSVAVPLLSTGTFS 1447 HAV P A EG+ L AYR++ ++ + VA PLL G + Sbjct 892 HAVAPRRPRDPAALEEGEALLERAYRSIVALAAARRWACVACPLLGAGVYG 942 >RecName: Full=Non-structural polyprotein pORF1; Includes: RecName: Full=Methyltransferase; Includes: RecName: Full=Putative papain-like cysteine protease; Short=PLP; Includes: RecName: Full=NTPase/helicase; Includes: RecName: Full=RNA-directed RNA polymerase; Short=RdRp [Hepatitis E virus human/g1/India/Hyderabad] Sequence ID: Q9WC28.1 Length: 1693 Range 1: 803 to 891 Score:39.7 bits(91), Expect:0.016, Method:Compositional matrix adjust., Identities:30/104(29%), Positives:44/104(42%), Gaps:23/104(22%) Query 1351 VVNAANAKGTVSDGVCRAVAKKWPSSFKGAATPVGTAKMIRADGMT--------VIHAVG 1402 +VNA+N G+C A +++P+SF A + DG +IH V Sbjct 803 LVNASNVDHCPGGGLCHAFYQRYPASFDAAC-------FVMRDGAAAYTLTPRPIIHRVA 855 Query 1403 PNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPLLSTGTF 1446 P++ + L AAYR S + T A PLL TG + Sbjct 856 PDYRL---EHNPKRLEAAYRETCSRLGT-----AAYPLLGTGIY 891 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain BRDII] Sequence ID: Q6X2U2.1 Length: 2116 Range 1: 822 to 942 Score:39.7 bits(91), Expect:0.016, Method:Compositional matrix adjust., Identities:36/121(30%), Positives:49/121(40%), Gaps:10/121(8%) Query 1337 RVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSF-----KGAATPVGTAKMIR 1391 RVR G + VVNAAN GVC A+ +S + A P G A Sbjct 822 RVRNIMDPPPGCKVVVNAANEGLLAGSGVCGAIFASAAASLAEDCRRLAPCPTGEAVATP 881 Query 1392 ADGMT---VIHAVGPNFSTVTEA--EGDRELAAAYRAVASIISTNNIKSVAVPLLSTGTF 1446 G +IHAV P A + + L AYR++ ++ + VA PLL G + Sbjct 882 GHGCGYAHIIHAVAPRRPQDPAALEQSEALLERAYRSIVALAAARRWTCVACPLLGAGIY 941 Query 1447 S 1447 Sbjct 942 G 942 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain Cendehill] Sequence ID: Q9J6K9.2 Length: 2116 Range 1: 832 to 942 Score:39.3 bits(90), Expect:0.017, Method:Compositional matrix adjust., Identities:35/111(32%), Positives:46/111(41%), Gaps:10/111(9%) Query 1347 GEEAVVNAANAKGTVSDGVCRAV-----AKKWPSSFKGAATPVGTAKMIRADGMT---VI 1398 G + VVNAAN GVC A+ A + A P G A G +I Sbjct 832 GCKVVVNAANEGLLAGSGVCGAIFANATAALAADCRRLAPCPTGEAVATPGHGCGYTHII 891 Query 1399 HAVGPNFSTVTEA--EGDRELAAAYRAVASIISTNNIKSVAVPLLSTGTFS 1447 HAV P A EG+ L AYR++ ++ + VA PLL G + Sbjct 892 HAVAPRRPRDPAALEEGEALLERAYRSIVALAAARRWAYVACPLLGAGVYG 942 >RecName: Full=Non-structural polyprotein pORF1; Includes: RecName: Full=Methyltransferase; Includes: RecName: Full=Putative papain-like cysteine protease; Short=PLP; Includes: RecName: Full=NTPase/helicase; Includes: RecName: Full=RNA-directed RNA polymerase; Short=RdRp [Hepatitis E virus (strain Mexico)] Sequence ID: Q03495.1 Length: 1691 Range 1: 801 to 889 Score:39.3 bits(90), Expect:0.018, Method:Compositional matrix adjust., Identities:30/104(29%), Positives:44/104(42%), Gaps:23/104(22%) Query 1351 VVNAANAKGTVSDGVCRAVAKKWPSSFKGAATPVGTAKMIRADGMT--------VIHAVG 1402 +VNA+NA G+C A +++P SF K + DG+ +IHAV Sbjct 801 LVNASNAGHRPGGGLCHAFFQRYPDSFD-------ATKFVMRDGLAAYTLTPRPIIHAVA 853 Query 1403 PNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPLLSTGTF 1446 P++ + L AAYR + T A PLL G + Sbjct 854 PDYRL---EHNPKRLEAAYRETCARRGT-----AAYPLLGAGIY 889 >RecName: Full=Non-structural polyprotein pORF1; Includes: RecName: Full=Methyltransferase; Includes: RecName: Full=Putative papain-like cysteine protease; Short=PLP; Includes: RecName: Full=NTPase/helicase; Includes: RecName: Full=RNA-directed RNA polymerase; Short=RdRp [Hepatitis E virus Ct1] Sequence ID: Q9IVZ9.1 Length: 1707 Range 1: 817 to 905 Score:39.3 bits(90), Expect:0.018, Method:Compositional matrix adjust., Identities:31/104(30%), Positives:44/104(42%), Gaps:23/104(22%) Query 1351 VVNAANAKGTVSDGVCRAVAKKWPSSFKGAATPVGTAKMIRADGMT--------VIHAVG 1402 +VNA+N G+C A +++P SF A+ I +DG +IHAV Sbjct 817 LVNASNPGHRPGGGLCHAFYQRFPESFD-------PAEFIMSDGFAAYTLTPRPIIHAVA 869 Query 1403 PNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPLLSTGTF 1446 P++ + L AAYR S T A PLL G + Sbjct 870 PDYRV---EHNPKRLEAAYRETCSRRGT-----AAYPLLGVGIY 905 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain BRD1] Sequence ID: Q6X2U4.1 Length: 2116 Range 1: 822 to 942 Score:38.9 bits(89), Expect:0.024, Method:Compositional matrix adjust., Identities:36/121(30%), Positives:49/121(40%), Gaps:10/121(8%) Query 1337 RVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSF-----KGAATPVGTAKMIR 1391 RVR G + VVNAAN GVC A+ ++ + A P G A Sbjct 822 RVRNIMDPPPGCKVVVNAANEGLLAGSGVCGAIFASAAATLAEDCRRLAPCPTGEAVATP 881 Query 1392 ADGMT---VIHAVGPNFSTVTEA--EGDRELAAAYRAVASIISTNNIKSVAVPLLSTGTF 1446 G +IHAV P A + + L AYR+V ++ + VA PLL G + Sbjct 882 GHGCGYTHIIHAVAPRRPQDPAALEQSEALLERAYRSVVALAAARRWACVACPLLGAGIY 941 Query 1447 S 1447 Sbjct 942 G 942 >RecName: Full=Non-structural polyprotein pORF1; Includes: RecName: Full=Methyltransferase; Includes: RecName: Full=Putative papain-like cysteine protease; Short=PLP; Includes: RecName: Full=NTPase/helicase; Includes: RecName: Full=RNA-directed RNA polymerase; Short=RdRp [Hepatitis E virus (strain Myanmar)] Sequence ID: Q04610.1 Length: 1693 Range 1: 803 to 891 Score:38.5 bits(88), Expect:0.035, Method:Compositional matrix adjust., Identities:30/104(29%), Positives:45/104(43%), Gaps:23/104(22%) Query 1351 VVNAANAKGTVSDGVCRAVAKKWPSSFKGAATPVGTAKMIRADGMT--------VIHAVG 1402 +VNA+N G+C A +++P+SF A+ + DG +IHAV Sbjct 803 LVNASNVDHRPGGGLCHAFYQRYPASFDAAS-------FVMRDGAAAYTLTPRPIIHAVA 855 Query 1403 PNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVPLLSTGTF 1446 P++ + L AAYR S + T A LL TG + Sbjct 856 PDYRL---EHNPKRLEAAYRETCSRLGT-----AAYSLLGTGIY 891