RID: 9WWTTZH101N Job Title:Q5XXP4:RecName: Full=Polyprotein P1234; Short=P1234;... Program: BLASTP Query: RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Chikungunya virus strain Senegal 37997] ID: Q5XXP4.1(amino acid) Length: 1856 Database: swissprot Non-redundant UniProtKB/SwissProt sequences Sequences producing significant alignments: Scientific Common Max Total Query E Per. Acc. Description Name Name Taxid Score Score cover Value Ident Len Accession RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Chikungunya ... NA 371095 3893 3893 100% 0.0 100.00 2474 Q5XXP4.1 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Chikungunya ... NA 371094 3700 3700 100% 0.0 95.42 2474 Q8JUX6.1 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... O'nyong-nyon... NA 374989 3393 3393 100% 0.0 84.31 2513 O90368.1 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Igbo Ora virus NA 79899 3386 3386 100% 0.0 84.15 2513 O90370.1 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... O'nyong-nyon... NA 11028 3375 3375 100% 0.0 83.69 2514 P13886.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Semliki Fore... NA 11033 2627 2627 98% 0.0 68.47 2432 P08411.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Getah virus NA 59300 2537 2537 89% 0.0 72.42 2467 Q5Y389.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Sagiyama virus NA 59303 2534 2534 89% 0.0 72.36 2467 Q9JGL0.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Ross river v... NA 11031 2522 2522 98% 0.0 66.12 2480 P13887.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Mayaro virus... NA 374990 2493 2493 98% 0.0 66.39 2437 Q8QZ73.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Barmah Fores... NA 11020 2197 2240 90% 0.0 63.37 2411 P87515.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Ockelbo virus NA 31699 2105 2105 99% 0.0 55.66 2515 P27283.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Sindbis virus NA 11034 2098 2098 99% 0.0 55.27 2513 P03317.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Venezuelan e... NA 11038 2063 2063 89% 0.0 59.53 2493 P27282.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Venezuelan e... NA 36385 2061 2061 89% 0.0 59.53 2493 P36328.2 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Venezuelan e... NA 36382 2061 2061 89% 0.0 59.31 2485 P36327.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Venezuelan e... NA 376610 2043 2043 89% 0.0 59.60 2497 Q8V294.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Eastern equi... NA 374598 2037 2037 89% 0.0 59.93 2494 Q4QXJ8.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Venezuelan e... NA 36384 2036 2036 89% 0.0 59.68 2499 Q9WJC7.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Eastern equi... NA 374596 2036 2036 89% 0.0 59.27 2471 Q306W6.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Eastern equi... NA 374597 2031 2031 89% 0.0 59.27 2474 Q306W8.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Western equi... NA 11039 2019 2019 89% 0.0 58.83 2467 P13896.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Aura virus NA 44158 2002 2002 89% 0.0 58.65 2499 Q86924.3 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Salmon pancr... NA 84589 1291 1291 88% 0.0 43.04 2601 Q8JJX1.1 RecName: Full=Polyprotein P1234; Short=P1234; AltName:... Sleeping dis... NA 78540 1286 1286 88% 0.0 42.88 2593 Q8QL53.1 RecName: Full=Polyprotein nsP1234; Short=P1234; AltName:... Ross river v... NA 11032 501 501 17% 1e-151 72.53 1149 P13888.2 RecName: Full=Polyprotein nsP1234; Short=P1234; AltName:... Middelburg v... NA 11023 365 365 23% 5e-105 48.96 995 P03318.2 RecName: Full=Replicase large subunit; AltName: Full=183 kDa... Odontoglossu... NA 138661 79.7 79.7 14% 2e-13 28.92 1612 P89659.2 RecName: Full=Replicase large subunit; AltName: Full=183 kDa... Odontoglossu... NA 138662 79.3 79.3 13% 3e-13 28.26 1612 Q84133.2 RecName: Full=Uncharacterized protein FN1951 [Fusobacterium... Fusobacteriu... NA 190304 64.3 64.3 7% 3e-10 31.93 175 Q8RHQ2.1 RecName: Full=Replicase large subunit; AltName: Full=186 kDa... Cucumber gre... NA 12236 67.4 67.4 13% 1e-09 28.73 1648 P69514.1 RecName: Full=Uncharacterized protein Saci_1252 [Sulfolobus... Sulfolobus a... NA 330779 56.6 56.6 5% 2e-07 35.04 181 Q4J9D2.1 RecName: Full=Non-structural polyprotein pORF1; Includes:... Hepatitis E ... NA 509615 60.5 60.5 5% 2e-07 34.82 1708 Q9YLR1.1 RecName: Full=Replicase large subunit; AltName: Full=183 kDa... Tobacco mild... NA 12241 59.3 59.3 13% 4e-07 26.47 1609 P18339.2 RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName:... Cronobacter ... NA 693216 54.7 54.7 5% 6e-07 35.25 176 C9Y0V8.1 RecName: Full=Non-structural polyprotein pORF1; Includes:... Hepatitis E ... NA 509627 58.5 58.5 5% 7e-07 37.86 1707 Q9IVZ9.1 RecName: Full=Non-structural polyprotein pORF1; Includes:... Hepatitis E ... NA 512345 58.2 58.2 5% 7e-07 34.82 1708 Q6J8G2.1 RecName: Full=Uncharacterized protein STK_23830 [Sulfurisphaer... Sulfurisphae... NA 273063 54.3 54.3 5% 1e-06 31.97 182 Q96XY5.1 RecName: Full=Non-structural polyprotein pORF1; Includes:... Hepatitis E ... NA 31767 57.4 57.4 5% 2e-06 38.78 1693 P29324.1 RecName: Full=Non-structural polyprotein pORF1; Includes:... Hepatitis E ... NA 33774 57.0 57.0 5% 2e-06 38.78 1693 P33424.2 RecName: Full=Macro domain-containing protein DR_2288... Deinococcus ... NA 243230 53.1 53.1 5% 2e-06 33.61 170 Q9RS39.1 RecName: Full=Non-structural polyprotein pORF1; Includes:... Hepatitis E ... NA 652674 56.6 56.6 5% 2e-06 38.78 1693 Q81862.1 RecName: Full=Replicase large subunit; AltName: Full=183 kDa... Turnip vein-... NA 29272 55.5 55.5 14% 5e-06 27.86 1601 Q88920.1 RecName: Full=Non-structural polyprotein pORF1; Includes:... Hepatitis E ... NA 512346 55.1 55.1 5% 7e-06 37.76 1693 Q9WC28.1 RecName: Full=Non-structural polyprotein pORF1; Includes:... Hepatitis E ... NA 31769 53.9 53.9 5% 2e-05 37.76 1693 Q04610.1 RecName: Full=Non-structural polyprotein pORF1; Includes:... Hepatitis E ... NA 31768 52.4 52.4 5% 4e-05 35.58 1691 Q03495.1 RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName:... Cronobacter ... NA 290339 48.9 48.9 5% 6e-05 33.33 180 A7MG20.1 RecName: Full=Uncharacterized protein SSO2899 [Saccharolobus... Saccharolobu... NA 273057 48.1 48.1 5% 1e-04 32.48 177 Q97UU4.1 RecName: Full=Replicase large subunit; AltName: Full=182 kDa... Youcai mosai... NA 228578 50.4 50.4 14% 2e-04 26.09 1597 Q66220.2 RecName: Full=Replicase polyprotein 1ab; Contains: RecName:... Beet yellows... NA 478555 50.4 50.4 12% 2e-04 24.29 3094 Q08534.2 RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName:... Klebsiella p... NA 507522 47.4 47.4 6% 2e-04 29.63 175 B5XXK9.1 RecName: Full=Macro domain-containing protein in gbd 3'region;... Cupriavidus ... NA 106590 45.8 45.8 5% 6e-04 31.71 173 Q44020.1 RecName: Full=Macro domain-containing protein RSc0334 [Ralston... Ralstonia so... NA 267608 44.7 44.7 7% 0.002 26.85 171 Q8Y2K1.1 RecName: Full=Protein mono-ADP-ribosyltransferase PARP9;... Mus musculus house mouse 10090 44.7 44.7 5% 0.010 28.46 866 Q8CAS9.2 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 11045 43.9 43.9 5% 0.018 33.64 2116 P13889.5 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 376267 43.9 43.9 5% 0.018 33.64 2116 Q8BCR0.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 376265 43.9 43.9 5% 0.019 33.64 2116 Q99IE7.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 376264 43.9 43.9 5% 0.019 33.64 2116 Q99IE5.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 376263 43.5 43.5 5% 0.024 31.78 2116 Q6X2U2.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 376266 43.1 43.1 5% 0.027 33.64 2116 Q9J6K9.2 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 11043 43.1 43.1 5% 0.031 33.64 2116 Q86500.2 RecName: Full=Non-structural polyprotein pORF1; Includes:... Avian hepati... NA 516993 42.4 42.4 5% 0.049 28.18 1531 Q6QLN1.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 376262 42.4 42.4 5% 0.051 31.78 2116 Q6X2U4.1 RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName:... Salmonella e... NA 454166 40.0 40.0 5% 0.065 31.82 179 B5F961.1 RecName: Full=Non-structural polyprotein p200; Short=p200;... Rubella viru... NA 11044 42.0 42.0 5% 0.065 33.64 2116 O40955.1 RecName: Full=Replicase polyprotein 1ab; Short=pp1ab; AltName:... Human corona... NA 277944 39.7 39.7 6% 0.36 28.06 6729 P0C6X5.1 RecName: Full=Replicase polyprotein 1a; Short=pp1a; AltName:... Human corona... NA 277944 39.3 39.3 6% 0.40 28.06 4060 P0C6U6.1 RecName: Full=Movement protein TGB1; AltName: Full=58 kDa... Barley strip... NA 12327 38.5 38.5 13% 0.62 23.38 528 P04867.1 RecName: Full=Replicase polyprotein 1ab; Short=pp1ab; AltName:... Avian infect... NA 160235 37.4 37.4 7% 1.8 26.24 6629 P0C6Y2.1 RecName: Full=Replicase polyprotein 1ab; Short=pp1ab; AltName:... Avian infect... NA 11122 37.4 37.4 7% 1.8 26.24 6629 P0C6Y1.1 RecName: Full=Replicase polyprotein 1a; Short=pp1a; AltName:... Avian infect... NA 11122 37.4 37.4 7% 1.9 26.24 3951 P0C6V3.1 RecName: Full=Protein mono-ADP-ribosyltransferase PARP9;... Homo sapiens human 9606 36.2 36.2 5% 3.3 26.05 854 Q8IXQ6.2 RecName: Full=RNA-directed RNA polymerase; AltName: Full=216.5... Apple chloro... NA 73472 36.2 36.2 6% 4.2 25.60 1885 P54891.1 RecName: Full=RNA-directed RNA polymerase; AltName: Full=216.5... Apple chloro... NA 73473 35.4 35.4 5% 6.2 26.50 1884 P27738.1 Alignments: >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Chikungunya virus strain Senegal 37997] Sequence ID: Q5XXP4.1 Length: 2474 Range 1: 1 to 1856 Score:3893 bits(10096), Expect:0.0, Method:Compositional matrix adjust., Identities:1856/1856(100%), Positives:1856/1856(100%), Gaps:0/1856(0%) Query 1 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST Sbjct 1 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 Query 61 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ 120 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ Sbjct 61 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ 120 Query 121 AVMAVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT 180 AVMAVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT Sbjct 121 AVMAVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT 180 Query 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVL 240 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVL Sbjct 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVL 240 Query 241 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKT 300 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKT Sbjct 241 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKT 300 Query 301 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV Sbjct 301 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 Query 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCL 420 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCL Sbjct 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCL 420 Query 421 WAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDL 480 WAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDL Sbjct 421 WAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDL 480 Query 481 IPYSGDAKEARDAEKEAEEEREAELTREALPPLQAAQDDVQVEIDVEQLEDRAGAGIIET 540 IPYSGDAKEARDAEKEAEEEREAELTREALPPLQAAQDDVQVEIDVEQLEDRAGAGIIET Sbjct 481 IPYSGDAKEARDAEKEAEEEREAELTREALPPLQAAQDDVQVEIDVEQLEDRAGAGIIET 540 Query 541 PRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY 600 PRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY Sbjct 541 PRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY 600 Query 601 DGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRA 660 DGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRA Sbjct 601 DGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRA 660 Query 661 ERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVF 720 ERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVF Sbjct 661 ERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVF 720 Query 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRP 780 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRP Sbjct 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRP 780 Query 781 VDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY 840 VDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY Sbjct 781 VDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY 840 Query 841 HKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVK 900 HKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVK Sbjct 841 HKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVK 900 Query 901 QLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKT 960 QLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKT Sbjct 901 QLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKT 960 Query 961 LSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVP 1020 LSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVP Sbjct 961 LSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVP 1020 Query 1021 ILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYA 1080 ILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYA Sbjct 1021 ILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYA 1080 Query 1081 DNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPAN 1140 DNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPAN Sbjct 1081 DNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPAN 1140 Query 1141 RRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYT 1200 RRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYT Sbjct 1141 RRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYT 1200 Query 1201 YNLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR 1260 YNLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR Sbjct 1201 YNLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR 1260 Query 1261 AYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQL 1320 AYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQL Sbjct 1261 AYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQL 1320 Query 1321 NAAFVGQATRAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKN 1380 NAAFVGQATRAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKN Sbjct 1321 NAAFVGQATRAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKN 1380 Query 1381 SATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIP 1440 SATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIP Sbjct 1381 SATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIP 1440 Query 1441 LLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVELLDE 1500 LLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVELLDE Sbjct 1441 LLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVELLDE 1500 Query 1501 HISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQTEAN 1560 HISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQTEAN Sbjct 1501 HISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQTEAN 1560 Query 1561 EQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVC 1620 EQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVC Sbjct 1561 EQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVC 1620 Query 1621 SSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREYKSPQETAQEVSSTTSLTHSQFDL 1680 SSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREYKSPQETAQEVSSTTSLTHSQFDL Sbjct 1621 SSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREYKSPQETAQEVSSTTSLTHSQFDL 1680 Query 1681 SVDGEELPAPSDLEADAPIPEPTPDDRAVLTLPPTIDNFSAVSDWVMNTAPVAPPRRRRG 1740 SVDGEELPAPSDLEADAPIPEPTPDDRAVLTLPPTIDNFSAVSDWVMNTAPVAPPRRRRG Sbjct 1681 SVDGEELPAPSDLEADAPIPEPTPDDRAVLTLPPTIDNFSAVSDWVMNTAPVAPPRRRRG 1740 Query 1741 KNLNVTCDEREGNVLPMASVRFFRADLHSIVQETAEIRDTAASLQAPLSVATEPNQLPIS 1800 KNLNVTCDEREGNVLPMASVRFFRADLHSIVQETAEIRDTAASLQAPLSVATEPNQLPIS Sbjct 1741 KNLNVTCDEREGNVLPMASVRFFRADLHSIVQETAEIRDTAASLQAPLSVATEPNQLPIS 1800 Query 1801 FGAPNETFPITFGDFDEGEIESLSSELLTFGDFSPGEVDDLTDSDWSTCSDTDDEL 1856 FGAPNETFPITFGDFDEGEIESLSSELLTFGDFSPGEVDDLTDSDWSTCSDTDDEL Sbjct 1801 FGAPNETFPITFGDFDEGEIESLSSELLTFGDFSPGEVDDLTDSDWSTCSDTDDEL 1856 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Chikungunya virus strain S27-African prototype] Sequence ID: Q8JUX6.1 Length: 2474 Range 1: 1 to 1856 Score:3700 bits(9595), Expect:0.0, Method:Compositional matrix adjust., Identities:1771/1856(95%), Positives:1815/1856(97%), Gaps:0/1856(0%) Query 1 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST Sbjct 1 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 Query 61 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ 120 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNIS KIGDLQ Sbjct 61 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISGKIGDLQ 120 Query 121 AVMAVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT 180 AVMAVPD ETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVR+AYW+GFDT Sbjct 121 AVMAVPDTETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRLAYWVGFDT 180 Query 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVL 240 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKK++PCDRVL Sbjct 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKLEPCDRVL 240 Query 241 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKT 300 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRIT+SPGLYGKT Sbjct 241 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITMSPGLYGKT 300 Query 301 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV Sbjct 301 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 Query 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCL 420 GLNQRIVVNGRTQRNTNTMKNY++PVVAQAFSKWAKECRKDMEDEKLLG+RERTLTCCCL Sbjct 361 GLNQRIVVNGRTQRNTNTMKNYMIPVVAQAFSKWAKECRKDMEDEKLLGVRERTLTCCCL 420 Query 421 WAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDL 480 WAFKKQKTHTVYKRPDTQSIQKV AEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDL Sbjct 421 WAFKKQKTHTVYKRPDTQSIQKVQAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDL 480 Query 481 IPYSGDAKEARDAEKEAEEEREAELTREALPPLQAAQDDVQVEIDVEQLEDRAGAGIIET 540 PYSGDA+EARDAEKEAEEEREAELT EALPPLQAAQ+DVQVEIDVEQLEDRAGAGIIET Sbjct 481 TPYSGDAQEARDAEKEAEEEREAELTLEALPPLQAAQEDVQVEIDVEQLEDRAGAGIIET 540 Query 541 PRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY 600 PRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY Sbjct 541 PRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY 600 Query 601 DGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRA 660 DGR+LVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIA+HGPALNTDEESYELVRA Sbjct 601 DGRVLVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIAMHGPALNTDEESYELVRA 660 Query 661 ERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVF 720 ERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGL+IRPACPYK AVIGVF Sbjct 661 ERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLKIRPACPYKIAVIGVF 720 Query 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRP 780 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEI+TDVMRQR LEISARTVDSLLLNGCNRP Sbjct 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEITTDVMRQRGLEISARTVDSLLLNGCNRP 780 Query 781 VDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY 840 VDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY Sbjct 781 VDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY 840 Query 841 HKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVK 900 HKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVK Sbjct 841 HKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVK 900 Query 901 QLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKT 960 QLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKT Sbjct 901 QLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKT 960 Query 961 LSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVP 1020 LSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGIC+HQ+TFDTFQNKANVCWAKSLVP Sbjct 961 LSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICSHQMTFDTFQNKANVCWAKSLVP 1020 Query 1021 ILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYA 1080 ILETAGIKLNDRQWSQIIQAFKED+AYSPEVALNEICTRMYGVDLDSGLFSKPLVSV+YA Sbjct 1021 ILETAGIKLNDRQWSQIIQAFKEDKAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVYYA 1080 Query 1081 DNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPAN 1140 DNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWN NKQICVTTRRIEDFNP TNIIPAN Sbjct 1081 DNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNINKQICVTTRRIEDFNPTTNIIPAN 1140 Query 1141 RRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYT 1200 RRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSG +L LPTKRVTWVAPLG+RGADYT Sbjct 1141 RRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGCSLALPTKRVTWVAPLGVRGADYT 1200 Query 1201 YNLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR 1260 YNLELGLPATLGRYDLV+INIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR Sbjct 1201 YNLELGLPATLGRYDLVVINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR 1260 Query 1261 AYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQL 1320 AYGYADRTSERV+CVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQL Sbjct 1261 AYGYADRTSERVICVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQL 1320 Query 1321 NAAFVGQATRAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKN 1380 NAAFVGQATRAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKN Sbjct 1321 NAAFVGQATRAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKN 1380 Query 1381 SATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIP 1440 SATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIP Sbjct 1381 SATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIP 1440 Query 1441 LLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVELLDE 1500 LLSTGVYSGGKDRLTQSLNHLFTA+DSTDADVVIYCRDKEWEKKI+EAIQMRTQVELLDE Sbjct 1441 LLSTGVYSGGKDRLTQSLNHLFTAMDSTDADVVIYCRDKEWEKKISEAIQMRTQVELLDE 1500 Query 1501 HISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQTEAN 1560 HIS+DCD++RVHPDSSLAGRKGYSTTEG+LYSYLEGTRFHQTAVDMAE+YTMWPKQTEAN Sbjct 1501 HISIDCDVVRVHPDSSLAGRKGYSTTEGALYSYLEGTRFHQTAVDMAEIYTMWPKQTEAN 1560 Query 1561 EQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVC 1620 EQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVC Sbjct 1561 EQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVC 1620 Query 1621 SSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREYKSPQETAQEVSSTTSLTHSQFDL 1680 SSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREY+ QE+ QE S+TTSLTHSQFDL Sbjct 1621 SSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREYRPSQESVQEASTTTSLTHSQFDL 1680 Query 1681 SVDGEELPAPSDLEADAPIPEPTPDDRAVLTLPPTIDNFSAVSDWVMNTAPVAPPRRRRG 1740 SVDG+ LP PSDL+ADAP EP DD A+ TLP N +AVSDWVM+T PVAPPRRRRG Sbjct 1681 SVDGKILPVPSDLDADAPALEPALDDGAIHTLPSATGNLAAVSDWVMSTVPVAPPRRRRG 1740 Query 1741 KNLNVTCDEREGNVLPMASVRFFRADLHSIVQETAEIRDTAASLQAPLSVATEPNQLPIS 1800 +NL VTCDEREGN+ PMASVRFFRA+L +VQETAE RDTA SLQAP S ATE + PIS Sbjct 1741 RNLTVTCDEREGNITPMASVRFFRAELCPVVQETAETRDTAMSLQAPPSTATELSHPPIS 1800 Query 1801 FGAPNETFPITFGDFDEGEIESLSSELLTFGDFSPGEVDDLTDSDWSTCSDTDDEL 1856 FGAP+ETFPITFGDF+EGEIESLSSELLTFGDF PGEVDDLTDSDWSTCSDTDDEL Sbjct 1801 FGAPSETFPITFGDFNEGEIESLSSELLTFGDFLPGEVDDLTDSDWSTCSDTDDEL 1856 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [O'nyong-nyong virus strain SG650] Sequence ID: O90368.1 Length: 2513 Range 1: 1 to 1895 Score:3393 bits(8797), Expect:0.0, Method:Compositional matrix adjust., Identities:1612/1912(84%), Positives:1722/1912(90%), Gaps:73/1912(3%) Query 1 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 MD VYVDIDADSAFLKALQRAYPMFEVEP+QVTPNDHANARAFSHLAIKLIEQEIDPDST Sbjct 1 MDSVYVDIDADSAFLKALQRAYPMFEVEPKQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 Query 61 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ 120 ILDIG APARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKV D+NIS KI DLQ Sbjct 61 ILDIGPAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVTDKNISGKINDLQ 120 Query 121 AVMAVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT 180 AVMAVP+ ET TFCLHTD +C+QR DVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT Sbjct 121 AVMAVPNMETSTFCLHTDATCKQRGDVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT 180 Query 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVL 240 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDL+EGRRGKLSIMRGKK+KPCDRVL Sbjct 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLSEGRRGKLSIMRGKKLKPCDRVL 240 Query 241 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKT 300 FSVGSTLYPESRKLL+SWHLPSVFHLKGKLSFTCRCDT+VSCEGYVVKR+T+SPG+YGKT Sbjct 241 FSVGSTLYPESRKLLQSWHLPSVFHLKGKLSFTCRCDTIVSCEGYVVKRVTMSPGIYGKT 300 Query 301 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 +GYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV Sbjct 301 SGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 Query 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCL 420 GLNQRIVVNGRTQRNTNTMKNYLLP+VAQAFSKWAKECRKDMEDEKLLG+RERTLTCCCL Sbjct 361 GLNQRIVVNGRTQRNTNTMKNYLLPIVAQAFSKWAKECRKDMEDEKLLGVRERTLTCCCL 420 Query 421 WAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDL 480 WAF+K KTHTVYKRPDTQSIQKVPAEFDSFV+PSLWSSGLSIPLRTRIKWLLSK PK + Sbjct 421 WAFRKHKTHTVYKRPDTQSIQKVPAEFDSFVIPSLWSSGLSIPLRTRIKWLLSKAPKHEQ 480 Query 481 IPYSGDAKEARDAEKEAEEEREAELTREALPPLQAAQDDVQVEIDVEQLEDRAGAGIIET 540 +P+SG+A+EA AE +A EEREAELTREA+PPLQA QDDVQVEIDVEQLEDRAGAGI+ET Sbjct 481 LPHSGNAEEAAQAEMDAAEEREAELTREAMPPLQATQDDVQVEIDVEQLEDRAGAGIVET 540 Query 541 PRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY 600 PRGAIKVTAQP+D VVGEYLVL+PQ VLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY Sbjct 541 PRGAIKVTAQPSDRVVGEYLVLTPQAVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY 600 Query 601 DGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRA 660 DGR+LVPSGYAI EDFQSLSESATMV+NEREFVNRKLHHIA+HGPALNTDEESYELVR Sbjct 601 DGRVLVPSGYAIPQEDFQSLSESATMVFNEREFVNRKLHHIAMHGPALNTDEESYELVRV 660 Query 661 ERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVF 720 E+TEHEYVYDVDQ++CCK+EEA GLVLVGDLT+PPYHEFAYEGL+IRPACPYKTAVIGVF Sbjct 661 EKTEHEYVYDVDQKKCCKREEATGLVLVGDLTSPPYHEFAYEGLKIRPACPYKTAVIGVF 720 Query 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRP 780 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEIS DVMRQR LEISARTVDSLLLNGCN+P Sbjct 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISNDVMRQRKLEISARTVDSLLLNGCNKP 780 Query 781 VDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY 840 V+VLYVDEAFACHSGTLLALIA+VRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY Sbjct 781 VEVLYVDEAFACHSGTLLALIAMVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY 840 Query 841 HKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVK 900 HKSISRRCTLPVTAIVSSLHYE KMRTTNEYN+PIVVDTTG TKP+PGDLVLTCFRGWVK Sbjct 841 HKSISRRCTLPVTAIVSSLHYESKMRTTNEYNQPIVVDTTGITKPEPGDLVLTCFRGWVK 900 Query 901 QLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKT 960 QLQIDYRG+EVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKL+WKT Sbjct 901 QLQIDYRGNEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLIWKT 960 Query 961 LSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVP 1020 LSGDPWIK LQNPPKGNFKATIKEWE EHASIMAGICNHQ+ FDTFQNKANVCWAK LVP Sbjct 961 LSGDPWIKILQNPPKGNFKATIKEWEAEHASIMAGICNHQMAFDTFQNKANVCWAKCLVP 1020 Query 1021 ILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYA 1080 IL+TAGIKL+DRQWSQI+QAFKEDRAYSPEVALNEICTR+YGVDLDSGLFSKPL+SV+YA Sbjct 1021 ILDTAGIKLSDRQWSQIVQAFKEDRAYSPEVALNEICTRIYGVDLDSGLFSKPLISVYYA 1080 Query 1081 DNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPAN 1140 DNHWDNRPGGKMFGFNPE A +LE+KYPFTKGKWN NKQIC+TTR++++FNP TNIIPAN Sbjct 1081 DNHWDNRPGGKMFGFNPEVALMLEKKYPFTKGKWNINKQICITTRKVDEFNPETNIIPAN 1140 Query 1141 RRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYT 1200 RRLPHSLVAEH V+GERMEWLVNKINGHH+LLVSGYNL+LPTKRVTWVAPLG RGADYT Sbjct 1141 RRLPHSLVAEHHTVRGERMEWLVNKINGHHMLLVSGYNLILPTKRVTWVAPLGTRGADYT 1200 Query 1201 YNLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR 1260 YNLELGLPATLGRYDLV+INIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR Sbjct 1201 YNLELGLPATLGRYDLVVINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR 1260 Query 1261 AYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQL 1320 AYGYADRTSERV+ VLGRKFRSSRALKP C+TSNTEMFFLFS FDNGRRNFTTHVMNNQL Sbjct 1261 AYGYADRTSERVISVLGRKFRSSRALKPQCITSNTEMFFLFSRFDNGRRNFTTHVMNNQL 1320 Query 1321 NAAFVGQATRAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKN 1380 NA + G ATRAGCAPSYRVKRMDIAKN EECVVNAANPRG+PGDGVCKAVY+KWPESF+N Sbjct 1321 NAVYAGLATRAGCAPSYRVKRMDIAKNTEECVVNAANPRGVPGDGVCKAVYRKWPESFRN 1380 Query 1381 SATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIP 1440 SATPVGTAKT+MCG YPVIHAVGPNFSNYSE+EGDRELA+ YREVAKEV+RLGV+SVAIP Sbjct 1381 SATPVGTAKTIMCGQYPVIHAVGPNFSNYSEAEGDRELASVYREVAKEVSRLGVSSVAIP 1440 Query 1441 LLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVELLDE 1500 LLSTGVYSGGKDRL QSLNHLFTA+DSTDADVVIYCRDKEWEKKI EAI +R+QVELLD+ Sbjct 1441 LLSTGVYSGGKDRLLQSLNHLFTAMDSTDADVVIYCRDKEWEKKITEAISLRSQVELLDD 1500 Query 1501 HISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQTEAN 1560 HISVDCDI+RVHPDSSLAGRKGYST EG+LYSYLEGTRFHQTAVDMAE+YTMWPKQTEAN Sbjct 1501 HISVDCDIVRVHPDSSLAGRKGYSTVEGALYSYLEGTRFHQTAVDMAEIYTMWPKQTEAN 1560 Query 1561 EQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVC 1620 EQVCLYALGESIES+RQKCPVDDADAS PPKTVPCLCRYAMTPERV RLRMNH TSIIVC Sbjct 1561 EQVCLYALGESIESVRQKCPVDDADASFPPKTVPCLCRYAMTPERVARLRMNHTTSIIVC 1620 Query 1621 SSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREYKSPQETAQEVS-STTSLTHSQFD 1679 SSFPLPKYKIEGVQKVKCSK +LFDHNVPSRVSPR Y+ E Q T + +QF Sbjct 1621 SSFPLPKYKIEGVQKVKCSKALLFDHNVPSRVSPRTYRPADEIIQTPQIPTEACQDAQFV 1680 Query 1680 LSVDGEELPAPSDLEA-DAPIPEPTPD--------------------------------- 1705 S+ E +P PSDLEA DA + P+ D Sbjct 1681 QSITDEAVPVPSDLEACDATMDWPSIDIVPTRQRSDSFDSEYSSRSNIQLVTADVHAPMY 1740 Query 1706 -------------------DRAVLTLPPT--IDNFSAVSDWVMNTAPVAPPRRRRGKNLN 1744 ++ LP + D+ S VS P+APPRRR G+ +N Sbjct 1741 ANSLASSGGSVLSLSSEQAQNGIMILPDSEDTDSISRVS------TPIAPPRRRLGRTIN 1794 Query 1745 VTCDEREGNVLPMASVRFFRADLHSIVQETAEIRDTAASLQAPLSVATEPNQLPISFGAP 1804 VTCDEREG +LPMAS R F A +++ + TA +QAPL +T+P L Sbjct 1795 VTCDEREGKILPMASDRLFTAKPYTVALGVSTADITAYPIQAPLG-STQPPALE------ 1847 Query 1805 NETFPITFGDFDEGEIESLSSELLTFGDFSPGEVDDLTDSDWSTCSDTDDEL 1856 ITFGDF EGEI++L + LTFGDF PGEV++LTDS+WSTCSDTD+EL Sbjct 1848 ----QITFGDFAEGEIDNLLTGALTFGDFEPGEVEELTDSEWSTCSDTDEEL 1895 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Igbo Ora virus] Sequence ID: O90370.1 Length: 2513 Range 1: 1 to 1895 Score:3386 bits(8780), Expect:0.0, Method:Compositional matrix adjust., Identities:1609/1912(84%), Positives:1721/1912(90%), Gaps:73/1912(3%) Query 1 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 MD VYVDIDADSAFLKALQRAYPMFEVEP+QVTPNDHANARAFSHLAIKLIEQEIDP ST Sbjct 1 MDSVYVDIDADSAFLKALQRAYPMFEVEPKQVTPNDHANARAFSHLAIKLIEQEIDPGST 60 Query 61 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ 120 IL IGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKV D+NIS KI DLQ Sbjct 61 ILGIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVTDKNISGKINDLQ 120 Query 121 AVMAVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT 180 AVMAVP+ ET TFCLHTD +C+QR DVAIYQDVYAVHAPTSLYHQAIKGV VAYWIGFDT Sbjct 121 AVMAVPNMETSTFCLHTDATCKQRGDVAIYQDVYAVHAPTSLYHQAIKGVHVAYWIGFDT 180 Query 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVL 240 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDL+EGRRGKLSIMRGKK KPCDRVL Sbjct 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLSEGRRGKLSIMRGKKFKPCDRVL 240 Query 241 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKT 300 FSVGSTLYPESRKLL+SWHLPSVFHLKGKLSFTCRCDT+VSCEGYVVKR+T+SPG+YGKT Sbjct 241 FSVGSTLYPESRKLLQSWHLPSVFHLKGKLSFTCRCDTIVSCEGYVVKRVTMSPGIYGKT 300 Query 301 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 +GYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV Sbjct 301 SGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 Query 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCL 420 GLNQRIVVNGRTQRNTNTMKNYLLP+VAQAFSKWAKECRKDMEDEKLLG+RERTLTCCCL Sbjct 361 GLNQRIVVNGRTQRNTNTMKNYLLPIVAQAFSKWAKECRKDMEDEKLLGVRERTLTCCCL 420 Query 421 WAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDL 480 WAF+K KTHTVYKRPDTQSIQKVPAEFDSFV+PSLWSSGLSIPLRTRIKWLLSK PK + Sbjct 421 WAFRKHKTHTVYKRPDTQSIQKVPAEFDSFVIPSLWSSGLSIPLRTRIKWLLSKAPKHEQ 480 Query 481 IPYSGDAKEARDAEKEAEEEREAELTREALPPLQAAQDDVQVEIDVEQLEDRAGAGIIET 540 +P+SG+A+EA AE +A EEREAELTREA+PPLQA QDDVQVEIDVEQLEDRAGAGI+ET Sbjct 481 LPHSGNAEEAAQAETDAVEEREAELTREAMPPLQATQDDVQVEIDVEQLEDRAGAGIVET 540 Query 541 PRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY 600 PRGAIKVTAQP+D VVGEYLVL+PQ VLRSQKL LIHALAEQVKTCTHSGRAGRYAVEAY Sbjct 541 PRGAIKVTAQPSDLVVGEYLVLTPQAVLRSQKLGLIHALAEQVKTCTHSGRAGRYAVEAY 600 Query 601 DGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRA 660 DGR+LVPSGYAI EDFQSLSESATMV+NEREFVNRKLHHIA+HGPALNTDEESYELVR Sbjct 601 DGRVLVPSGYAIPQEDFQSLSESATMVFNEREFVNRKLHHIAMHGPALNTDEESYELVRV 660 Query 661 ERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVF 720 E+TEHEYVYDVDQ++CCK+EEA GLVLVGDLT+PPYHEFAYEGL+IRPACPYKTAVIGVF Sbjct 661 EKTEHEYVYDVDQKKCCKREEATGLVLVGDLTSPPYHEFAYEGLKIRPACPYKTAVIGVF 720 Query 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRP 780 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEIS DVMRQR LEISARTVDSLLLNGCN+P Sbjct 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISNDVMRQRKLEISARTVDSLLLNGCNKP 780 Query 781 VDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY 840 V+VLYVDEAFACHSGTLLALIA+VRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY Sbjct 781 VEVLYVDEAFACHSGTLLALIAMVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY 840 Query 841 HKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVK 900 HKSISRRCTLPVTAIVSSLHYE KMRTTNEYN+PIVVDTTG+TKP+PGDLVLTCFRGWVK Sbjct 841 HKSISRRCTLPVTAIVSSLHYESKMRTTNEYNQPIVVDTTGTTKPEPGDLVLTCFRGWVK 900 Query 901 QLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKT 960 QLQIDYRG+EVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKL+WKT Sbjct 901 QLQIDYRGNEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLIWKT 960 Query 961 LSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVP 1020 LSGDPWIK LQNPPKGNFKATIKEWE EHASIMAGICN+Q+ FDTFQNKANVCWAK LVP Sbjct 961 LSGDPWIKILQNPPKGNFKATIKEWEAEHASIMAGICNYQMAFDTFQNKANVCWAKCLVP 1020 Query 1021 ILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYA 1080 IL+TAGIKL+DRQWSQI+QAFKEDRAYSPEVALNEICTR+YGVDLDSGLFSKPL+SV+YA Sbjct 1021 ILDTAGIKLSDRQWSQIVQAFKEDRAYSPEVALNEICTRIYGVDLDSGLFSKPLISVYYA 1080 Query 1081 DNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPAN 1140 DNHWDNRPGGKMFGFNPE A +LE+KYPFTKGKWN NKQIC+TTR++++FNP TNIIPAN Sbjct 1081 DNHWDNRPGGKMFGFNPEVALMLEKKYPFTKGKWNINKQICITTRKVDEFNPETNIIPAN 1140 Query 1141 RRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYT 1200 RRLPHSLVAEH V+GERMEWLVNKINGHH+LLVSGYNL+LPTKRVTWVAPLG RGADYT Sbjct 1141 RRLPHSLVAEHHSVRGERMEWLVNKINGHHMLLVSGYNLILPTKRVTWVAPLGTRGADYT 1200 Query 1201 YNLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR 1260 YNLELGLPATLGRYDLV+INIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR Sbjct 1201 YNLELGLPATLGRYDLVVINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR 1260 Query 1261 AYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQL 1320 AYGYADRTSERV+ VLGRKFRSSRALKP C+TSNTEMFFLFS FDNGRRNFTTHVMNNQL Sbjct 1261 AYGYADRTSERVISVLGRKFRSSRALKPQCITSNTEMFFLFSRFDNGRRNFTTHVMNNQL 1320 Query 1321 NAAFVGQATRAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKN 1380 NA + G ATRAGCAPSYRVKRMDIAKN EECVVNAANPRG+PGDGVCKAVY+KWPESF+N Sbjct 1321 NAVYAGLATRAGCAPSYRVKRMDIAKNTEECVVNAANPRGVPGDGVCKAVYRKWPESFRN 1380 Query 1381 SATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIP 1440 SATPVGTAKT+MCG YPVIHAVGPNFSNYSE+EGDRELA+AYREVAKEV+RLGV+SVAIP Sbjct 1381 SATPVGTAKTIMCGQYPVIHAVGPNFSNYSEAEGDRELASAYREVAKEVSRLGVSSVAIP 1440 Query 1441 LLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVELLDE 1500 LLSTGVYSGGKDRL QSLNHLF A+DSTDADVVIYCRDKEWEKKI EAI +R+QVELLD+ Sbjct 1441 LLSTGVYSGGKDRLLQSLNHLFAAMDSTDADVVIYCRDKEWEKKITEAISLRSQVELLDD 1500 Query 1501 HISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQTEAN 1560 HISVDCDI+RVHPDSSLAGRKGYST EG+LYSYLEGTRFHQTAVDMAE+YTMWPKQTEAN Sbjct 1501 HISVDCDIVRVHPDSSLAGRKGYSTVEGALYSYLEGTRFHQTAVDMAEIYTMWPKQTEAN 1560 Query 1561 EQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVC 1620 EQVCLYALGESIES+RQKCPVDDADAS PPKTVPCLCRYAMTPERV RLRMNH TSIIVC Sbjct 1561 EQVCLYALGESIESVRQKCPVDDADASFPPKTVPCLCRYAMTPERVARLRMNHTTSIIVC 1620 Query 1621 SSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREYKSPQETAQEVS-STTSLTHSQFD 1679 SSFPLPKYKIEGVQKVKCSK +LFDHNVPSRVSPR Y+ E Q ST + +Q Sbjct 1621 SSFPLPKYKIEGVQKVKCSKALLFDHNVPSRVSPRTYRPADEIIQTPQISTEACQDAQLV 1680 Query 1680 LSVDGEELPAPSDLEA-DAPIPEPT----------------------------------- 1703 S++ E +P PSDLEA DA + P+ Sbjct 1681 QSINDEAVPVPSDLEACDATMDWPSIGTVPTRQRHDSFDSEYSSRSNIQLVTADVHAPMY 1740 Query 1704 -----------------PDDRAVLTLPPT--IDNFSAVSDWVMNTAPVAPPRRRRGKNLN 1744 P ++ LP + D+ S VS P+APPRRR G+ +N Sbjct 1741 ANSLASSGGSMLSLSSEPAQNGIMILPDSEDTDSISRVS------TPIAPPRRRLGRTIN 1794 Query 1745 VTCDEREGNVLPMASVRFFRADLHSIVQETAEIRDTAASLQAPLSVATEPNQLPISFGAP 1804 VTCDEREG +LPMAS RFF A +++ + TA +QAPL + T+P L Sbjct 1795 VTCDEREGKILPMASDRFFTAKPYTVALSVSTADITAYPIQAPLGL-TQPPTLE------ 1847 Query 1805 NETFPITFGDFDEGEIESLSSELLTFGDFSPGEVDDLTDSDWSTCSDTDDEL 1856 ITFGDF EGEI++L + LTFGDF PGEV++LTDS+WSTCSDTD+EL Sbjct 1848 ----QITFGDFAEGEIDNLLTGALTFGDFEPGEVEELTDSEWSTCSDTDEEL 1895 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [O'nyong-nyong virus strain Gulu] Sequence ID: P13886.2 Length: 2514 Range 1: 1 to 1896 Score:3375 bits(8750), Expect:0.0, Method:Compositional matrix adjust., Identities:1601/1913(84%), Positives:1714/1913(89%), Gaps:74/1913(3%) Query 1 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 MD VYVDIDADSAFLKALQ+AYPMFEVEP+QVTPNDHANARAFSHLAIKLIEQEIDPDST Sbjct 1 MDSVYVDIDADSAFLKALQQAYPMFEVEPKQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 Query 61 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ 120 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKV D+NIS KI DLQ Sbjct 61 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVTDKNISGKINDLQ 120 Query 121 AVMAVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT 180 AVMAVP+ ET TFCLHTD +C+QR DVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT Sbjct 121 AVMAVPNMETSTFCLHTDATCKQRGDVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT 180 Query 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVL 240 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDL+EGRRGKLSIMRGKK+KPCDRVL Sbjct 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLSEGRRGKLSIMRGKKLKPCDRVL 240 Query 241 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKT 300 FSVGSTLYPESRKLL+SWHLPSVFHLKGKLSFTCRCDT+VSCEGYVVKR+T+SPG+YGKT Sbjct 241 FSVGSTLYPESRKLLQSWHLPSVFHLKGKLSFTCRCDTIVSCEGYVVKRVTMSPGIYGKT 300 Query 301 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 +GYAVTHHA GFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV Sbjct 301 SGYAVTHHAGGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 Query 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCL 420 GLNQRIVVNGRTQRNTNTMKNYLLP+VAQAFSKWAKECRKDMEDEKLLG+RERTLTCCCL Sbjct 361 GLNQRIVVNGRTQRNTNTMKNYLLPIVAQAFSKWAKECRKDMEDEKLLGVRERTLTCCCL 420 Query 421 WAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDL 480 WAF+K KTHTVYKRPDTQSIQKVPAEFDSFV+PSLWSSGLSIPLRTRIKWLLSK PK + Sbjct 421 WAFRKHKTHTVYKRPDTQSIQKVPAEFDSFVIPSLWSSGLSIPLRTRIKWLLSKAPKYEQ 480 Query 481 IPYSGDAKEARDAEKEAEEEREAELTREALPPLQAAQDDVQVEIDVEQLEDRAGAGIIET 540 +P+SG+A+EA AE +A EE+EAELTREA+PPLQA QDD+QVEIDVEQLEDRAGAGI+ET Sbjct 481 LPHSGNAEEAAQAETDAVEEQEAELTREAMPPLQATQDDIQVEIDVEQLEDRAGAGIVET 540 Query 541 PRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY 600 PRGAIKVTAQP+D VVGEYLVL+PQ VLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY Sbjct 541 PRGAIKVTAQPSDLVVGEYLVLTPQAVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY 600 Query 601 DGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRA 660 DGR+LVPSGYAI EDFQSLSESATMV+NEREFVNRKLHHIA+HGPALNTDEESYELVR Sbjct 601 DGRVLVPSGYAIPQEDFQSLSESATMVFNEREFVNRKLHHIAMHGPALNTDEESYELVRV 660 Query 661 ERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVF 720 E+TEHEYVYDVDQ++CCK+EEA GLVLVGDLT+PPYHEFAYEGL+IRPACPYKTAVIGVF Sbjct 661 EKTEHEYVYDVDQKKCCKREEATGLVLVGDLTSPPYHEFAYEGLKIRPACPYKTAVIGVF 720 Query 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRP 780 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEIS DVMRQR LEISARTVDSLLLNGCN+P Sbjct 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISNDVMRQRKLEISARTVDSLLLNGCNKP 780 Query 781 VDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY 840 V+VLYVDEAFACHSGTLLALIA+VRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY Sbjct 781 VEVLYVDEAFACHSGTLLALIAMVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY 840 Query 841 HKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVK 900 HKSISRRCTLPVTAIVSSLHYE KMRTTNEYN+PIVVDTTG TKP+PGDLVLTCFRGWVK Sbjct 841 HKSISRRCTLPVTAIVSSLHYESKMRTTNEYNQPIVVDTTGITKPEPGDLVLTCFRGWVK 900 Query 901 QLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKT 960 QLQIDYRG+EVMTAAASQGLTRKGVYAVRQKVNENPLYA TSEHVNVLLTRTEGKL WKT Sbjct 901 QLQIDYRGNEVMTAAASQGLTRKGVYAVRQKVNENPLYAPTSEHVNVLLTRTEGKLTWKT 960 Query 961 LSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVP 1020 LSGDPWIK LQNPPKG+FKATIKEWE EHASIMAGICNHQ+ FDTFQNKANVCWAK LVP Sbjct 961 LSGDPWIKILQNPPKGDFKATIKEWEAEHASIMAGICNHQMAFDTFQNKANVCWAKCLVP 1020 Query 1021 ILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYA 1080 IL+TAGIKL+DRQWSQI+QAFKEDRAYSPEVALNEICTR+YGVDLDSGLFSKPL+SV+YA Sbjct 1021 ILDTAGIKLSDRQWSQIVQAFKEDRAYSPEVALNEICTRIYGVDLDSGLFSKPLISVYYA 1080 Query 1081 DNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPAN 1140 DNHWDNRPGGKMFGFNPE A +LE+KYPFTKGKWN NKQIC+TTR++++FNP TNIIPAN Sbjct 1081 DNHWDNRPGGKMFGFNPEVALMLEKKYPFTKGKWNINKQICITTRKVDEFNPETNIIPAN 1140 Query 1141 RRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYT 1200 RRLPHSLVAEH V+GERMEWLVNKI+GHH+LLVSG+NL+LPTKRVTWVAPLG RGADYT Sbjct 1141 RRLPHSLVAEHHSVRGERMEWLVNKISGHHMLLVSGHNLILPTKRVTWVAPLGTRGADYT 1200 Query 1201 YNLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR 1260 YNLELGLPATLGRYDLV+INIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR Sbjct 1201 YNLELGLPATLGRYDLVVINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR 1260 Query 1261 AYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQL 1320 AYGYADRTSERV+ VLGRKFRSSRALKP C+TSNTEMFFLFS FDNGRRNFTTHVMNNQL Sbjct 1261 AYGYADRTSERVISVLGRKFRSSRALKPQCITSNTEMFFLFSRFDNGRRNFTTHVMNNQL 1320 Query 1321 NAAFVGQATRAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKN 1380 NA + G ATRAGCAPSYRVKRMDIAKN EECVVNAANPRG+PGDGVCKAVY+KWPESF+N Sbjct 1321 NAVYAGLATRAGCAPSYRVKRMDIAKNTEECVVNAANPRGVPGDGVCKAVYRKWPESFRN 1380 Query 1381 SATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIP 1440 SATPVGTAKT+MCG YPVIHAVGPNFSNYSE+EGDRELA+ YREVAKEV+RLGV+SVAIP Sbjct 1381 SATPVGTAKTIMCGQYPVIHAVGPNFSNYSEAEGDRELASVYREVAKEVSRLGVSSVAIP 1440 Query 1441 LLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVELLDE 1500 LLSTGVYSGGKDRL QSLNHLF A+DSTDADVVIYCRDKEWEKKI EAI +R+QVELLD+ Sbjct 1441 LLSTGVYSGGKDRLLQSLNHLFAAMDSTDADVVIYCRDKEWEKKITEAISLRSQVELLDD 1500 Query 1501 HISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQTEAN 1560 HISVDCDI+RVHPDSSLAGRKGYST EG+LYSYLEGTRFHQTAVDMAE+YTMWPKQTEAN Sbjct 1501 HISVDCDIVRVHPDSSLAGRKGYSTVEGALYSYLEGTRFHQTAVDMAEIYTMWPKQTEAN 1560 Query 1561 EQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVC 1620 EQVCLYALGESIES+RQKCPVDDADAS PPKTVPCLCRYAMTPERV RLRMNH TSIIVC Sbjct 1561 EQVCLYALGESIESVRQKCPVDDADASFPPKTVPCLCRYAMTPERVARLRMNHTTSIIVC 1620 Query 1621 SSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREYKSPQETAQEVSS-TTSLTHSQFD 1679 SSFPLPKYKIEGVQKVKCSK +LFDHNVPSRVSPR Y+ E Q + T + +Q Sbjct 1621 SSFPLPKYKIEGVQKVKCSKALLFDHNVPSRVSPRTYRPADEIIQTPQTPTEACQDAQLV 1680 Query 1680 LSVDGEELPAPSDLE--------------------------------------ADAPIP- 1700 S++ E +P PSDLE AD P Sbjct 1681 QSINDEAVPVPSDLEACDATMDWPSIGTVSTRQRHDSSDSEYSGSRSNIQLVTADVHAPM 1740 Query 1701 -----------------EPTPDDRAVLTLPPTIDNFSAVSDWVMNTAPVAPPRRRRGKNL 1743 EP + +L D+ S VS P+APPRRR G+ + Sbjct 1741 YAHSLASSGGSMLSLSSEPAQNGTMILLDSEDTDSISRVS------TPIAPPRRRLGRTI 1794 Query 1744 NVTCDEREGNVLPMASVRFFRADLHSIVQETAEIRDTAASLQAPLSVATEPNQLPISFGA 1803 NVTCDEREG +LPMAS RFF A +++ + T +QAPL + P Sbjct 1795 NVTCDEREGKILPMASDRFFTAKPYTVALSVSTADMTVYPIQAPLGLIPPPT-------- 1846 Query 1804 PNETFPITFGDFDEGEIESLSSELLTFGDFSPGEVDDLTDSDWSTCSDTDDEL 1856 PITFGDF EGEI++L + LTFGDF PGEV++LTDS+WSTCSDTD+EL Sbjct 1847 ---LEPITFGDFAEGEIDNLLTGALTFGDFEPGEVEELTDSEWSTCSDTDEEL 1896 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Semliki Forest virus] Sequence ID: P08411.2 Length: 2432 Range 1: 5 to 1807 Score:2627 bits(6810), Expect:0.0, Method:Compositional matrix adjust., Identities:1268/1852(68%), Positives:1492/1852(80%), Gaps:71/1852(3%) Query 4 VYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDSTILD 63 V+VDI+ADS F+K+LQ+A+P FEVE QVTPNDHANARAFSHLA KLIEQE D D+ ILD Sbjct 5 VHVDIEADSPFIKSLQKAFPSFEVESLQVTPNDHANARAFSHLATKLIEQETDKDTLILD 64 Query 64 IGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQAVM 123 IGSAP+RRMMS KYHCVCPMRSAEDPERL YA+KLA+A+GKVLDR I+ KI DLQ VM Sbjct 65 IGSAPSRRMMSTHKYHCVCPMRSAEDPERLVCYAKKLAAASGKVLDREIAGKITDLQTVM 124 Query 124 AVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDTTPF 183 A PDAE+PTFCLHTDV+CR A+VA+YQDVYAVHAPTSLYHQA+KGVR AYWIGFDTTPF Sbjct 125 ATPDAESPTFCLHTDVTCRTAAEVAVYQDVYAVHAPTSLYHQAMKGVRTAYWIGFDTTPF 184 Query 184 MYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVLFSV 243 M++A+AGAYP+Y+TNWADEQVL+A+NIGLC+ LTEGR GKLSI+R K++KPCD V+FSV Sbjct 185 MFDALAGAYPTYATNWADEQVLQARNIGLCAASLTEGRLGKLSILRKKQLKPCDTVMFSV 244 Query 244 GSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKTTGY 303 GSTLY ESRKLL+SWHLPSVFHLKGK SFTCRCDT+VSCEGYVVK+IT+ PGLYGKT GY Sbjct 245 GSTLYTESRKLLRSWHLPSVFHLKGKQSFTCRCDTIVSCEGYVVKKITMCPGLYGKTVGY 304 Query 304 AVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLVGLN 363 AVT+HA+GFL+CKTTDTV GERVSF VCTYVP+TICDQMTGILAT+VTPEDAQKLLVGLN Sbjct 305 AVTYHAEGFLVCKTTDTVKGERVSFPVCTYVPSTICDQMTGILATDVTPEDAQKLLVGLN 364 Query 364 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCLWAF 423 QRIVVNGRTQRNTNTMKNYLLP+VA AFSKWA+E + D++DEK LG+RER+LTCCCLWAF Sbjct 365 QRIVVNGRTQRNTNTMKNYLLPIVAVAFSKWAREYKADLDDEKPLGVRERSLTCCCLWAF 424 Query 424 KKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDLIPY 483 K +K HT+YK+PDTQ+I KVP+EF+SFV+PSLWS+GL+IP+R+RIK LL+K K +LIP Sbjct 425 KTRKMHTMYKKPDTQTIVKVPSEFNSFVIPSLWSTGLAIPVRSRIKMLLAKKTKRELIPV 484 Query 484 SGDAKEARDAEKEAEEEREAELTREALPPLQ--AAQDDVQVEIDVEQLEDRAGAGIIETP 541 DA ARDAE+E +E EAELTREALPPL A + V++DVE+LE AGAG++ETP Sbjct 485 L-DASSARDAEQEEKERLEAELTREALPPLVPIAPAETGVVDVDVEELEYHAGAGVVETP 543 Query 542 RGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAYD 601 R A+KVTAQP D ++G Y+VLSPQTVL+S KL+ +H LAEQVK TH+GRAGRY V+ YD Sbjct 544 RSALKVTAQPNDVLLGNYVVLSPQTVLKSSKLAPVHPLAEQVKIITHNGRAGRYQVDGYD 603 Query 602 GRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRAE 661 GR+L+P G AI +FQ+LSESATMVYNEREFVNRKL+HIA+HGP+LNTDEE+YE VRAE Sbjct 604 GRVLLPCGSAIPVPEFQALSESATMVYNEREFVNRKLYHIAVHGPSLNTDEENYEKVRAE 663 Query 662 RTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVFG 721 RT+ EYV+DVD++ C K+EEA+GLVLVG+LTNPP+HEFAYEGL+IRP+ PYKT V+GVFG Sbjct 664 RTDAEYVFDVDKKCCVKREEASGLVLVGELTNPPFHEFAYEGLKIRPSAPYKTTVVGVFG 723 Query 722 VPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRPV 781 VPGSGKSAIIK+LVT+ DLVTSGKKENCQEI DV + R L+I A+TVDS+LLNGC R V Sbjct 724 VPGSGKSAIIKSLVTKHDLVTSGKKENCQEIVNDVKKHRGLDIQAKTVDSILLNGCRRAV 783 Query 782 DVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVYH 841 D+LYVDEAFACHSGTLLALIALV+PR KVVLCGDPKQCGFFNMMQ+KVN+NHNICT+V H Sbjct 784 DILYVDEAFACHSGTLLALIALVKPRSKVVLCGDPKQCGFFNMMQLKVNFNHNICTEVCH 843 Query 842 KSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVKQ 901 KSISRRCT PVTAIVS+LHY GKMRTTN NKPI++DTTG TKP PGD+VLTCFRGWVKQ Sbjct 844 KSISRRCTRPVTAIVSTLHYGGKMRTTNPCNKPIIIDTTGQTKPKPGDIVLTCFRGWVKQ 903 Query 902 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKTL 961 LQ+DYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYA SEHVNVLLTRTE +LVWKTL Sbjct 904 LQLDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYAPASEHVNVLLTRTEDRLVWKTL 963 Query 962 SGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVPI 1021 +GDPWIK L N P+GNF AT++EW+ EH IM I D FQNKANVCWAKSLVP+ Sbjct 964 AGDPWIKVLSNIPQGNFTATLEEWQEEHDKIMKVIEGPAAPVDAFQNKANVCWAKSLVPV 1023 Query 1022 LETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYAD 1081 L+TAGI+L +WS II AFKEDRAYSP VALNEICT+ YGVDLDSGLFS P VS++Y + Sbjct 1024 LDTAGIRLTAEEWSTIITAFKEDRAYSPVVALNEICTKYYGVDLDSGLFSAPKVSLYYEN 1083 Query 1082 NHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPANR 1141 NHWDNRPGG+M+GFN A+ LE ++ F KG+W+T KQ + R+I+ + N+IP NR Sbjct 1084 NHWDNRPGGRMYGFNAATAARLEARHTFLKGQWHTGKQAVIAERKIQPLSVLDNVIPINR 1143 Query 1142 RLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYTY 1201 RLPH+LVAE++ VKG R+EWLVNK+ G+HVLLVS YNL LP +RVTW++PL + GAD Y Sbjct 1144 RLPHALVAEYKTVKGSRVEWLVNKVRGYHVLLVSEYNLALPRRRVTWLSPLNVTGADRCY 1203 Query 1202 NLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIRA 1261 +L LGLPA GR+DLV +NIHT FRIHHYQQCVDHAMKLQMLGGD+LRLLKPGGSLL+RA Sbjct 1204 DLSLGLPADAGRFDLVFVNIHTEFRIHHYQQCVDHAMKLQMLGGDALRLLKPGGSLLMRA 1263 Query 1262 YGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQLN 1321 YGYAD+ SE VV L RKF S+R L+P CVTSNTE+F LFSNFDNG+R T H MN +L+ Sbjct 1264 YGYADKISEAVVSSLSRKFSSARVLRPDCVTSNTEVFLLFSNFDNGKRPSTLHQMNTKLS 1323 Query 1322 AAFVGQATR-AGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKN 1380 A + G+A AGCAPSYRVKR DIA E VVNAAN RG GDGVC+AV KKWP +FK Sbjct 1324 AVYAGEAMHTAGCAPSYRVKRADIATCTEAAVVNAANARGTVGDGVCRAVAKKWPSAFKG 1383 Query 1381 SATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIP 1440 +ATPVGT KTVMCG+YPVIHAV PNFS +E+EGDRELAA YR VA EV RL ++SVAIP Sbjct 1384 AATPVGTIKTVMCGSYPVIHAVAPNFSATTEAEGDRELAAVYRAVAAEVNRLSLSSVAIP 1443 Query 1441 LLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVELLDE 1500 LLSTGV+SGG+DRL QSLNHLFTA+D+TDADV IYCRDK WEKKI EAI MRT VELL++ Sbjct 1444 LLSTGVFSGGRDRLQQSLNHLFTAMDATDADVTIYCRDKSWEKKIQEAIDMRTAVELLND 1503 Query 1501 HISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQTEAN 1560 + + D++RVHPDSSL GRKGYSTT+GSLYSY EGT+F+Q A+DMAE+ T+WP+ EAN Sbjct 1504 DVELTTDLVRVHPDSSLVGRKGYSTTDGSLYSYFEGTKFNQAAIDMAEILTLWPRLQEAN 1563 Query 1561 EQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVC 1620 EQ+CLYALGE++++IR KCPV+D+D+S+PP+TVPCLCRYAMT ER+ RLR + V S++VC Sbjct 1564 EQICLYALGETMDNIRSKCPVNDSDSSTPPRTVPCLCRYAMTAERIARLRSHQVKSMVVC 1623 Query 1621 SSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREYKSPQETAQEVSSTTSLTHSQFDL 1680 SSFPLPKY ++GVQKVKC K +LFD VPS VSPR+Y A + + + FDL Sbjct 1624 SSFPLPKYHVDGVQKVKCEKGLLFDPTVPSVVSPRKY------AASTTDHSDRSLRGFDL 1677 Query 1681 ---------SVDGEELPAPSDLEADAPIPEPT----------PDDRAVLTLPPTIDNFSA 1721 + D LP+ + D+ I EP P+ + L D Sbjct 1678 DWTTDSSSTASDTMSLPSLQSCDIDS-IYEPMAPIVVTADVHPEPAGIADL--AADVHPE 1734 Query 1722 VSDWVMNTAPVAPPRRRRGKNLNVTCDEREGNVLPMASVRFFRADLHSIVQETAEIRDTA 1781 +D V P+ PPR +R L ER Sbjct 1735 PADHVDLENPIPPPRPKRAAYLASRAAER------------------------------- 1763 Query 1782 ASLQAPLSVATEPNQLPISFGAPNETFPITFGDFDEGEIESLSSELLTFGDF 1833 P+ +P P + A P+TFGDFDE E+++L+S +TFGDF Sbjct 1764 -----PVPAPRKPTPAPRT--AFRNKLPLTFGDFDEHEVDALASG-ITFGDF 1807 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Getah virus] Sequence ID: Q5Y389.3 Length: 2467 Range 1: 3 to 1667 Score:2537 bits(6575), Expect:0.0, Method:Compositional matrix adjust., Identities:1208/1668(72%), Positives:1396/1668(83%), Gaps:7/1668(0%) Query 4 VYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDSTILD 63 V VD++ADS FLKALQ+A+P FEVE +QVTPNDHANARAFSHLA KLIEQE+ TILD Sbjct 3 VTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTILD 62 Query 64 IGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQAVM 123 +GSAPARR+MSD YHC+CPM+SAEDPERLANYARKLA A+G VLD+N+S KI DLQ VM Sbjct 63 VGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDVM 122 Query 124 AVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDTTPF 183 A PD E+PTFCLHTD +CR RA+VA+YQDVYAVHAPTSLYHQAIKGVR AYWIGFDTTPF Sbjct 123 ATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTPF 182 Query 184 MYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVLFSV 243 M+ A+AGAYP+YSTNWADEQVL+A+NIGLC+T L+EGRRGKLSIMR K ++P DRV+FSV Sbjct 183 MFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFSV 242 Query 244 GSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKTTGY 303 GSTLY ESRKLL+SWHLPSVFHLKGK SFTCRCDTVVSCEGYVVK+ITISPG+YGKT Y Sbjct 243 GSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVDY 302 Query 304 AVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLVGLN 363 AVTHHA+GFLMCK TDTV GERVSF VCTYVPATICDQMTGILAT+VTPEDAQKLLVGLN Sbjct 303 AVTHHAEGFLMCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGLN 362 Query 364 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCLWAF 423 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWA+E R DMEDEK LG RERTLTCCCLWAF Sbjct 363 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWAF 422 Query 424 KKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDLIPY 483 K K HT+YKRP+TQ+I KVP+ FDSFV+PSLWSS LS+ +R RIK LLS L PY Sbjct 423 KSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQGL-PY 481 Query 484 SGDAKEARDAEKEAEEEREAELTREALPPLQAAQ--DDVQVEIDVEQLEDRAGAGIIETP 541 SGD EAR AE+E +E +EAELTR ALPPL + DD+ ++DVE+L RAGAG++ETP Sbjct 482 SGDRTEARAAEEEEKEVQEAELTRAALPPLVSGSCADDI-AQVDVEELTFRAGAGVVETP 540 Query 542 RGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAYD 601 R A+KVT Q DH++G YL+LSPQTVL+S+KL+ IH LAEQV THSGR+GRY V+ YD Sbjct 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 Query 602 GRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRAE 661 GR+L+P+G AI +FQ+LSESATMVYNEREF+NRKLHHIAL+GPALNTDEESYE VRAE Sbjct 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 Query 662 RTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVFG 721 R E EYV+DVD++ C KKEEA+GLVL GDL NPP+HEFAYEGL+IRPA PY T +IGVFG Sbjct 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 Query 722 VPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRPV 781 VPGSGKSAIIKN+VT +DLV SGKKENCQEI DV RQR L+++ARTVDS+LLNGC + V Sbjct 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKKGV 780 Query 782 DVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVYH 841 + LYVDEAFACHSGTLLALIALVRP KVVLCGDPKQCGFFN+MQ+KV+YNHNICT+V H Sbjct 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 Query 842 KSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVKQ 901 KSISRRCTLPVTAIVS+LHY+GKMRTTN N PI +DTTGS+KP GD+VLTCFRGWVKQ Sbjct 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 Query 902 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKTL 961 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLY+ SEHVNVLLTRTE +LVWKTL Sbjct 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 Query 962 SGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVPI 1021 SGDPWIK L N P+G+F AT++EW EH IM + D FQNKA VCWAK LV + Sbjct 961 SGDPWIKVLTNVPRGDFSATLEEWHEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 Query 1022 LETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYAD 1081 LETAGI++ +W+ I+ AF+EDRAYSPEVALNEICTR YGVDLDSGLFS VS+ Y + Sbjct 1021 LETAGIRMTADEWNTIL-AFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYEN 1079 Query 1082 NHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPANR 1141 NHWDNRPGG+M+GFN E A ++PF +G N+ Q+ V R+++ F+ NI+P+NR Sbjct 1080 NHWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNR 1139 Query 1142 RLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYTY 1201 RLPH+LV ++ +GER+EWL+ KI GH +LLVS YNLV+P KRV W+AP + GAD TY Sbjct 1140 RLPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLVIPHKRVFWIAPPRVSGADRTY 1199 Query 1202 NLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIRA 1261 +L+LGLP GRYDLV +NIHT +R HHYQQCVDH+M+LQMLGGDSL LL+PGGSLL+RA Sbjct 1200 DLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRA 1259 Query 1262 YGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQLN 1321 YGYADR SE VV L RKF + R L+P CVTSNTE+F LFSNFDNGRR T H N +L+ Sbjct 1260 YGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLS 1319 Query 1322 AAFVGQATR-AGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKN 1380 + + AGCAPSYRV+R DI+ + EE VVNAAN +G DGVC+AV KKWP SFK Sbjct 1320 SMYACNGLHTAGCAPSYRVRRADISGHSEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKG 1379 Query 1381 SATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIP 1440 +ATPVGTAK + VIHAVGPNFS +E+EGDRELAAAYR VA ++ + SVA+P Sbjct 1380 AATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVP 1439 Query 1441 LLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVELLDE 1500 LLSTG +SGGKDR+ QSLNHLFTALD+TDADVVIYCRDK WEKKI EAI RT +EL+ E Sbjct 1440 LLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVSE 1499 Query 1501 HISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQTEAN 1560 ++++ D++RVHPDS L GR GYS T+G LYSYLEGTRFHQTAVDMAE+ T+WP+ +AN Sbjct 1500 DVTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDAN 1559 Query 1561 EQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVC 1620 EQ+CLYALGE+++SIR KCPV+DAD+S+PPKTVPCLCRYAMT ERV RLRMN+ +IIVC Sbjct 1560 EQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVC 1619 Query 1621 SSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREY-KSPQETAQEV 1667 SSFPLPKY+IEGVQKVKC +V++FD VPS VSPR+Y + P E V Sbjct 1620 SSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQQPPEQLDNV 1667 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Sagiyama virus] Sequence ID: Q9JGL0.3 Length: 2467 Range 1: 3 to 1667 Score:2534 bits(6567), Expect:0.0, Method:Compositional matrix adjust., Identities:1207/1668(72%), Positives:1396/1668(83%), Gaps:7/1668(0%) Query 4 VYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDSTILD 63 V VD++ADS FLKALQ+A+P FEVE +QVTPNDHANARAFSHLA KLIEQE+ TILD Sbjct 3 VTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPTGVTILD 62 Query 64 IGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQAVM 123 +GSAPARR+MSD YHC+CPM+SAEDPERLANYARKLA A+G VLD+N+S KI DLQ VM Sbjct 63 VGSAPARRLMSDHTYHCICPMKSAEDPERLANYARKLAKASGTVLDKNVSGKITDLQDVM 122 Query 124 AVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDTTPF 183 A PD E+PTFCLHTD +CR RA+VA+YQDVYAVHAPTSLYHQAIKGVR AYWIGFDTTPF Sbjct 123 ATPDLESPTFCLHTDETCRTRAEVAVYQDVYAVHAPTSLYHQAIKGVRTAYWIGFDTTPF 182 Query 184 MYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVLFSV 243 M+ A+AGAYP+YSTNWADEQVL+A+NIGLC+T L+EGRRGKLSIMR K ++P DRV+FSV Sbjct 183 MFEALAGAYPAYSTNWADEQVLQARNIGLCATGLSEGRRGKLSIMRKKCLRPSDRVMFSV 242 Query 244 GSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKTTGY 303 GSTLY ESRKLL+SWHLPSVFHLKGK SFTCRCDTVVSCEGYVVK+ITISPG+YGKT Y Sbjct 243 GSTLYTESRKLLRSWHLPSVFHLKGKNSFTCRCDTVVSCEGYVVKKITISPGIYGKTVDY 302 Query 304 AVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLVGLN 363 AVTHHA+GFL+CK TDTV GERVSF VCTYVPATICDQMTGILAT+VTPEDAQKLLVGLN Sbjct 303 AVTHHAEGFLVCKITDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGLN 362 Query 364 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCLWAF 423 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWA+E R DMEDEK LG RERTLTCCCLWAF Sbjct 363 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREARADMEDEKPLGTRERTLTCCCLWAF 422 Query 424 KKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDLIPY 483 K K HT+YKRP+TQ+I KVP+ FDSFV+PSLWSS LS+ +R RIK LLS L PY Sbjct 423 KSHKIHTMYKRPETQTIVKVPSTFDSFVIPSLWSSSLSMGIRQRIKLLLSARMAQGL-PY 481 Query 484 SGDAKEARDAEKEAEEEREAELTREALPPLQAAQ--DDVQVEIDVEQLEDRAGAGIIETP 541 SGD EAR AE+E +E +EAELTR ALPPL + DD+ ++DVE+L RAGAG++ETP Sbjct 482 SGDRTEARAAEEEEKEAQEAELTRAALPPLVSGSCADDI-AQVDVEELTFRAGAGVVETP 540 Query 542 RGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAYD 601 R A+KVT Q DH++G YL+LSPQTVL+S+KL+ IH LAEQV THSGR+GRY V+ YD Sbjct 541 RNALKVTPQAHDHLIGSYLILSPQTVLKSEKLAPIHPLAEQVTVMTHSGRSGRYPVDKYD 600 Query 602 GRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRAE 661 GR+L+P+G AI +FQ+LSESATMVYNEREF+NRKLHHIAL+GPALNTDEESYE VRAE Sbjct 601 GRVLIPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEESYEKVRAE 660 Query 662 RTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVFG 721 R E EYV+DVD++ C KKEEA+GLVL GDL NPP+HEFAYEGL+IRPA PY T +IGVFG Sbjct 661 RAETEYVFDVDKKACIKKEEASGLVLTGDLINPPFHEFAYEGLKIRPAAPYHTTIIGVFG 720 Query 722 VPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRPV 781 VPGSGKSAIIKN+VT +DLV SGKKENCQEI DV RQR L+++ARTVDS+LLNGC R V Sbjct 721 VPGSGKSAIIKNMVTTRDLVASGKKENCQEIMNDVKRQRGLDVTARTVDSILLNGCKRGV 780 Query 782 DVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVYH 841 + LYVDEAFACHSGTLLALIALVRP KVVLCGDPKQCGFFN+MQ+KV+YNHNICT+V H Sbjct 781 ENLYVDEAFACHSGTLLALIALVRPSGKVVLCGDPKQCGFFNLMQLKVHYNHNICTRVLH 840 Query 842 KSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVKQ 901 KSISRRCTLPVTAIVS+LHY+GKMRTTN N PI +DTTGS+KP GD+VLTCFRGWVKQ Sbjct 841 KSISRRCTLPVTAIVSTLHYQGKMRTTNRCNTPIQIDTTGSSKPASGDIVLTCFRGWVKQ 900 Query 902 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKTL 961 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLY+ SEHVNVLLTRTE +LVWKTL Sbjct 901 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYSPLSEHVNVLLTRTENRLVWKTL 960 Query 962 SGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVPI 1021 SGDPWIK L N P+G+F AT++EW+ EH IM + D FQNKA VCWAK LV + Sbjct 961 SGDPWIKVLTNVPRGDFSATLEEWQEEHDGIMRVLNERPAEVDPFQNKAKVCWAKCLVQV 1020 Query 1022 LETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYAD 1081 LETAGI++ +W+ I+ AF+EDRAYSPEVALNEICTR YGVDLDSGLFS VS+ Y + Sbjct 1021 LETAGIRMTADEWNTIL-AFREDRAYSPEVALNEICTRYYGVDLDSGLFSAQSVSLFYEN 1079 Query 1082 NHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPANR 1141 NHWDNRPGG+M+GFN E A ++PF +G N+ Q+ V R+++ F+ NI+P+NR Sbjct 1080 NHWDNRPGGRMYGFNHEVARKYAARFPFLRGNMNSGLQLNVPERKLQPFSAECNIVPSNR 1139 Query 1142 RLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYTY 1201 RLPH+LV ++ +GER+EWL+ KI GH +LLVS YNL +P KRV W+AP + GAD TY Sbjct 1140 RLPHALVTSYQQCRGERVEWLLKKIPGHQMLLVSEYNLAIPHKRVFWIAPPRVSGADRTY 1199 Query 1202 NLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIRA 1261 +L+LGLP GRYDLV +NIHT +R HHYQQCVDH+M+LQMLGGDSL LL+PGGSLL+RA Sbjct 1200 DLDLGLPMDAGRYDLVFVNIHTEYRQHHYQQCVDHSMRLQMLGGDSLHLLRPGGSLLMRA 1259 Query 1262 YGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQLN 1321 YGYADR SE VV L RKF + R L+P CVTSNTE+F LFSNFDNGRR T H N +L+ Sbjct 1260 YGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFSNFDNGRRAVTLHQANQKLS 1319 Query 1322 AAFVGQATR-AGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKN 1380 + + AGCAPSYRV+R DI+ + EE VVNAAN +G DGVC+AV KKWP SFK Sbjct 1320 SMYACNGLHTAGCAPSYRVRRADISGHGEEAVVNAANAKGTVSDGVCRAVAKKWPSSFKG 1379 Query 1381 SATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIP 1440 +ATPVGTAK + VIHAVGPNFS +E+EGDRELAAAYR VA ++ + SVA+P Sbjct 1380 AATPVGTAKMIRADGMTVIHAVGPNFSTVTEAEGDRELAAAYRAVASIISTNNIKSVAVP 1439 Query 1441 LLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVELLDE 1500 LLSTG +SGGKDR+ QSLNHLFTALD+TDADVVIYCRDK WEKKI EAI RT +EL+ E Sbjct 1440 LLSTGTFSGGKDRVMQSLNHLFTALDATDADVVIYCRDKNWEKKIQEAIDRRTAIELVSE 1499 Query 1501 HISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQTEAN 1560 ++++ D++RVHPDS L GR GYS T+G LYSYLEGTRFHQTAVDMAE+ T+WP+ +AN Sbjct 1500 DVTLETDLVRVHPDSCLVGRNGYSATDGKLYSYLEGTRFHQTAVDMAEISTLWPRLQDAN 1559 Query 1561 EQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVC 1620 EQ+CLYALGE+++SIR KCPV+DAD+S+PPKTVPCLCRYAMT ERV RLRMN+ +IIVC Sbjct 1560 EQICLYALGETMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKNIIVC 1619 Query 1621 SSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREY-KSPQETAQEV 1667 SSFPLPKY+IEGVQKVKC +V++FD VPS VSPR+Y + P E V Sbjct 1620 SSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIQQPPEQLDNV 1667 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Ross river virus (STRAIN NB5092)] Sequence ID: P13887.2 Length: 2480 Range 1: 3 to 1857 Score:2522 bits(6537), Expect:0.0, Method:Compositional matrix adjust., Identities:1245/1883(66%), Positives:1465/1883(77%), Gaps:81/1883(4%) Query 4 VYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDSTILD 63 V VD++ADS FLKALQ+A+P FEVE +QVTPNDHANARAFSHLA KLIEQE+ + TILD Sbjct 3 VTVDVEADSPFLKALQKAFPAFEVESQQVTPNDHANARAFSHLATKLIEQEVPANITILD 62 Query 64 IGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQAVM 123 +GSAPARR+MSD YHC+CPM+SAEDPERLANYARKLA AG+VLD+N+S KI DLQ VM Sbjct 63 VGSAPARRLMSDHSYHCICPMKSAEDPERLANYARKLAKTAGEVLDKNVSGKITDLQDVM 122 Query 124 AVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDTTPF 183 A PD E+PTFCLHTD +CR RA+VA+YQDV HAPTSLYHQA+KGVR YWIGFDTTPF Sbjct 123 ATPDLESPTFCLHTDETCRTRAEVAVYQDV-XXHAPTSLYHQAMKGVRTVYWIGFDTTPF 181 Query 184 MYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVLFSV 243 M+ +AGAYP+YSTNWADEQVL+A+NIGLC+T L+EG RGK+SIMR K+++P DR +FSV Sbjct 182 MFEVVAGAYPTYSTNWADEQVLQARNIGLCATSLSEGHRGKISIMRKKRLRPSDRXMFSV 241 Query 244 GSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKTTGY 303 G TLY ESR+LLKSWHLPSVFHLKGK SFTCRCDT+VSCEGYVVK+IT+SPG YGKT GY Sbjct 242 GXTLYIESRRLLKSWHLPSVFHLKGKNSFTCRCDTIVSCEGYVVKKITMSPGTYGKTVGY 301 Query 304 AVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLVGLN 363 AVTHHA+GFLMCK TDTV GERVSF VCTYVPATICDQMTGILAT+VTPEDAQKLLVGLN Sbjct 302 AVTHHAEGFLMCKVTDTVRGERVSFPVCTYVPATICDQMTGILATDVTPEDAQKLLVGLN 361 Query 364 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCLWAF 423 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWA+E + DMEDEK LG RERTLTCCCLWAF Sbjct 362 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAREAKADMEDEKPLGTRERTLTCCCLWAF 421 Query 424 KKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDLIPY 483 K KTHT+YKRPDTQ+I KVP+ FDSFV+PSLWSS LSI +R RIK LL DL PY Sbjct 422 KNHKTHTMYKRPDTQTIVKVPSTFDSFVIPSLWSSSLSIGIRQRIKLLLGPKLSRDL-PY 480 Query 484 SGDAKEARDAEKEAEEEREAELTREALPPLQAAQ--DDVQVEIDVEQLEDRAGAGIIETP 541 SGD EAR+AEKEAEE +EAELTREALPPL + DDV ++DVE+L RAGAG++ETP Sbjct 481 SGDRNEAREAEKEAEETKEAELTREALPPLVGSNCADDVD-QVDVEELTYRAGAGVVETP 539 Query 542 RGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAYD 601 R A+KVT Q D ++G YL+LSPQTVL+S+KL+ IH LAEQV THSGR+GRY V+ YD Sbjct 540 RNALKVTPQERDQLIGAYLILSPQTVLKSEKLTPIHPLAEQVTIMTHSGRSGRYPVDRYD 599 Query 602 GRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRAE 661 GR+LVP+G AI +FQ+LSESATMVYNEREF+NRKLHHIAL+GPALNTDEE+YE VRAE Sbjct 600 GRVLVPTGAAIPVSEFQALSESATMVYNEREFINRKLHHIALYGPALNTDEENYEKVRAE 659 Query 662 RTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVFG 721 R E EYV+DVD+R C K+E+A+GLVLVGDL NPP+HEFAYEGL+IRPA P++T VIGVFG Sbjct 660 RAEAEYVFDVDKRTCVKREDASGLVLVGDLINPPFHEFAYEGLKIRPATPFQTTVIGVFG 719 Query 722 VPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRPV 781 VPGSGKSAIIK++VT +DLV SGKKENCQEI DV +QR L+++ARTVDS+LLNGC R V Sbjct 720 VPGSGKSAIIKSVVTTRDLVASGKKENCQEIVNDVKKQRGLDVTARTVDSILLNGCRRGV 779 Query 782 DVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVYH 841 + LYVDEAFACHSGTLLALIA+V+P KV+LCGDPKQCGFFN+MQ+KVN+NH+ICTQV H Sbjct 780 ENLYVDEAFACHSGTLLALIAMVKPTGKVILCGDPKQCGFFNLMQLKVNFNHDICTQVLH 839 Query 842 KSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVKQ 901 KSISRRCTLP+TAIVS+LHY+GKMRTTN + PI +DTTG+TKP GD+VLTCFR WVKQ Sbjct 840 KSISRRCTLPITAIVSTLHYQGKMRTTNLCSAPIQIDTTGTTKPAKGDIVLTCFRXWVKQ 899 Query 902 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKTL 961 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYA +SEHVNVLLTRTE +LVWKTL Sbjct 900 LQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYAPSSEHVNVLLTRTENRLVWKTL 959 Query 962 SGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVPI 1021 SGDPWIK L N PKG+F AT++EW+ EH +IM + D FQNKA VCWAK LV + Sbjct 960 SGDPWIKVLTNIPKGDFSATLEEWQEEHDNIMNALRERSTAVDPFQNKAKVCWAKCLVQV 1019 Query 1022 LETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYAD 1081 LETAGI++ +W ++ AF+EDRAYSPEVALNEICT+ YGVDLDSGLFS VS++Y + Sbjct 1020 LETAGIRMTAEEWDTVL-AFREDRAYSPEVALNEICTKYYGVDLDSGLFSAQSVSLYYEN 1078 Query 1082 NHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPANR 1141 NHWDNRPGG+M+GFN E A E++YPF +GK ++ Q+ V R+++ FN NI+ NR Sbjct 1079 NHWDNRPGGRMYGFNREVARKFEQRYPFLRGKMDSGLQVNVPERKVQPFNAECNILLLNR 1138 Query 1142 RLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYTY 1201 RLPH+LV ++ +GER+EWL+ K+ G+H+LLVS YNL LP KRV W+AP + GAD Y Sbjct 1139 RLPHALVTSYQQCRGERVEWLLKKLPGYHLLLVSEYNLALPHKRVFWIAPPHVSGADRIY 1198 Query 1202 NLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIRA 1261 +L+LGLP GRYDLV +NIHT +R HHYQQCVDH+MKLQMLGGDSL LL PGGSLLIRA Sbjct 1199 DLDLGLPLNAGRYDLVFVNIHTEYRTHHYQQCVDHSMKLQMLGGDSLHLLXPGGSLLIRA 1258 Query 1262 YGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQLN 1321 YGYADR SE VV L RKF + R L+P CVTSNTE+F LF+NFDNGRR T H N +L+ Sbjct 1259 YGYADRVSEMVVTALARKFSAFRVLRPACVTSNTEVFLLFTNFDNGRRAVTLHQANQRLS 1318 Query 1322 AAFVGQATR-AGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKN 1380 + F AGCAPSYRV+R DI+ + EE VVNAAN +G G GVC+AV +KWP+SFK Sbjct 1319 SMFACNGLHTAGCAPSYRVRRTDISGHAEEAVVNAANAKGTVGVGVCRAVARKWPDSFKG 1378 Query 1381 SATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIP 1440 +ATPVGTAK V VIHAVGPNFS +E+EGDRELAAAYR VA + + SVAIP Sbjct 1379 AATPVGTAKLVQANGMNVIHAVGPNFSTVTEAEGDRELAAAYRAVAGIINASNIKSVAIP 1438 Query 1441 LLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVELLDE 1500 LLSTGV+SGGKDR+ QSLNHLFTA+D+TDADVVIYCRDK WEKKI EAI RT VEL+ E Sbjct 1439 LLSTGVFSGGKDRVMQSLNHLFTAMDTTDADVVIYCRDKAWEKKIQEAIDRRTAVELVSE 1498 Query 1501 HISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQTEAN 1560 IS++ D+IRVHPDS L GRKGYS T+G L+SYLEGTRFHQTAVDMAE+ T+WPK +AN Sbjct 1499 DISLESDLIRVHPDSCLVGRKGYSITDGKLHSYLEGTRFHQTAVDMAEISTLWPKLQDAN 1558 Query 1561 EQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVC 1620 EQ+CLYALGES++SIR KCPV+DAD+S+PPKTVPCLCRYAMT ERV RLRMN+ +IIVC Sbjct 1559 EQICLYALGESMDSIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKAIIVC 1618 Query 1621 SSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREY----------------------- 1657 SSFPLPKY+IEGVQKVKC +V++FD VPS VSPR+Y Sbjct 1619 SSFPLPKYRIEGVQKVKCDRVLIFDQTVPSLVSPRKYIPAAASMHADTVSLDSTVSTGSA 1678 Query 1658 -KSPQETAQEVSSTTSLTH-----------SQFDLSVDGEELPAPSDLEA--DAPIPEPT 1703 P E E + H + +++ +EL SD+ A + P Sbjct 1679 WSFPSEATYETMEVVAEVHHSEPPVPPPRRRRAQVTMHHQELLEVSDMHTPIAARVEIPV 1738 Query 1704 PDDRAV---LTLPPTIDNFSAV-SDWVMNTAPVAPPRRRRGKNLNVTCDEREGNVLPMAS 1759 D V + +P T + + + + + PV PR +R V+ P + Sbjct 1739 YDTAVVAERVAIPCTSEYATPIPTPRAVRVVPVPAPRIQRASTYRVS---------PTPT 1789 Query 1760 VRFFRADLHSIVQETAEIRDTAASLQAP-----LSVATEPNQL----PISFGAPNETFPI 1810 R RA + S+ T+A ++ P L V TEP P+ E I Sbjct 1790 PRVLRASVCSVT--------TSAGVEFPWAPEDLEVLTEPVHCEMREPVELPWEPEDVDI 1841 Query 1811 TFGDFDEGEIESLSSELLTFGDF 1833 FGDF+ + + + FGD Sbjct 1842 QFGDFE-------TPDKIQFGDI 1857 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Mayaro virus (strain Brazil)] Sequence ID: Q8QZ73.3 Length: 2437 Range 1: 1 to 1817 Score:2493 bits(6461), Expect:0.0, Method:Compositional matrix adjust., Identities:1217/1833(66%), Positives:1455/1833(79%), Gaps:24/1833(1%) Query 1 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 M V+VDI+A+S FLK+LQRA+P FEVE +QVTPNDHANARAFSHLA KLIEQE + D+ Sbjct 1 MSKVFVDIEAESPFLKSLQRAFPAFEVEAQQVTPNDHANARAFSHLATKLIEQETEKDTL 60 Query 61 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ 120 ILDIGSAPARRMMS+ YHCVCPMRSAEDPERL YARKLA A+G+V+DRNI+ KI DLQ Sbjct 61 ILDIGSAPARRMMSEHTYHCVCPMRSAEDPERLLYYARKLAKASGEVVDRNIAAKIDDLQ 120 Query 121 AVMAVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT 180 +VMA PD E+ TFCLHTD +CR +A+VA+YQDVYAVHAPTSLY QA+KGVR AYWIGFDT Sbjct 121 SVMATPDNESRTFCLHTDQTCRTQAEVAVYQDVYAVHAPTSLYFQAMKGVRTAYWIGFDT 180 Query 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVL 240 TPFM++ MAGAYP+Y+TNWADEQVLKA+NIGLCS LTEG GKLSIMR KKM P D+++ Sbjct 181 TPFMFDTMAGAYPTYATNWADEQVLKARNIGLCSASLTEGHLGKLSIMRKKKMTPSDQIM 240 Query 241 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKT 300 FSVGSTLY ESR+LLKSWHLPSVFHLKG+ S+TCRCDT+VSCEGYVVK+IT+SPG++GKT Sbjct 241 FSVGSTLYIESRRLLKSWHLPSVFHLKGRQSYTCRCDTIVSCEGYVVKKITMSPGVFGKT 300 Query 301 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 +GYAVTHHA+GFL+CKTTDT+ GERVSF +CTYVP+TICDQMTGILATEVTPEDAQKLLV Sbjct 301 SGYAVTHHAEGFLVCKTTDTIAGERVSFPICTYVPSTICDQMTGILATEVTPEDAQKLLV 360 Query 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCL 420 GLNQRIVVNGRTQRNTNTMKNYLLPVV+QAFSKWAKE R D EDEK +G+RERTLTCCCL Sbjct 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVSQAFSKWAKEYRLDQEDEKNMGMRERTLTCCCL 420 Query 421 WAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDL 480 WAFK K HT+YK+PDTQ+I KVP+EF+SFV+PSLWS+GLSI +R RI+ LL L Sbjct 421 WAFKTHKNHTMYKKPDTQTIVKVPSEFNSFVIPSLWSAGLSIGIRHRIRLLLQSRRVEPL 480 Query 481 IPYSGDAKEARDAEKEAEEEREAELTREALPPL---QAAQDDVQVEIDVEQLEDRAGAGI 537 +P S D EAR AE+EA E +EAE T ALPPL DD+ E+DVE+LE RAGAG+ Sbjct 481 VP-SMDVGEARAAEREAAEAKEAEDTLAALPPLIPTAPVLDDIP-EVDVEELEFRAGAGV 538 Query 538 IETPRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAV 597 +ETPR A+KVT Q D +VG YLVLSPQTVL+S KL +H LAE VK TH GRAGRY V Sbjct 539 VETPRNALKVTPQDRDTMVGSYLVLSPQTVLKSVKLQALHPLAESVKIITHKGRAGRYQV 598 Query 598 EAYDGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYEL 657 +AYDGR+L+P+G AI DFQ+LSESATMVYNEREF+NRKL+HIA+HG ALNTDEE YE Sbjct 599 DAYDGRVLLPTGAAIPVPDFQALSESATMVYNEREFINRKLYHIAVHGAALNTDEEGYEK 658 Query 658 VRAERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVI 717 VRAE T+ EYVYDVD+++C K+EEA GLV++GDL NPP+HEFAYEGL+ RPA PYKT V+ Sbjct 659 VRAESTDAEYVYDVDRKQCVKREEAEGLVMIGDLINPPFHEFAYEGLKRRPAAPYKTTVV 718 Query 718 GVFGVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGC 777 GVFGVPGSGKS IIK+LVTR DLV SGKKENCQEI DV R R+L+++A+TVDS+LLNG Sbjct 719 GVFGVPGSGKSGIIKSLVTRGDLVASGKKENCQEIMLDVKRYRDLDMTAKTVDSVLLNGV 778 Query 778 NRPVDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICT 837 + VDVLYVDEAFACH+GTLLALIA VRPR+KVVLCGDPKQCGFFN+MQ++VN+NHNICT Sbjct 779 KQTVDVLYVDEAFACHAGTLLALIATVRPRKKVVLCGDPKQCGFFNLMQLQVNFNHNICT 838 Query 838 QVYHKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRG 897 +V HKSISRRCTLP+TAIVS+LHYEG+MRTTN YNKP+++DTTG TKP+ D+VLTCFRG Sbjct 839 EVDHKSISRRCTLPITAIVSTLHYEGRMRTTNPYNKPVIIDTTGQTKPNREDIVLTCFRG 898 Query 898 WVKQLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLV 957 WVKQLQ+DYRGHEVMTAAASQGLTRKGVYAVR KVNENPLYA +SEHVNVLLTRTEG+LV Sbjct 899 WVKQLQLDYRGHEVMTAAASQGLTRKGVYAVRMKVNENPLYAQSSEHVNVLLTRTEGRLV 958 Query 958 WKTLSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKS 1017 WKTLSGDPWIKTL N PKGNF AT+++W+ EH +IM I D FQNKA VCWAK Sbjct 959 WKTLSGDPWIKTLSNIPKGNFTATLEDWQREHDTIMRAITQEAAPLDVFQNKAKVCWAKC 1018 Query 1018 LVPILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSV 1077 LVP+LETAGIKL+ WS II AFKEDRAYSPEVALNEICT++YGVDLDSGLFS P VS+ Sbjct 1019 LVPVLETAGIKLSATDWSAIILAFKEDRAYSPEVALNEICTKIYGVDLDSGLFSAPRVSL 1078 Query 1078 HYADNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNII 1137 HY NHWDN PGG+M+GF+ EAA+ LE+++PF +G+W + Q+ V R+ + + N+I Sbjct 1079 HYTTNHWDNSPGGRMYGFSVEAANRLEQQHPFYRGRWASG-QVLVAERKTQPIDVTCNLI 1137 Query 1138 PANRRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGA 1197 P NRRLPH+LV E+ P+KGER+EWLVNKI G+HVLLVS YNL+LP ++VTW+AP + GA Sbjct 1138 PFNRRLPHTLVTEYHPIKGERVEWLVNKIPGYHVLLVSEYNLILPRRKVTWIAPPTVTGA 1197 Query 1198 DYTYNLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSL 1257 D TY+L+LGLP GRYDLV +N+HTP+R+HHYQQCVDHAMKLQMLGGD+L LLKPGGSL Sbjct 1198 DLTYDLDLGLPPNAGRYDLVFVNMHTPYRLHHYQQCVDHAMKLQMLGGDALYLLKPGGSL 1257 Query 1258 LIRAYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMN 1317 L+ Y YADRTSE VV L R+F S RA+ CVTSNTE+F LF+NFDNGRR T H N Sbjct 1258 LLSTYAYADRTSEAVVTALARRFSSFRAVTVRCVTSNTEVFLLFTNFDNGRRTVTLHQTN 1317 Query 1318 NQLNAAFVGQATR-AGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPE 1376 +L++ + G + AGCAP+Y VKR DIA E+ VVNAAN RG GDGVC+AV +KWP+ Sbjct 1318 GKLSSIYAGTVLQAAGCAPAYAVKRADIATAIEDAVVNAANHRGQVGDGVCRAVARKWPQ 1377 Query 1377 SFKNSATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNS 1436 +F+N+ATPVGTAKTV C +IHAVGPNF+N SE+EGDR+LAAAYR VA E+ RL ++S Sbjct 1378 AFRNAATPVGTAKTVKCDETYIIHAVGPNFNNTSEAEGDRDLAAAYRAVAAEINRLSISS 1437 Query 1437 VAIPLLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVE 1496 VAIPLLSTG++S GKDR+ QSL+HL A+D+T+A V IYCRDK WE+KI +Q R+ E Sbjct 1438 VAIPLLSTGIFSAGKDRVHQSLSHLLAAMDTTEARVTIYCRDKTWEQKIKTVLQNRSATE 1497 Query 1497 LLDEHISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQ 1556 L+ + + + ++ RVHPDSSL GR GYSTT+G+LYSY+EGT+FHQ A+DMAE+ T+WP+ Sbjct 1498 LVSDELQFEVNLTRVHPDSSLVGRPGYSTTDGTLYSYMEGTKFHQAALDMAEITTLWPRV 1557 Query 1557 TEANEQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTS 1616 +ANE +CLYALGE++++IR +CPV+D+D+S+PPKTVPCLCRYAMTPERVTRLRM+H Sbjct 1558 QDANEHICLYALGETMDNIRARCPVEDSDSSTPPKTVPCLCRYAMTPERVTRLRMHHTKD 1617 Query 1617 IIVCSSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREYKSPQETAQEVSSTTSLTHS 1676 +VCSSF LPKY+I GVQ+VKC KVMLFD P+ VSP +Y + Q + +S ++ S Sbjct 1618 FVVCSSFQLPKYRIPGVQRVKCEKVMLFDAAPPASVSPVQYLTNQ-SETTISLSSFSITS 1676 Query 1677 QFDLSVDGEELPAPSDLEADAPIPEPT---PDDRAVLTLPPTIDNFSAVSDWVMNTAPVA 1733 +L + +L+ D+ P PDD PT A A Sbjct 1677 DSSSLSTFPDLESAEELDHDSQSVRPALNEPDDHQ-----PTPTAELATHPVPPPRPNRA 1731 Query 1734 PPRRRRGKNLNVTCDEREGNVLPMASVRFFRADLHSIVQETAEIRDTAASLQAPLSVATE 1793 + V + N P + R L + + L PL Sbjct 1732 RRLAAARVQVQVEVHQPPSN-QPTKPIPAPRTSLRPVPAPRRYVPRPVVELPWPLET--- 1787 Query 1794 PNQLPISFGAPN-ETFPITFGDFDEGEIESLSS 1825 + + FGAP E ITFGDF E E++S+ Sbjct 1788 ---IDVEFGAPTEEESDITFGDFSASEWETISN 1817 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Barmah Forest virus] Sequence ID: P87515.3 Length: 2411 Range 1: 6 to 1647 Score:2197 bits(5692), Expect:0.0, Method:Compositional matrix adjust., Identities:1045/1649(63%), Positives:1274/1649(77%), Gaps:10/1649(0%) Query 4 VYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDSTILD 63 V +D++ +S F K +Q +P FE+E Q TPNDHA+ARAFSHLA KLIE E D ILD Sbjct 6 VKIDVEPESHFAKQVQSCFPQFEIEAVQTTPNDHAHARAFSHLATKLIEMETAKDQIILD 65 Query 64 IGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKL--ASAAGKVLDRNISEKIGDLQA 121 IGSAPARR+ S+ KYHCVCPM+ EDPER+ YARKL SA GK +EK+ DL+ Sbjct 66 IGSAPARRLYSEHKYHCVCPMKCTEDPERMLGYARKLIAGSAKGK------AEKLRDLRD 119 Query 122 VMAVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDTT 181 V+A PD ET + CLHTD SCR R DVA+YQDVYA+ APT+LYHQA+KGVR AYWIGFDTT Sbjct 120 VLATPDIETQSLCLHTDASCRYRGDVAVYQDVYAIDAPTTLYHQALKGVRTAYWIGFDTT 179 Query 182 PFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVLF 241 PFMY+A+AGAYP YSTNWADEQVL+++NIGLCS ++EG + SI+R K +K DRV+F Sbjct 180 PFMYDALAGAYPLYSTNWADEQVLESRNIGLCSDKVSEGGKKGRSILRKKFLKQSDRVMF 239 Query 242 SVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKTT 301 SVGSTLY ESRKLL+SWHLPS FHLKGK SFTCRCDT+VSCEGYV+K+IT+ PG+ GK Sbjct 240 SVGSTLYTESRKLLQSWHLPSTFHLKGKSSFTCRCDTIVSCEGYVLKKITMCPGVTGKPI 299 Query 302 GYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLVG 361 GYAVTHH +GF++ K TDT+ GERVSF+VCTYVP T+CDQMTGILATEVT +DAQKLLVG Sbjct 300 GYAVTHHKEGFVVGKVTDTIRGERVSFAVCTYVPTTLCDQMTGILATEVTADDAQKLLVG 359 Query 362 LNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCLW 421 LNQRIVVNGRTQRNTNTMKNYLLP+VAQA +KWAKE ++DMEDE+ L R+RTLTC C W Sbjct 360 LNQRIVVNGRTQRNTNTMKNYLLPLVAQALAKWAKEAKQDMEDERPLNERQRTLTCLCCW 419 Query 422 AFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDLI 481 AFK+ K H +YKRPDTQSI KVP EF SF + SLWS+G+SI LR ++K +L T + Sbjct 420 AFKRNKRHAIYKRPDTQSIVKVPCEFTSFPLVSLWSAGMSISLRQKLKMMLQARQPTQIA 479 Query 482 PYSGD-AKEARDAEKEAEEEREAELTREALPPLQAAQDDVQVEIDVEQLEDRAGAGIIET 540 + + +EA E+EA + AEL A P + + VE++VE+L+ RAG G++ET Sbjct 480 AVTEELIQEAAAVEQEAVDTANAELDHAAWPSI-VDTTERHVEVEVEELDQRAGEGVVET 538 Query 541 PRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY 600 PR +IKV+ Q D ++G YL+LSPQ VLRS+KL+ IH LAEQVK THSGR+GRYAV+ Y Sbjct 539 PRNSIKVSTQIGDALIGSYLILSPQAVLRSEKLACIHDLAEQVKLVTHSGRSGRYAVDKY 598 Query 601 DGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRA 660 GR+LVP+G AI + FQ+LSESAT+VYNEREFVNRKL HIA++G ALNTDEE YE V Sbjct 599 XGRVLVPTGVAIDIQSFQALSESATLVYNEREFVNRKLWHIAVYGAALNTDEEGYEKVPV 658 Query 661 ERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVF 720 ER E +YV+DVDQ+ C KKE+A+G VL G+L NPP+HEFAYEGLR RP+ PYK +GV+ Sbjct 659 ERAESDYVFDVDQKMCLKKEQASGWVLCGELVNPPFHEFAYEGLRTRPSAPYKVHTVGVY 718 Query 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRP 780 GVPGSGKSAIIKN VT DLV SGKKENC EI DV++ R L I+A+TVDS+LLNG Sbjct 719 GVPGSGKSAIIKNTVTMSDLVLSGKKENCLEIMNDVLKHRALRITAKTVDSVLLNGVKHT 778 Query 781 VDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY 840 ++LY+DEAF+CH+GTLLA IA+VRP+QKVVLCGDPKQCGFFNMMQ+KVNYNH+IC++V+ Sbjct 779 PNILYIDEAFSCHAGTLLATIAIVRPKQKVVLCGDPKQCGFFNMMQLKVNYNHDICSEVF 838 Query 841 HKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVK 900 HKSISRRCT +TAIVS LHY+ +MRTTN I++DTTG+TKP DL+LTCFRGWVK Sbjct 839 HKSISRRCTQDITAIVSKLHYQDRMRTTNPRKGDIIIDTTGTTKPAKTDLILTCFRGWVK 898 Query 901 QLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKT 960 QLQ DYRG+EVMTAAASQGLTR VYAVR KVNENPLYA TSEHVNVLLTRTE KLVWKT Sbjct 899 QLQQDYRGNEVMTAAASQGLTRASVYAVRTKVNENPLYAQTSEHVNVLLTRTENKLVWKT 958 Query 961 LSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVP 1020 LS DPWIKTL NPP+G++ ATI EWE EH IM I + +TF NK NVCWAK+L P Sbjct 959 LSTDPWIKTLTNPPRGHYTATIAEWEAEHQGIMKAIQGYAPPVNTFMNKVNVCWAKTLTP 1018 Query 1021 ILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYA 1080 +LETAGI L+ WS+++ F +D AYSPEVALN ICT+MYG DLD+GLFS+P V + Y Sbjct 1019 VLETAGISLSAEDWSELLPPFAQDVAYSPEVALNIICTKMYGFDLDTGLFSRPSVPMTYT 1078 Query 1081 DNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPAN 1140 +HWDNR GGKM+GF+ +A L R++P+ +G+ + QI VT RI+ + NIIP N Sbjct 1079 KDHWDNRVGGKMYGFSQQAYDQLARRHPYLRGREKSGMQIVVTEMRIQRPRSDANIIPIN 1138 Query 1141 RRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYT 1200 RRLPHSLVA H + R E G+ +LLVS YN+ LP K++TW+AP+G +GA +T Sbjct 1139 RRLPHSLVATHEYRRAARAEEFFTTTRGYTMLLVSEYNMNLPNKKITWLAPIGTQGAHHT 1198 Query 1201 YNLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR 1260 NL LG+P LG +D V++N+ TPFR HHYQQC DHAMKLQML GD+LR +KPGGSL ++ Sbjct 1199 ANLNLGIPPLLGSFDAVVVNMPTPFRNHHYQQCEDHAMKLQMLAGDALRHIKPGGSLWVK 1258 Query 1261 AYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQL 1320 AYGYADR SE VV L RKF+S R +P CVTSNTE+F FS FDNG+R H N + Sbjct 1259 AYGYADRHSEHVVLALARKFKSFRVTQPSCVTSNTEVFLHFSIFDNGKRAIALHSANRKA 1318 Query 1321 NAAFVGQATRAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKN 1380 N+ F AG AP+YRVKR DI+ E+ VVNAAN +G+ G GVC A+Y+KWP++F + Sbjct 1319 NSIFQNTFLPAGSAPAYRVKRGDISNAPEDAVVNAANQQGVKGAGVCGAIYRKWPDAFGD 1378 Query 1381 SATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIP 1440 ATP GTA + VIHAVGPNFS SE EGDR+LA+AYR A+ V + +VA+P Sbjct 1379 VATPTGTAVSKSVQDKLVIHAVGPNFSKCSEEEGDRDLASAYRAAAEIVMDKKITTVAVP 1438 Query 1441 LLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVELLDE 1500 LLSTG+Y+GGK+R+ QSLNHLFTA D+TDADV IYC DK WEKKI EAI RT VE++ + Sbjct 1439 LLSTGIYAGGKNRVEQSLNHLFTAFDNTDADVTIYCMDKTWEKKIKEAIDHRTSVEMVQD 1498 Query 1501 HISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQTEAN 1560 + ++ +++RVHP SSLAGRKGYST G ++SYLEGT+FHQTAVD+AE+ +WP E+N Sbjct 1499 DVQLEEELVRVHPLSSLAGRKGYSTDSGRVFSYLEGTKFHQTAVDIAEMQVLWPALKESN 1558 Query 1561 EQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVC 1620 EQ+ Y LGES++ IR KCP +D DAS+PP+TVPCLCRYAMTPERV RL+ + T VC Sbjct 1559 EQIVAYTLGESMDQIRGKCPTEDTDASTPPRTVPCLCRYAMTPERVYRLKCTNTTQFTVC 1618 Query 1621 SSFPLPKYKIEGVQKVKCSKVMLFDHNVP 1649 SSF LPKY I+GVQ+VKC ++++ D VP Sbjct 1619 SSFELPKYHIQGVQRVKCERIIILDPTVP 1647 Range 2: 1758 to 1789 Score:43.1 bits(100), Expect:0.030, Method:Compositional matrix adjust., Identities:17/32(53%), Positives:23/32(71%), Gaps:0/32(0%) Query 1810 ITFGDFDEGEIESLSSELLTFGDFSPGEVDDL 1841 TFGDF E E+E L++ LTFGDF+ GE+ + Sbjct 1758 FTFGDFGEHEVEELTASPLTFGDFAEGEIQGM 1789 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Ockelbo virus] Sequence ID: P27283.2 Length: 2515 Range 1: 6 to 1876 Score:2105 bits(5453), Expect:0.0, Method:Compositional matrix adjust., Identities:1053/1892(56%), Positives:1345/1892(71%), Gaps:72/1892(3%) Query 4 VYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDSTILD 63 V VD+D S F+ LQ+++P FEV +Q TPNDHANARAFSHLA KLIE E+ +TILD Sbjct 6 VNVDVDPQSPFVVQLQKSFPQFEVVAQQATPNDHANARAFSHLASKLIELEVPTTATILD 65 Query 64 IGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQAVM 123 IGSAPARRM S+ +YHCVCPMRS EDP+R+ YA KLA A K+ ++N+ EKI DL+ V+ Sbjct 66 IGSAPARRMFSEHQYHCVCPMRSPEDPDRMMKYASKLAEKACKITNKNLHEKIKDLRTVL 125 Query 124 AVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDTTPF 183 PDAETP+ C H DV+C RA+ ++ QDVY ++AP ++YHQA+KGVR YWIGFDTT F Sbjct 126 DTPDAETPSLCFHNDVTCNTRAEYSVMQDVY-INAPGTIYHQAMKGVRTLYWIGFDTTQF 184 Query 184 MYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVLFSV 243 M++AMAG+YP+Y+TNWADE+VL+A+NIGLCST L+EGR GKLSIMR K++KP RV FSV Sbjct 185 MFSAMAGSYPAYNTNWADEKVLEARNIGLCSTKLSEGRTGKLSIMRKKELKPGSRVYFSV 244 Query 244 GSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKTTGY 303 GSTLYPE R L+SWHLPSVFHLKGK S+TCRCDTVVSCEGYVVK+ITISPG+ G+T GY Sbjct 245 GSTLYPEHRASLQSWHLPSVFHLKGKQSYTCRCDTVVSCEGYVVKKITISPGITGETVGY 304 Query 304 AVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLVGLN 363 AVT++++GFL+CK TDTV GERVSF VCTY+PATICDQMTGI+AT+++P+DAQKLLVGLN Sbjct 305 AVTNNSEGFLLCKVTDTVKGERVSFPVCTYIPATICDQMTGIMATDISPDDAQKLLVGLN 364 Query 364 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCLWAF 423 QRIV+NG+T RNTNTM+NYLLP +AQ FSKWAKE ++D+++EK+LG RER LT CLWAF Sbjct 365 QRIVINGKTNRNTNTMQNYLLPTIAQGFSKWAKERKEDLDNEKMLGTRERKLTYGCLWAF 424 Query 424 KKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDLIPY 483 + +K H+ Y+ P TQ+ KVPA F +F + S+W++ L + LR ++K L + L+ Sbjct 425 RTKKVHSFYRPPGTQTSVKVPASFSAFPMSSVWTTSLPMSLRQKMKLALQPKKEEKLLQV 484 Query 484 SGD-AKEARDAEKEAEEEREAELTREALPPLQAAQD---DVQVEIDVEQLEDRAGAGIIE 539 + EA+ A ++A+EE AE REALPPL A +D +V +VE L+ GA ++E Sbjct 485 PEELVMEAKAAFEDAQEEARAEKLREALPPLVADKDIEAAAEVVCEVEGLQADIGAALVE 544 Query 540 TPRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEA 599 TPRG +++ Q D ++G+Y+V+SP +VL++ KL+ H LA+QVK THSGRAGRYAVE Sbjct 545 TPRGHVRIIPQANDRMIGQYIVVSPTSVLKNAKLAPAHPLADQVKIITHSGRAGRYAVEP 604 Query 600 YDGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVR 659 YD ++L+P+G A+ +F +LSESAT+VYNEREFVNRKL+HIA+HGPA NT+EE Y++ + Sbjct 605 YDAKVLMPAGSAVPWPEFLALSESATLVYNEREFVNRKLYHIAMHGPAKNTEEEQYKVTK 664 Query 660 AERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGV 719 AE E EYV+DVD++RC KKEEA+GLVL G+LTNPPYHE A EGL+ RPA PYK IGV Sbjct 665 AELAETEYVFDVDKKRCVKKEEASGLVLSGELTNPPYHELALEGLKTRPAVPYKVETIGV 724 Query 720 FGVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNR 779 G PGSGKSAIIK+ VT +DLVTSGKKENC+EI DV+R R ++I+++TVDS++LNGC++ Sbjct 725 IGTPGSGKSAIIKSTVTARDLVTSGKKENCREIEADVLRLRGMQITSKTVDSVMLNGCHK 784 Query 780 PVDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNH---NIC 836 V+VLYVDEAFACH+G LLALIA+VRPR+KVVLCGDPKQCGFFNMMQ+KV++NH +IC Sbjct 785 AVEVLYVDEAFACHAGALLALIAIVRPRKKVVLCGDPKQCGFFNMMQLKVHFNHPERDIC 844 Query 837 TQVYHKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFR 896 T+ ++K ISRRCT PVTAIVS+LHY+GKM+TTN K I +D TG+TKP PGD++LTCFR Sbjct 845 TKTFYKFISRRCTQPVTAIVSTLHYDGKMKTTNPCKKNIEIDITGATKPKPGDIILTCFR 904 Query 897 GWVKQLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKL 956 GWVKQLQIDY GHEVMTAAASQGLTRKGVYAVRQKVNEN LYA TSEHVNVLLTRTE +L Sbjct 905 GWVKQLQIDYPGHEVMTAAASQGLTRKGVYAVRQKVNENALYAITSEHVNVLLTRTEDRL 964 Query 957 VWKTLSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAK 1016 VWKTL GDPWIK L N PKGNF+ATI++WE EH I+A I + + F K NVCWAK Sbjct 965 VWKTLQGDPWIKQLTNVPKGNFQATIEDWEAEHKGIIAAINSPAPRTNPFSCKTNVCWAK 1024 Query 1017 SLVPILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVS 1076 +L PIL TAGI L QWS++ F +D+ +S AL+ IC + +G+DL SGLFSK + Sbjct 1025 ALEPILATAGIVLTGCQWSELFPQFADDKPHSAIYALDVICIKFFGMDLTSGLFSKQSIP 1084 Query 1077 VHY--ADN-----HWDNRPGGKMFGFNPEAASILERKYPFTK--GKWNTNKQICVTTRRI 1127 + Y AD+ HWDN PG + +G++ A+ L R++P + GK Q+ + T R Sbjct 1085 LTYHPADSARPVAHWDNSPGTRKYGYDHAVAAELSRRFPVFQLAGK---GTQLDLQTGRT 1141 Query 1128 EDFNPNTNIIPANRRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVT 1187 + N++P NR LPH+LV EH+ + +E +N+ H VL+VS + P KR+ Sbjct 1142 RVISAQHNLVPVNRNLPHALVPEHKEKQPGPVEKFLNQFKHHSVLVVSEEKIEAPHKRIE 1201 Query 1188 WVAPLGIRGADYTYNLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDS 1247 W+AP+GI GAD YNL G P RYDLV INI T +R HH+QQC DHA L+ L + Sbjct 1202 WIAPIGIAGADKNYNLAFGFPPQ-ARYDLVFINIGTKYRNHHFQQCEDHAATLKTLSRSA 1260 Query 1248 LRLLKPGGSLLIRAYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNG 1307 L L PGG+L++++YGYADR SE VV L RKF A +P CV+SNTEM+ +F DN Sbjct 1261 LNCLNPGGTLVVKSYGYADRNSEDVVTALARKFVRVSAARPECVSSNTEMYLIFRQLDNS 1320 Query 1308 R-RNFTTHVMNNQLNAAFVGQATRAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGV 1366 R R FT H +N +++ + G G APSYR KR +IA EE VVNAANP G PG+GV Sbjct 1321 RTRQFTPHHLNCVISSVYEGTRDGVGAAPSYRTKRENIADCQEEAVVNAANPLGRPGEGV 1380 Query 1367 CKAVYKKWPESFKNSATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVA 1426 C+A+YK+WP SF +SAT GTAK +C VIHAVGP+F + E+E + L AY VA Sbjct 1381 CRAIYKRWPNSFTDSATETGTAKLTVCHGKKVIHAVGPDFRKHPEAEALKLLQNAYHAVA 1440 Query 1427 KEVTRLGVNSVAIPLLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIA 1486 V + SVAIPLLSTG+Y+ GKDRL SLN L TALD TDADV IYC DK+W+++I Sbjct 1441 DLVNEHNIKSVAIPLLSTGIYAAGKDRLEVSLNCLTTALDRTDADVTIYCLDKKWKERID 1500 Query 1487 EAIQMRTQV-ELLDEHISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVD 1545 +Q++ V EL DE + +D +++ +HPDS L GRKG+STT+G LYSY EGT+FHQ A D Sbjct 1501 AVLQLKESVTELKDEDMEIDDELVWIHPDSCLKGRKGFSTTKGKLYSYFEGTKFHQAAKD 1560 Query 1546 MAEVYTMWPKQTEANEQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPER 1605 MAE+ ++P E+NEQ+C Y LGE++E+IR+KCPVD +SSPPKT+PCLC YAMTPER Sbjct 1561 MAEIKVLFPNDQESNEQLCAYILGETMEAIREKCPVDHNPSSSPPKTLPCLCMYAMTPER 1620 Query 1606 VTRLRMNHVTSIIVCSSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREY-KSPQETA 1664 V RLR N+V + VCSS PLPKYKI+ VQKV+C+KV+LF+ + P+ V R+Y + P++ A Sbjct 1621 VHRLRSNNVKEVTVCSSTPLPKYKIKNVQKVQCTKVVLFNPHTPAFVPARKYIEVPEQPA 1680 Query 1665 -----------------QEVSSTTSLTHSQFDLSVD--GEELPAPSDLEADAPIPEPTPD 1705 + TSL + L +D E S +D I T Sbjct 1681 APPAQDEEAPEAVATPAPPAADNTSLDVTDISLDMDDSSEGSLFSSFSGSDNSI---TCM 1737 Query 1706 DRAVLTLPPTIDNFSAV---SDWVMNTAPVAPPRRRRGKNLNVTCDEREGNVLPMASVRF 1762 DR + P ++D V V AP+ PPR ++ L +E +P AS Sbjct 1738 DRWS-SGPSSLDRRQVVVADVHAVQEPAPIPPPRLKKMARLAAASKTQE-EPIPPASTSS 1795 Query 1763 FRADLH----SIVQETAEIRD-TAASLQAPLSVATEPNQLPISFGAPNETFPITFGDFDE 1817 LH + + D A L A AT P +P+SFG+ F + Sbjct 1796 ADESLHLSFGGVSMSFGSLLDGEMARLAAAQPPATGPTDVPMSFGS-----------FSD 1844 Query 1818 GEIESLS-----SELLTFGDFSPGEVDDLTDS 1844 GEIE LS SE + FG F PGEV+ + S Sbjct 1845 GEIEELSRRVTESEPVLFGSFEPGEVNSIISS 1876 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; AltName: Full=p270 nonstructural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Sindbis virus] Sequence ID: P03317.2 Length: 2513 Range 1: 6 to 1874 Score:2098 bits(5435), Expect:0.0, Method:Compositional matrix adjust., Identities:1043/1887(55%), Positives:1337/1887(70%), Gaps:64/1887(3%) Query 4 VYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDSTILD 63 V VD+D S F+ LQ+++P FEV +QVTPNDHANARAFSHLA KLIE E+ +TILD Sbjct 6 VNVDVDPQSPFVVQLQKSFPQFEVVAQQVTPNDHANARAFSHLASKLIELEVPTTATILD 65 Query 64 IGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQAVM 123 IGSAPARRM S+ +YHCVCPMRS EDP+R+ YA KLA A K+ ++N+ EKI DL+ V+ Sbjct 66 IGSAPARRMFSEHQYHCVCPMRSPEDPDRMMKYASKLAEKACKITNKNLHEKIKDLRTVL 125 Query 124 AVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDTTPF 183 PDAETP+ C H DV+C RA+ ++ QDVY ++AP ++YHQA+KGVR YWIGFDTT F Sbjct 126 DTPDAETPSLCFHNDVTCNMRAEYSVMQDVY-INAPGTIYHQAMKGVRTLYWIGFDTTQF 184 Query 184 MYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVLFSV 243 M++AMAG+YP+Y+TNWADE+VL+A+NIGLCST L+EGR GKLSIMR K++KP RV FSV Sbjct 185 MFSAMAGSYPAYNTNWADEKVLEARNIGLCSTKLSEGRTGKLSIMRKKELKPGSRVYFSV 244 Query 244 GSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKTTGY 303 GSTLYPE R L+SWHLPSVFHL GK S+TCRCDTVVSCEGYVVK+ITISPG+ G+T GY Sbjct 245 GSTLYPEHRASLQSWHLPSVFHLNGKQSYTCRCDTVVSCEGYVVKKITISPGITGETVGY 304 Query 304 AVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLVGLN 363 AVTH+++GFL+CK TDTV GERVSF VCTY+PATICDQMTGI+AT+++P+DAQKLLVGLN Sbjct 305 AVTHNSEGFLLCKVTDTVKGERVSFPVCTYIPATICDQMTGIMATDISPDDAQKLLVGLN 364 Query 364 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCLWAF 423 QRIV+NGRT RNTNTM+NYLLP++AQ FSKWAKE + D+++EK+LG RER LT CLWAF Sbjct 365 QRIVINGRTNRNTNTMQNYLLPIIAQGFSKWAKERKDDLDNEKMLGTRERKLTYGCLWAF 424 Query 424 KKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDLIPY 483 + +K H+ Y+ P TQ+ KVPA F +F + S+W++ L + LR ++K L + L+ Sbjct 425 RTKKVHSFYRPPGTQTCVKVPASFSAFPMSSVWTTSLPMSLRQKLKLALQPKKEEKLLQV 484 Query 484 SGD-AKEARDAEKEAEEEREAELTREALPPLQA---AQDDVQVEIDVEQLEDRAGAGIIE 539 S + EA+ A ++A+EE AE REALPPL A + +V +VE L+ GA ++E Sbjct 485 SEELVMEAKAAFEDAQEEARAEKLREALPPLVADKGIEAAAEVVCEVEGLQADIGAALVE 544 Query 540 TPRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEA 599 TPRG +++ Q D ++G+Y+V+SP +VL++ KL+ H LA+QVK THSGR+GRYAVE Sbjct 545 TPRGHVRIIPQANDRMIGQYIVVSPNSVLKNAKLAPAHPLADQVKIITHSGRSGRYAVEP 604 Query 600 YDGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVR 659 YD ++L+P+G A+ +F +LSESAT+VYNEREFVNRKL+HIA+HGPA NT+EE Y++ + Sbjct 605 YDAKVLMPAGGAVPWPEFLALSESATLVYNEREFVNRKLYHIAMHGPAKNTEEEQYKVTK 664 Query 660 AERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGV 719 AE E EYV+DVD++RC KKEEA+GLVL G+LTNPPYHE A EGL+ RPA PYK IGV Sbjct 665 AELAETEYVFDVDKKRCVKKEEASGLVLSGELTNPPYHELALEGLKTRPAVPYKVETIGV 724 Query 720 FGVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNR 779 G PGSGKSAIIK+ VT +DLVTSGKKENC+EI DV+R R ++I+++TVDS++LNGC++ Sbjct 725 IGTPGSGKSAIIKSTVTARDLVTSGKKENCREIEADVLRLRGMQITSKTVDSVMLNGCHK 784 Query 780 PVDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNH---NIC 836 V+VLYVDEAFACH+G LLALIA+VRPR+KVVLCGDP QCGFFNMMQ+KV++NH +IC Sbjct 785 AVEVLYVDEAFACHAGALLALIAIVRPRKKVVLCGDPMQCGFFNMMQLKVHFNHPEKDIC 844 Query 837 TQVYHKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFR 896 T+ ++K ISRRCT PVTAIVS+LHY+GKM+TTN K I +D TG+TKP PGD++LTCFR Sbjct 845 TKTFYKYISRRCTQPVTAIVSTLHYDGKMKTTNPCKKNIEIDITGATKPKPGDIILTCFR 904 Query 897 GWVKQLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKL 956 GWVKQLQIDY GHEVMTAAASQGLTRKGVYAVRQKVNENPLYA TSEHVNVLLTRTE +L Sbjct 905 GWVKQLQIDYPGHEVMTAAASQGLTRKGVYAVRQKVNENPLYAITSEHVNVLLTRTEDRL 964 Query 957 VWKTLSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAK 1016 VWKTL GDPWIK N PKGNF+ATI++WE EH I+A I + + F K NVCWAK Sbjct 965 VWKTLQGDPWIKQPTNIPKGNFQATIEDWEAEHKGIIAAINSPTPRANPFSCKTNVCWAK 1024 Query 1017 SLVPILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVS 1076 +L PIL TAGI L QWS++ F +D+ +S AL+ IC + +G+DL SGLFSK + Sbjct 1025 ALEPILATAGIVLTGCQWSELFPQFADDKPHSAIYALDVICIKFFGMDLTSGLFSKQSIP 1084 Query 1077 VHY--ADN-----HWDNRPGGKMFGFNPEAASILERKYPFTK--GKWNTNKQICVTTRRI 1127 + Y AD+ HWDN PG + +G++ A+ L R++P + GK Q+ + T R Sbjct 1085 LTYHPADSARPVAHWDNSPGTRKYGYDHAIAAELSRRFPVFQLAGK---GTQLDLQTGRT 1141 Query 1128 EDFNPNTNIIPANRRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVT 1187 + N++P NR LPH+LV E++ + ++ +N+ H VL+VS + P KR+ Sbjct 1142 RVISAQHNLVPVNRNLPHALVPEYKEKQPGPVKKFLNQFKHHSVLVVSEEKIEAPRKRIE 1201 Query 1188 WVAPLGIRGADYTYNLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDS 1247 W+AP+GI GAD YNL G P RYDLV INI T +R HH+QQC DHA L+ L + Sbjct 1202 WIAPIGIAGADKNYNLAFGFPPQ-ARYDLVFINIGTKYRNHHFQQCEDHAATLKTLSRSA 1260 Query 1248 LRLLKPGGSLLIRAYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNG 1307 L L PGG+L++++YGYADR SE VV L RKF A +P CV+SNTEM+ +F DN Sbjct 1261 LNCLNPGGTLVVKSYGYADRNSEDVVTALARKFVRVSAARPDCVSSNTEMYLIFRQLDNS 1320 Query 1308 R-RNFTTHVMNNQLNAAFVGQATRAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGV 1366 R R FT H +N +++ + G G APSYR KR +IA EE VVNAANP G PG+GV Sbjct 1321 RTRQFTPHHLNCVISSVYEGTRDGVGAAPSYRTKRENIADCQEEAVVNAANPLGRPGEGV 1380 Query 1367 CKAVYKKWPESFKNSATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVA 1426 C+A+YK+WP SF +SAT GTA+ +C VIHAVGP+F + E+E + L AY VA Sbjct 1381 CRAIYKRWPTSFTDSATETGTARMTVCLGKKVIHAVGPDFRKHPEAEALKLLQNAYHAVA 1440 Query 1427 KEVTRLGVNSVAIPLLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIA 1486 V + SVAIPLLSTG+Y+ GKDRL SLN L TALD TDADV IYC DK+W+++I Sbjct 1441 DLVNEHNIKSVAIPLLSTGIYAAGKDRLEVSLNCLTTALDRTDADVTIYCLDKKWKERID 1500 Query 1487 EAIQMRTQV-ELLDEHISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVD 1545 A+Q++ V EL DE + +D +++ +HPDS L GRKG+STT+G LYSY EGT+FHQ A D Sbjct 1501 AALQLKESVTELKDEDMEIDDELVWIHPDSCLKGRKGFSTTKGKLYSYFEGTKFHQAAKD 1560 Query 1546 MAEVYTMWPKQTEANEQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPER 1605 MAE+ ++P E+NEQ+C Y LGE++E+IR+KCPVD +SSPPKT+PCLC YAMTPER Sbjct 1561 MAEIKVLFPNDQESNEQLCAYILGETMEAIREKCPVDHNPSSSPPKTLPCLCMYAMTPER 1620 Query 1606 VTRLRMNHVTSIIVCSSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREY-------- 1657 V RLR N+V + VCSS PLPK+KI+ VQKV+C+KV+LF+ + P+ V R+Y Sbjct 1621 VHRLRSNNVKEVTVCSSTPLPKHKIKNVQKVQCTKVVLFNPHTPAFVPARKYIEVPEQPT 1680 Query 1658 --KSPQETAQEVSSTTS----------LTHSQFDLSVDGEELPAPSDLEADAPIPEPTPD 1705 + E A EV +T S +T D+ E S +D I Sbjct 1681 APPAQAEEAPEVVATPSPSTADNTSLDVTDISLDMDDSSEGSLFSSFSGSDNSITSMDSW 1740 Query 1706 DRAVLTLPPTIDNFSAVSD--WVMNTAPVAPPRRRRGKNLNVTCDEREGNVLPMASVRFF 1763 +L V+D V AP+ PPR ++ L +E S Sbjct 1741 SSGPSSLEIVDRRQVVVADVHAVQEPAPIPPPRLKKMARLAAA--RKEPTPPASNSSESL 1798 Query 1764 RADLHSIVQETAEIRDTAASLQAPLS-VATEPNQLPISFGAPNETFPITFGDFDEGEIES 1822 + I D + QA + +AT P +P+SFG+ F +GEI+ Sbjct 1799 HLSFGGVSMSLGSIFDGETARQAAVQPLATGPTDVPMSFGS-----------FSDGEIDE 1847 Query 1823 LS-----SELLTFGDFSPGEVDDLTDS 1844 LS SE + FG F PGEV+ + S Sbjct 1848 LSRRVTESEPVLFGSFEPGEVNSIISS 1874 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Venezuelan equine encephalitis virus (strain Trinidad donkey)] Sequence ID: P27282.3 Length: 2493 Range 1: 1 to 1658 Score:2063 bits(5344), Expect:0.0, Method:Compositional matrix adjust., Identities:990/1663(60%), Positives:1234/1663(74%), Gaps:11/1663(0%) Query 1 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 M+ V+VDI+ DS FL+ALQR++P FEVE +QVT NDHANARAFSHLA KLIE E+DP T Sbjct 1 MEKVHVDIEEDSPFLRALQRSFPQFEVEAKQVTDNDHANARAFSHLASKLIETEVDPSDT 60 Query 61 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ 120 ILDIGSAPARRM S KYHC+CPMR AEDP+RL YA KL ++ D+ + +K+ +L Sbjct 61 ILDIGSAPARRMYSKHKYHCICPMRCAEDPDRLYKYATKLKKNCKEITDKELDKKMKELA 120 Query 121 AVMAVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT 180 AVM+ PD ET T CLH D SCR VA+YQDVYAV PTSLYHQA KGVRVAYWIGFDT Sbjct 121 AVMSDPDLETETMCLHDDESCRYEGQVAVYQDVYAVDGPTSLYHQANKGVRVAYWIGFDT 180 Query 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVL 240 TPFM+ +AGAYPSYSTNWADE VL A+NIGLCS+D+ E R +SI+R K +KP + VL Sbjct 181 TPFMFKNLAGAYPSYSTNWADETVLTARNIGLCSSDVMERSRRGMSILRKKYLKPSNNVL 240 Query 241 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKT 300 FSVGST+Y E R LL+SWHLPSVFHL+GK ++TCRC+T+VSC+GYVVKRI ISPGLYGK Sbjct 241 FSVGSTIYHEKRDLLRSWHLPSVFHLRGKQNYTCRCETIVSCDGYVVKRIAISPGLYGKP 300 Query 301 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 +GYA T H +GFL CK TDT++GERVSF VCTYVPAT+CDQMTGILAT+V+ +DAQKLLV Sbjct 301 SGYAATMHREGFLCCKVTDTLNGERVSFPVCTYVPATLCDQMTGILATDVSADDAQKLLV 360 Query 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCL 420 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAF++WAKE ++D EDE+ LG+R+R L C Sbjct 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFARWAKEYKEDQEDERPLGLRDRQLVMGCC 420 Query 421 WAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDL 480 WAF++ K ++YKRPDTQ+I KV ++F SFV+P + S+ L I LRTRI+ +L + + Sbjct 421 WAFRRHKITSIYKRPDTQTIIKVNSDFHSFVLPRIGSNTLEIGLRTRIRKMLEEHKEPSP 480 Query 481 IPYSGDAKEARDAEKEAEEEREAELTREALPPLQAAQDDVQVEIDVEQLEDRAGAGIIET 540 + + D +EA+ A EA+E REAE R ALPPL A ++ +E DV+ + AGAG +ET Sbjct 481 LITAEDVQEAKCAADEAKEVREAEELRAALPPLAADVEEPTLEADVDLMLQEAGAGSVET 540 Query 541 PRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY 600 PRG IKVT+ + +G Y VLSPQ VL+S+KLS IH LAEQV THSGR GRYAVE Y Sbjct 541 PRGLIKVTSYAGEDKIGSYAVLSPQAVLKSEKLSCIHPLAEQVIVITHSGRKGRYAVEPY 600 Query 601 DGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRA 660 G+++VP G+AI +DFQ+LSESAT+VYNEREFVNR LHHIA HG ALNTDEE Y+ V+ Sbjct 601 HGKVVVPEGHAIPVQDFQALSESATIVYNEREFVNRYLHHIATHGGALNTDEEYYKTVKP 660 Query 661 ERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVF 720 + EY+YD+D+++C KKE GL L G+L +PP+HEFAYE LR RPA PY+ IGV+ Sbjct 661 SEHDGEYLYDIDRKQCVKKELVTGLGLTGELVDPPFHEFAYESLRTRPAAPYQVPTIGVY 720 Query 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRP 780 GVPGSGKS IIK+ VT++DLV S KKENC EI DV + + L+++ARTVDS+LLNGC P Sbjct 721 GVPGSGKSGIIKSAVTKKDLVVSAKKENCAEIIRDVKKMKGLDVNARTVDSVLLNGCKHP 780 Query 781 VDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY 840 V+ LY+DEAFACH+GTL ALIA++RP+ K VLCGDPKQCGFFNMM +KV++NH ICTQV+ Sbjct 781 VETLYIDEAFACHAGTLRALIAIIRPK-KAVLCGDPKQCGFFNMMCLKVHFNHEICTQVF 839 Query 841 HKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVK 900 HKSISRRCT VT++VS+L Y+ KMRTTN IV+DTTGSTKP DL+LTCFRGWVK Sbjct 840 HKSISRRCTKSVTSVVSTLFYDKKMRTTNPKETKIVIDTTGSTKPKQDDLILTCFRGWVK 899 Query 901 QLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKT 960 QLQIDY+G+E+MTAAASQGLTRKGVYAVR KVNENPLYA TSEHVNVLLTRTE ++VWKT Sbjct 900 QLQIDYKGNEIMTAAASQGLTRKGVYAVRYKVNENPLYAPTSEHVNVLLTRTEDRIVWKT 959 Query 961 LSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVP 1020 L+GDPWIKTL GNF ATI+EW+ EH +IM I D FQNKANVCWAK+LVP Sbjct 960 LAGDPWIKTLTAKYPGNFTATIEEWQAEHDAIMRHILERPDPTDVFQNKANVCWAKALVP 1019 Query 1021 ILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYA 1080 +L+TAGI + QW+ + F+ D+A+S E+ LN++C R +G+DLDSGLFS P V + Sbjct 1020 VLKTAGIDMTTEQWN-TVDYFETDKAHSAEIVLNQLCVRFFGLDLDSGLFSAPTVPLSIR 1078 Query 1081 DNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPAN 1140 +NHWDN P M+G N E L R+YP T + + T + +++P N++P N Sbjct 1079 NNHWDNSPSPNMYGLNKEVVRQLSRRYPQLPRAVATGRVYDMNTGTLRNYDPRINLVPVN 1138 Query 1141 RRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYT 1200 RRLPH+LV H V+K+ G VL+V G L +P K V W++ A + Sbjct 1139 RRLPHALVLHHNEHPQSDFSSFVSKLKGRTVLVV-GEKLSVPGKMVDWLSDRP--EATFR 1195 Query 1201 YNLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR 1260 L+LG+P + +YD++ +N+ TP++ HHYQQC DHA+KL ML + L PGG+ + Sbjct 1196 ARLDLGIPGDVPKYDIIFVNVRTPYKYHHYQQCEDHAIKLSMLTKKACLHLNPGGTCVSI 1255 Query 1261 AYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQL 1320 YGYADR SE ++ + R+F+ SR KP TE+ F+F +D R + +++ L Sbjct 1256 GYGYADRASESIIGAIARQFKFSRVCKPKSSLEETEVLFVFIGYDRKARTHNPYKLSSTL 1315 Query 1321 NAAFVG-QATRAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFK 1379 + G + AGCAPSY V R DIA E ++NAAN +G PG GVC A+YKK+PESF Sbjct 1316 TNIYTGSRLHEAGCAPSYHVVRGDIATATEGVIINAANSKGQPGGGVCGALYKKFPESFD 1375 Query 1380 NSATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAI 1439 VG A+ V +IHAVGPNF+ SE EGD++LA AY +AK V SVAI Sbjct 1376 LQPIEVGKARLVKGAAKHIIHAVGPNFNKVSEVEGDKQLAEAYESIAKIVNDNNYKSVAI 1435 Query 1440 PLLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVE--L 1497 PLLSTG++SG KDRLTQSLNHL TALD+TDADV IYCRDK+WE + EA+ R VE Sbjct 1436 PLLSTGIFSGNKDRLTQSLNHLLTALDTTDADVAIYCRDKKWEMTLKEAVARREAVEEIC 1495 Query 1498 LDEHISV---DCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWP 1554 + + SV D +++RVHP SSLAGRKGYST++G +SYLEGT+FHQ A D+AE+ MWP Sbjct 1496 ISDDSSVTEPDAELVRVHPKSSLAGRKGYSTSDGKTFSYLEGTKFHQAAKDIAEINAMWP 1555 Query 1555 KQTEANEQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHV 1614 TEANEQVC+Y LGES+ SIR KCPV++++AS+PP T+PCLC +AMTPERV RL+ + Sbjct 1556 VATEANEQVCMYILGESMSSIRSKCPVEESEASTPPSTLPCLCIHAMTPERVQRLKASRP 1615 Query 1615 TSIIVCSSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREY 1657 I VCSSFPLPKY+I GVQK++CS+ +LF VP+ + PR+Y Sbjct 1616 EQITVCSSFPLPKYRITGVQKIQCSQPILFSPKVPAYIHPRKY 1658 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Venezuelan equine encephalitis virus (strain P676)] Sequence ID: P36328.2 Length: 2493 Range 1: 1 to 1658 Score:2061 bits(5341), Expect:0.0, Method:Compositional matrix adjust., Identities:990/1663(60%), Positives:1234/1663(74%), Gaps:11/1663(0%) Query 1 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 M+ V+VDI+ DS FL+ALQR++P FEVE +QVT NDHANARAFSHLA KLIE E+DP T Sbjct 1 MEKVHVDIEEDSPFLRALQRSFPQFEVEAKQVTDNDHANARAFSHLASKLIETEVDPSDT 60 Query 61 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ 120 ILDIGSAPARRM S KYHC+CPMR AEDP+RL YA KL ++ D+ + +K+ +L Sbjct 61 ILDIGSAPARRMYSKHKYHCICPMRCAEDPDRLYKYATKLKKNCKEITDKELDKKMKELA 120 Query 121 AVMAVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT 180 AVM+ PD ET T CLH D SCR VA+YQDVYAV PTSLYHQA KGVRVAYWIGFDT Sbjct 121 AVMSDPDLETETMCLHDDESCRYEGQVAVYQDVYAVDGPTSLYHQANKGVRVAYWIGFDT 180 Query 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVL 240 TPFM+ +AGAYPSYSTNWADE VL A+NIGLCS+D+ E R +SI+R K +KP + VL Sbjct 181 TPFMFKNLAGAYPSYSTNWADETVLTARNIGLCSSDVMERSRRGMSILRKKYLKPSNNVL 240 Query 241 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKT 300 FSVGST+Y E R LL+SWHLPSVFHL+GK ++TCRC+T+VSC+GYVVKRI ISPGLYGK Sbjct 241 FSVGSTIYHEKRDLLRSWHLPSVFHLRGKQNYTCRCETIVSCDGYVVKRIAISPGLYGKP 300 Query 301 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 +GYA T H +GFL CK TDT++GERVSF VCTYVPAT+CDQMTGILAT+V+ +DAQKLLV Sbjct 301 SGYAATMHREGFLCCKVTDTLNGERVSFPVCTYVPATLCDQMTGILATDVSADDAQKLLV 360 Query 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCL 420 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAF++WAKE ++D EDE+ LG+R+R L C Sbjct 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFARWAKEYKEDQEDERPLGLRDRQLVMGCC 420 Query 421 WAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDL 480 WAF++ K ++YKRPDTQ+I KV ++F SFV+P + S+ L I LRTRI+ +L + + Sbjct 421 WAFRRHKITSIYKRPDTQTIIKVNSDFHSFVLPRIGSNTLEIGLRTRIRKMLEEHKEPSP 480 Query 481 IPYSGDAKEARDAEKEAEEEREAELTREALPPLQAAQDDVQVEIDVEQLEDRAGAGIIET 540 + + D +EA+ A EA+E REAE R ALPPL A ++ +E DV+ + AGAG +ET Sbjct 481 LITAEDIQEAKCAADEAKEVREAEELRAALPPLAADFEEPTLEADVDLMLQEAGAGSVET 540 Query 541 PRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY 600 PRG IKVT+ + +G Y VLSPQ VL+S+KLS IH LAEQV THSGR GRYAVE Y Sbjct 541 PRGLIKVTSYAGEDKIGSYAVLSPQAVLKSEKLSCIHPLAEQVIVITHSGRKGRYAVEPY 600 Query 601 DGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRA 660 G+++VP G+AI +DFQ+LSESAT+VYNEREFVNR LHHIA HG ALNTDEE Y+ V+ Sbjct 601 HGKVVVPEGHAIPVQDFQALSESATIVYNEREFVNRYLHHIATHGGALNTDEEYYKTVKP 660 Query 661 ERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVF 720 + EY+YD+D+++C KKE GL L G+L +PP+HEFAYE LR RPA PY+ IGV+ Sbjct 661 SEHDGEYLYDIDRKQCVKKELVTGLGLTGELVDPPFHEFAYESLRTRPAAPYQVPTIGVY 720 Query 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRP 780 GVPGSGKS IIK+ VT++DLV S KKENC EI DV + + L+++ARTVDS+LLNGC P Sbjct 721 GVPGSGKSGIIKSAVTKKDLVVSAKKENCAEIIRDVKKMKGLDVNARTVDSVLLNGCKHP 780 Query 781 VDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY 840 V+ LY+DEAFACH+GTL ALIA++RP+ K VLCGDPKQCGFFNMM +KV++NH ICTQV+ Sbjct 781 VETLYIDEAFACHAGTLRALIAIIRPK-KAVLCGDPKQCGFFNMMCLKVHFNHEICTQVF 839 Query 841 HKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVK 900 HKSISRRCT VT++VS+L Y+ +MRTTN IV+DTTGSTKP DL+LTCFRGWVK Sbjct 840 HKSISRRCTKSVTSVVSTLFYDKRMRTTNPKETKIVIDTTGSTKPKQDDLILTCFRGWVK 899 Query 901 QLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKT 960 QLQIDY+G+E+MTAAASQGLTRKGVYAVR KVNENPLYA TSEHVNVLLTRTE ++VWKT Sbjct 900 QLQIDYKGNEIMTAAASQGLTRKGVYAVRYKVNENPLYAPTSEHVNVLLTRTEDRIVWKT 959 Query 961 LSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVP 1020 L+GDPWIK L GNF ATI+EW+ EH +IM I D FQNKANVCWAK+LVP Sbjct 960 LAGDPWIKILTAKYPGNFTATIEEWQAEHDAIMRHILERPDPTDVFQNKANVCWAKALVP 1019 Query 1021 ILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYA 1080 +L+TAGI + QW+ + F+ D+A+S E+ LN++C R +G+DLDSGLFS P V + Sbjct 1020 VLKTAGIDMTTEQWN-TVDYFETDKAHSAEIVLNQLCVRFFGLDLDSGLFSAPTVPLSIR 1078 Query 1081 DNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPAN 1140 +NHWDN P M+G N E L R+YP T + + T + +++P N++P N Sbjct 1079 NNHWDNSPSPNMYGLNKEVVRQLSRRYPQLPRAVATGRVYDMNTGTLRNYDPRINLVPVN 1138 Query 1141 RRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYT 1200 RRLPH+LV H V+K+ G VL+V G L +P K+V W++ A + Sbjct 1139 RRLPHALVLHHNEHPQSDFSSFVSKLKGRTVLVV-GEKLSVPGKKVDWLSDQP--EATFR 1195 Query 1201 YNLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR 1260 L+LG+P + +YD+V IN+ TP++ HHYQQC DHA+KL ML + L PGG+ + Sbjct 1196 ARLDLGIPGDVPKYDIVFINVRTPYKYHHYQQCEDHAIKLSMLTKKACLHLNPGGTCVSI 1255 Query 1261 AYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQL 1320 YGYADR SE ++ + R+F+ SR KP TE+ F+F +D R + +++ L Sbjct 1256 GYGYADRASESIIGAIARQFKFSRVCKPKSSHEETEVLFVFIGYDRKARTHNPYKLSSTL 1315 Query 1321 NAAFVG-QATRAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFK 1379 + G + AGCAPSY V R DIA E ++NAAN +G PG GVC A+YKK+PESF Sbjct 1316 TNIYTGSRLHEAGCAPSYHVVRGDIATATEGVIINAANSKGQPGGGVCGALYKKFPESFD 1375 Query 1380 NSATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAI 1439 VG A+ V +IHAVGPNF+ SE EGD++LA AY +AK V SVAI Sbjct 1376 LQPIEVGKARLVKGAAKHIIHAVGPNFNKVSEVEGDKQLAEAYESIAKIVNDNNYKSVAI 1435 Query 1440 PLLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVE--L 1497 PLLSTG++SG KDRLTQSLNHL TALD+TDADV IYCRDK+WE + EA+ R VE Sbjct 1436 PLLSTGIFSGNKDRLTQSLNHLLTALDTTDADVAIYCRDKKWEMTLKEAVARREAVEEIC 1495 Query 1498 LDEHISV---DCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWP 1554 + + SV D +++RVHP SSLAGRKGYST++G +SYLEGT+FHQ A D+AE+ MWP Sbjct 1496 ISDDSSVTEPDAELVRVHPKSSLAGRKGYSTSDGKTFSYLEGTKFHQAAKDIAEINAMWP 1555 Query 1555 KQTEANEQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHV 1614 TEANEQVC+Y LGES+ SIR KCPV++++AS+PP T+PCLC +AMTPERV RL+ + Sbjct 1556 VATEANEQVCMYILGESMSSIRSKCPVEESEASTPPSTLPCLCIHAMTPERVQRLKASRP 1615 Query 1615 TSIIVCSSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREY 1657 I VCSSFPLPKY+I GVQK++CS+ +LF VP+ + PR+Y Sbjct 1616 EQITVCSSFPLPKYRITGVQKIQCSQPILFSPKVPAYIHPRKY 1658 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Venezuelan equine encephalitis virus (strain 3880)] Sequence ID: P36327.3 Length: 2485 Range 1: 1 to 1671 Score:2061 bits(5339), Expect:0.0, Method:Compositional matrix adjust., Identities:994/1676(59%), Positives:1237/1676(73%), Gaps:11/1676(0%) Query 1 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 M+ V+VDI+ DS FL+ALQR++P FEVE +QVT NDHANARAFSHLA KLIE E+DP T Sbjct 1 MEKVHVDIEEDSPFLRALQRSFPQFEVEAKQVTDNDHANARAFSHLASKLIETEVDPSDT 60 Query 61 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ 120 ILDIGSAPARRM S KYHC+CPMR AEDP+RL YA KL ++ D+ + +K+ +L Sbjct 61 ILDIGSAPARRMYSKHKYHCICPMRCAEDPDRLYKYATKLKKNCKEITDKELDKKMKELA 120 Query 121 AVMAVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT 180 AVM+ PD ET T CLH D SCR VA+YQDVYAV PTSLYHQA KGVRVAYWIGFDT Sbjct 121 AVMSDPDLETETMCLHDDESCRYEGQVAVYQDVYAVDGPTSLYHQANKGVRVAYWIGFDT 180 Query 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVL 240 TPFM+ +AGAYPSYSTNWADE VL A+NIGLCS+D+ E R +SI+R K +KP + VL Sbjct 181 TPFMFKNLAGAYPSYSTNWADETVLTARNIGLCSSDVMERSRRGMSILRKKYLKPSNNVL 240 Query 241 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKT 300 FSVGST+Y E R LL+SWHLPSVFHL+GK ++TCRC+T+VSC+GYVVKRI ISPGLYGK Sbjct 241 FSVGSTIYHEKRDLLRSWHLPSVFHLRGKQNYTCRCETIVSCDGYVVKRIAISPGLYGKP 300 Query 301 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 +GYA T H +GFL CK TDT++GERVSF VCTYVPAT+CDQMTGILAT+V+ +DAQKLLV Sbjct 301 SGYAATMHREGFLCCKVTDTLNGERVSFPVCTYVPATLCDQMTGILATDVSADDAQKLLV 360 Query 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCL 420 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAF++WAKE ++D EDE+ LG+R+R L C Sbjct 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFARWAKEYKEDQEDERPLGLRDRQLVMGCC 420 Query 421 WAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDL 480 WAF++ K ++YKRPDTQ+I KV ++F SFV+P + S+ L I LRTRI+ +L + + Sbjct 421 WAFRRHKITSIYKRPDTQTIIKVNSDFHSFVLPRIGSNTLEIGLRTRIRKMLEEHKEPSP 480 Query 481 IPYSGDAKEARDAEKEAEEEREAELTREALPPLQAAQDDVQVEIDVEQLEDRAGAGIIET 540 + + D +EA+ A EA+E REAE R LPPL A ++ +E DV+ + AGAG +ET Sbjct 481 LITAEDIQEAKCAADEAKEVREAEELRAVLPPLAADVEEPTLEADVDLMLQEAGAGSVET 540 Query 541 PRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY 600 PRG IKVT+ + +G Y VLSPQ VL+S+KLS IH LAEQV THSGR GRYAVE Y Sbjct 541 PRGLIKVTSYAGEDKIGSYAVLSPQAVLKSEKLSCIHPLAEQVIVITHSGRKGRYAVEPY 600 Query 601 DGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRA 660 G+++VP G+AI +DFQ+LSESAT+VYNEREFVNR LHHIA HG ALNTDEE Y+ V+ Sbjct 601 HGKVVVPEGHAIPVQDFQALSESATIVYNEREFVNRYLHHIATHGGALNTDEEYYKTVKP 660 Query 661 ERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVF 720 + EY+YD+D+++C KKE GL L G+L +PP+HEFAYE LR RPA PY+ IGV+ Sbjct 661 SEHDGEYLYDIDRKQCVKKELVTGLGLTGELVDPPFHEFAYESLRTRPAAPYQVPTIGVY 720 Query 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRP 780 GVPGSGKS IIK+ VT++DLV S KKENC EI DV R + L+++ARTVDS+LLNGC P Sbjct 721 GVPGSGKSGIIKSAVTKKDLVVSAKKENCAEIIRDVKRIKGLDVNARTVDSVLLNGCKYP 780 Query 781 VDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY 840 V+ LY+DEAFACH+GTL ALIA++RP+ K VLCGDPKQCGFFNMM +KV++NH ICTQV+ Sbjct 781 VETLYIDEAFACHAGTLRALIAIIRPK-KAVLCGDPKQCGFFNMMCLKVHFNHEICTQVF 839 Query 841 HKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVK 900 HKSISRRCT VT++VS+L Y+ +MRTTN I +DTTGSTKP DL+LTCFRGWVK Sbjct 840 HKSISRRCTKSVTSVVSTLFYDKRMRTTNPKETKIEIDTTGSTKPKQDDLILTCFRGWVK 899 Query 901 QLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKT 960 QLQIDY+G+EVMTAAASQGLTRKGVYAVR KVNENPLYA TSEHVNVLLTRTE ++VWKT Sbjct 900 QLQIDYKGNEVMTAAASQGLTRKGVYAVRYKVNENPLYAPTSEHVNVLLTRTEDRIVWKT 959 Query 961 LSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVP 1020 L+GDPWIKTL GNF ATI+EW+ EH +IM I D FQNKANVCWAK+LVP Sbjct 960 LAGDPWIKTLTAKYPGNFTATIEEWQAEHDAIMRHILERPDPTDVFQNKANVCWAKALVP 1019 Query 1021 ILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYA 1080 +L+TAGI + QW+ + F+ D+A+S E+ LN++C R +G+DLDSGLFS P V + Sbjct 1020 VLKTAGIDMTTEQWN-TVDYFETDKAHSAEIVLNQLCVRFFGLDLDSGLFSAPTVPLSIR 1078 Query 1081 DNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPAN 1140 +NHWDN P M+G N E L R+YP T + + T + +++P N++P N Sbjct 1079 NNHWDNSPSPNMYGLNKEVVRQLSRRYPQLPRAVTTGRVYDMNTGTLRNYDPRINLVPVN 1138 Query 1141 RRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYT 1200 RRLPH+LV H V+K+ G VL+V G L +P K V W++ A + Sbjct 1139 RRLPHALVLHHNEHPQSDFSSFVSKLKGRTVLVV-GEKLSVPGKTVDWLSDRP--EATFR 1195 Query 1201 YNLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR 1260 L+LG+P + +YD++ IN+ TP++ HHYQQC DHA+KL ML + L PGG+ + Sbjct 1196 ARLDLGIPGDVPKYDIIFINVRTPYKYHHYQQCEDHAIKLSMLTKKACLHLNPGGTCVSI 1255 Query 1261 AYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQL 1320 YGYADR SE ++ + R+F+ SR KP TE+ F+F +D R + +++ L Sbjct 1256 GYGYADRASESIIGAIARQFKFSRVCKPKSSLEETEVLFVFIGYDRKARTHNPYKLSSTL 1315 Query 1321 NAAFVG-QATRAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFK 1379 + G + AGCAPSY V R DIA E ++NAAN +G PG GVC A+YKK+PESF Sbjct 1316 TNIYTGSRLHEAGCAPSYHVVRGDIATATEGVIINAANSKGQPGGGVCGALYKKFPESFD 1375 Query 1380 NSATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAI 1439 VG A+ V +IHAVGPNF+ SE EGD++LA AY +AK V SVAI Sbjct 1376 LQPIEVGKARLVKGAAKHIIHAVGPNFNKVSEIEGDKQLAEAYESIAKIVNDNNYKSVAI 1435 Query 1440 PLLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVE--L 1497 PLLSTG++SG KDRLTQSLNHL TALD+TDADV IYCRDK+WE + EA+ R VE Sbjct 1436 PLLSTGIFSGNKDRLTQSLNHLLTALDTTDADVAIYCRDKKWEMTLKEAVARREAVEEIC 1495 Query 1498 LDEHISV---DCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWP 1554 + + SV D +++RVHP SSLAGRKGYST++G +SYLEGT+FHQ A D+AE+ MWP Sbjct 1496 ISDDSSVTEPDAELVRVHPKSSLAGRKGYSTSDGKTFSYLEGTKFHQAAKDIAEINAMWP 1555 Query 1555 KQTEANEQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHV 1614 TEANEQVC+Y LGES+ SIR KCPV++++AS+PP T+PCLC +AMTPERV RL+ + Sbjct 1556 VATEANEQVCMYILGESMSSIRSKCPVEESEASTPPSTLPCLCIHAMTPERVQRLKASRP 1615 Query 1615 TSIIVCSSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREYKSPQETAQEVSST 1670 I VCSSFPLPKY+I GVQK++CS+ +LF VP+ + PR+Y T +E ST Sbjct 1616 EQITVCSSFPLPKYRITGVQKIQCSQPILFSPKVPAYIHPRKYLVETPTVEENQST 1671 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Venezuelan equine encephalitis virus (strain CPA201)] Sequence ID: Q8V294.3 Length: 2497 Range 1: 1 to 1658 Score:2043 bits(5294), Expect:0.0, Method:Compositional matrix adjust., Identities:993/1666(60%), Positives:1234/1666(74%), Gaps:17/1666(1%) Query 1 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 M+ V+VDI+ DS FL+ALQR++P FEVE +QVT NDHANARAFSHLA KLIE E++P T Sbjct 1 MEKVHVDIEEDSPFLRALQRSFPQFEVEAKQVTDNDHANARAFSHLASKLIETEVEPSDT 60 Query 61 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ 120 ILDIGSAPARRM S KYHC+CPM+ AEDP+RL YA KL + D+ + +K+ +L Sbjct 61 ILDIGSAPARRMYSKHKYHCICPMKCAEDPDRLFKYAAKLKKNCKDITDKELDKKMKELA 120 Query 121 AVMAVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT 180 VM+ PD ET T CLH D +CR VA+YQDVYAV PTSLYHQA KGVRVAYWIGFDT Sbjct 121 EVMSDPDLETETICLHDDETCRFEGQVAVYQDVYAVDGPTSLYHQANKGVRVAYWIGFDT 180 Query 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVL 240 TPFM+ +AGAYPSYSTNWADE VL A+NIGLCS+D+ E R +SI+R K +KP + VL Sbjct 181 TPFMFKNLAGAYPSYSTNWADETVLTARNIGLCSSDVMERSRRGMSILRKKFLKPSNNVL 240 Query 241 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKT 300 FSVGST+Y E R LL+SWHLPSVFHL+GK ++TCRC+T+VSC+GYVVKRI ISPGLYGK Sbjct 241 FSVGSTIYHEKRDLLRSWHLPSVFHLRGKQNYTCRCETIVSCDGYVVKRIAISPGLYGKP 300 Query 301 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 +GYA T H +GFL CK TDT+DGERVSF VCTYVPAT+CDQMTGILAT+V+ +DAQKLLV Sbjct 301 SGYAATMHREGFLCCKVTDTLDGERVSFPVCTYVPATLCDQMTGILATDVSADDAQKLLV 360 Query 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCL 420 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAF++WAKE ++D EDE+ LG+R+R L C Sbjct 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFARWAKEYKEDQEDERPLGLRDRQLVMGCC 420 Query 421 WAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSK-VPKTD 479 WAF+K K +VYKRPDTQ+I KV ++F SFV+P + S+ L I LRTRI+ LL + V + Sbjct 421 WAFRKHKITSVYKRPDTQTIIKVNSDFHSFVLPRIGSNTLEIGLRTRIRKLLEEPVDRPP 480 Query 480 LIPYSGDAKEARDAEKEAEEEREAELTREALPPLQAAQDDVQVEIDVEQLEDRAGAGIIE 539 LI + D +EA++A EA+E +EAE R ALPPL A ++ +E DV+ + AGAG +E Sbjct 481 LIT-ADDIQEAKNAADEAKEVKEAEELRAALPPLSADVEEPALEADVDLMLQEAGAGSVE 539 Query 540 TPRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEA 599 TPRG IKVT+ + +G Y VLSPQ VLRS+KL+ IH LAEQV THSGR GRYAVE Sbjct 540 TPRGLIKVTSYAGEDKIGSYAVLSPQAVLRSEKLTCIHPLAEQVIVITHSGRKGRYAVEP 599 Query 600 YDGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVR 659 Y G+++VP G AI +DFQ+LSESAT+VYNEREFVNR LHHIA HG ALNTDEE Y +V+ Sbjct 600 YHGKVVVPEGQAIPVQDFQALSESATIVYNEREFVNRYLHHIATHGGALNTDEEYYRVVK 659 Query 660 AERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGV 719 E EY+YD+D+++C KKE +GL L G+L +PP+HEFAYE LR RPA PY+ IGV Sbjct 660 PSEHEGEYLYDIDKKQCVKKELVSGLGLTGELVDPPFHEFAYESLRTRPAAPYQVPTIGV 719 Query 720 FGVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNR 779 +GVPGSGKS IIK+ VT++DLV S KKENC EI DV + + L+++ARTVDS+LLNGC Sbjct 720 YGVPGSGKSGIIKSAVTKKDLVVSAKKENCAEIIRDVKKMKGLDVNARTVDSVLLNGCKH 779 Query 780 PVDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQV 839 PV+ LY+DEAFACH+GTL ALIA++RP+ K VLCGDPKQCGFFNMM +KV++NH ICTQV Sbjct 780 PVETLYIDEAFACHAGTLRALIAIIRPK-KAVLCGDPKQCGFFNMMCLKVHFNHEICTQV 838 Query 840 YHKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWV 899 +HKSISRRCT VT++VS+L Y+ +MRTTN + I +DTTGSTKP DL+LTCFRGWV Sbjct 839 FHKSISRRCTKSVTSVVSTLFYDKRMRTTNPRDSKIEIDTTGSTKPKKDDLILTCFRGWV 898 Query 900 KQLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWK 959 KQLQIDY+G+E+MTAAASQGLTRKGVYAVR KVNENPLYA TSEHVNVLLTRTE K+VWK Sbjct 899 KQLQIDYKGNEIMTAAASQGLTRKGVYAVRYKVNENPLYAPTSEHVNVLLTRTEDKIVWK 958 Query 960 TLSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLV 1019 TL+GDPWIKTL G+F AT++EW+ EH +IM I D FQNKANVCWAK+LV Sbjct 959 TLAGDPWIKTLTAKYPGDFTATMEEWQAEHDAIMRHILEKPDPTDVFQNKANVCWAKALV 1018 Query 1020 PILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHY 1079 P+L+TAGI L QW+ + FKED+A+S E+ LN++C R +G+DLDSGLFS P V + Sbjct 1019 PVLKTAGIDLTTEQWN-TVDYFKEDKAHSAEIVLNQLCVRFFGLDLDSGLFSAPTVPLSI 1077 Query 1080 ADNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPA 1139 +NHWDN P M+G N E L R+YP T + + T + +++P N++P Sbjct 1078 RNNHWDNSPSPNMYGLNKEVVRQLSRRYPQLPRAVTTGRAYDMNTGTLRNYDPRINLVPV 1137 Query 1140 NRRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADY 1199 NRRLPH+LV +H V+K+ G VL+V G + + K V W++ D Sbjct 1138 NRRLPHALVTQHADYPPSDFSAFVSKLKGRTVLVV-GEKMSISGKTVDWLS----ETPDS 1192 Query 1200 TY--NLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSL 1257 T+ L+LG+P+ L +YD+V +N+ T +R HHYQQC DHA+KL ML + L PGG+ Sbjct 1193 TFRARLDLGIPSELPKYDIVFVNVRTQYRYHHYQQCEDHAIKLSMLTKKACLHLNPGGTC 1252 Query 1258 LIRAYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMN 1317 + YGYADR SE ++ + R+F+ SR KP TE+ F+F FD R + ++ Sbjct 1253 VSIGYGYADRASESIIGAVARQFKFSRVCKPKVSKEETEVLFVFIGFDRKTRTHNPYKLS 1312 Query 1318 NQLNAAFVG-QATRAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPE 1376 + L + G + AGCAPSY V R DIA E +VNAAN +G PG GVC A+Y+K+PE Sbjct 1313 STLTNIYTGSRLHEAGCAPSYHVVRGDIATATEGVIVNAANSKGQPGSGVCGALYRKYPE 1372 Query 1377 SFKNSATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNS 1436 SF VG A+ V + +IHAVGPNF+ SE EGD++LA AY +A+ + S Sbjct 1373 SFDLQPIEVGKARLVKGNSKHLIHAVGPNFNKVSEVEGDKQLAEAYESIARIINDNNYRS 1432 Query 1437 VAIPLLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVE 1496 VAIPLLSTG+++G KDRL QSLNHL TALD+TDADV IYCRDK+WE + E + R VE Sbjct 1433 VAIPLLSTGIFAGNKDRLMQSLNHLLTALDTTDADVAIYCRDKKWEVTLKEVVARREAVE 1492 Query 1497 --LLDEHISV---DCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYT 1551 + E SV D +++RVHP SSLAGRKGYST++G +SYLEGT+FHQ A DMAE+ Sbjct 1493 EICISEDSSVAEPDAELVRVHPKSSLAGRKGYSTSDGKTFSYLEGTKFHQAAKDMAEINA 1552 Query 1552 MWPKQTEANEQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRM 1611 MWP TEANEQVCLY LGES+ SIR KCPV++++AS+PP T+PCLC +AMTPERV RL+ Sbjct 1553 MWPAATEANEQVCLYILGESMSSIRSKCPVEESEASTPPSTLPCLCIHAMTPERVQRLKA 1612 Query 1612 NHVTSIIVCSSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREY 1657 + I VCSSFPLPKY+I GVQK++CS +LF VP + PR+Y Sbjct 1613 SRPEQITVCSSFPLPKYRITGVQKIQCSHPILFSPKVPEYIHPRKY 1658 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Eastern equine encephalitis virus (strain Florida 91-469)] Sequence ID: Q4QXJ8.3 Length: 2494 Range 1: 1 to 1653 Score:2037 bits(5278), Expect:0.0, Method:Compositional matrix adjust., Identities:996/1662(60%), Positives:1222/1662(73%), Gaps:12/1662(0%) Query 1 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 M+ V+VD+DADS F+K+LQR +P FE+E QVT NDHANARAFSHLA KLIE E+D D Sbjct 1 MEKVHVDLDADSPFVKSLQRCFPHFEIEATQVTDNDHANARAFSHLATKLIEGEVDTDQV 60 Query 61 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ 120 ILDIGSAP R S KYHC+CPM+SAEDP+RL YA KL + V D+ I+ K DL Sbjct 61 ILDIGSAPVRHTHSKHKYHCICPMKSAEDPDRLYRYADKLRKS--DVTDKCIASKAADLL 118 Query 121 AVMAVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT 180 VM+ PDAETP+ C+HTD +CR VA+YQDVYAVHAPTS+Y+QA+KGVR YWIGFDT Sbjct 119 TVMSTPDAETPSLCMHTDSTCRYHGSVAVYQDVYAVHAPTSIYYQALKGVRTIYWIGFDT 178 Query 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVL 240 TPFMY MAGAYP+Y+TNWADE VL+A+NIGL S+DL E GK+SIMR KK++P ++V+ Sbjct 179 TPFMYKNMAGAYPTYNTNWADESVLEARNIGLGSSDLHEKSFGKVSIMRKKKLQPTNKVI 238 Query 241 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKT 300 FSVGST+Y E R LL+SWHLP+VFHLKGK SFT RC+T+VSCEGYVVK+IT+SPG+YGK Sbjct 239 FSVGSTIYTEERILLRSWHLPNVFHLKGKTSFTGRCNTIVSCEGYVVKKITLSPGIYGKV 298 Query 301 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 A T H +GFL CK TDT+ GERVSF VCTYVPAT+CDQMTGILAT+V+ +DAQKLLV Sbjct 299 DNLASTMHREGFLSCKVTDTLRGERVSFPVCTYVPATLCDQMTGILATDVSVDDAQKLLV 358 Query 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCL 420 GLNQRIVVNGRTQRNTNTM+NYLLPVVAQAFS+WA+E R D+EDEK LG+RER+L C Sbjct 359 GLNQRIVVNGRTQRNTNTMQNYLLPVVAQAFSRWAREHRADLEDEKGLGVRERSLVMGCC 418 Query 421 WAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDL 480 WAFK K ++YKRP TQ+I+KVPA F+SFV+P S GL I LR RIK L Sbjct 419 WAFKTHKITSIYKRPGTQTIKKVPAVFNSFVIPQPTSYGLDIGLRRRIKMLFDAKKAPAP 478 Query 481 IPYSGDAKEARDAEKEAEEEREAELTREALPPLQAAQDDVQVEIDVEQLEDRAGAGIIET 540 I D + + EAE EAE R ALPPL D VE D++ + AGAG +ET Sbjct 479 IITEADVAHLKGLQDEAEAVAEAEAVRAALPPLLPEVDKETVEADIDLIMQEAGAGSVET 538 Query 541 PRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY 600 PR IKVT P + ++G Y VLSPQ VL S+KL+ IH LAEQV TH GRAGRY VE Y Sbjct 539 PRRHIKVTTYPGEEMIGSYAVLSPQAVLNSEKLACIHPLAEQVLVMTHKGRAGRYKVEPY 598 Query 601 DGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRA 660 GR++VPSG AI DFQ+LSESAT+V+NEREFVNR LHHIA++G ALNTDEE Y++V++ Sbjct 599 HGRVIVPSGTAIPILDFQALSESATIVFNEREFVNRYLHHIAVNGGALNTDEEYYKVVKS 658 Query 661 ERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVF 720 T+ EYV+D+D ++C KK +A + LVG+L +PP+HEFAYE L+ RPA P+K IGV+ Sbjct 659 TETDSEYVFDIDAKKCVKKGDAGPMCLVGELVDPPFHEFAYESLKTRPAAPHKVPTIGVY 718 Query 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRP 780 GVPGSGKS IIK+ VT++DLV S KKENC EI DV R R ++I+ARTVDS+LLNG Sbjct 719 GVPGSGKSGIIKSAVTKRDLVVSAKKENCMEIIKDVKRMRGMDIAARTVDSVLLNGVKHS 778 Query 781 VDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY 840 VD LY+DEAFACH+GTLLALIA+V+P+ KVVLCGDPKQCGFFNMM +KV++NH ICT+VY Sbjct 779 VDTLYIDEAFACHAGTLLALIAIVKPK-KVVLCGDPKQCGFFNMMCLKVHFNHEICTEVY 837 Query 841 HKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVK 900 HKSISRRCT VT+IVS+L Y+ +MRT N N I++DTT +TKP D++LTCFRGWVK Sbjct 838 HKSISRRCTKTVTSIVSTLFYDKRMRTVNPCNDKIIIDTTSTTKPLKDDIILTCFRGWVK 897 Query 901 QLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKT 960 QLQIDY+ HE+MTAAASQGLTRKGVYAVR KVNENPLYA TSEHVNVLLTRTE ++VWKT Sbjct 898 QLQIDYKNHEIMTAAASQGLTRKGVYAVRYKVNENPLYAQTSEHVNVLLTRTEKRIVWKT 957 Query 961 LSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVP 1020 L+GDPWIKTL GNF AT++EW+ EH +IMA I + D FQNK NVCWAK+L P Sbjct 958 LAGDPWIKTLTASYPGNFTATLEEWQAEHDAIMAKILETPASSDVFQNKVNVCWAKALEP 1017 Query 1021 ILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYA 1080 +L TA I L QW + I AFK+D+AYSPE+ALN CTR +GVD+DSGLFS P V + Y Sbjct 1018 VLATANITLTRSQW-ETIPAFKDDKAYSPEMALNFFCTRFFGVDIDSGLFSAPTVPLTYT 1076 Query 1081 DNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPAN 1140 + HWDN PG M+G A L R+YP +T + V T I+D+NP N++P N Sbjct 1077 NEHWDNSPGPNMYGLCMRTAKELARRYPCILKAVDTGRVADVRTDTIKDYNPLINVVPLN 1136 Query 1141 RRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYT 1200 RRLPHSLV HR LV K+ G VL+V G + +P KRV + P Y Sbjct 1137 RRLPHSLVVTHRYTGNGDYSQLVTKMTGKTVLVV-GTPMNIPGKRVETLGPSP--QCTYK 1193 Query 1201 YNLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR 1260 L+LG+PA LG+YD++ IN+ TP+R HHYQQC DHA+ ML ++ L GG+ + Sbjct 1194 AELDLGIPAALGKYDIIFINVRTPYRHHHYQQCEDHAIHHSMLTRKAVDHLNKGGTCIAL 1253 Query 1261 AYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQL 1320 YG ADR +E ++ + R FR SR +P C NTE+ F+F DNG ++ L Sbjct 1254 GYGTADRATENIISAVARSFRFSRVCQPKCAWENTEVAFVFFGKDNGNHLQDQDRLSVVL 1313 Query 1321 NAAFVGQATR-AGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFK 1379 N + G AG AP+YRV R DI K+++E +VNAAN +G PG GVC A+Y+KWP +F Sbjct 1314 NNIYQGSTQHEAGRAPAYRVVRGDITKSNDEVIVNAANNKGQPGSGVCGALYRKWPGAFD 1373 Query 1380 NSATPVGTAKTVMCGTYP-VIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVA 1438 PV T K + P VIHAVGPNFS SE+EGD++L+ Y ++A+ + V+ Sbjct 1374 KQ--PVATGKAHLVKHSPNVIHAVGPNFSRLSENEGDQKLSEVYMDIARIINNERFTKVS 1431 Query 1439 IPLLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQV-EL 1497 IPLLSTG+Y+GGKDR+ QSLNHLFTA+D+TDAD+ IYC DK+WE +I EAI + V EL Sbjct 1432 IPLLSTGIYAGGKDRVMQSLNHLFTAMDTTDADITIYCLDKQWESRIKEAITRKESVEEL 1491 Query 1498 LDEHISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQT 1557 ++ VD +++RVHP SSLAGR GYSTTEG +YSYLEGTRFHQTA D+AE+Y MWP + Sbjct 1492 TEDDRPVDIELVRVHPLSSLAGRPGYSTTEGKVYSYLEGTRFHQTAKDIAEIYAMWPNKQ 1551 Query 1558 EANEQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSI 1617 EANEQ+CLY LGES+ SIR KCPV++++ASSPP T+PCLC YAMT ERV RLRM Sbjct 1552 EANEQICLYVLGESMNSIRSKCPVEESEASSPPHTIPCLCNYAMTAERVYRLRMAKNEQF 1611 Query 1618 IVCSSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREYKS 1659 VCSSF LPKY+I GVQK++CSK ++F VP + PR++ S Sbjct 1612 AVCSSFQLPKYRITGVQKIQCSKPVIFSGTVPPAIHPRKFAS 1653 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Venezuelan equine encephalitis virus (strain Mena II)] Sequence ID: Q9WJC7.3 Length: 2499 Range 1: 1 to 1658 Score:2036 bits(5275), Expect:0.0, Method:Compositional matrix adjust., Identities:993/1664(60%), Positives:1229/1664(73%), Gaps:13/1664(0%) Query 1 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 M+ V+VDI+ DS FL+ALQR++P FEVE +QVT NDHANARAFSHLA KLIE E++P T Sbjct 1 MEKVHVDIEEDSPFLRALQRSFPQFEVEAKQVTDNDHANARAFSHLASKLIETEVEPSDT 60 Query 61 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ 120 ILDIGSAPARRM S KYHC+CPM+ AEDP+RL YA KL ++ D+ + +K+ +L Sbjct 61 ILDIGSAPARRMYSKHKYHCICPMKCAEDPDRLFKYAAKLKKNCKEITDKELDKKMKELA 120 Query 121 AVMAVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT 180 VM PD ET T CLH D +CR VA+YQDVYAV PTSLYHQA KGVRVAYWIGFDT Sbjct 121 EVMNDPDLETETICLHDDETCRFEGQVAVYQDVYAVDGPTSLYHQANKGVRVAYWIGFDT 180 Query 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVL 240 TPFM+ +AGAYPSYSTNWADE VL A+NIGLCS+D+ E R +SI+R K +KP + VL Sbjct 181 TPFMFKNLAGAYPSYSTNWADETVLTARNIGLCSSDVMERSRRGMSILRKKFLKPSNNVL 240 Query 241 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKT 300 FSVGST+Y E R LL+SWHLPSVFHL+GK ++TCRC+T+VSC+GYVVKRI ISPGLYGK Sbjct 241 FSVGSTIYHEKRDLLRSWHLPSVFHLRGKQNYTCRCETIVSCDGYVVKRIAISPGLYGKP 300 Query 301 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 +GYA T H +GFL CK TDT++GERVSF VCTYVPAT+CDQMTGILAT+V+ +DAQKLLV Sbjct 301 SGYAATMHREGFLCCKVTDTLNGERVSFPVCTYVPATLCDQMTGILATDVSADDAQKLLV 360 Query 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCL 420 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAF++WAKE ++D EDE+ LG+R+R L C Sbjct 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFARWAKEYKEDQEDERPLGLRDRQLVMGCC 420 Query 421 WAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSK-VPKTD 479 WAF+K K +VYKRPDTQ+I KV ++F SFV+P + SS L I LRTRIK LL + V + Sbjct 421 WAFRKHKITSVYKRPDTQTIIKVNSDFHSFVLPRIGSSTLEIGLRTRIKKLLEEPVDRPP 480 Query 480 LIPYSGDAKEARDAEKEAEEEREAELTREALPPLQAAQDDVQVEIDVEQLEDRAGAGIIE 539 LI + D +EA++A EA+E +EAE R ALPPL A ++ +E DV+ + AGAG +E Sbjct 481 LIT-ADDIQEAKNAADEAKEVKEAEELRAALPPLSADVEEPALEADVDLMLQEAGAGSVE 539 Query 540 TPRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEA 599 TPRG IKVT+ + +G Y VLSPQ VLRS+KL+ IH LAEQV THSGR GRYAVE Sbjct 540 TPRGLIKVTSYGGEDKIGSYAVLSPQAVLRSEKLTCIHPLAEQVIVITHSGRKGRYAVEP 599 Query 600 YDGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVR 659 Y G+++VP G AI +DFQ+LSESAT+VYNEREFVNR LHHIA HG ALNTDEE Y +V+ Sbjct 600 YHGKVVVPEGQAIPVQDFQALSESATIVYNEREFVNRYLHHIATHGGALNTDEEYYRVVK 659 Query 660 AERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGV 719 E EY+YD+D+++C KKE +GL L G+L +PP+HEFAYE LR RPA PY+ IGV Sbjct 660 PSEHEGEYLYDIDKKQCVKKELVSGLGLTGELVDPPFHEFAYESLRTRPAAPYQVPTIGV 719 Query 720 FGVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNR 779 +GVPGSGKS IIK+ VT++DLV S KKENC EI DV + + L+++ARTVDS+LLNGC Sbjct 720 YGVPGSGKSGIIKSAVTKKDLVVSAKKENCAEIIRDVKKMKGLDVNARTVDSVLLNGCKH 779 Query 780 PVDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQV 839 PV+ LY+DEAFACH+GTL ALIA++RP+ K VLCGDPKQCGFFNMM +KV++NH ICTQV Sbjct 780 PVETLYIDEAFACHAGTLRALIAIIRPK-KAVLCGDPKQCGFFNMMCLKVHFNHEICTQV 838 Query 840 YHKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWV 899 +HKSISRRCT VT++VS+L Y+ +MRTTN + I +DTTGSTK DL+LTCFRGWV Sbjct 839 FHKSISRRCTKSVTSVVSTLFYDKRMRTTNPRDSKIEIDTTGSTKSKKEDLILTCFRGWV 898 Query 900 KQLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWK 959 KQLQIDY+G+E+MTAAASQGLTRK VYAVR KVNENPLYA TSEHVNVLLTRTE K+VWK Sbjct 899 KQLQIDYKGNEIMTAAASQGLTRKSVYAVRYKVNENPLYAPTSEHVNVLLTRTEDKIVWK 958 Query 960 TLSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLV 1019 TL+GDPWIKTL G+F AT++EW+ EH +IM I D FQNKANVCWAK+LV Sbjct 959 TLAGDPWIKTLTAKYPGDFTATMEEWQAEHDAIMRHILEKPDPTDVFQNKANVCWAKALV 1018 Query 1020 PILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHY 1079 P+L+TAGI L QW+ + FKED+A+S E+ LN++C R +G+DLDSGLFS P V + Sbjct 1019 PVLKTAGIDLTTEQWN-TVDYFKEDKAHSAEIVLNQLCVRYFGLDLDSGLFSAPTVPLSI 1077 Query 1080 ADNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPA 1139 +NHWDN P M+G N E L R+YP T + + T + +++P N++P Sbjct 1078 RNNHWDNSPSPNMYGLNHEVVRQLSRRYPQLPRAVTTGRVYDMNTGTLRNYDPRINLVPV 1137 Query 1140 NRRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADY 1199 NRRLPH+LV +H V+K+ G VL+V G + + K V W++ A + Sbjct 1138 NRRLPHALVTQHADHPPSDFSAFVSKLKGRTVLVV-GEKMNISGKAVDWLSETP--DATF 1194 Query 1200 TYNLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLI 1259 L+LG+P L +YD+V +N+ T +R HHYQQC DHA+KL ML + L PGG+ + Sbjct 1195 RARLDLGIPTELPKYDIVFVNVRTQYRYHHYQQCEDHAIKLSMLTKKACLHLNPGGTCVS 1254 Query 1260 RAYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQ 1319 YGYADR SE ++ + R+F+ SR KP TE+ F+F FD R + +++ Sbjct 1255 IGYGYADRASESIIGAVARQFKFSRVCKPKVSKEETEVLFVFIGFDRKTRTHNPYKLSST 1314 Query 1320 LNAAFVGQATR-AGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESF 1378 L + G AGCAPSY V R DIA E +VNAAN +G PG GVC A+Y+K+PESF Sbjct 1315 LTNIYTGSGLHEAGCAPSYHVVRGDIATATEGVIVNAANSKGQPGSGVCGALYRKYPESF 1374 Query 1379 KNSATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVA 1438 VG A+ V + +IHAVGPNFS SE EGD++LA AY +AK + SVA Sbjct 1375 DLQPIEVGKARLVKGSSKHIIHAVGPNFSKVSEVEGDKQLAEAYESIAKIINDNNYRSVA 1434 Query 1439 IPLLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVE-- 1496 IPLLSTG+++G KDRL QSLNHL TALD+TDADV IYCRDK+WE + E + R VE Sbjct 1435 IPLLSTGIFAGNKDRLMQSLNHLLTALDTTDADVAIYCRDKKWEVTLKEVVARREAVEEI 1494 Query 1497 LLDEHISV---DCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMW 1553 + E SV D +++RVHP SSLAGRKGYST++G +SYLEGT+FHQ A DMAE+ MW Sbjct 1495 CISEDSSVAEPDAELVRVHPKSSLAGRKGYSTSDGKTFSYLEGTKFHQAAKDMAEINAMW 1554 Query 1554 PKQTEANEQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNH 1613 P TEANEQVCLY LGES+ SIR KCPV++++AS+PP T+PCLC +AMTPERV RL+ + Sbjct 1555 PTATEANEQVCLYILGESMSSIRSKCPVEESEASTPPSTLPCLCIHAMTPERVQRLKASR 1614 Query 1614 VTSIIVCSSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREY 1657 I VCSSFPLPKY+I GVQK++CS +LF VP + PR+Y Sbjct 1615 PEQITVCSSFPLPKYRITGVQKIQCSHPILFSPKVPEYIHPRKY 1658 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Eastern equine encephalitis virus (strain PE-0.0155)] Sequence ID: Q306W6.3 Length: 2471 Range 1: 1 to 1657 Score:2036 bits(5274), Expect:0.0, Method:Compositional matrix adjust., Identities:988/1667(59%), Positives:1228/1667(73%), Gaps:14/1667(0%) Query 1 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 M+ V+VD+DADS ++K+LQ+ +P FE+E QVT NDHANARAFSHLA KLIE E+DPD Sbjct 1 MEKVHVDLDADSPYVKSLQKCFPHFEIEATQVTDNDHANARAFSHLATKLIESEVDPDQV 60 Query 61 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ 120 ILDIGSAP R S KYHC+CPM SAEDP+RL YA KL + V DR I+ K DL Sbjct 61 ILDIGSAPVRHTHSKHKYHCICPMISAEDPDRLHRYADKLRKS--DVTDRFIASKAADLL 118 Query 121 AVMAVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT 180 VM+ PD ETP+ C+HTD +CR VA+YQDVYAVHAPTS+YHQA+KGVR YWIGFDT Sbjct 119 TVMSTPDVETPSLCMHTDSTCRYHGTVAVYQDVYAVHAPTSIYHQALKGVRTIYWIGFDT 178 Query 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVL 240 TPFMY MAGAYP+Y+TNWADE VL+A+NIGLCS+DL E R GK+SIMR KK++P ++V+ Sbjct 179 TPFMYKNMAGAYPTYNTNWADESVLEARNIGLCSSDLHEQRFGKISIMRKKKLQPTNKVV 238 Query 241 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKT 300 FSVGST+Y E R LL+SWHLP+VFHLKGK SFT RC+T+VSCEGYVVK+ITISPG+YGK Sbjct 239 FSVGSTIYTEERILLRSWHLPNVFHLKGKTSFTGRCNTIVSCEGYVVKKITISPGIYGKV 298 Query 301 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 A T H +GFL CK TDT+ GERVSF VCTYVPAT+CDQMTGILAT+V+ +DAQKLLV Sbjct 299 DNLASTMHREGFLSCKVTDTLRGERVSFPVCTYVPATLCDQMTGILATDVSVDDAQKLLV 358 Query 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCL 420 GLNQRIVVNGRTQRNTNTM NYLLP+VAQAFS+WA+E D+EDEK LG+RER+L C Sbjct 359 GLNQRIVVNGRTQRNTNTMPNYLLPIVAQAFSRWAREYHADLEDEKDLGVRERSLVMGCC 418 Query 421 WAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDL 480 WAFK K ++YK+P TQ+ +KVPA F+SFVVP L S GL I LR RIK LL + K Sbjct 419 WAFKTHKITSIYKKPGTQTTKKVPAVFNSFVVPQLTSYGLDIELRRRIKMLLEEKKKPAP 478 Query 481 IPYSGDAKEARDAEKEAEEEREAELTREALPPLQAAQDDVQVEIDVEQLEDRAGAGIIET 540 I D + ++EAE EAE R ALPPL + VE D++ + AGAG +ET Sbjct 479 IITEADVAHLKGMQEEAEVVAEAEAIRAALPPLLPEVERETVEADIDLIMQEAGAGSVET 538 Query 541 PRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY 600 PR IKVT P + ++G Y VLSPQ VL S+KL+ IH LAEQV TH GRAGRY VE Y Sbjct 539 PRRHIKVTTYPGEEMIGSYAVLSPQAVLNSEKLACIHPLAEQVLVMTHKGRAGRYKVEPY 598 Query 601 DGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRA 660 GR++VPSG AI DFQ+LSESAT+VYNEREFVNR LHHIA++G ALNTDEE Y+++R+ Sbjct 599 HGRVIVPSGTAIPIPDFQALSESATIVYNEREFVNRYLHHIAINGGALNTDEEYYKVLRS 658 Query 661 ERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVF 720 E EYV+D+D ++C KK EA + LVGDL +PP+HEFAYE L+ RPA P+K IGV+ Sbjct 659 GEAESEYVFDIDAKKCVKKAEAGPMCLVGDLVDPPFHEFAYESLKTRPAAPHKVPTIGVY 718 Query 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRP 780 GVPGSGKS IIK+ VT++DLV S KKENC EI DV R R ++++ARTVDS+LLNG P Sbjct 719 GVPGSGKSGIIKSAVTKRDLVVSAKKENCTEIIKDVKRMRGMDVAARTVDSVLLNGVKHP 778 Query 781 VDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY 840 VD LY+DEAFACH+GTLLALIA+V+P+ KVVLCGDPKQCGFFNMM +KV++NH ICT+VY Sbjct 779 VDTLYIDEAFACHAGTLLALIAIVKPK-KVVLCGDPKQCGFFNMMCLKVHFNHEICTEVY 837 Query 841 HKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVK 900 HKSISRRCT VTAIVS+L Y+ +MRT N + I++DTT +TKP D++LTCFRGWVK Sbjct 838 HKSISRRCTRTVTAIVSTLFYDKRMRTVNPCSDKIIIDTTSTTKPLKDDIILTCFRGWVK 897 Query 901 QLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKT 960 QLQIDY+ HE+MTAAASQGLTRKGVYAVR KVNENPLYA TSEHVNVLLTRTE ++VWKT Sbjct 898 QLQIDYKNHEIMTAAASQGLTRKGVYAVRYKVNENPLYAQTSEHVNVLLTRTEKRIVWKT 957 Query 961 LSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVP 1020 L+GDPWIKTL G F AT++EW+ EH +IM + D +QNK +VCWAK+L P Sbjct 958 LAGDPWIKTLTAHYPGEFSATLEEWQAEHDAIMKRVLETPANSDVYQNKVHVCWAKALEP 1017 Query 1021 ILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYA 1080 +L TA I L QW + I AFK+D+A+SPE+ALN +CTR +GVD+DSGLFS P V + Y Sbjct 1018 VLATANITLTRSQW-ETIPAFKDDKAFSPEMALNFLCTRFFGVDIDSGLFSAPTVPLTYT 1076 Query 1081 DNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPAN 1140 + HWDN PG +G A L R+YP +T + V T I D+NP N++P N Sbjct 1077 NEHWDNSPGPNRYGLCMRTAKELARRYPCILKAVDTGRVADVRTNTIRDYNPMINVVPLN 1136 Query 1141 RRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYT 1200 RRLPHSLV HR L++K+ G +L++ G + +P KRV + P G T Sbjct 1137 RRLPHSLVVSHRYTGDGNYSQLLSKLTGKTILVI-GTPINVPGKRVETLGP----GPQCT 1191 Query 1201 Y--NLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLL 1258 Y +L+LG+P+ +G+YD++ +N+ TP++ HHYQQC DHA+ ML ++ L GG+ + Sbjct 1192 YKADLDLGIPSMIGKYDIIFVNVRTPYKHHHYQQCEDHAIHHSMLTRKAVDHLNKGGTCV 1251 Query 1259 IRAYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNN 1318 YG ADR +E ++ + R FR SR +P C NTE+ F+F DNG ++ Sbjct 1252 ALGYGTADRATENIISAVARSFRFSRVCQPKCAWENTEVAFVFFGKDNGNHLRDQDQLSV 1311 Query 1319 QLNAAFVGQAT-RAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPES 1377 LN + G AG AP+YRV R DI+K+ +E +VNAAN +G PG GVC A+YKKWP + Sbjct 1312 VLNNIYQGSTQYEAGRAPAYRVIRGDISKSTDEVIVNAANNKGQPGAGVCGALYKKWPGA 1371 Query 1378 FKNSATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSV 1437 F + GTA V T +IHAVGPNFS SE EG+++L+ Y ++AK + + N V Sbjct 1372 FDKAPIATGTAHLVK-HTPNIIHAVGPNFSRMSEVEGNQKLSEVYMDIAKIINKERYNKV 1430 Query 1438 AIPLLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQV-E 1496 +IPLLSTGVY+GGKDR+ QSLNHLFTA+D+TDADV IYC DK+WE +I +AI + V E Sbjct 1431 SIPLLSTGVYAGGKDRVMQSLNHLFTAMDTTDADVTIYCLDKQWETRIKDAIARKESVEE 1490 Query 1497 LLDEHISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQ 1556 L+++ VD +++RVHP SSL GR GYST EG ++SYLEGTRFHQTA D+AE+Y MWP + Sbjct 1491 LVEDDKPVDIELVRVHPQSSLVGRPGYSTNEGKVHSYLEGTRFHQTAKDIAEIYAMWPNK 1550 Query 1557 TEANEQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTS 1616 EANEQ+CLY LGES+ SIR KCPV++++ASSPP T+PCLC YAMT ERV RLRM Sbjct 1551 QEANEQICLYVLGESMTSIRSKCPVEESEASSPPHTIPCLCNYAMTAERVYRLRMAKNEQ 1610 Query 1617 IIVCSSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREYKSPQET 1663 VCSSF LPKY+I GVQK++C+K ++F VP + PR++ + +ET Sbjct 1611 FAVCSSFQLPKYRITGVQKIQCNKPVIFSGVVPPAIHPRKFSTVEET 1657 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Eastern equine encephalitis virus (strain PE-3.0815)] Sequence ID: Q306W8.3 Length: 2474 Range 1: 1 to 1657 Score:2031 bits(5261), Expect:0.0, Method:Compositional matrix adjust., Identities:988/1667(59%), Positives:1231/1667(73%), Gaps:14/1667(0%) Query 1 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 M+ V+VD+DADS ++K LQ+ +P FE+E QVT NDHANARAFSHLA KLIE E+DPD Sbjct 1 MEKVHVDLDADSPYVKLLQKCFPHFEIEATQVTDNDHANARAFSHLATKLIESEVDPDQV 60 Query 61 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ 120 ILDIGSAP R S KYHC+CPM SAEDP+RL YA KL + V DR I+ K DL Sbjct 61 ILDIGSAPVRHTHSKHKYHCICPMISAEDPDRLQRYADKLRKS--DVTDRFIASKAADLL 118 Query 121 AVMAVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT 180 VM+ PD ETP+ C+HTD +CR VA+YQDVYAVHAPTS+YHQA+KGVR YWIGFDT Sbjct 119 TVMSTPDVETPSLCMHTDSTCRYHGTVAVYQDVYAVHAPTSIYHQALKGVRTIYWIGFDT 178 Query 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVL 240 TPFMY MAGAYP+Y+TNWADE VL+A+NIGLCS+DL E R GK+SIMR KK++P ++V+ Sbjct 179 TPFMYKNMAGAYPTYNTNWADESVLEARNIGLCSSDLHEKRLGKISIMRKKKLQPTNKVV 238 Query 241 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKT 300 FSVGST+Y E R LL+SWHLP+VFHLKGK SFT RC+T+VSC+GYVVK+ITISPG+YGK Sbjct 239 FSVGSTIYTEERILLRSWHLPNVFHLKGKTSFTGRCNTIVSCDGYVVKKITISPGIYGKV 298 Query 301 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 A T H +GFL CK TDT+ GERVSF VCTYVPAT+CDQMTGILAT+V+ +DAQKLLV Sbjct 299 DNLASTLHREGFLSCKVTDTLRGERVSFPVCTYVPATLCDQMTGILATDVSVDDAQKLLV 358 Query 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCL 420 GLNQRIVVNGRTQRNTNTM+NYLLPVVAQAFS+WA+E R D+EDEK LG+RER+L C Sbjct 359 GLNQRIVVNGRTQRNTNTMQNYLLPVVAQAFSRWAREYRADLEDEKDLGVRERSLVMGCC 418 Query 421 WAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDL 480 WAFK K ++YK+P TQ+I+KVPA F+SFV+P S GL+I LR RIK LL + K Sbjct 419 WAFKTHKITSIYKKPGTQTIKKVPAVFNSFVIPQFNSYGLNIGLRRRIKMLLEEKRKPAP 478 Query 481 IPYSGDAKEARDAEKEAEEEREAELTREALPPLQAAQDDVQVEIDVEQLEDRAGAGIIET 540 I D + ++EAE EAE R ALPPL + +E D++ + AGAG +ET Sbjct 479 IITEADVAHLKGMQEEAEAVAEAEAVRAALPPLLPEVERETIEADIDLIMQEAGAGSVET 538 Query 541 PRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY 600 PR IKVT P + +G Y VLSPQ VL S+KL+ IH LAEQV TH GRAGRY VE Y Sbjct 539 PRRHIKVTTYPGEETIGSYAVLSPQAVLNSEKLACIHPLAEQVLVMTHKGRAGRYKVEPY 598 Query 601 DGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRA 660 GR++VPSG AI DFQ+LSESAT+VYNEREFVNR LHHIA++G A+NTDEE Y+++R+ Sbjct 599 HGRVVVPSGTAIPIPDFQALSESATIVYNEREFVNRYLHHIAINGGAINTDEEYYKVLRS 658 Query 661 ERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVF 720 + EYV+D+D R+C KK +A + LVG+L +PP+HEFAYE L+ RPA P+K IGV+ Sbjct 659 SEADSEYVFDIDARKCVKKADAGPMCLVGELVDPPFHEFAYESLKTRPAAPHKVPTIGVY 718 Query 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRP 780 GVPGSGKS IIK+ VT++DLV S KKENC EI DV R R ++I+ARTVDS+LLNG P Sbjct 719 GVPGSGKSGIIKSAVTKRDLVVSAKKENCTEIIKDVKRMRGMDIAARTVDSVLLNGVKHP 778 Query 781 VDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY 840 VD LY+DEAFACH+GTLLALIA+V+P+ KVVLCGDPKQCGFFNMM +KV++NH ICT+VY Sbjct 779 VDTLYIDEAFACHAGTLLALIAIVKPK-KVVLCGDPKQCGFFNMMCLKVHFNHEICTEVY 837 Query 841 HKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVK 900 HKSISRRCT VTAIVS+L Y+ +MRT N + I++DTT +TKP D++LTCFRGWVK Sbjct 838 HKSISRRCTKTVTAIVSTLFYDKRMRTVNPCSDKIIIDTTSTTKPQRDDIILTCFRGWVK 897 Query 901 QLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKT 960 QLQIDY+ HE+MTAAASQGLTRKGVYAVR KVNENPLYA TSEHVNVLLTRTE ++VWKT Sbjct 898 QLQIDYKNHEIMTAAASQGLTRKGVYAVRYKVNENPLYAQTSEHVNVLLTRTEKRIVWKT 957 Query 961 LSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVP 1020 L+GDPWIKTL G F AT++EW+ EH +IM I + D +QNK +VCWAK+L P Sbjct 958 LAGDPWIKTLTAHYPGEFSATLEEWQAEHDAIMERILETPASSDVYQNKVHVCWAKALEP 1017 Query 1021 ILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYA 1080 +L TA I L QW + I AFK+D+A+SPE+ALN +CTR +GVD+DSGLFS P V + Y Sbjct 1018 VLATANITLTRSQW-ETIPAFKDDKAFSPEMALNFLCTRFFGVDIDSGLFSAPTVPLTYT 1076 Query 1081 DNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPAN 1140 + HWDN PG +G A L R+YP +T + V T I+D++P N++P N Sbjct 1077 NEHWDNSPGPNRYGLCMRTAKELARRYPCILKAVDTGRLADVRTNTIKDYSPLINVVPLN 1136 Query 1141 RRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYT 1200 RRLPHSLV HR L++K+ G VL++ G + +P KRV + P G T Sbjct 1137 RRLPHSLVVSHRYTGDGNYSQLLSKLIGKTVLVI-GTPISVPGKRVETLGP----GPQCT 1191 Query 1201 Y--NLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLL 1258 Y +L+LG+P+T+G+YD++ +N+ TP++ HHYQQC DHA+ ML ++ L GG+ + Sbjct 1192 YKADLDLGIPSTIGKYDIIFVNVRTPYKHHHYQQCEDHAIHHSMLTRKAVDHLNKGGTCV 1251 Query 1259 IRAYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNN 1318 YG ADR +E ++ + R FR SR +P C NTE+ F+F DNG ++ Sbjct 1252 ALGYGTADRATENIISAVARSFRFSRVCQPKCAWENTEVAFVFFGKDNGNHLRDQDQLSI 1311 Query 1319 QLNAAFVGQAT-RAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPES 1377 LN + G AG AP+YRV R DI+K+ +E +VNAAN +G PG GVC A+YKKWP + Sbjct 1312 VLNNIYQGSTQYEAGRAPAYRVIRGDISKSTDEAIVNAANNKGQPGAGVCGALYKKWPGA 1371 Query 1378 FKNSATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSV 1437 F GTA V T +IHAVGPNFS SE EG+++L+ Y ++AK + R N V Sbjct 1372 FDKVPIATGTAHLVK-HTPNIIHAVGPNFSRVSEVEGNQKLSEVYMDIAKIINRERYNKV 1430 Query 1438 AIPLLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQV-E 1496 +IPLLSTG+Y+GGKDR+ QSLNHLFTA+D+TDADV IYC DK+WE +I +AI + V E Sbjct 1431 SIPLLSTGIYAGGKDRVMQSLNHLFTAMDTTDADVTIYCLDKQWEARIKDAIARKESVEE 1490 Query 1497 LLDEHISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQ 1556 L+++ VD +++RVHP SSL GR GYST EG ++SYLEGTRFHQTA D+AE+Y MWP + Sbjct 1491 LVEDDKPVDIELVRVHPLSSLVGRPGYSTDEGKVHSYLEGTRFHQTAKDIAEIYAMWPNK 1550 Query 1557 TEANEQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTS 1616 EANEQ+CLY LGES+ SIR KCPV+D++ASSPP T+PCLC YAMT ERV RLRM Sbjct 1551 QEANEQICLYVLGESMTSIRSKCPVEDSEASSPPHTIPCLCNYAMTAERVYRLRMAKNEQ 1610 Query 1617 IIVCSSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREYKSPQET 1663 VCSSF LPKY+I GVQK++C+K ++F VP + PR++ + +ET Sbjct 1611 FAVCSSFQLPKYRITGVQKIQCNKPVIFSGVVPPAIHPRKFSAIEET 1657 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Western equine encephalitis virus] Sequence ID: P13896.3 Length: 2467 Range 1: 1 to 1650 Score:2019 bits(5232), Expect:0.0, Method:Compositional matrix adjust., Identities:976/1659(59%), Positives:1234/1659(74%), Gaps:11/1659(0%) Query 1 MDPVYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDST 60 M+ ++VD+DADS ++K+LQR +P FE+E RQVT NDHANARAFSH+A KLIE E+D D Sbjct 1 MERIHVDLDADSPYVKSLQRTFPQFEIEARQVTDNDHANARAFSHVATKLIESEVDRDQV 60 Query 61 ILDIGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ 120 ILDIGSAP R S+ +YHC+CPM SAEDP+RL YA +L + + D+NI+ K DL Sbjct 61 ILDIGSAPVRHAHSNHRYHCICPMISAEDPDRLQRYAERLKKS--DITDKNIASKAADLL 118 Query 121 AVMAVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDT 180 VM+ PDAETP+ C+HTD +CR VA+YQDVYAVHAPTS+YHQA+KGVR YWIGFDT Sbjct 119 EVMSTPDAETPSLCMHTDATCRYFGSVAVYQDVYAVHAPTSIYHQALKGVRTIYWIGFDT 178 Query 181 TPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVL 240 TPFMY MAG+YP+Y+TNWADE+VL+A+NIGL ++DL E R GKLSI+R K+++P ++++ Sbjct 179 TPFMYKNMAGSYPTYNTNWADERVLEARNIGLGNSDLQESRLGKLSILRKKRLQPTNKII 238 Query 241 FSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKT 300 FSVGST+Y E R LL+SWHLP+VFHLKGK +FT RC T+VSCEGYV+K+ITISPGLYGK Sbjct 239 FSVGSTIYTEDRSLLRSWHLPNVFHLKGKSNFTGRCGTIVSCEGYVIKKITISPGLYGKV 298 Query 301 TGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLV 360 A T H +GFL CK TDT+ GERVSF+VCTYVPAT+CDQMTGILAT+V+ +DAQKLLV Sbjct 299 ENLASTMHREGFLSCKVTDTLRGERVSFAVCTYVPATLCDQMTGILATDVSVDDAQKLLV 358 Query 361 GLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCL 420 GLNQRIVVNGRTQRNTNTM+NYLLPVVAQAFS+WA+E R D++DEK LG+RERTLT C Sbjct 359 GLNQRIVVNGRTQRNTNTMQNYLLPVVAQAFSRWAREHRADLDDEKELGVRERTLTMGCC 418 Query 421 WAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDL 480 WAFK QK ++YK+P TQ+I+KVPA FDSFV+P L S GL + R R+K LL K Sbjct 419 WAFKTQKITSIYKKPGTQTIKKVPAVFDSFVIPRLTSHGLDMGFRRRLKLLLEPTVKPAP 478 Query 481 IPYSGDAKEARDAEKEAEEEREAELTREALPPLQAAQDDVQVEIDVEQLEDRAGAGIIET 540 D + R ++EAEE AE REALPPL + VE +V+ + AGAG +ET Sbjct 479 AITMADVEHLRGLQQEAEEVAAAEEIREALPPLLPEIEKETVEAEVDLIMQEAGAGSVET 538 Query 541 PRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY 600 PRG I+VT+ P + +G Y +LSPQ VL S+KL+ IH LAEQV TH GRAGRY VE Y Sbjct 539 PRGHIRVTSYPGEEKIGSYAILSPQAVLNSEKLACIHPLAEQVLVMTHKGRAGRYKVEPY 598 Query 601 DGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRA 660 G+++VP G A+ +DFQ+LSESAT+V+NEREFVNR LHHIA++G ALNTDEE Y+ V+ Sbjct 599 HGKVIVPEGTAVPVQDFQALSESATIVFNEREFVNRYLHHIAINGGALNTDEEYYKTVKT 658 Query 661 ERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVF 720 + T+ EYV+D+D R+C K+E+A L L GDL +PP+HEFAYE L+ RPA P+K IGV+ Sbjct 659 QDTDSEYVFDIDARKCVKREDAGPLCLTGDLVDPPFHEFAYESLKTRPAAPHKVPTIGVY 718 Query 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRP 780 GVPGSGKS IIK+ VT++DLV S KKENC EI DV R R ++++ARTVDS+LLNG P Sbjct 719 GVPGSGKSGIIKSAVTKKDLVVSAKKENCAEIIRDVRRMRRMDVAARTVDSVLLNGVKHP 778 Query 781 VDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNICTQVY 840 V+ LY+DEAFACH+GTLLALIA+V+P+ KVVLCGDPKQCGFFNMM +KV++NH+ICT+VY Sbjct 779 VNTLYIDEAFACHAGTLLALIAIVKPK-KVVLCGDPKQCGFFNMMCLKVHFNHDICTEVY 837 Query 841 HKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVK 900 HKSISRRCT VTAIVS+L Y+ +M+T N I++DTTG+TKP DL+LTCFRGWVK Sbjct 838 HKSISRRCTQTVTAIVSTLFYDKRMKTVNPCADKIIIDTTGTTKPHKDDLILTCFRGWVK 897 Query 901 QLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKT 960 QLQIDY+ HE+MTAAASQGLTRKGVYAVR KVNENPLY+ TSEHVNVLLTRTE ++VWKT Sbjct 898 QLQIDYKNHEIMTAAASQGLTRKGVYAVRYKVNENPLYSQTSEHVNVLLTRTEKRIVWKT 957 Query 961 LSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKSLVP 1020 L+GDPWIKTL G+F A++ +W+ EH +IMA + + T D FQNK NVCWAK+L P Sbjct 958 LAGDPWIKTLTAKYPGDFTASLDDWQREHDAIMARVLDKPQTADVFQNKVNVCWAKALEP 1017 Query 1021 ILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLVSVHYA 1080 +L TA I L +QW + + FK DRAYSPE+ALN CTR +GVDLDSGLFS P V++ Y Sbjct 1018 VLATANIVLTRQQW-ETLHPFKHDRAYSPEMALNFFCTRFFGVDLDSGLFSAPTVALTYR 1076 Query 1081 DNHWDNRPGGKMFGFNPEAASILERKYPFTKGKWNTNKQICVTTRRIEDFNPNTNIIPAN 1140 D HWDN PG M+G N E A L R+YP +T + + I+D++P N++P N Sbjct 1077 DQHWDNSPGKNMYGLNREVAKELSRRYPCITKAVDTGRVADIRNNTIKDYSPTINVVPLN 1136 Query 1141 RRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTWVAPLGIRGADYT 1200 RRLPHSL+ +H+ ++K+ G VL++ G + +P K+V + PL Sbjct 1137 RRLPHSLIVDHKGQGTTDHSGFLSKMKGKSVLVI-GDPISIPGKKVESMGPLPTN--TIR 1193 Query 1201 YNLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR 1260 +L+LG+P+ +G+YD++ +N+ TP+R HHYQQC DHA+ ML ++ L GG+ + Sbjct 1194 CDLDLGIPSHVGKYDIIFVNVRTPYRNHHYQQCEDHAIHHSMLTCKAVHHLNTGGTCVAI 1253 Query 1261 AYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGRRNFTTHVMNNQL 1320 YG ADR +E ++ + R FR +R +P NTE+ F+F DNG + L Sbjct 1254 GYGLADRATENIITAVARSFRFTRVCQPKNTAENTEVLFVFFGKDNGNHTHDQDRLGVVL 1313 Query 1321 NAAFVGQATR--AGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESF 1378 + + G +TR AG AP+YRV R DI+K+ ++ +VNAAN +G PG GVC A+Y+KWP +F Sbjct 1314 DNIYQG-STRYEAGRAPAYRVIRGDISKSADQAIVNAANSKGQPGSGVCGALYRKWPAAF 1372 Query 1379 KNSATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVA 1438 VGTA+ V +IHAVGPNFS E EGD +LAAAY +A V + ++ Sbjct 1373 DRQPIAVGTARLVKHEPL-IIHAVGPNFSKMPEPEGDLKLAAAYMSIASIVNAERITKIS 1431 Query 1439 IPLLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVELL 1498 +PLLSTG+YSGGKDR+ QSL+HLFTA D+TDADV IYC DK+WE +I EAI + VE+L Sbjct 1432 VPLLSTGIYSGGKDRVMQSLHHLFTAFDTTDADVTIYCLDKQWETRIIEAIHRKESVEIL 1491 Query 1499 DEHISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQTE 1558 D+ VD D++RVHP+SSLAGR GYS EG LYSYLEGTRFHQTA D+AE++ MWP ++E Sbjct 1492 DDDKPVDIDLVRVHPNSSLAGRPGYSVNEGKLYSYLEGTRFHQTAKDIAEIHAMWPNKSE 1551 Query 1559 ANEQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSII 1618 ANEQ+CLY LGES+ SIR KCPV++++AS+PP T+PCLC YAMT ERV RLR Sbjct 1552 ANEQICLYILGESMSSIRSKCPVEESEASAPPHTLPCLCNYAMTAERVYRLRSAKKEQFA 1611 Query 1619 VCSSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREY 1657 VCSSF LPKY+I GVQK++CSK +LF VP V PR+Y Sbjct 1612 VCSSFLLPKYRITGVQKLQCSKPVLFSGVVPPAVHPRKY 1650 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Aura virus] Sequence ID: Q86924.3 Length: 2499 Range 1: 6 to 1675 Score:2002 bits(5186), Expect:0.0, Method:Compositional matrix adjust., Identities:983/1676(59%), Positives:1228/1676(73%), Gaps:23/1676(1%) Query 4 VYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDSTILD 63 V+VD+D S F+ LQ+++P FE+ +QVTPNDHANARAFSHLA KLIE EI TILD Sbjct 6 VHVDVDPQSPFVLQLQKSFPQFEIVAQQVTPNDHANARAFSHLASKLIEHEIPTSVTILD 65 Query 64 IGSAPARRMMSDRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQAVM 123 IGSAPARRM S+ KYHCVCPMRS EDP+RL NYA +LA AG++ ++ + +K+ DL++V+ Sbjct 66 IGSAPARRMYSEHKYHCVCPMRSPEDPDRLMNYASRLADKAGEITNKRLHDKLADLKSVL 125 Query 124 AVPDAETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGFDTTPF 183 PDAET T C H DV CR A+V++ Q+VY ++AP+++YHQA+KGVR YWIGFDTT F Sbjct 126 ESPDAETGTICFHNDVICRTTAEVSVMQNVY-INAPSTIYHQALKGVRKLYWIGFDTTQF 184 Query 184 MYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDRVLFSV 243 M+++MAG+YPSY+TNWADE+VL+A+NIGLCST L EG GKLS R K +KP V FSV Sbjct 185 MFSSMAGSYPSYNTNWADERVLEARNIGLCSTKLREGTMGKLSTFRKKALKPGTNVYFSV 244 Query 244 GSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYGKTTGY 303 GSTLYPE+R L+SWHLPSVFHLKGK SFTCRCDT V+CEGYVVK+ITISPG+ G+ Y Sbjct 245 GSTLYPENRADLQSWHLPSVFHLKGKQSFTCRCDTAVNCEGYVVKKITISPGITGRVNRY 304 Query 304 AVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQKLLVGLN 363 VT++++GFL+CK TDTV GERVSF VCTY+P +ICDQMTGILAT++ PEDAQKLLVGLN Sbjct 305 TVTNNSEGFLLCKITDTVKGERVSFPVCTYIPPSICDQMTGILATDIQPEDAQKLLVGLN 364 Query 364 QRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEKLLGIRERTLTCCCLWAF 423 QRIVVNG+T RNTNTM+NYLLP VA SKWAKE + D DEK L +RER L CLWAF Sbjct 365 QRIVVNGKTNRNTNTMQNYLLPAVATGLSKWAKERKADCSDEKPLNVRERKLAFGCLWAF 424 Query 424 KKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKVPKTDLIPY 483 K +K H+ Y+ P TQ+I KV AEF +F + S+W++ L + LR ++K LL K ++ Sbjct 425 KTKKIHSFYRPPGTQTIVKVAAEFSAFPMSSVWTTSLPMSLRQKVKLLLVKKTNKPVVTI 484 Query 484 SGDA-KEARDAEKEAEEEREAELTREALPPLQAAQDDV--QVEIDVEQLEDRAGAGIIET 540 + A K A++A EA E EAE +ALPPL+ V V+ +V L D AGA ++ET Sbjct 485 TDTAVKNAQEAYNEAVETAEAEEKAKALPPLKPTAPPVAEDVKCEVTDLVDDAGAALVET 544 Query 541 PRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLSLIHALAEQVKTCTHSGRAGRYAVEAY 600 PRG IK+ Q D +G Y V+SP VLR+Q+L IH LAEQVK TH GR GRY+VE Y Sbjct 545 PRGKIKIIPQEGDVRIGSYTVISPAAVLRNQQLEPIHELAEQVKIITHGGRTGRYSVEPY 604 Query 601 DGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIALHGPALNTDEESYELVRA 660 D ++L+P+G +S + F +LSESAT+VYNEREF+NRKLHHIA G A NT+EE Y++ +A Sbjct 605 DAKVLLPTGCPMSWQHFAALSESATLVYNEREFLNRKLHHIATKGAAKNTEEEQYKVCKA 664 Query 661 ERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEGLRIRPACPYKTAVIGVF 720 + T+HEYVYDVD R+C K+E A GLVLVG+LTNPPYHE AYEGLR RPA PY +GV Sbjct 665 KDTDHEYVYDVDARKCVKREHAQGLVLVGELTNPPYHELAYEGLRTRPAAPYHIETLGVI 724 Query 721 GVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEISARTVDSLLLNGCNRP 780 G PGSGKSAIIK+ VT +DLVTSGKKENC+EI DV + R + I+ RTVDS+LLNG + Sbjct 725 GTPGSGKSAIIKSTVTLKDLVTSGKKENCKEIENDVQKMRGMTIATRTVDSVLLNGWKKA 784 Query 781 VDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNH---NICT 837 VDVLYVDEAFACH+GTL+ALIA+V+PR+KVVLCGDPKQ FFN+MQ+KVN+N+ ++CT Sbjct 785 VDVLYVDEAFACHAGTLMALIAIVKPRRKVVLCGDPKQWPFFNLMQLKVNFNNPERDLCT 844 Query 838 QVYHKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGSTKPDPGDLVLTCFRG 897 ++K ISRRCT PVTAIVS+LHY+GKMRTTN + I +D GSTKP GD+VLTCFRG Sbjct 845 STHYKYISRRCTQPVTAIVSTLHYDGKMRTTNPCKRAIEIDVNGSTKPKKGDIVLTCFRG 904 Query 898 WVKQLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLV 957 WVKQ QIDY G AASQGLTR+GVYAVRQKVNENPLYA SEHVNVLLTRTE ++V Sbjct 905 WVKQGQIDYPGPGGHDRAASQGLTRRGVYAVRQKVNENPLYAEKSEHVNVLLTRTEDRIV 964 Query 958 WKTLSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGICNHQVTFDTFQNKANVCWAKS 1017 WKTL GDPWIK L N PKGNF AT++EW+ EH IM I + D F +K N CWAK+ Sbjct 965 WKTLQGDPWIKYLTNVPKGNFTATLEEWQAEHEDIMKAINSTSTVSDPFASKVNTCWAKA 1024 Query 1018 LVPILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRMYGVDLDSGLFSKPLV-- 1075 ++PIL TAGI+L QW + F+ D+ YS AL+ ICT+M+G+DL SG+FS+P + Sbjct 1025 IIPILRTAGIELTFEQWEDLFPQFRNDQPYSVMYALDVICTKMFGMDLSSGIFSRPEIPL 1084 Query 1076 SVHYAD-----NHWDNRPGGKMFGFNPEAASILERKYP--FTKGKWNTNKQICVTTRRIE 1128 + H AD HWDN PGG+ FG+N +A +KYP GK + QI R+ Sbjct 1085 TFHPADVGRVRAHWDNSPGGQKFGYN-KAVIPTCKKYPVYLRAGKGD---QILPIYGRVS 1140 Query 1129 DFNPNTNIIPANRRLPHSLVAEHRPVKGERMEWLVNKINGHHVLLVSGYNLVLPTKRVTW 1188 + N++P NR LPHSL A + + + +N++ GH +LLVS +KR+TW Sbjct 1141 VPSARNNLVPLNRNLPHSLTASLQKKEAAPLHKFLNQLPGHSMLLVSKETCYCVSKRITW 1200 Query 1189 VAPLGIRGADYTYNLELGLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSL 1248 VAPLG+RGAD+ ++L G P L RYDLV++N+ P+R HHYQQC +HA ++ L +L Sbjct 1201 VAPLGVRGADHNHDLHFGFPP-LSRYDLVVVNMGQPYRFHHYQQCEEHAGLMRTLARSAL 1259 Query 1249 RLLKPGGSLLIRAYGYADRTSERVVCVLGRKFRSSRALKPPCVTSNTEMFFLFSNFDNGR 1308 LKPGG+L ++AYG+AD SE VV L RKF + A++P C NTEMFF+F DN R Sbjct 1260 NCLKPGGTLALKAYGFADSNSEDVVLSLARKFVRASAVRPSCTQFNTEMFFVFRQLDNDR 1319 Query 1309 -RNFTTHVMNNQLNAAFVGQATRAGCAPSYRVKRMDIAKNDEECVVNAANPRGLPGDGVC 1367 R FT H +N ++ F +G APSYRVKRM+IA EE VVNAAN RG PGDGVC Sbjct 1320 ERQFTQHHLNLAVSNIFDNYKDGSGAAPSYRVKRMNIADCTEEAVVNAANARGKPGDGVC 1379 Query 1368 KAVYKKWPESFKNSATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAYREVAK 1427 +A++KKWP+SF+N+ T V TA C VIHAVGP+F Y+ E + L AY +VAK Sbjct 1380 RAIFKKWPKSFENATTEVETAVMKPCHNKVVIHAVGPDFRKYTLEEATKLLQNAYHDVAK 1439 Query 1428 EVTRLGVNSVAIPLLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAE 1487 V G++SVAIPLLSTG+Y+ G DRL SL LFTALD TDADV IYC DK+WE++IA+ Sbjct 1440 IVNEKGISSVAIPLLSTGIYAAGADRLDLSLRCLFTALDRTDADVTIYCLDKKWEQRIAD 1499 Query 1488 AIQMRTQV-ELLDEHISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDM 1546 AI+MR QV EL D I +D + RVHPDS L GYST G LYSY EGT+FHQTA D+ Sbjct 1500 AIRMREQVTELKDPDIEIDEGLTRVHPDSCLKDHIGYSTQYGKLYSYFEGTKFHQTAKDI 1559 Query 1547 AEVYTMWPKQTEANEQVCLYALGESIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERV 1606 AE+ ++P ANEQ+CLY LGE +ESIR+KCPV+D+ AS+PPKT+PCLC YAMT ER+ Sbjct 1560 AEIRALFPDVQAANEQICLYTLGEPMESIREKCPVEDSPASAPPKTIPCLCMYAMTAERI 1619 Query 1607 TRLRMNHVTSIIVCSSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSPREYKSPQE 1662 R+R N VT+I VCSSFPLPKY+I+ VQK++C+KV+LF+ +VP + R Y + E Sbjct 1620 CRVRSNSVTNITVCSSFPLPKYRIKNVQKIQCTKVVLFNPDVPPYIPARVYINKDE 1675 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Salmon pancreas disease virus] Sequence ID: Q8JJX1.1 Length: 2601 Range 1: 15 to 1746 Score:1291 bits(3340), Expect:0.0, Method:Compositional matrix adjust., Identities:751/1745(43%), Positives:1005/1745(57%), Gaps:122/1745(6%) Query 4 VYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDSTIL- 62 V V++ AD L + A+P FEV + NDHA ARAFSHLA K IE++I I+ Sbjct 15 VTVNLPADHPALNQFKTAFPGFEVVASNRSSNDHAAARAFSHLATKWIERDIGGRQVIVA 74 Query 63 DIGSAPARRMMS--DRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ 120 DIGSAPARR+ + + YH VCP + AEDPERLA+YARKL A + ++EKI DL+ Sbjct 75 DIGSAPARRIGAPDNVTYHSVCPRKCAEDPERLASYARKLVRAVERGDGHLVNEKITDLK 134 Query 121 AVMAVPDA--ETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGF 178 V+ PD ET + CL+ DVSC+ +AD+A+YQDVYAV AP+++Y QA KG RV YWIGF Sbjct 135 DVLENPDTSLETTSICLNDDVSCKVKADIAVYQDVYAVDAPSTIYAQADKGTRVVYWIGF 194 Query 179 DTTPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDR 238 + F +AMAG++P Y NW+D VL AKN+ LC + L+E R K + P Sbjct 195 EPFVFHTDAMAGSFPLYDANWSDSAVLAAKNLPLCYSGLSEDSIKWRFRFRDKPLVPSGE 254 Query 239 VLFSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYG 298 + +SVGST Y E R LKSWHLPS FH +TCRCDTVVSC GYVVK+ITI G+ G Sbjct 255 IHYSVGSTHYVEDRDKLKSWHLPSTFHFVAPNKYTCRCDTVVSCGGYVVKKITICEGIVG 314 Query 299 --KTTGYAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQ 356 A ++H DG ++ K +DT++ E+VSF V TY+PA ICDQMT + A V DA Sbjct 315 IPAKEELATSYHRDGVVVTKFSDTINHEQVSFPVVTYIPAVICDQMTAMTANPVKYSDAV 374 Query 357 KLLVGLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEK-LLGIRERTL 415 KLLVGLNQRIVVNG T RN N+M N L+PV A+A WA E R+DMEDE+ L GI T Sbjct 375 KLLVGLNQRIVVNGTTVRNVNSMDNSLIPVFARALCSWADEVRRDMEDEQDLYGITSVTT 434 Query 416 TCCCLWAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKV 475 C A+ K++ HT Y+RP S VPA+F + +L ++ L++PL+ + L + Sbjct 435 WICICRAYDKRQQHTFYRRPKQSSGIYVPAKFTGSLRAALSATYLNLPLKQLLLNTLKRA 494 Query 476 --PKTDLIPYSGDA--KEARDAEKEAEEEREAELTREALPPLQAAQDDVQVE-------I 524 P I +A +A + + EEER + + QDD + E + Sbjct 495 IKPMDQAIADETEALAHDAAEVHELTEEERRQQAANPSYIADVLGQDDDEEEAGDGMSDV 554 Query 525 DVEQLEDRAGAGIIETPRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLS-LIHALAEQV 583 D+ + ED AGA II+ RG +KV D+++GEYLVLSP TVLR++KL+ L+ LAE+V Sbjct 555 DLGE-EDGAGATIIDCQRGTVKVITAFGDNMMGEYLVLSPVTVLRTRKLAILLGPLAEEV 613 Query 584 KTCTHSGRAGRYAVEAYDGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKLHHIAL 643 H GR GRYA+E + ++L+P+G ++ + FQ+L+ESAT+ YN+ F R L +A Sbjct 614 MQYVHKGRTGRYAIEKNNLKVLIPTGVSLKTDHFQALAESATLTYNDYLFTCRTLDQLAT 673 Query 644 HGPALNTDEESYELVRAERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHEFAYEG 703 G A NTDE Y+LV A + EYVY++ ++C KKE+A G VL GD+ NPPYH+FAYE Sbjct 674 RGSARNTDEVYYKLVDAAKARDEYVYELSSKQCVKKEDATGTVLQGDICNPPYHQFAYEA 733 Query 704 LRIRPACPYKTAVIGVFGVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLE 763 LR RPA + IG++GVPG+GK+AII VT +DLV SGKKENC++I V+ +R L+ Sbjct 734 LRKRPAHTHDVHTIGIYGVPGAGKTAIITTEVTTRDLVASGKKENCEDIKRCVLERRGLK 793 Query 764 ISARTVDSLLLNGCNRPVDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFN 823 I+ARTVDSL V+ LYVDEA+ACHSGTLLALIA VRP KVVLCGDPKQ G N Sbjct 794 IAARTVDSLFYGAYRGAVNTLYVDEAYACHSGTLLALIAAVRPTGKVVLCGDPKQVGCVN 853 Query 824 MMQMKVNYNHNICTQVYHKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGST 883 +QM+++YNH I +V K+ISRRCT +TAIVS+L+YEG+M+TTN KP+++DTTGST Sbjct 854 QLQMRMHYNHEISDRVLRKNISRRCTHTLTAIVSNLNYEGRMKTTNPCKKPVLIDTTGST 913 Query 884 KPDPGDLVLTCFRGWVKQLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSE 943 KPD LVLTCFRGWVK L+ Y +E+MTAAASQGLTR+ VYAVR +V NPLY TSE Sbjct 914 KPDKEALVLTCFRGWVKDLKFLYPHNELMTAAASQGLTREKVYAVRCRVTTNPLYEPTSE 973 Query 944 HVNVLLTRTEGKLVWKTLSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGI---CNHQ 1000 H+ VLLTRT +LVWKTL DP I L PPKG++ AT+++WE EH I+A + C + Sbjct 974 HITVLLTRTNDELVWKTLPNDPLIPILSKPPKGDYSATMEDWEDEHNGILAALREACVPR 1033 Query 1001 VTFDTFQNKANVCWAKSLVPILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNEICTRM 1060 + F K N CWA + +L AG+++ +++I AF+ED+ +S AL+ + T + Sbjct 1034 MNF--AHGKRNTCWAVTSSRVLHEAGVQITPEDYNRIFPAFREDKPHSALAALDAVATLV 1091 Query 1061 YGVDLDSGLFSKPLVSVHYADNHWDNRPGGKMFGFNPEAASILERKYP-FTKGKWNTNKQ 1119 +G+D SG+ S + ++HW N G +G N +A E P K + Sbjct 1092 WGLDTSSGILSGKGSFMRLENSHWSNSNRGYEYGLNLDALEGYEIANPRMIKALKQRRGR 1151 Query 1120 IC--VTTRRIEDFNPNTNIIPANRRLPHSLVAEHRPVKGERME--WLVNKINGHHVLLV- 1174 C + T ++ +P +P NR +PH LV K +E V++ + H Sbjct 1152 ECYDIETGKLVPLDPARVQVPINRIVPHVLVDTSAAAKPGFLENRLTVDRWDQVHSFKTR 1211 Query 1175 SGYNLVLPTKRVTW-------VAPLGI----------------------RGADYTYNLEL 1205 + TKRV++ AP G+ RGA Sbjct 1212 AAVKFAELTKRVSYNSVLDLGAAPGGVTDYCVKKGKTVTSVSEQWDTKPRGAVVVTADIN 1271 Query 1206 GLPATLGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIRAYGYA 1265 G LG +DLV + P R HHY QC DHA+ + GG +++AYG A Sbjct 1272 GPLNNLGIFDLVFCDAAGPRRYHHYAQCEDHAVLFTSACKHGVERTAKGGVFIVKAYGMA 1331 Query 1266 DRTSERVVCVLGRKFRSSRALKP-PCVTSNTEMFFLFSN----------------FDNGR 1308 DR +ER V R FRS KP +N E+FF FS D Sbjct 1332 DRRTERAVEGTARYFRSVSVEKPVSSRITNVEVFFKFSGRCRPHARSIAHLGPQLTDIYA 1391 Query 1309 RNFTTHVM------NNQLNAAFVGQATRAGCAPSYRVKRMDIAKNDEECVVNAANPRGLP 1362 R + + M +++ A + + G AP YRV +I +EE +VNAAN G P Sbjct 1392 RTWKAYKMLARGSVADKVKVAEILNSM-VGAAPGYRVLNRNIITAEEEVLVNAANSNGRP 1450 Query 1363 GDGVCKAVYKKWPESFKNSATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDRELAAAY 1422 GDGVC A+Y + ++F N A G A V +IHA G +F E G R+L AAY Sbjct 1451 GDGVCGALYGAFGDAFPNGAIGAGNAVLVRGLEATIIHAAGADFREVDEETGARQLRAAY 1510 Query 1423 REVAKEVTRLGVNSVAIPLLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRDKEWE 1482 R A VT G+ S AIPLLST ++S G++RL QS + L A D+T+ DV IYC Sbjct 1511 RAAATLVTANGITSAAIPLLSTHIFSNGRNRLEQSFSALVEAFDTTECDVTIYC------ 1564 Query 1483 KKIAEAIQMRTQVELLDEHI----------------------------SVDCDIIRVHPD 1514 +A + R Q +L+D H S + + V Sbjct 1565 --LANNMAARIQ-QLIDAHAREEFDEEVVVEEEEEHEADAMSDTETLSSFGDETVWVPKH 1621 Query 1515 SSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQTEANEQVCLYALGESIES 1574 S+LAGR GYS G S GT+FH+ AV M+ + WPK EAN ++ Y G+ + Sbjct 1622 STLAGRPGYSAYYGDRRSLFVGTKFHRAAVAMSSIEAAWPKTKEANAKLIEYIRGQHLVD 1681 Query 1575 IRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVCSSFPLPKYKIEGVQ 1634 + + CPVDD PP ++PC C YAMTPERVT L+ +VCS+F LP I+ V Sbjct 1682 VLKSCPVDDIPVGRPPSSLPCGCIYAMTPERVTVLKQRPQEGFVVCSAFKLPLTNIQDVT 1741 Query 1635 KVKCS 1639 KV+C+ Sbjct 1742 KVECT 1746 >RecName: Full=Polyprotein P1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=mRNA-capping enzyme nsP1; AltName: Full=Non-structural protein 1; Contains: RecName: Full=Protease nsP2; AltName: Full=Non-structural protein 2; Short=nsP2; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Sleeping disease virus] Sequence ID: Q8QL53.1 Length: 2593 Range 1: 15 to 1745 Score:1286 bits(3327), Expect:0.0, Method:Compositional matrix adjust., Identities:750/1749(43%), Positives:1005/1749(57%), Gaps:131/1749(7%) Query 4 VYVDIDADSAFLKALQRAYPMFEVEPRQVTPNDHANARAFSHLAIKLIEQEIDPDSTIL- 62 V VD+ AD L + A+P FEV + NDHA ARAFSHLA K IE++ID I+ Sbjct 15 VTVDLPADHPALNQFKTAFPGFEVVASNRSSNDHAAARAFSHLATKWIERDIDGRQVIVA 74 Query 63 DIGSAPARRMMS--DRKYHCVCPMRSAEDPERLANYARKLASAAGKVLDRNISEKIGDLQ 120 DIGSAPARR+ + + YH VCP + AEDPERLA+YARKL A K +S++I DL+ Sbjct 75 DIGSAPARRVGAPDNVTYHSVCPRKCAEDPERLASYARKLVRAVEKGDGHLVSDRITDLK 134 Query 121 AVMAVPDA--ETPTFCLHTDVSCRQRADVAIYQDVYAVHAPTSLYHQAIKGVRVAYWIGF 178 V+ PD ET + CL+ DVSC+ +AD+A+YQDVYAV AP+++Y QA KG RV YWIGF Sbjct 135 DVLENPDTSLETTSICLNDDVSCKVKADIAVYQDVYAVDAPSTIYAQADKGTRVVYWIGF 194 Query 179 DTTPFMYNAMAGAYPSYSTNWADEQVLKAKNIGLCSTDLTEGRRGKLSIMRGKKMKPCDR 238 + F +AMAG++P Y NW+D VL AKN+ LC + L+E R K + P Sbjct 195 EPFVFHTDAMAGSFPLYDANWSDSAVLAAKNLPLCYSGLSEDSIKWRFRFRDKPLVPSGE 254 Query 239 VLFSVGSTLYPESRKLLKSWHLPSVFHLKGKLSFTCRCDTVVSCEGYVVKRITISPGLYG 298 + +SVGST Y E R LKSWHLPS FH +TCRCDTVVSC GYVVK+ITI G+ G Sbjct 255 IHYSVGSTHYVEDRDKLKSWHLPSTFHFVAPNKYTCRCDTVVSCGGYVVKKITICEGIVG 314 Query 299 KTTG--YAVTHHADGFLMCKTTDTVDGERVSFSVCTYVPATICDQMTGILATEVTPEDAQ 356 + A ++H DG ++ K +DT++ E+VSF V TY+PA ICDQMT + A V D Sbjct 315 RPANEELATSYHRDGVVVTKFSDTINHEQVSFPVVTYIPAVICDQMTAMTADPVKYPDVV 374 Query 357 KLLVGLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFSKWAKECRKDMEDEK-LLGIRERTL 415 KLLVGLNQRIVVNG T RN N+M N L+PV A+A WA E R+DMEDE+ + G+ T Sbjct 375 KLLVGLNQRIVVNGTTVRNVNSMDNSLIPVFARALCSWADEARRDMEDEQDMYGVTSVTT 434 Query 416 TCCCLWAFKKQKTHTVYKRPDTQSIQKVPAEFDSFVVPSLWSSGLSIPLRTRIKWLLSKV 475 C A+ K++ HT Y+RP S VPA+F + SL ++ L++PL+ + L + Sbjct 435 WICICRAYDKRQQHTFYRRPKQSSGIYVPAKFTGSLRASLSATYLNLPLKQLLLNTLKRA 494 Query 476 PKTDLIPYSGDAKEARDAEKEAEEEREA-ELTRE-----ALPPLQAAQ----------DD 519 K GD A + E A + E ELT E A P A DD Sbjct 495 IK------PGDQALADETEARAHDAAEVHELTEEEGRQQAANPSYIADVLGQDDEEEVDD 548 Query 520 VQVEIDVEQLEDRAGAGIIETPRGAIKVTAQPTDHVVGEYLVLSPQTVLRSQKLS-LIHA 578 +D+ + ED G+ II+ RG +KV D+ +GEYLVLSP TVLR++KL+ L+ Sbjct 549 GMSNVDLGE-EDGVGSTIIDCQRGTVKVITAFGDNTMGEYLVLSPVTVLRTRKLAVLLGP 607 Query 579 LAEQVKTCTHSGRAGRYAVEAYDGRILVPSGYAISPEDFQSLSESATMVYNEREFVNRKL 638 LAE+V H GR GRYA+E + ++L+P+G ++ FQ+L+ESAT+ YN+ F R L Sbjct 608 LAEEVMQYVHKGRTGRYAIEKNNLKVLIPTGVSLKTAHFQALTESATLTYNDYLFTCRTL 667 Query 639 HHIALHGPALNTDEESYELVRAERTEHEYVYDVDQRRCCKKEEAAGLVLVGDLTNPPYHE 698 +A G A NTDE Y+LV A + + EYVY++ ++C KKE+A G VL GD+ NPPYH+ Sbjct 668 DQLATRGSAKNTDEVYYKLVDAAKAKDEYVYELSSKQCVKKEDATGTVLQGDICNPPYHQ 727 Query 699 FAYEGLRIRPACPYKTAVIGVFGVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMR 758 FA+E LR RPA + IG++GVPG+GK+AII VT +DLV SGKKENC++I V+ Sbjct 728 FAFEALRKRPAHTHDVHTIGIYGVPGAGKTAIITTEVTTRDLVASGKKENCEDIKRCVLE 787 Query 759 QRNLEISARTVDSLLLNGCNRPVDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQ 818 +R L+I+ARTVDSLL VD LYVDEA+ACHSGTLLALIA VRP KVVLCGDPKQ Sbjct 788 RRGLKIAARTVDSLLYGAYRGAVDTLYVDEAYACHSGTLLALIAAVRPTGKVVLCGDPKQ 847 Query 819 CGFFNMMQMKVNYNHNICTQVYHKSISRRCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVD 878 G N +QM+++YNH I +V K+ISRRCT +TAIVS+L+YEG+M+TTN KP+++D Sbjct 848 VGCVNQLQMRMHYNHEISDRVLRKNISRRCTHTLTAIVSNLNYEGRMKTTNPCKKPVLID 907 Query 879 TTGSTKPDPGDLVLTCFRGWVKQLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLY 938 TTGSTKPD LVLTCFRGWVK L+I Y +E+MTAAASQGLTR+ VYAVR +V NPLY Sbjct 908 TTGSTKPDKEALVLTCFRGWVKDLKILYPHNELMTAAASQGLTREKVYAVRCRVTSNPLY 967 Query 939 ASTSEHVNVLLTRTEGKLVWKTLSGDPWIKTLQNPPKGNFKATIKEWEVEHASIMAGI-- 996 TSEH+ VLLTRT +LVWKTL DP I L PPKG++ AT+++WE EH I+A + Sbjct 968 EPTSEHITVLLTRTNDELVWKTLPNDPLIPILSKPPKGDYSATMEDWEDEHNGILAALRE 1027 Query 997 -CNHQVTFDTFQNKANVCWAKSLVPILETAGIKLNDRQWSQIIQAFKEDRAYSPEVALNE 1055 C ++ F K N CWA + +L AG+ + +++I AF+ED+ +S AL+ Sbjct 1028 ACVPRMNF--AHGKRNTCWAVTSSRVLHEAGVLITPEDFNRIFPAFREDKPHSALAALDA 1085 Query 1056 ICTRMYGVDLDSGLFSKPLVSVHYADNHWDNRPGGKMFGFNPEAASILERKYP-FTKGKW 1114 + ++G+D SG+ S + ++HW N G +G N +A E P K Sbjct 1086 VAALVWGLDTSSGILSGKGSFMRLENSHWSNSNRGYEYGLNLDALEGYEIANPRMIKALK 1145 Query 1115 NTNKQIC--VTTRRIEDFNPNTNIIPANRRLPHSLVAEHRPVKGERME--WLVNKINGHH 1170 + C + T ++ +P +P NR +PH LV K +E V++ + H Sbjct 1146 QRRGRECYDIETGKLVPMDPGRVQVPINRVVPHVLVDTSAAAKPGFLENRLSVDRWDQVH 1205 Query 1171 VLLV-SGYNLVLPTKRVTWVAPLGI---RGADYTYNLELGLPAT---------------- 1210 + TKRV++ + L + RG Y ++ G T Sbjct 1206 SFKTRAAVKFAELTKRVSYNSVLDLGAARGGVTDYCVKKGKTVTCVSEQWDSKPRGAVVI 1265 Query 1211 ----------LGRYDLVIINIHTPFRIHHYQQCVDHAMKLQMLGGDSLRLLKPGGSLLIR 1260 LG +DLV + P R HHY QC DHA + + GG +++ Sbjct 1266 TADINGPLNNLGIFDLVFCDAAGPRRYHHYAQCEDHARRSTSACKHGVERTAKGGVFIVK 1325 Query 1261 AYGYADRTSERVVCVLGRKFRSSRALKP-PCVTSNTEMFFLFSN---------------- 1303 AYG ADR +ER V R F+S KP +N E+FF FS Sbjct 1326 AYGMADRRTERAVECTARYFKSVSVEKPVSSRITNVEVFFKFSGRCRPHARSIAHLGPQL 1385 Query 1304 ---FDNGRRNFTTHVMNNQLNAAFVGQA--TRAGCAPSYRVKRMDIAKNDEECVVNAANP 1358 + R+ + + + V + + G AP YRV +I +EE +VNAAN Sbjct 1386 TDIYARTRKAYKMLARGSVADKVKVAEILNSMVGAAPGYRVLNKNIITAEEEVLVNAANS 1445 Query 1359 RGLPGDGVCKAVYKKWPESFKNSATPVGTAKTVMCGTYPVIHAVGPNFSNYSESEGDREL 1418 G PGDGVC A+Y + ++F N A G A V +IHA G +F E G R+L Sbjct 1446 NGRPGDGVCGALYGAFGDAFPNGAIGAGNAVLVRGLEATIIHAAGADFREVDEETGARQL 1505 Query 1419 AAAYREVAKEVTRLGVNSVAIPLLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVIYCRD 1478 AAYR A VT G+ S AIPLLST ++S G++RL QS L A D+T+ DV IYC Sbjct 1506 RAAYRAAATLVTANGITSAAIPLLSTHIFSNGRNRLEQSFGALVEAFDTTECDVTIYC-- 1563 Query 1479 KEWEKKIAEAIQMRTQVELLDEH------------------ISVDCD----------IIR 1510 +A + R Q +L+D+H + CD + Sbjct 1564 ------LANNMAARIQ-QLIDDHAREEFDEEVVVEEEEEHEANAMCDTETLSSFGDETVW 1616 Query 1511 VHPDSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQTEANEQVCLYALGE 1570 V S+LAGR GYS T G S GT+FH+ AV M+ + WP+ EAN ++ Y G+ Sbjct 1617 VPKHSTLAGRPGYSATYGDRRSLFVGTKFHRAAVAMSSIEAAWPRTKEANAKLIEYIRGQ 1676 Query 1571 SIESIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVCSSFPLPKYKI 1630 + + + CPV+D PP ++PC C YAMTPERVT L+ +VCS+F LP I Sbjct 1677 HLVDVLKSCPVNDIPVGRPPSSLPCGCIYAMTPERVTVLKQRPQEGFVVCSAFKLPLTNI 1736 Query 1631 EGVQKVKCS 1639 + V KV+C+ Sbjct 1737 QDVTKVECT 1745 >RecName: Full=Polyprotein nsP1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Ross river virus (STRAIN T48)] Sequence ID: P13888.2 Length: 1149 Range 1: 1 to 324 Score:501 bits(1290), Expect:1e-151, Method:Compositional matrix adjust., Identities:235/324(73%), Positives:273/324(84%), Gaps:0/324(0%) Query 1334 APSYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKNSATPVGTAKTVMC 1393 APSYRV+R DI+ + EE VVNAAN +G GDGVC+AV +KWP+SFK +ATPVGTAK V Sbjct 1 APSYRVRRTDISGHAEEAVVNAANAKGTVGDGVCRAVARKWPDSFKGAATPVGTAKLVQA 60 Query 1394 GTYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYSGGKDR 1453 VIHAVGPNFS +E+EGDRELAAAYR VA + + SVAIPLLSTGV+SGGKDR Sbjct 61 NGMNVIHAVGPNFSTVTEAEGDRELAAAYRAVAGIINASNIKSVAIPLLSTGVFSGGKDR 120 Query 1454 LTQSLNHLFTALDSTDADVVIYCRDKEWEKKIAEAIQMRTQVELLDEHISVDCDIIRVHP 1513 + QSLNHLFTA+D+TDADVVIYCRDK WEKKI EAI RT VEL+ E IS++ D+IRVHP Sbjct 121 VMQSLNHLFTAMDTTDADVVIYCRDKAWEKKIQEAIDRRTAVELVSEDISLESDLIRVHP 180 Query 1514 DSSLAGRKGYSTTEGSLYSYLEGTRFHQTAVDMAEVYTMWPKQTEANEQVCLYALGESIE 1573 DS L GRKGYS T+G L+SYLEGTRFHQTAVDMAE+ T+WPK +ANEQ+CLYALGES++ Sbjct 181 DSCLVGRKGYSITDGKLHSYLEGTRFHQTAVDMAEISTLWPKLQDANEQICLYALGESMD 240 Query 1574 SIRQKCPVDDADASSPPKTVPCLCRYAMTPERVTRLRMNHVTSIIVCSSFPLPKYKIEGV 1633 SIR KCPV+DAD+S+PPKTVPCLCRYAMT ERV RLRMN+ +IIVCSSFPLPKY+IEGV Sbjct 241 SIRTKCPVEDADSSTPPKTVPCLCRYAMTAERVARLRMNNTKAIIVCSSFPLPKYRIEGV 300 Query 1634 QKVKCSKVMLFDHNVPSRVSPREY 1657 QKVKC +V++FD VPS VSPR+Y Sbjct 301 QKVKCDRVLIFDQTVPSLVSPRKY 324 >RecName: Full=Polyprotein nsP1234; Short=P1234; AltName: Full=Non-structural polyprotein; Contains: RecName: Full=Polyprotein P123'; Short=P123'; Contains: RecName: Full=Polyprotein P123; Short=P123; Contains: RecName: Full=Non-structural protein 3'; Short=nsP3'; Contains: RecName: Full=Non-structural protein 3; Short=nsP3; Contains: RecName: Full=RNA-directed RNA polymerase nsP4; AltName: Full=Non-structural protein 4; Short=nsP4 [Middelburg virus] Sequence ID: P03318.2 Length: 995 Range 1: 1 to 378 Score:365 bits(936), Expect:5e-105, Method:Compositional matrix adjust., Identities:211/431(49%), Positives:264/431(61%), Gaps:54/431(12%) Query 1415 DRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYSGGKDRLTQSLNHLFTALDSTDADVVI 1474 D +LAA YR VA + V ++AIPLLSTG ++GGKDR+ QSLNHLFTALD+TD DV I Sbjct 1 DADLAAVYRAVAS-LADETVRTMAIPLLSTGTFAGGKDRVLQSLNHLFTALDTTDVDVTI 59 Query 1475 YCRDKEWEKKIAEAIQMRTQVELLDEHISVDCDIIRVHPDSSLAGRKGYSTTEGSLYSYL 1534 YCRDK WEKKI EAI MRT ELLD+ +V ++ RVHPDS L GR G+ST +G L+SYL Sbjct 60 YCRDKSWEKKIQEAIDMRTATELLDDDTTVMKELTRVHPDSCLVGRSGFSTVDGRLHSYL 119 Query 1535 EGTRFHQTAVDMAEVYTMWPKQTEANEQVCLYALGESIESIRQKCPVDDADASSPPKTVP 1594 EGTRFHQTAVD+AE T+WP++ EANEQ+ Y LGES+E+IR KCPVDD D+S+PP TVP Sbjct 120 EGTRFHQTAVDVAERPTLWPRREEANEQITHYVLGESMEAIRTKCPVDDTDSSAPPCTVP 179 Query 1595 CLCRYAMTPERVTRLRMNHVTSIIVCSSFPLPKYKIEGVQKVKCSKVMLFDHNVPSRVSP 1654 CLCRYAMTPERV RLR V VCSSFPLPKYKI GVQ+V CS VMLF+H+VP+ VSP Sbjct 180 CLCRYAMTPERVHRLRAAQVKQFTVCSSFPLPKYKIPGVQRVACSAVMLFNHDVPALVSP 239 Query 1655 REYKSPQETAQEVSSTTSLTHSQFDLSVDGEELPAPSDLEADAPI-PEPTPDDRAVLTLP 1713 R+Y+ P +++ SS S+ FDL + + S+ E P+ PEP D V Sbjct 240 RKYREPSISSESSSSGLSV----FDLDIGSD-----SEYEPMEPVQPEPLIDLAVVEETA 290 Query 1714 PTIDNFSAVSDWVMNTAPVAPPRRRRGKNLNVTCDEREGNVLPMASVRFFRADLHSIVQE 1773 P + APVA PRR R T ++R Sbjct 291 PV---------RLERVAPVAAPRRARATPF--TLEQR----------------------- 316 Query 1774 TAEIRDTAASLQAPLSVATEPNQLPISFGAPNETFPITFGDFDEGEIESLSSELLTFGDF 1833 A + AP ++ P + + E I+FGD D E ++ ++ LTFGDF Sbjct 317 ------VVAPVPAPRTMPVRPPRRKKAATRTPER--ISFGDLD-AECMAIINDDLTFGDF 367 Query 1834 SPGEVDDLTDS 1844 GE + LT + Sbjct 368 GAGEFERLTSA 378 >RecName: Full=Replicase large subunit; AltName: Full=183 kDa protein; AltName: Full=RNA-directed RNA polymerase; Contains: RecName: Full=Replicase small subunit; AltName: Full=126 kDa protein; AltName: Full=Methyltransferase/RNA helicase; Short=MT/HEL [Odontoglossum ringspot virus (isolate Korean Cy)] Sequence ID: P89659.2 Length: 1612 Range 1: 816 to 1096 Score:79.7 bits(195), Expect:2e-13, Method:Compositional matrix adjust., Identities:83/287(29%), Positives:131/287(45%), Gaps:28/287(9%) Query 711 PYKTAVIGVFGVPGSGKSA-IIKNLVTRQDLVTSGKKENCQEISTDVMRQ---RNLEISA 766 P V V GVPG GK+ I++ + +DL+ KE C+ I + R + + Sbjct 816 PSDAKVTLVDGVPGCGKTKEILETVNFDEDLILVPGKEACKMIIKRANKSGHVRATKDNV 875 Query 767 RTVDSLLLNGCNRPVDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQ 826 RTVDS L++ + + L++DE H+G + LIAL R+ +V GD +Q F N + Sbjct 876 RTVDSFLMHLKPKTYNKLFIDEGLMLHTGCVNFLIALSHCREAMVF-GDTEQIPFINRVA 934 Query 827 MKVNYNHNICTQVY-HKSISR---RCTLPVTAIVSSLHYEGKMRTTNEYNKPI---VVDT 879 Y + T VY H+ + R RC VT ++S Y+GK+ TN+ + + VV Sbjct 935 -NFPYPKHFATLVYDHREVRRLSLRCPADVTHFMNS-KYDGKVLCTNDVIRSVDAEVVRG 992 Query 880 TG----STKPDPGDLVLTCFRGWVKQLQIDYRGH-------EVMTAAASQGLTRKGVYAV 928 G +KP G ++ + ++ RG+ E+ T QG T + V V Sbjct 993 KGVFNPKSKPLKGKIITFT---QSDKAELKERGYEEVSTFGEINTVHEIQGETFEDVSVV 1049 Query 929 RQKVNENPLYASTSEHVNVLLTRTEGKLVWKTLSGDPWIKTLQNPPK 975 R L + +S HV V LTR + ++ DP +K + K Sbjct 1050 RLTPTPLELISKSSPHVLVALTRHTKSFKYYSVVLDPLVKVCSDLSK 1096 >RecName: Full=Replicase large subunit; AltName: Full=183 kDa protein; AltName: Full=RNA-directed RNA polymerase; Contains: RecName: Full=Replicase small subunit; AltName: Full=126 kDa protein; AltName: Full=Methyltransferase/RNA helicase; Short=MT/HEL [Odontoglossum ringspot virus (isolate Singapore 1)] Sequence ID: Q84133.2 Length: 1612 Range 1: 826 to 1096 Score:79.3 bits(194), Expect:3e-13, Method:Compositional matrix adjust., Identities:78/276(28%), Positives:126/276(45%), Gaps:26/276(9%) Query 721 GVPGSGKSA-IIKNLVTRQDLVTSGKKENCQEISTDVMRQ---RNLEISARTVDSLLLNG 776 GVPG GK+ I++ + +DL+ KE C+ I + R + RTVDS L++ Sbjct 826 GVPGCGKTKEILETVNFDEDLILVPGKEACKMIIKRANKSGHVRATRDNVRTVDSFLMHL 885 Query 777 CNRPVDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNYNHNIC 836 + + L++DE H+G + L+AL R+ +V GD +Q F N + H Sbjct 886 KPKTYNKLFIDEGLMLHTGCVNFLVALSHCREAMVF-GDAEQIPFINRVANFPYPKHFRY 944 Query 837 TQVYHKSISR---RCTLPVTAIVSSLHYEGKMRTTNEYNKPI---VVDTTG----STKPD 886 T +YH+ + R RC VT ++S Y+GK+ TN+ + + VV G +KP Sbjct 945 TCLYHREVRRLSLRCPADVTHFMNS-KYDGKVLCTNDVIRSVDAEVVRGKGVFNPKSKPL 1003 Query 887 PGDLVLTCFRGWVKQLQIDYRGH-------EVMTAAASQGLTRKGVYAVRQKVNENPLYA 939 G ++ + ++ RG+ E+ T QG T + V VR L + Sbjct 1004 KGKIITFT---QSDKAELKERGYEEVSTFGEINTVHEIQGETFEDVSVVRLTPTPLELIS 1060 Query 940 STSEHVNVLLTRTEGKLVWKTLSGDPWIKTLQNPPK 975 +S HV V LTR + ++ DP +K + K Sbjct 1061 KSSPHVLVALTRHTKSFKYYSVVLDPLVKVWSDLSK 1096 >RecName: Full=Uncharacterized protein FN1951 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] Sequence ID: Q8RHQ2.1 Length: 175 Range 1: 12 to 174 Score:64.3 bits(155), Expect:3e-10, Method:Composition-based stats., Identities:53/166(32%), Positives:76/166(45%), Gaps:24/166(14%) Query 1343 DIAKNDE-ECVVNAANPRGLPGDGVCKAVYKKWPESFKNSATPVG---TAKTVMCGTYP- 1397 DI K E E +VNAAN G GVC A++K +G T + V+ Y Sbjct 12 DITKIPEVEAIVNAANSSLEMGGGVCGAIFKAAGSELAQECKEIGGCNTGEAVITKGYNL 71 Query 1398 ----VIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVY-----S 1448 +IH VGP +S E +R LA+AY E K G+ +A P +STG+Y Sbjct 72 PNKYIIHTVGPRYSTGENREAER-LASAYYESLKLANEKGIRRIAFPSISTGIYRFPVDE 130 Query 1449 GGKDRLTQSLNHLFTALDSTDADVVIYCRD-------KEWEKKIAE 1487 G K LT ++ F + + D++++ D KE KK+ E Sbjct 131 GAKIALTTAIK--FLDKNPSSFDLILWVLDEKTYIVYKEKYKKLLE 174 >RecName: Full=Replicase large subunit; AltName: Full=186 kDa protein; Contains: RecName: Full=Replicase small subunit; AltName: Full=129 kDa protein; AltName: Full=Methyltransferase/RNA helicase; Short=MT/HEL [Cucumber green mottle mosaic virus (watermelon strain SH)] Sequence ID: P69514.1 Length: 1648 Range 1: 863 to 1117 Score:67.4 bits(163), Expect:1e-09, Method:Compositional matrix adjust., Identities:77/268(29%), Positives:118/268(44%), Gaps:37/268(13%) Query 721 GVPGSGKSA-IIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEI---------SARTVD 770 GVPG GK+A II + + DLV + +E + ++R+R + + RT D Sbjct 863 GVPGCGKTAEIIARVNWKTDLVLTPGRE-----AAAMIRRRACALHKSPVATNDNVRTFD 917 Query 771 SLLLNGCNRPVDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVN 830 S ++N D +YVDE H+G LL + +K + GD KQ F N + M + Sbjct 918 SFVMNRKIFKFDAVYVDEGLMVHTG-LLNFALKISGCKKAFVFGDAKQIPFINRV-MNFD 975 Query 831 YNHN----ICTQVYHKSISRRCTLPVTAIVSSLHYEGKMRTTN---EYNKPIVVDTTGST 883 Y I V + ++ RC VT+ ++++ Y+ + TT+ K I V G Sbjct 976 YPKELRTLIVDNVERRYVTHRCPRDVTSFLNTI-YKAAVATTSPVVHSVKAIKVSGAGIL 1034 Query 884 KPDPGDLVLTCFRGWV-------KQLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENP 936 +P+ LT +G + KQ I ++V T QG T + VR Sbjct 1035 RPE-----LTKIKGKIITFTQSDKQSLIKSGYNDVNTVHEIQGETFEETAVVRATPTPIG 1089 Query 937 LYASTSEHVNVLLTRTEGKLVWKTLSGD 964 L A S HV V LTR +V+ T+ D Sbjct 1090 LIARDSPHVLVALTRHTKAMVYYTVVFD 1117 >RecName: Full=Uncharacterized protein Saci_1252 [Sulfolobus acidocaldarius DSM 639] Sequence ID: Q4J9D2.1 Length: 181 Range 1: 19 to 131 Score:56.6 bits(135), Expect:2e-07, Method:Composition-based stats., Identities:41/117(35%), Positives:57/117(48%), Gaps:16/117(13%) Query 1343 DIAKNDEECVVNAANPRGLPGDGVCKAV-----YKKWPES----FKNSATPVGTAKTVMC 1393 DI K + + +VNAAN G GV A+ Y ES +N PVG Sbjct 19 DITKVEADAIVNAANSYLSHGGGVALAIVRSGGYIIQEESDEYVRRNGPVPVGEVAVTTA 78 Query 1394 GTYP---VIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVY 1447 G VIHAVGP + EGD +L +A R ++ L ++S+A+P +STG+Y Sbjct 79 GKLKARYVIHAVGPRYG----IEGDDKLESAIRRSLEKADELKLSSIALPAISTGIY 131 >RecName: Full=Non-structural polyprotein pORF1; Includes: RecName: Full=Methyltransferase; Includes: RecName: Full=Putative papain-like cysteine protease; Short=PLP; Includes: RecName: Full=NTPase/helicase; Includes: RecName: Full=RNA-directed RNA polymerase; Short=RdRp [Hepatitis E virus US2] Sequence ID: Q9YLR1.1 Length: 1708 Range 1: 804 to 906 Score:60.5 bits(145), Expect:2e-07, Method:Compositional matrix adjust., Identities:39/112(35%), Positives:63/112(56%), Gaps:11/112(9%) Query 1338 RVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKNSATPV--GTAKTVMCGT 1395 +V + ++D + +VNA+NP PG G+C A Y+++PE+F ++ + G A + Sbjct 804 KVYAGSLXESDCDWLVNASNPGHRPGGGLCHAFYQRFPEAFYSTEFIMREGLAAYTLT-P 862 Query 1396 YPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVY 1447 P+IHAV P +Y + + L AAYRE +R G + A PLL +G+Y Sbjct 863 RPIIHAVAP---DYRVEQNPKRLEAAYRETC---SRRG--TAAYPLLGSGIY 906 >RecName: Full=Replicase large subunit; AltName: Full=183 kDa protein; AltName: Full=RNA-directed RNA polymerase; Contains: RecName: Full=Replicase small subunit; AltName: Full=126 kDa protein; AltName: Full=Methyltransferase/RNA helicase; Short=MT/HEL [Tobacco mild green mosaic virus] Sequence ID: P18339.2 Length: 1609 Range 1: 826 to 1090 Score:59.3 bits(142), Expect:4e-07, Method:Compositional matrix adjust., Identities:72/272(26%), Positives:117/272(43%), Gaps:24/272(8%) Query 716 VIGVFGVPGSGK-SAIIKNLVTRQDLVTSGKKENCQEI-----STDVMRQRNLEISARTV 769 ++ V GVPG GK + +DL+ K+ I S+ ++R + RTV Sbjct 826 MVLVDGVPGCGKYKGDFERFDLDEDLILVPGKQAAAMIRRRANSSGLIRATMDNV--RTV 883 Query 770 DSLLLNGCNRPVDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKV 829 DSLL++ R L++DE H+G + L+ L+ + GD +Q F N +Q Sbjct 884 DSLLMHPKPRSHKRLFIDEGLMLHTGCVNFLV-LISGCDIAYIYGDTQQIPFINRVQNFP 942 Query 830 NYNHNICTQVYHKSISR---RCTLPVTAIVSSLHYEGKMRTTNEYNKPIVVDTTGS---- 882 H QV + R RC V + S YEG + TT+ + + + G Sbjct 943 YPKHFEKLQVDEVEMRRTTLRCPGDVNFFLQS-KYEGAVTTTSTVQRSVSSEMIGGKGVL 1001 Query 883 ---TKPDPGDLVLTCFRGWVKQLQIDYRGHE-VMTAAASQGLTRKGVYAVRQKVNENPLY 938 +KP G +V + +++ +G++ V T QG T + V VR L Sbjct 1002 NSVSKPLKGKIVTFT---QADKFELEEKGYKNVNTVHEIQGETFEDVSLVRLTATPLTLI 1058 Query 939 ASTSEHVNVLLTRTEGKLVWKTLSGDPWIKTL 970 + +S HV V LTR + T+ DP ++ + Sbjct 1059 SKSSPHVLVALTRHTKSFKYYTVVLDPLVQII 1090 >RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName: Full=Regulator of RNase III activity [Cronobacter turicensis z3032] Sequence ID: C9Y0V8.1 Length: 176 Range 1: 7 to 127 Score:54.7 bits(130), Expect:6e-07, Method:Compositional matrix adjust., Identities:43/122(35%), Positives:59/122(48%), Gaps:13/122(10%) Query 1339 VKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKNSATPV---------GTAK 1389 V + DI + D + +VNAANP + G GV A+++ S + V G A Sbjct 7 VVQGDITRIDTDVIVNAANPSLMGGGGVDGAIHRAAGPSLLAACKVVRQQQGECQPGHAV 66 Query 1390 TVMCG---TYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGV 1446 G VIH VGP + ++E + LA AYR + VT G NSVA P +STG+ Sbjct 67 ITEAGDLAAKAVIHTVGPIWRGGHDNE-PQLLADAYRNSLELVTANGYNSVAFPAISTGI 125 Query 1447 YS 1448 Y Sbjct 126 YG 127 >RecName: Full=Non-structural polyprotein pORF1; Includes: RecName: Full=Methyltransferase; Includes: RecName: Full=Putative papain-like cysteine protease; Short=PLP; Includes: RecName: Full=NTPase/helicase; Includes: RecName: Full=RNA-directed RNA polymerase; Short=RdRp [Hepatitis E virus Ct1] Sequence ID: Q9IVZ9.1 Length: 1707 Range 1: 811 to 905 Score:58.5 bits(140), Expect:7e-07, Method:Compositional matrix adjust., Identities:39/103(38%), Positives:54/103(52%), Gaps:11/103(10%) Query 1348 DEEC--VVNAANPRGLPGDGVCKAVYKKWPESFKNSATPVGTAKTVMCGT-YPVIHAVGP 1404 + EC +VNA+NP PG G+C A Y+++PESF + + T P+IHAV P Sbjct 811 ESECTWLVNASNPGHRPGGGLCHAFYQRFPESFDPAEFIMSDGFAAYTLTPRPIIHAVAP 870 Query 1405 NFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVY 1447 +Y + L AAYRE +R G + A PLL G+Y Sbjct 871 ---DYRVEHNPKRLEAAYRETC---SRRG--TAAYPLLGVGIY 905 >RecName: Full=Non-structural polyprotein pORF1; Includes: RecName: Full=Methyltransferase; Includes: RecName: Full=Putative papain-like cysteine protease; Short=PLP; Includes: RecName: Full=NTPase/helicase; Includes: RecName: Full=RNA-directed RNA polymerase; Short=RdRp [Hepatitis E virus swine/g3/United States/swUS1] Sequence ID: Q6J8G2.1 Length: 1708 Range 1: 804 to 906 Score:58.2 bits(139), Expect:7e-07, Method:Compositional matrix adjust., Identities:39/112(35%), Positives:61/112(54%), Gaps:11/112(9%) Query 1338 RVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKNSATPV--GTAKTVMCGT 1395 +V + ++D +VNA+NP PG G+C A Y+++PE+F + + G A + Sbjct 804 KVYAGSLFESDCNWLVNASNPGHRPGGGLCHAFYQRFPEAFYPTEFIMREGLAAYTLT-P 862 Query 1396 YPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVY 1447 P+IHAV P +Y + + L AAYRE +R G + A PLL +G+Y Sbjct 863 RPIIHAVAP---DYRVEQNPKRLEAAYRETC---SRRG--TAAYPLLGSGIY 906 >RecName: Full=Uncharacterized protein STK_23830 [Sulfurisphaera tokodaii str. 7] Sequence ID: Q96XY5.1 Length: 182 Range 1: 4 to 121 Score:54.3 bits(129), Expect:1e-06, Method:Composition-based stats., Identities:39/122(32%), Positives:59/122(48%), Gaps:16/122(13%) Query 1338 RVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKK-----WPESF----KNSATPVGTA 1388 ++ + DI + + E +VNAAN G GV +A+ +K ES K P G Sbjct 4 KIIKGDITEIEAEAIVNAANSYLEHGGGVARAIVEKGGYIIQKESREYVRKYGPVPTGGV 63 Query 1389 KTVMCGTYP---VIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTG 1445 G VIHAVGP + EG+ +L A R ++ L ++S+A+P +STG Sbjct 64 AVTSAGKLKAKYVIHAVGPRYG----IEGEEKLEEAIRNALRKAEELKLSSIALPAISTG 119 Query 1446 VY 1447 +Y Sbjct 120 IY 121 >RecName: Full=Non-structural polyprotein pORF1; Includes: RecName: Full=Methyltransferase; Includes: RecName: Full=Putative papain-like cysteine protease; Short=PLP; Includes: RecName: Full=NTPase/helicase; Includes: RecName: Full=RNA-directed RNA polymerase; Short=RdRp [Hepatitis E virus (strain Burma)] Sequence ID: P29324.1 Length: 1693 Range 1: 803 to 891 Score:57.4 bits(137), Expect:2e-06, Method:Compositional matrix adjust., Identities:38/98(39%), Positives:54/98(55%), Gaps:11/98(11%) Query 1352 VVNAANPRGLPGDGVCKAVYKKWPESFKNSATPV--GTAKTVMCGTYPVIHAVGPNFSNY 1409 +VNA+N PG G+C A Y+++P SF ++ + G A + P+IHAV P +Y Sbjct 803 LVNASNVDHRPGGGLCHAFYQRYPASFDAASFVMRDGAAAYTLT-PRPIIHAVAP---DY 858 Query 1410 SESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVY 1447 + L AAYRE +RLG + A PLL TG+Y Sbjct 859 RLEHNPKRLEAAYRETC---SRLG--TAAYPLLGTGIY 891 >RecName: Full=Non-structural polyprotein pORF1; Includes: RecName: Full=Methyltransferase; Includes: RecName: Full=Putative papain-like cysteine protease; Short=PLP; Includes: RecName: Full=NTPase/helicase; Includes: RecName: Full=RNA-directed RNA polymerase; Short=RdRp [Hepatitis E virus (strain Pakistan)] Sequence ID: P33424.2 Length: 1693 Range 1: 803 to 891 Score:57.0 bits(136), Expect:2e-06, Method:Compositional matrix adjust., Identities:38/98(39%), Positives:54/98(55%), Gaps:11/98(11%) Query 1352 VVNAANPRGLPGDGVCKAVYKKWPESFKNSATPV--GTAKTVMCGTYPVIHAVGPNFSNY 1409 +VNA+N PG G+C A Y+++P SF ++ + G A + P+IHAV P +Y Sbjct 803 LVNASNVDHRPGGGLCHAFYQRYPASFDAASFVMRDGAAAYTLT-PRPIIHAVAP---DY 858 Query 1410 SESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVY 1447 + L AAYRE +RLG + A PLL TG+Y Sbjct 859 RLEHNPKRLEAAYRETC---SRLG--TAAYPLLGTGIY 891 >RecName: Full=Macro domain-containing protein DR_2288 [Deinococcus radiodurans R1] Sequence ID: Q9RS39.1 Length: 170 Range 1: 9 to 129 Score:53.1 bits(126), Expect:2e-06, Method:Composition-based stats., Identities:41/122(34%), Positives:53/122(43%), Gaps:12/122(9%) Query 1343 DIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKNSATPVGTAKTVMCGTYP----- 1397 DIA + VV AAN + + G GV +++ + P+G T P Sbjct 9 DIAHQPVDAVVTAANKQLMGGGGVDGVIHRAAGPRLLQAIRPIGGTPTGTAVITPAFDLE 68 Query 1398 ------VIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYSGGK 1451 VIHAVGP + E + LA AYRE + G SVA P +STGVY Sbjct 69 RQGVKYVIHAVGPIWRGGQHGEAEL-LAGAYRESLRLGVENGCRSVAFPSISTGVYGYPL 127 Query 1452 DR 1453 DR Sbjct 128 DR 129 >RecName: Full=Non-structural polyprotein pORF1; Includes: RecName: Full=Methyltransferase; Includes: RecName: Full=Papain-like cysteine protease; Short=PLP; Includes: RecName: Full=NTPase/helicase; Includes: RecName: Full=RNA-directed RNA polymerase; Short=RdRp [Hepatitis E virus isolate Hetian] Sequence ID: Q81862.1 Length: 1693 Range 1: 803 to 891 Score:56.6 bits(135), Expect:2e-06, Method:Compositional matrix adjust., Identities:38/98(39%), Positives:54/98(55%), Gaps:11/98(11%) Query 1352 VVNAANPRGLPGDGVCKAVYKKWPESFKNSATPV--GTAKTVMCGTYPVIHAVGPNFSNY 1409 +VNA+N PG G+C A Y+++P SF ++ + G A + P+IHAV P +Y Sbjct 803 LVNASNVDHRPGGGLCHAFYQRYPASFDAASFVMRDGAAAYTLT-PRPIIHAVAP---DY 858 Query 1410 SESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVY 1447 + L AAYRE +RLG + A PLL TG+Y Sbjct 859 RLEHNPKMLEAAYRETC---SRLG--TAAYPLLGTGIY 891 >RecName: Full=Replicase large subunit; AltName: Full=183 kDa protein; AltName: Full=RNA-directed RNA polymerase; Contains: RecName: Full=Replicase small subunit; AltName: Full=126 kDa protein; AltName: Full=Methyltransferase/RNA helicase; Short=MT/HEL [Turnip vein-clearing virus] Sequence ID: Q88920.1 Length: 1601 Range 1: 822 to 1095 Score:55.5 bits(132), Expect:5e-06, Method:Compositional matrix adjust., Identities:78/280(28%), Positives:117/280(41%), Gaps:23/280(8%) Query 716 VIGVFGVPGSGKSA-IIKNLVTRQDLVTSGKKENCQEI-----STDVMRQRNLEISARTV 769 VI V GVPG GK+ II+ + +DL+ KE + I V+R + RTV Sbjct 822 VILVDGVPGCGKTKEIIEKVNFSEDLILVPGKEASKMIIRRANQAGVIRADKDNV--RTV 879 Query 770 DSLLLNGCNRPVDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKV 829 DS L++ R L++DE H+G + L+ L + V GD KQ F + Sbjct 880 DSFLMHPSRRVFKRLFIDEGLMLHTGCVNFLLLLSQCDVAYVY-GDTKQIPFICRVA-NF 937 Query 830 NYNHNICTQVYHKSISRRCTLPVTAIVSSL---HYEGKMRTTNEYNKPI---VVDTTGS- 882 Y + V + RR TL A V+ Y+G + T+ + + VV G+ Sbjct 938 PYPAHFAKLVADEKEVRRVTLRCPADVTYFLNKKYDGAVMCTSAVERSVKAEVVRGKGAL 997 Query 883 ---TKPDPGDLVLTCFRGWVKQLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYA 939 T P G ++ + L+ Y+ +V T QG T + VR + + Sbjct 998 NPITLPLEGKILTFTQADKFELLEKGYK--DVNTVHEVQGETYEKTAIVRLTSTPLEIIS 1055 Query 940 STSEHVNVLLTRTEGKLVWKTLSGDPWIKTLQNPPK-GNF 978 S S HV V LTR + T+ DP + + K NF Sbjct 1056 SASPHVLVALTRHTTCCKYYTVVLDPMVNVISEMEKLSNF 1095 >RecName: Full=Non-structural polyprotein pORF1; Includes: RecName: Full=Methyltransferase; Includes: RecName: Full=Putative papain-like cysteine protease; Short=PLP; Includes: RecName: Full=NTPase/helicase; Includes: RecName: Full=RNA-directed RNA polymerase; Short=RdRp [Hepatitis E virus human/g1/India/Hyderabad] Sequence ID: Q9WC28.1 Length: 1693 Range 1: 803 to 891 Score:55.1 bits(131), Expect:7e-06, Method:Compositional matrix adjust., Identities:37/98(38%), Positives:52/98(53%), Gaps:11/98(11%) Query 1352 VVNAANPRGLPGDGVCKAVYKKWPESFKNSATPV--GTAKTVMCGTYPVIHAVGPNFSNY 1409 +VNA+N PG G+C A Y+++P SF + + G A + P+IH V P +Y Sbjct 803 LVNASNVDHCPGGGLCHAFYQRYPASFDAACFVMRDGAAAYTLT-PRPIIHRVAP---DY 858 Query 1410 SESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVY 1447 + L AAYRE +RLG + A PLL TG+Y Sbjct 859 RLEHNPKRLEAAYRETC---SRLG--TAAYPLLGTGIY 891 >RecName: Full=Non-structural polyprotein pORF1; Includes: RecName: Full=Methyltransferase; Includes: RecName: Full=Putative papain-like cysteine protease; Short=PLP; Includes: RecName: Full=NTPase/helicase; Includes: RecName: Full=RNA-directed RNA polymerase; Short=RdRp [Hepatitis E virus (strain Myanmar)] Sequence ID: Q04610.1 Length: 1693 Range 1: 803 to 891 Score:53.9 bits(128), Expect:2e-05, Method:Compositional matrix adjust., Identities:37/98(38%), Positives:53/98(54%), Gaps:11/98(11%) Query 1352 VVNAANPRGLPGDGVCKAVYKKWPESFKNSATPV--GTAKTVMCGTYPVIHAVGPNFSNY 1409 +VNA+N PG G+C A Y+++P SF ++ + G A + P+IHAV P +Y Sbjct 803 LVNASNVDHRPGGGLCHAFYQRYPASFDAASFVMRDGAAAYTLT-PRPIIHAVAP---DY 858 Query 1410 SESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVY 1447 + L AAYRE +RLG + A LL TG+Y Sbjct 859 RLEHNPKRLEAAYRETC---SRLG--TAAYSLLGTGIY 891 >RecName: Full=Non-structural polyprotein pORF1; Includes: RecName: Full=Methyltransferase; Includes: RecName: Full=Putative papain-like cysteine protease; Short=PLP; Includes: RecName: Full=NTPase/helicase; Includes: RecName: Full=RNA-directed RNA polymerase; Short=RdRp [Hepatitis E virus (strain Mexico)] Sequence ID: Q03495.1 Length: 1691 Range 1: 795 to 889 Score:52.4 bits(124), Expect:4e-05, Method:Compositional matrix adjust., Identities:37/104(36%), Positives:54/104(51%), Gaps:13/104(12%) Query 1348 DEEC--VVNAANPRGLPGDGVCKAVYKKWPESFKNSATPV--GTAKTVMCGTYPVIHAVG 1403 + EC +VNA+N PG G+C A ++++P+SF + + G A + P+IHAV Sbjct 795 ESECTWLVNASNAGHRPGGGLCHAFFQRYPDSFDATKFVMRDGLAAYTLT-PRPIIHAVA 853 Query 1404 PNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVY 1447 P +Y + L AAYRE R G + A PLL G+Y Sbjct 854 P---DYRLEHNPKRLEAAYRETC---ARRG--TAAYPLLGAGIY 889 >RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName: Full=Regulator of RNase III activity [Cronobacter sakazakii ATCC BAA-894] Sequence ID: A7MG20.1 Length: 180 Range 1: 11 to 126 Score:48.9 bits(115), Expect:6e-05, Method:Composition-based stats., Identities:39/117(33%), Positives:55/117(47%), Gaps:13/117(11%) Query 1343 DIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKNSATPV---------GTAKTVMC 1393 DI D + +VNAANP + G GV A+++ + + V G A Sbjct 11 DITLIDVDVIVNAANPSLMGGGGVDGAIHRAAGPALLAACRQVRQQQGECQPGHAVITEA 70 Query 1394 G---TYPVIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVY 1447 G V+H VGP + ++E + LA AYR + V G NSVA P +STG+Y Sbjct 71 GDLAAKAVVHTVGPVWRGGQDNE-PQLLADAYRNSLQLVAANGYNSVAFPAISTGIY 126 >RecName: Full=Uncharacterized protein SSO2899 [Saccharolobus solfataricus P2] Sequence ID: Q97UU4.1 Length: 177 Range 1: 16 to 128 Score:48.1 bits(113), Expect:1e-04, Method:Composition-based stats., Identities:38/117(32%), Positives:56/117(47%), Gaps:16/117(13%) Query 1343 DIAKNDEECVVNAANPRGLPGDGVCKAVYKK-----WPES----FKNSATPVGTAKTVMC 1393 DI + + + +VNAAN G GV A+ +K ES K PVG Sbjct 16 DITEIEADAIVNAANSYLQHGGGVAYAIVRKGGYIIQKESDEYVKKFGPVPVGEVAVTSA 75 Query 1394 GTYP---VIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVY 1447 G VIHAVGP + EG+ +L +A + + L ++S+A+P +STG+Y Sbjct 76 GKLKAKYVIHAVGPRYG----IEGEDKLESAIFKSLLKADELSLSSIAMPAISTGIY 128 >RecName: Full=Replicase large subunit; AltName: Full=182 kDa protein; Contains: RecName: Full=Replicase small subunit; AltName: Full=125 kDa protein; AltName: Full=Methyltransferase/RNA helicase; Short=MT/HEL [Youcai mosaic virus] Sequence ID: Q66220.2 Length: 1597 Range 1: 818 to 1091 Score:50.4 bits(119), Expect:2e-04, Method:Compositional matrix adjust., Identities:72/276(26%), Positives:113/276(40%), Gaps:15/276(5%) Query 716 VIGVFGVPGSGKSA-IIKNLVTRQDLVTSGKKENCQEI---STDVMRQRNLEISARTVDS 771 ++ V GVPG GK+ I++ + +DLV KE + I + R + + RTVDS Sbjct 818 LVLVDGVPGCGKTKEILEKVNFSEDLVLVPGKEASKMIIRRANQAGITRADKDNVRTVDS 877 Query 772 LLLNGCNRPVDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQMKVNY 831 L++ R L++DE H+G + L+ L V D +Q F + Y Sbjct 878 FLMHPPKRVFKRLFIDEGLMLHTGCVNFLMLLSHCDVAYVYV-DTQQIPFICRVA-NFPY 935 Query 832 NHNICTQVYHKSISRRCTLPVTAIVSSL---HYEGKMRTTNEYNKPI---VVDTTGSTKP 885 + V + RR TL A V+ Y+G + T+ + + VV G+ P Sbjct 936 PAHFAKLVVDEKEDRRVTLRCPADVTYFLNQKYDGSVLCTSSVERSVSAEVVRGKGALNP 995 Query 886 D--PGDLVLTCFRGWVKQLQIDYRGHEVMTAAASQGLTRKGVYAVRQKVNENPLYASTSE 943 P + + F K +D +V T QG T + VR + + S Sbjct 996 ITLPLEGKILTFTQADKFELLDKGYKDVNTVHEVQGETYEKTAIVRLTATPLEIISRASP 1055 Query 944 HVNVLLTRTEGKLVWKTLSGDPWIKTLQNPPK-GNF 978 HV V LTR + + T+ DP + + K NF Sbjct 1056 HVLVALTRHTTRCKYYTVVLDPMVNVISELGKLSNF 1091 >RecName: Full=Replicase polyprotein 1ab; Contains: RecName: Full=Leader protease; Short=L-Pro; AltName: Full=Papain-like cysteine proteinase; Short=PCP; Contains: RecName: Full=Methyltransferase/helicase/RNA-directed RNA polymerase [Beet yellows virus isolate Ukraine] Sequence ID: Q08534.2 Length: 3094 Range 1: 2251 to 2520 Score:50.4 bits(119), Expect:2e-04, Method:Compositional matrix adjust., Identities:68/280(24%), Positives:115/280(41%), Gaps:49/280(17%) Query 723 PGSGKSAIIKNLVTR-----QDLVTSGKKENCQEISTDVMR----QRNLEISAR----TV 769 PG GK+ + + L+ + K + +EI V R + + + R T+ Sbjct 2251 PGGGKTTTLIKVFCETFSKVNSLILTANKSSREEILAKVNRIVLDEGDTPLQTRDRILTI 2310 Query 770 DSLLLNGCNRPVDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQCGFFNMMQM-- 827 DS L+N VLY+DE F H+G +A I + +L GD +Q + ++ Sbjct 2311 DSYLMNNRGLTCKVLYLDECFMVHAGAAVACIEFTKC-DSAILFGDSRQIRYGRCSELDT 2369 Query 828 ----KVNYNHNICTQVYHKSISRRCTLPVTAIVSSLHYEGKMRTTN-------------- 869 +N + ++VY + +S RC V A +S+ Y + TTN Sbjct 2370 AVLSDLNRFVDDESRVYGE-VSYRCPWDVCAWLSTF-YPKTVATTNLVSAGQSSMQVREI 2427 Query 870 ------EYNKPIVVDTTGSTKPDPGDLVLTCFRGWVKQLQIDYRGHEVMTAAASQGLTRK 923 EY+ V T + + DL+ + + K+ + V+T +QG T + Sbjct 2428 ESVDDVEYSSEFVYLTM--LQSEKKDLL----KSFGKRSRSSVEKPTVLTVHEAQGETYR 2481 Query 924 GVYAVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKTLSG 963 V VR K E+ + S + H+ V L+R L + LS Sbjct 2482 KVNLVRTKFQEDDPFRSEN-HITVALSRHVESLTYSVLSS 2520 >RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName: Full=Regulator of RNase III activity [Klebsiella pneumoniae 342] Sequence ID: B5XXK9.1 Length: 175 >RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName: Full=Regulator of RNase III activity [Klebsiella variicola At-22] Sequence ID: D3RKJ0.1 Length: 175 Range 1: 11 to 144 Score:47.4 bits(111), Expect:2e-04, Method:Compositional matrix adjust., Identities:40/135(30%), Positives:57/135(42%), Gaps:13/135(9%) Query 1343 DIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESF---------KNSATPVGTAKTVMC 1393 DI + + +VNAANP L G GV A+++ + + P G A + Sbjct 11 DITTLEVDVIVNAANPSLLGGGGVDGAIHRAAGPALLAACKQVLQQQGECPPGHAVITIA 70 Query 1394 GTYP---VIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYSGG 1450 G P VIH VGP + E + LA AY+ + S+A P +STGVY Sbjct 71 GDLPASAVIHTVGPVWHGGDRMEA-QTLADAYKNSLQLAAANNYRSIAFPAISTGVYGYP 129 Query 1451 KDRLTQSLNHLFTAL 1465 K+ + TA Sbjct 130 KEEAAEIAVRTVTAF 144 >RecName: Full=Macro domain-containing protein in gbd 3'region; AltName: Full=ORF2 [Cupriavidus necator] Sequence ID: Q44020.1 Length: 173 Range 1: 12 to 133 Score:45.8 bits(107), Expect:6e-04, Method:Composition-based stats., Identities:39/123(32%), Positives:58/123(47%), Gaps:13/123(10%) Query 1343 DIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKNS---------ATPVGTAKTVMC 1393 DI + + + +VNAAN L G GV A++ + K + P G A Sbjct 12 DITRMEVDAIVNAANSGLLGGGGVDGAIHGAGGSAIKEACRAIRDTQGGCPTGEAVITTG 71 Query 1394 GTYP---VIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYSGG 1450 G P VIHAVGP + + E D LA AYR + + + +A P +STG+Y+ Sbjct 72 GHLPAPYVIHAVGPVWQGGDQGE-DELLANAYRNSIRLAAQHHLRRLAFPNISTGIYAFP 130 Query 1451 KDR 1453 ++R Sbjct 131 RER 133 >RecName: Full=Macro domain-containing protein RSc0334 [Ralstonia solanacearum GMI1000] Sequence ID: Q8Y2K1.1 Length: 171 Range 1: 7 to 153 Score:44.7 bits(104), Expect:0.002, Method:Composition-based stats., Identities:40/149(27%), Positives:61/149(40%), Gaps:10/149(6%) Query 1336 SYRVKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKNSATPVGTAKTVMCGT 1395 + R R DI + +VNAAN L G GV A+++ + + +T Sbjct 7 TLRALRADITTLACDAIVNAANSALLGGGGVDGAIHRAAGPELLEACRALHGCRTGQAKI 66 Query 1396 YP--------VIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVY 1447 P +IH VGP + + E LAA YR + V ++A P +STGVY Sbjct 67 TPGFLLPARYIIHTVGPIWRGGRQDEAAL-LAACYRNSLALAKQHDVRTIAFPCISTGVY 125 Query 1448 SGGKDRLTQSLNHLFTALDSTDADVVIYC 1476 G +L + D D +++C Sbjct 126 -GFPPQLAAPIAVRTVREHGADLDDIVFC 153 >RecName: Full=Protein mono-ADP-ribosyltransferase PARP9; AltName: Full=ADP-ribosyltransferase diphtheria toxin-like 9; Short=ARTD9; AltName: Full=B aggressive lymphoma protein homolog; AltName: Full=Poly [ADP-ribose] polymerase 9; Short=PARP-9 [Mus musculus] Sequence ID: Q8CAS9.2 Length: 866 Range 1: 124 to 246 Score:44.7 bits(104), Expect:0.010, Method:Compositional matrix adjust., Identities:35/123(28%), Positives:56/123(45%), Gaps:14/123(11%) Query 1339 VKRMDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKNSA---------TPVGTAK 1389 V + D+ ++ + VVNAAN L G G+ ++ K + + VG Sbjct 124 VWKDDLTRHVVDAVVNAANENLLHGSGLAGSLVKTGGFEIQEESKRIIANVGKISVGGIA 183 Query 1390 TVMCGTYP---VIHAVGPNFSNYSESEGDRELAAAYREVAKEVTR--LGVNSVAIPLLST 1444 G P +IHAVGP ++ + L A R + VT+ L + +VAIP LS+ Sbjct 184 ITGAGRLPCHLIIHAVGPRWTVTNSQTAIELLKFAIRNILDYVTKYDLRIKTVAIPALSS 243 Query 1445 GVY 1447 G++ Sbjct 244 GIF 246 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain Therien] Sequence ID: P13889.5 Length: 2116 Range 1: 836 to 942 Score:43.9 bits(102), Expect:0.018, Method:Compositional matrix adjust., Identities:36/107(34%), Positives:45/107(42%), Gaps:10/107(9%) Query 1352 VVNAANPRGLPGDGVCKAVYKKWPESFKNSA---TPVGTAKTVM-----CGTYPVIHAVG 1403 VVNAAN L G GVC A++ + + P T + V CG +IHAV Sbjct 836 VVNAANEGLLAGSGVCGAIFANATAALAANCRRLAPCPTGEAVATPGHGCGYTHIIHAVA 895 Query 1404 PNFSN--YSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYS 1448 P + EG+ L AYR + VA PLL GVY Sbjct 896 PRRPRDPAALEEGEALLERAYRSIVALAAARRWACVACPLLGAGVYG 942 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain RN-UK86] Sequence ID: Q8BCR0.1 Length: 2116 Range 1: 836 to 942 Score:43.9 bits(102), Expect:0.018, Method:Compositional matrix adjust., Identities:36/107(34%), Positives:45/107(42%), Gaps:10/107(9%) Query 1352 VVNAANPRGLPGDGVCKAVY-----KKWPESFKNSATPVGTAKTV---MCGTYPVIHAVG 1403 VVNAAN L G GVC A++ + + + P G A CG +IHAV Sbjct 836 VVNAANEGLLAGSGVCGAIFANATAALAADCRRLAPCPTGEAVATPGHGCGYTHIIHAVA 895 Query 1404 PNFSN--YSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYS 1448 P + EG+ L AYR + VA PLL GVY Sbjct 896 PRRPRDPAALEEGEALLERAYRSIVALAAARRWACVACPLLGAGVYG 942 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain TO-336 vaccine] Sequence ID: Q99IE7.1 Length: 2116 Range 1: 836 to 942 Score:43.9 bits(102), Expect:0.019, Method:Compositional matrix adjust., Identities:36/107(34%), Positives:45/107(42%), Gaps:10/107(9%) Query 1352 VVNAANPRGLPGDGVCKAVY-----KKWPESFKNSATPVGTAKTV---MCGTYPVIHAVG 1403 VVNAAN L G GVC A++ + + + P G A CG +IHAV Sbjct 836 VVNAANEGLLAGSGVCGAIFANATAALAADCRRLAPCPTGEAVATPGHGCGYTHIIHAVA 895 Query 1404 PNFSN--YSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYS 1448 P + EG+ L AYR + VA PLL GVY Sbjct 896 PRRPRDPAALEEGEALLERAYRSIVALAAARRWACVACPLLGAGVYG 942 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain TO-336] Sequence ID: Q99IE5.1 Length: 2116 Range 1: 836 to 942 Score:43.9 bits(102), Expect:0.019, Method:Compositional matrix adjust., Identities:36/107(34%), Positives:45/107(42%), Gaps:10/107(9%) Query 1352 VVNAANPRGLPGDGVCKAVY-----KKWPESFKNSATPVGTAKTV---MCGTYPVIHAVG 1403 VVNAAN L G GVC A++ + + + P G A CG +IHAV Sbjct 836 VVNAANEGLLAGSGVCGAIFANATAALAADCRRLAPCPTGEAVATPGHGCGYTHIIHAVA 895 Query 1404 PNFSN--YSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYS 1448 P + EG+ L AYR + VA PLL GVY Sbjct 896 PRRPRDPAALEEGEALLERAYRSIVALAAARRWACVACPLLGAGVYG 942 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain BRDII] Sequence ID: Q6X2U2.1 Length: 2116 Range 1: 836 to 942 Score:43.5 bits(101), Expect:0.024, Method:Compositional matrix adjust., Identities:34/107(32%), Positives:43/107(40%), Gaps:10/107(9%) Query 1352 VVNAANPRGLPGDGVCKAVYKKWPESFKNSA---TPVGTAKTVM-----CGTYPVIHAVG 1403 VVNAAN L G GVC A++ S P T + V CG +IHAV Sbjct 836 VVNAANEGLLAGSGVCGAIFASAAASLAEDCRRLAPCPTGEAVATPGHGCGYAHIIHAVA 895 Query 1404 PNFSN--YSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYS 1448 P + + + L AYR + VA PLL G+Y Sbjct 896 PRRPQDPAALEQSEALLERAYRSIVALAAARRWTCVACPLLGAGIYG 942 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain Cendehill] Sequence ID: Q9J6K9.2 Length: 2116 Range 1: 836 to 942 Score:43.1 bits(100), Expect:0.027, Method:Compositional matrix adjust., Identities:36/107(34%), Positives:45/107(42%), Gaps:10/107(9%) Query 1352 VVNAANPRGLPGDGVCKAVY-----KKWPESFKNSATPVGTAKTV---MCGTYPVIHAVG 1403 VVNAAN L G GVC A++ + + + P G A CG +IHAV Sbjct 836 VVNAANEGLLAGSGVCGAIFANATAALAADCRRLAPCPTGEAVATPGHGCGYTHIIHAVA 895 Query 1404 PNFSN--YSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYS 1448 P + EG+ L AYR + VA PLL GVY Sbjct 896 PRRPRDPAALEEGEALLERAYRSIVALAAARRWAYVACPLLGAGVYG 942 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain M33] Sequence ID: Q86500.2 Length: 2116 Range 1: 836 to 942 Score:43.1 bits(100), Expect:0.031, Method:Compositional matrix adjust., Identities:36/107(34%), Positives:46/107(42%), Gaps:10/107(9%) Query 1352 VVNAANPRGLPGDGVCKAVY-----KKWPESFKNSATPVGTAKTV---MCGTYPVIHAVG 1403 VVNAAN L G GVC A++ + + + P+G A CG +IHAV Sbjct 836 VVNAANEGLLAGSGVCGAIFANATAALAADCRRLAPCPIGEAVATPGHGCGYTHIIHAVA 895 Query 1404 PNFSN--YSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYS 1448 P + EG+ L AYR + VA PLL GVY Sbjct 896 PRRPRDPAALEEGEALLERAYRSIVALAAARRWARVACPLLGAGVYG 942 >RecName: Full=Non-structural polyprotein pORF1; Includes: RecName: Full=Methyltransferase; Includes: RecName: Full=Putative papain-like cysteine protease; Short=PLP; Includes: RecName: Full=NTPase/helicase; Includes: RecName: Full=RNA-directed RNA polymerase; Short=RdRp [Avian hepatitis E virus chicken/United States/Meng] Sequence ID: Q6QLN1.1 Length: 1531 Range 1: 633 to 730 Score:42.4 bits(98), Expect:0.049, Method:Compositional matrix adjust., Identities:31/110(28%), Positives:49/110(44%), Gaps:16/110(14%) Query 1342 MDIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKNSA----TPVGTAKTVMCGTYP 1397 +D+A + +VN AN PG G+C +++WP + P G G Sbjct 633 LDVAA---DWLVNPANRDHQPGGGLCGMFHRRWPHLWPVCGEVQDLPTGPV-IFQQGPPK 688 Query 1398 VIHAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVY 1447 VIHA GP++ + +G R + A + +VA PL+S G+Y Sbjct 689 VIHAPGPDYRIKPDPDGLRRVYAVVHQAH--------GTVASPLISAGIY 730 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus strain BRD1] Sequence ID: Q6X2U4.1 Length: 2116 Range 1: 836 to 942 Score:42.4 bits(98), Expect:0.051, Method:Compositional matrix adjust., Identities:34/107(32%), Positives:43/107(40%), Gaps:10/107(9%) Query 1352 VVNAANPRGLPGDGVCKAVYKKWPESFKNSA---TPVGTAKTVM-----CGTYPVIHAVG 1403 VVNAAN L G GVC A++ + P T + V CG +IHAV Sbjct 836 VVNAANEGLLAGSGVCGAIFASAAATLAEDCRRLAPCPTGEAVATPGHGCGYTHIIHAVA 895 Query 1404 PNFSN--YSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYS 1448 P + + + L AYR V VA PLL G+Y Sbjct 896 PRRPQDPAALEQSEALLERAYRSVVALAAARRWACVACPLLGAGIYG 942 >RecName: Full=O-acetyl-ADP-ribose deacetylase; AltName: Full=Regulator of RNase III activity [Salmonella enterica subsp. enterica serovar Agona str. SL483] Sequence ID: B5F961.1 Length: 179 Range 1: 6 to 127 Score:40.0 bits(92), Expect:0.065, Method:Compositional matrix adjust., Identities:42/132(32%), Positives:56/132(42%), Gaps:31/132(23%) Query 1338 RVKRMDIAKNDEECVVNAANPRGLPGDGV---------------CKAVYKKWPESFKNSA 1382 +V + DI + + +VNAAN + G GV CK + ++ E A Sbjct 6 QVIQGDITQLSVDAIVNAANASLMGGGGVDGAIHRAAGPALLDACKLIRQQQGECQTGHA 65 Query 1383 --TPVG--TAKTVMCGTYPVIHAVGPNF--SNYSESEGDRELAAAYREVAKEVTRLGVNS 1436 TP G +AK V IH VGP + Y E+E L AAYR S Sbjct 66 VITPAGKLSAKAV-------IHTVGPVWRGGEYQEAE---LLEAAYRNCLLLAEANHFRS 115 Query 1437 VAIPLLSTGVYS 1448 +A P +STGVY Sbjct 116 IAFPAISTGVYG 127 >RecName: Full=Non-structural polyprotein p200; Short=p200; Contains: RecName: Full=Protease/methyltransferase p150; Short=p150; Contains: RecName: Full=RNA-directed RNA polymerase p90; Short=p90 [Rubella virus vaccine strain RA27/3] Sequence ID: O40955.1 Length: 2116 Range 1: 836 to 942 Score:42.0 bits(97), Expect:0.065, Method:Compositional matrix adjust., Identities:36/107(34%), Positives:45/107(42%), Gaps:10/107(9%) Query 1352 VVNAANPRGLPGDGVCKAVY-----KKWPESFKNSATPVGTAKTV---MCGTYPVIHAVG 1403 VVNAAN L G GVC A++ + + + P G A CG +IHAV Sbjct 836 VVNAANEGLLAGSGVCGAIFANATAALAADCRRLAPCPTGEAVATPGHGCGYTHIIHAVA 895 Query 1404 PNFSN--YSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYS 1448 P + EG+ L AYR + VA PLL GVY Sbjct 896 PRRPRDPAALEEGEALLERAYRSIVALAAARRWARVACPLLGAGVYG 942 >RecName: Full=Replicase polyprotein 1ab; Short=pp1ab; AltName: Full=ORF1ab polyprotein; Contains: RecName: Full=Non-structural protein 1; Short=nsp1; AltName: Full=p9; Contains: RecName: Full=Non-structural protein 2; Short=nsp2; AltName: Full=p87; Contains: RecName: Full=Non-structural protein 3; Short=nsp3; AltName: Full=PL1-PRO/PL2-PRO; AltName: Full=PLP1/PLP2; AltName: Full=Papain-like proteinases 1/2; AltName: Full=p195; Contains: RecName: Full=Non-structural protein 4; Short=nsp4; AltName: Full=Peptide HD2; Contains: RecName: Full=3C-like proteinase; Short=3CL-PRO; Short=3CLp; AltName: Full=M-PRO; AltName: Full=nsp5; AltName: Full=p34; Contains: RecName: Full=Non-structural protein 6; Short=nsp6; Contains: RecName: Full=Non-structural protein 7; Short=nsp7; AltName: Full=p5; Contains: RecName: Full=Non-structural protein 8; Short=nsp8; AltName: Full=p23; Contains: RecName: Full=Non-structural protein 9; Short=nsp9; AltName: Full=p12; Contains: RecName: Full=Non-structural protein 10; Short=nsp10; AltName: Full=Growth factor-like peptide; Short=GFL; AltName: Full=p14; Contains: RecName: Full=RNA-directed RNA polymerase; Short=Pol; Short=RdRp; AltName: Full=nsp12; AltName: Full=p100; Contains: RecName: Full=Helicase; Short=Hel; AltName: Full=nsp13; AltName: Full=p66; AltName: Full=p66-HEL; Contains: RecName: Full=Exoribonuclease; Short=ExoN; AltName: Full=nsp14; Contains: RecName: Full=Uridylate-specific endoribonuclease; AltName: Full=NendoU; AltName: Full=nsp15; AltName: Full=p41; Contains: RecName: Full=Putative 2'-O-methyl transferase; AltName: Full=nsp16 [Human coronavirus NL63] Sequence ID: P0C6X5.1 Length: 6729 Range 1: 1285 to 1410 Score:39.7 bits(91), Expect:0.36, Method:Compositional matrix adjust., Identities:39/139(28%), Positives:60/139(43%), Gaps:23/139(16%) Query 1352 VVNAANPRGLPGDGVCKAV-------YKKWPESFKNSATPVGTAKTVM--CGTYPVIHAV 1402 VVNAAN L G GV +A+ + + + +S P+ VM C + V + V Sbjct 1285 VVNAANENLLHGGGVARAIDILTEGQLQSLSKDYISSNGPLKVGAGVMLECEKFNVFNVV 1344 Query 1403 GPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSV-AIPLLSTGVYSGGKDRLTQSLNHL 1461 GP + S L AY + E N + +PLLS G++ R+ SL L Sbjct 1345 GPRTGKHEHS----LLVEAYNSILFE------NGIPLMPLLSCGIFG---VRIENSLKAL 1391 Query 1462 FTALDSTDADVVIYCRDKE 1480 F+ + V +Y ++E Sbjct 1392 FSCDINKPLQVFVYSSNEE 1410 >RecName: Full=Replicase polyprotein 1a; Short=pp1a; AltName: Full=ORF1a polyprotein; Contains: RecName: Full=Non-structural protein 1; Short=nsp1; AltName: Full=p9; Contains: RecName: Full=Non-structural protein 2; Short=nsp2; AltName: Full=p87; Contains: RecName: Full=Non-structural protein 3; Short=nsp3; AltName: Full=PL1-PRO/PL2-PRO; AltName: Full=PLP1/PLP2; AltName: Full=Papain-like proteinases 1/2; AltName: Full=p195; Contains: RecName: Full=Non-structural protein 4; Short=nsp4; AltName: Full=Peptide HD2; Contains: RecName: Full=3C-like proteinase; Short=3CL-PRO; Short=3CLp; AltName: Full=M-PRO; AltName: Full=nsp5; AltName: Full=p34; Contains: RecName: Full=Non-structural protein 6; Short=nsp6; Contains: RecName: Full=Non-structural protein 7; Short=nsp7; AltName: Full=p5; Contains: RecName: Full=Non-structural protein 8; Short=nsp8; AltName: Full=p23; Contains: RecName: Full=Non-structural protein 9; Short=nsp9; AltName: Full=p12; Contains: RecName: Full=Non-structural protein 10; Short=nsp10; AltName: Full=Growth factor-like peptide; Short=GFL; AltName: Full=p14; Contains: RecName: Full=Non-structural protein 11; Short=nsp11 [Human coronavirus NL63] Sequence ID: P0C6U6.1 Length: 4060 Range 1: 1285 to 1410 Score:39.3 bits(90), Expect:0.40, Method:Compositional matrix adjust., Identities:39/139(28%), Positives:60/139(43%), Gaps:23/139(16%) Query 1352 VVNAANPRGLPGDGVCKAV-------YKKWPESFKNSATPVGTAKTVM--CGTYPVIHAV 1402 VVNAAN L G GV +A+ + + + +S P+ VM C + V + V Sbjct 1285 VVNAANENLLHGGGVARAIDILTEGQLQSLSKDYISSNGPLKVGAGVMLECEKFNVFNVV 1344 Query 1403 GPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSV-AIPLLSTGVYSGGKDRLTQSLNHL 1461 GP + S L AY + E N + +PLLS G++ R+ SL L Sbjct 1345 GPRTGKHEHS----LLVEAYNSILFE------NGIPLMPLLSCGIFG---VRIENSLKAL 1391 Query 1462 FTALDSTDADVVIYCRDKE 1480 F+ + V +Y ++E Sbjct 1392 FSCDINKPLQVFVYSSNEE 1410 >RecName: Full=Movement protein TGB1; AltName: Full=58 kDa protein; AltName: Full=Beta-B protein; AltName: Full=Triple gene block 1 protein; Short=TGBp1 [Barley stripe mosaic virus] Sequence ID: P04867.1 Length: 528 Range 1: 256 to 501 Score:38.5 bits(88), Expect:0.62, Method:Compositional matrix adjust., Identities:65/278(23%), Positives:112/278(40%), Gaps:51/278(18%) Query 706 IRPACPYKTAVIGVFGVPGSGKSAIIKNLVTRQDLVTSGKKENCQEISTDVMRQRNLEIS 765 ++PA + T +I GVPGSGKS I++ L+ + C + +M + Sbjct 256 LKPAVDFLTGIIS--GVPGSGKSTIVRTLLKGEFPAV------CALANPALMNDYSGIEG 307 Query 766 ARTVDSLLLNGCNRPVDVLYVDEAFACHSGTLLALIALVRPRQKVVLCGDPKQ-----CG 820 +D LLL+ D+L +DE S +L L +R V+L GD Q Sbjct 308 VYGLDDLLLSAVPITSDLLIIDEYTLAESAEILLLQRRLRASM-VLLVGDVAQGKATTAS 366 Query 821 FFNMMQMKVNYNH-----------NICTQVYHKSISR--RCTLPVTAIVSSLHYEGKMRT 867 + + V Y ++C++ ++ +S+ R T+ +T Y+G+ Sbjct 367 SIEYLTLPVIYRSETTYRLGQETASLCSKQGNRMVSKGGRDTVIIT------DYDGETDE 420 Query 868 TNEYNKPIVVDTTGSTKPDPGDLVLTCFRGWVKQLQIDYRGHEVMTAAAS-QGLTRKGVY 926 T E N VDT K C G+ L ID +G E + + RK + Sbjct 421 T-EKNIAFTVDTVRDVKD--------C--GYDCALAIDVQGKEFDSVTLFLRNEDRKAL- 468 Query 927 AVRQKVNENPLYASTSEHVNVLLTRTEGKLVWKTLSGD 964 +++ + S H + L+ R + ++ L+GD Sbjct 469 -----ADKHLRLVALSRHKSKLIIRADAEIRQAFLTGD 501 >RecName: Full=Replicase polyprotein 1ab; Short=pp1ab; AltName: Full=ORF1ab polyprotein; Contains: RecName: Full=Non-structural protein 2; Short=nsp2; AltName: Full=p87; Contains: RecName: Full=Non-structural protein 3; Short=nsp3; AltName: Full=Papain-like proteinase; Short=PL-PRO; AltName: Full=p195; Contains: RecName: Full=Non-structural protein 4; Short=nsp4; AltName: Full=Peptide HD2; AltName: Full=p41; Contains: RecName: Full=3C-like proteinase; Short=3CL-PRO; Short=3CLp; AltName: Full=M-PRO; AltName: Full=nsp5; AltName: Full=p33; Contains: RecName: Full=Non-structural protein 6; Short=nsp6; AltName: Full=p34; Contains: RecName: Full=Non-structural protein 7; Short=nsp7; AltName: Full=p9; Contains: RecName: Full=Non-structural protein 8; Short=nsp8; AltName: Full=p24; Contains: RecName: Full=Non-structural protein 9; Short=nsp9; AltName: Full=p10; Contains: RecName: Full=Non-structural protein 10; Short=nsp10; AltName: Full=Growth factor-like peptide; Short=GFL; AltName: Full=p16; Contains: RecName: Full=RNA-directed RNA polymerase; Short=Pol; Short=RdRp; AltName: Full=nsp12; AltName: Full=p100; Contains: RecName: Full=Helicase; Short=Hel; AltName: Full=nsp13; AltName: Full=p68; Contains: RecName: Full=Exoribonuclease; Short=ExoN; AltName: Full=nsp14; AltName: Full=p58; Contains: RecName: Full=Uridylate-specific endoribonuclease; AltName: Full=NendoU; AltName: Full=nsp15; AltName: Full=p39; Contains: RecName: Full=Putative 2'-O-methyl transferase; AltName: Full=nsp16; AltName: Full=p35 [Avian infectious bronchitis virus (strain Beaudette CK)] Sequence ID: P0C6Y2.1 Length: 6629 Range 1: 1036 to 1166 Score:37.4 bits(85), Expect:1.8, Method:Compositional matrix adjust., Identities:37/141(26%), Positives:64/141(45%), Gaps:19/141(13%) Query 1349 EECVVNAANPRGLPGDGVCKAV-------YKKWPESFKNSATPVGTAKT--VMCGTYPVI 1399 E C+VNAAN G GV KA+ + ++ E + P T + G V Sbjct 1036 EFCIVNAANEHMTHGSGVAKAIADFCGLDFVEYCEDYVKKHGPQQRLVTPSFVKGIQCVN 1095 Query 1400 HAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYSGGKDRLTQSLN 1459 + VGP + ++ +L AAY+ V + GV + +P+LS G++ G D S++ Sbjct 1096 NVVGP---RHGDNNLHEKLVAAYKNVLVD----GVVNYVVPVLSLGIF--GVD-FKMSID 1145 Query 1460 HLFTALDSTDADVVIYCRDKE 1480 + A + V+++ +E Sbjct 1146 AMREAFEGCTIRVLLFSLSQE 1166 >RecName: Full=Replicase polyprotein 1ab; Short=pp1ab; AltName: Full=ORF1ab polyprotein; Contains: RecName: Full=Non-structural protein 2; Short=nsp2; AltName: Full=p87; Contains: RecName: Full=Non-structural protein 3; Short=nsp3; AltName: Full=Papain-like proteinase; Short=PL-PRO; AltName: Full=p195; Contains: RecName: Full=Non-structural protein 4; Short=nsp4; AltName: Full=Peptide HD2; AltName: Full=p41; Contains: RecName: Full=3C-like proteinase; Short=3CL-PRO; Short=3CLp; AltName: Full=M-PRO; AltName: Full=nsp5; AltName: Full=p33; Contains: RecName: Full=Non-structural protein 6; Short=nsp6; AltName: Full=p34; Contains: RecName: Full=Non-structural protein 7; Short=nsp7; AltName: Full=p9; Contains: RecName: Full=Non-structural protein 8; Short=nsp8; AltName: Full=p24; Contains: RecName: Full=Non-structural protein 9; Short=nsp9; AltName: Full=p10; Contains: RecName: Full=Non-structural protein 10; Short=nsp10; AltName: Full=Growth factor-like peptide; Short=GFL; AltName: Full=p16; Contains: RecName: Full=RNA-directed RNA polymerase; Short=Pol; Short=RdRp; AltName: Full=nsp12; AltName: Full=p100; Contains: RecName: Full=Helicase; Short=Hel; AltName: Full=nsp13; AltName: Full=p68; Contains: RecName: Full=Exoribonuclease; Short=ExoN; AltName: Full=nsp14; AltName: Full=p58; Contains: RecName: Full=Uridylate-specific endoribonuclease; AltName: Full=NendoU; AltName: Full=nsp15; AltName: Full=p39; Contains: RecName: Full=Putative 2'-O-methyl transferase; AltName: Full=nsp16; AltName: Full=p35 [Avian infectious bronchitis virus (strain Beaudette)] Sequence ID: P0C6Y1.1 Length: 6629 Range 1: 1036 to 1166 Score:37.4 bits(85), Expect:1.8, Method:Compositional matrix adjust., Identities:37/141(26%), Positives:64/141(45%), Gaps:19/141(13%) Query 1349 EECVVNAANPRGLPGDGVCKAV-------YKKWPESFKNSATPVGTAKT--VMCGTYPVI 1399 E C+VNAAN G GV KA+ + ++ E + P T + G V Sbjct 1036 EFCIVNAANEHMTHGSGVAKAIADFCGLDFVEYCEDYVKKHGPQQRLVTPSFVKGIQCVN 1095 Query 1400 HAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYSGGKDRLTQSLN 1459 + VGP + ++ +L AAY+ V + GV + +P+LS G++ G D S++ Sbjct 1096 NVVGP---RHGDNNLHEKLVAAYKNVLVD----GVVNYVVPVLSLGIF--GVD-FKMSID 1145 Query 1460 HLFTALDSTDADVVIYCRDKE 1480 + A + V+++ +E Sbjct 1146 AMREAFEGCTIRVLLFSLSQE 1166 >RecName: Full=Replicase polyprotein 1a; Short=pp1a; AltName: Full=ORF1a polyprotein; Contains: RecName: Full=Non-structural protein 2; Short=nsp2; AltName: Full=p87; Contains: RecName: Full=Non-structural protein 3; Short=nsp3; AltName: Full=Papain-like proteinase; Short=PL-PRO; AltName: Full=p195; Contains: RecName: Full=Non-structural protein 4; Short=nsp4; AltName: Full=Peptide HD2; AltName: Full=p41; Contains: RecName: Full=3C-like proteinase; Short=3CL-PRO; Short=3CLp; AltName: Full=M-PRO; AltName: Full=nsp5; AltName: Full=p33; Contains: RecName: Full=Non-structural protein 6; Short=nsp6; AltName: Full=p34; Contains: RecName: Full=Non-structural protein 7; Short=nsp7; AltName: Full=p9; Contains: RecName: Full=Non-structural protein 8; Short=nsp8; AltName: Full=p24; Contains: RecName: Full=Non-structural protein 9; Short=nsp9; AltName: Full=p10; Contains: RecName: Full=Non-structural protein 10; Short=nsp10; AltName: Full=Growth factor-like peptide; Short=GFL; AltName: Full=p16; Contains: RecName: Full=Non-structural protein 11; Short=nsp11 [Avian infectious bronchitis virus (strain Beaudette)] Sequence ID: P0C6V3.1 Length: 3951 >RecName: Full=Replicase polyprotein 1a; Short=pp1a; AltName: Full=ORF1a polyprotein; Contains: RecName: Full=Non-structural protein 2; Short=nsp2; AltName: Full=p87; Contains: RecName: Full=Non-structural protein 3; Short=nsp3; AltName: Full=Papain-like proteinase; Short=PL-PRO; AltName: Full=p195; Contains: RecName: Full=Non-structural protein 4; Short=nsp4; AltName: Full=Peptide HD2; AltName: Full=p41; Contains: RecName: Full=3C-like proteinase; Short=3CL-PRO; Short=3CLp; AltName: Full=M-PRO; AltName: Full=nsp5; AltName: Full=p33; Contains: RecName: Full=Non-structural protein 6; Short=nsp6; AltName: Full=p34; Contains: RecName: Full=Non-structural protein 7; Short=nsp7; AltName: Full=p9; Contains: RecName: Full=Non-structural protein 8; Short=nsp8; AltName: Full=p24; Contains: RecName: Full=Non-structural protein 9; Short=nsp9; AltName: Full=p10; Contains: RecName: Full=Non-structural protein 10; Short=nsp10; AltName: Full=Growth factor-like peptide; Short=GFL; AltName: Full=p16; Contains: RecName: Full=Non-structural protein 11; Short=nsp11 [Avian infectious bronchitis virus (strain Beaudette CK)] Sequence ID: P0C6V4.1 Length: 3951 Range 1: 1036 to 1166 Score:37.4 bits(85), Expect:1.9, Method:Compositional matrix adjust., Identities:37/141(26%), Positives:64/141(45%), Gaps:19/141(13%) Query 1349 EECVVNAANPRGLPGDGVCKAV-------YKKWPESFKNSATPVGTAKT--VMCGTYPVI 1399 E C+VNAAN G GV KA+ + ++ E + P T + G V Sbjct 1036 EFCIVNAANEHMTHGSGVAKAIADFCGLDFVEYCEDYVKKHGPQQRLVTPSFVKGIQCVN 1095 Query 1400 HAVGPNFSNYSESEGDRELAAAYREVAKEVTRLGVNSVAIPLLSTGVYSGGKDRLTQSLN 1459 + VGP + ++ +L AAY+ V + GV + +P+LS G++ G D S++ Sbjct 1096 NVVGP---RHGDNNLHEKLVAAYKNVLVD----GVVNYVVPVLSLGIF--GVD-FKMSID 1145 Query 1460 HLFTALDSTDADVVIYCRDKE 1480 + A + V+++ +E Sbjct 1146 AMREAFEGCTIRVLLFSLSQE 1166 >RecName: Full=Protein mono-ADP-ribosyltransferase PARP9; AltName: Full=ADP-ribosyltransferase diphtheria toxin-like 9; Short=ARTD9; AltName: Full=B aggressive lymphoma protein; AltName: Full=Poly [ADP-ribose] polymerase 9; Short=PARP-9 [Homo sapiens] Sequence ID: Q8IXQ6.2 Length: 854 Range 1: 126 to 244 Score:36.2 bits(82), Expect:3.3, Method:Compositional matrix adjust., Identities:31/119(26%), Positives:49/119(41%), Gaps:14/119(11%) Query 1343 DIAKNDEECVVNAANPRGLPGDGVCKAVYKKWPESFKNSA---------TPVGTAKTVMC 1393 D+ + + VVNAAN L G G+ A+ K + + G Sbjct 126 DLTTHAVDAVVNAANEDLLHGGGLALALVKAGGFEIQEESKQFVARYGKVSAGEIAVTGA 185 Query 1394 GTYP---VIHAVGPNFSNYSESEGDRELAAAYREVAKEVT--RLGVNSVAIPLLSTGVY 1447 G P +IHAVGP + + + +L A + V + +VAIP LS+G++ Sbjct 186 GRLPCKQIIHAVGPRWMEWDKQGCTGKLQRAIVSILNYVIYKNTHIKTVAIPALSSGIF 244 >RecName: Full=RNA-directed RNA polymerase; AltName: Full=216.5 kDa protein; AltName: Full=ORF1 protein; AltName: Full=RNA replicase [Apple chlorotic leaf spot virus (isolate APPLE)] Sequence ID: P54891.1 Length: 1885 Range 1: 1044 to 1167 Score:36.2 bits(82), Expect:4.2, Method:Compositional matrix adjust., Identities:32/125(26%), Positives:57/125(45%), Gaps:8/125(6%) Query 705 RIRPACPYKTAVIGVFGVPGSGKSAIIKNLVTRQDLVTSGKKENC--QEISTDVMRQRNL 762 R++ K + G+FG GSGKS I+NL+ + + G C + ++ D + Sbjct 1044 RVQELNFMKVKIYGIFGFAGSGKSHAIQNLIQTEFKGSQGVMVICPRRFLAKDWSEKGVD 1103 Query 763 EISARTVDSLLLNGCNRPVDVLYVDEAFACHSG-TLLALIAL----VRPRQKVVLCGDPK 817 E +T +S L + + + +DE G T L ++ + + + +V GDP Sbjct 1104 EKDIKTFESALKSDV-KGKRLFILDEISLLPKGFTDLLMLKMHMEGILKKSTIVCIGDPL 1162 Query 818 QCGFF 822 Q G+F Sbjct 1163 QAGYF 1167 >RecName: Full=RNA-directed RNA polymerase; AltName: Full=216.5 kDa protein; AltName: Full=ORF1 protein; AltName: Full=RNA replicase [Apple chlorotic leaf spot virus (isolate PLUM P863)] Sequence ID: P27738.1 Length: 1884 Range 1: 1051 to 1166 Score:35.4 bits(80), Expect:6.2, Method:Compositional matrix adjust., Identities:31/117(26%), Positives:54/117(46%), Gaps:8/117(6%) Query 713 KTAVIGVFGVPGSGKSAIIKNLVTRQDLVTSGKKENC--QEISTDVMRQRNLEISARTVD 770 K + G+FG GSGKS I+NL+ + + G C + ++ D + E +T + Sbjct 1051 KVKIYGIFGFAGSGKSHAIQNLIQTEFKGSQGIMVICPRRFLAKDWSEKGVDEKDIKTFE 1110 Query 771 SLLLNGCNRPVDVLYVDEAFACHSG-TLLALIAL----VRPRQKVVLCGDPKQCGFF 822 S L + + + +DE G T L ++ + + + +V GDP Q G+F Sbjct 1111 SALKSDV-KGKRLFILDEISLLPKGFTDLLMLKMHMEGILKKSTIVCIGDPLQAGYF 1166