RID: 00SJJ422016 Job Title:Protein Sequence Program: BLASTP Query: unnamed protein product ID: lcl|Query_1713845(amino acid) Length: 496 Database: swissprot Non-redundant UniProtKB/SwissProt sequences Sequences producing significant alignments: Scientific Common Max Total Query E Per. Acc. Description Name Name Taxid Score Score cover Value Ident Len Accession RecName: Full=Envelopment polyprotein; AltName: Full=M... Uukuniemi vi... NA 487099 1036 1036 100% 0.0 100.00 1008 P09613.2 RecName: Full=Envelopment polyprotein; AltName: Full=M... Rift valley ... NA 11589 100 100 77% 3e-21 20.74 1197 P21401.1 RecName: Full=Envelopment polyprotein; AltName: Full=M... Rift Valley ... NA 11588 98.6 98.6 77% 2e-20 20.74 1206 P03518.2 RecName: Full=Envelopment polyprotein; AltName: Full=M... SFTS virus HB29 NA 992212 84.7 84.7 79% 6e-16 22.41 1073 A0A0B5A886.2 RecName: Full=Envelopment polyprotein; AltName: Full=M... Punta Toro v... NA 11587 82.4 82.4 78% 3e-15 22.72 1313 P03517.1 RecName: Full=Envelopment polyprotein; AltName: Full=M... Severe fever... NA 1003835 82.0 82.0 79% 4e-15 21.69 1073 R4V2Q5.1 Alignments: >RecName: Full=Envelopment polyprotein; AltName: Full=M polyprotein; AltName: Full=p110; Contains: RecName: Full=Glycoprotein N; Short=Gn; AltName: Full=Glycoprotein G1; Contains: RecName: Full=Glycoprotein C; Short=Gc; AltName: Full=Glycoprotein G2; Flags: Precursor [Uukuniemi virus S23] Sequence ID: P09613.2 Length: 1008 Range 1: 18 to 513 Score:1036 bits(2680), Expect:0.0, Method:Compositional matrix adjust., Identities:496/496(100%), Positives:496/496(100%), Gaps:0/496(0%) Query 1 FFNHLMDVTRRLLDSSNATWQRDQPDTHRLSRLDAHVMSMLGVGSHIDEVSVNHSQHLHN 60 FFNHLMDVTRRLLDSSNATWQRDQPDTHRLSRLDAHVMSMLGVGSHIDEVSVNHSQHLHN Sbjct 18 FFNHLMDVTRRLLDSSNATWQRDQPDTHRLSRLDAHVMSMLGVGSHIDEVSVNHSQHLHN 77 Query 61 FRSYNCEEGRRTLTMMDPKSGKFKRLKCNENQTLSKDCASCIEKKSSIMKSEHLVYDDAI 120 FRSYNCEEGRRTLTMMDPKSGKFKRLKCNENQTLSKDCASCIEKKSSIMKSEHLVYDDAI Sbjct 78 FRSYNCEEGRRTLTMMDPKSGKFKRLKCNENQTLSKDCASCIEKKSSIMKSEHLVYDDAI 137 Query 121 CQSDYSSPEAMPDHETHLCRIGPLHIQHCTHEAKRVQHVSWFWIDGKLRVYDDFSVSWTE 180 CQSDYSSPEAMPDHETHLCRIGPLHIQHCTHEAKRVQHVSWFWIDGKLRVYDDFSVSWTE Sbjct 138 CQSDYSSPEAMPDHETHLCRIGPLHIQHCTHEAKRVQHVSWFWIDGKLRVYDDFSVSWTE 197 Query 181 GKFLSLFDCLNETSKDHNCNKAVCLEGRCSGDLQFCTEFTCSYAKADCNCKRNQVSGVAV 240 GKFLSLFDCLNETSKDHNCNKAVCLEGRCSGDLQFCTEFTCSYAKADCNCKRNQVSGVAV Sbjct 198 GKFLSLFDCLNETSKDHNCNKAVCLEGRCSGDLQFCTEFTCSYAKADCNCKRNQVSGVAV 257 Query 241 VHTKHGSFMPECMGQSLWSVRKPLSKRSVTVQQPCMDCESDCKVDHILVIVRHFYPDHYQ 300 VHTKHGSFMPECMGQSLWSVRKPLSKRSVTVQQPCMDCESDCKVDHILVIVRHFYPDHYQ Sbjct 258 VHTKHGSFMPECMGQSLWSVRKPLSKRSVTVQQPCMDCESDCKVDHILVIVRHFYPDHYQ 317 Query 301 ACLGSTCLTGRAKDKEFKIPFKMADRLSDSHFEIRIWDKERSNEYFLESRCESVDACAAI 360 ACLGSTCLTGRAKDKEFKIPFKMADRLSDSHFEIRIWDKERSNEYFLESRCESVDACAAI Sbjct 318 ACLGSTCLTGRAKDKEFKIPFKMADRLSDSHFEIRIWDKERSNEYFLESRCESVDACAAI 377 Query 361 TCWFCRANWANIHCFSKEQVLILVAVSSLCILLLASVLRALKVIATFTWKIIKPFWWILS 420 TCWFCRANWANIHCFSKEQVLILVAVSSLCILLLASVLRALKVIATFTWKIIKPFWWILS Sbjct 378 TCWFCRANWANIHCFSKEQVLILVAVSSLCILLLASVLRALKVIATFTWKIIKPFWWILS 437 Query 421 LLCRTCSKRLNKRAERLKESIHSLEEGLNNVDEGPREQNNPARAVARPNVRQKMFNLTRL 480 LLCRTCSKRLNKRAERLKESIHSLEEGLNNVDEGPREQNNPARAVARPNVRQKMFNLTRL Sbjct 438 LLCRTCSKRLNKRAERLKESIHSLEEGLNNVDEGPREQNNPARAVARPNVRQKMFNLTRL 497 Query 481 SPVVVGMLCLACPVES 496 SPVVVGMLCLACPVES Sbjct 498 SPVVVGMLCLACPVES 513 >RecName: Full=Envelopment polyprotein; AltName: Full=M polyprotein; Contains: RecName: Full=NSm-Gn protein; AltName: Full=p78; Contains: RecName: Full=Glycoprotein N; Short=Gn; AltName: Full=Glycoprotein G1; Contains: RecName: Full=Glycoprotein C; Short=Gc; AltName: Full=Glycoprotein G2; Flags: Precursor [Rift valley fever virus (STRAIN ZH-548 M12)] Sequence ID: P21401.1 Length: 1197 Range 1: 264 to 664 Score:100 bits(250), Expect:3e-21, Method:Compositional matrix adjust., Identities:84/405(21%), Positives:154/405(38%), Gaps:27/405(6%) Query 81 GKFKRLKCNENQTLSKDCASCIEKKSSIMKSEHLVYDDAICQSDYSSPEAMPDHETHLCR 140 GK +KC L++DC C + + +K D CQS + +C Sbjct 264 GKMASVKCPPKYELTEDCNFCRQMTGASLKKGSYPLQDLFCQSSEDDGSKLKTKMKGVCE 323 Query 141 IGPLHIQHCTHEAKRVQHVSWFWI--DGKLRVYDDFSVSWTEGKFLSLFDCLNETS---- 194 +G ++ C + V F + + K D + E F C Sbjct 324 VGVQALKKCDGQLSTAHEVVPFAVFKNSKKVYLDKLDLKTEENLLPDSFVCFEHKGQYKG 383 Query 195 -----------KDHNCNKAVCLEG----RCSGDLQFCTEFTCSYAKADCNCKRNQVSGVA 239 K + ++ + G +C+GD FC+ + C+ A+ C SG+ Sbjct 384 TMDSGQTKRELKSFDISQCPKIGGHGSKKCTGDAAFCSAYECTAQYANAYCSHANGSGIV 443 Query 240 VVHTKHGSFMPECMGQSLWSVRKPLSKRSVTVQQPCMDCESDCKVDHILVIVRHFYPDHY 299 + P C+G V++ LS + + +PC C + C+ ++V F Sbjct 444 QIQVSGVWKKPLCVGYERVVVKRELSAKPIQRVEPCTTCITKCEPHGLVVRSTGFKISSA 503 Query 300 QACLGSTCLTG-RAKDKEFKIPFKMADRLSDSHFEIRIWDKERSNEYFLESRCESVDACA 358 AC C+TG ++ E + + + S + + ++S + + C D C Sbjct 504 VACASGVCVTGSQSPSTEITLKYPGISQSSGGDIGVHMAHDDQSVSSKIVAHCPPQDPCL 563 Query 359 AITCWFCRANWANIHCFSKEQVLILVAV-SSLCILLLASVLRALKVIATFTWKIIKPFWW 417 C C N C + ++V V SS+ I+ LA + R LK + K++ P W Sbjct 564 VHDCIVCAHGLINYQCHTALSAFVVVFVFSSIAIICLAILYRVLKCLKIAPRKVLNPLMW 623 Query 418 ILSLLCRTCSKRLNKRAERLKESIHSLEEGLNNVDEGPREQNNPA 462 I + + R K++ R+ ++I+ + + ++ G NPA Sbjct 624 ITAFI-RWIYKKM---VARVADNINQVNREIGWMEGGQLVLGNPA 664 >RecName: Full=Envelopment polyprotein; AltName: Full=M polyprotein; Contains: RecName: Full=NSm-Gn protein; Contains: RecName: Full=Glycoprotein N; Short=Gn; AltName: Full=Glycoprotein G1; Contains: RecName: Full=Glycoprotein C; Short=Gc; AltName: Full=Glycoprotein G2; Flags: Precursor [Rift Valley fever virus] Sequence ID: P03518.2 Length: 1206 Range 1: 264 to 664 Score:98.6 bits(244), Expect:2e-20, Method:Compositional matrix adjust., Identities:84/405(21%), Positives:152/405(37%), Gaps:27/405(6%) Query 81 GKFKRLKCNENQTLSKDCASCIEKKSSIMKSEHLVYDDAICQSDYSSPEAMPDHETHLCR 140 GK +KC L++DC C + + +K D CQS + +C Sbjct 264 GKMASVKCPPKYGLTEDCNFCRQMTGASLKKGSYPLQDLFCQSSEDDGSKLKTKMKGVCE 323 Query 141 IGPLHIQHCTHEAKRVQHVSWFWI--DGKLRVYDDFSVSWTEGKFLSLFDCLNETS---- 194 +G + C + V F + + K D + E F C Sbjct 324 VGVQAHKKCDGQLSTAHEVVPFAVFKNSKKVYLDKLDLKTEENLLPDSFVCFEHKGQYKG 383 Query 195 -----------KDHNCNKAVCLEG----RCSGDLQFCTEFTCSYAKADCNCKRNQVSGVA 239 K + ++ + G +C+GD FC+ + C+ A+ C SG+ Sbjct 384 TMDSGQTKRELKSFDISQCPKIGGHGSKKCTGDAAFCSAYECTAQYANAYCSHANGSGIV 443 Query 240 VVHTKHGSFMPECMGQSLWSVRKPLSKRSVTVQQPCMDCESDCKVDHILVIVRHFYPDHY 299 + P C+G V++ LS + + +PC C + C+ ++V F Sbjct 444 QIQVSGVWKKPLCVGYERVVVKRELSAKPIQRVEPCTTCITKCEPHGLVVRSTGFKISSA 503 Query 300 QACLGSTCLTG-RAKDKEFKIPFKMADRLSDSHFEIRIWDKERSNEYFLESRCESVDACA 358 AC C+TG ++ E + + + S + + ++S + + C D C Sbjct 504 VACASGVCVTGSQSPSTEITLKYPGISQSSGGDIGVHMAHDDQSVSSKIVAHCPPQDPCL 563 Query 359 AITCWFCRANWANIHCFSKEQVLILVAV-SSLCILLLASVLRALKVIATFTWKIIKPFWW 417 C C N C + ++V V SS+ I+ LA + R LK + K++ P W Sbjct 564 VHGCIVCAHGLINYQCHTALSAFVVVFVFSSIAIICLAVLYRVLKCLKIAPRKVLNPLMW 623 Query 418 ILSLLCRTCSKRLNKRAERLKESIHSLEEGLNNVDEGPREQNNPA 462 I + + R K++ R+ +I+ + + ++ G NPA Sbjct 624 ITAFI-RWIYKKM---VARVAHNINQVNREIGWMEGGQLVLGNPA 664 >RecName: Full=Envelopment polyprotein; AltName: Full=M polyprotein; Contains: RecName: Full=Glycoprotein N; Short=Gn; AltName: Full=Glycoprotein G1; Contains: RecName: Full=Glycoprotein C; Short=Gc; AltName: Full=Glycoprotein G2; Flags: Precursor [SFTS virus HB29] Sequence ID: A0A0B5A886.2 Length: 1073 Range 1: 156 to 559 Score:84.7 bits(208), Expect:6e-16, Method:Compositional matrix adjust., Identities:93/415(22%), Positives:163/415(39%), Gaps:36/415(8%) Query 98 CASCIEKKSSIMKSEHLVY-DDAICQSDYSSPEA------------MPDHETHLCRIGPL 144 C+S S ++ S+ +++ D CQ PE PD +C+I + Sbjct 156 CSSDSGTSSGLLPSDRVLWIGDVACQPMTPIPEETFLELKSFSQSEFPD----ICKIDGI 211 Query 145 HIQHCTHEA-KRVQHVSWFWIDGKLRV-YDDFSVSWTEGKFLSLFDCLNETSK--DHNCN 200 C E+ + V+W + ++ + W + F C E + + Sbjct 212 VFNQCEGESLPQPFDVAWMDVGHSHKIIMREHKTKWVQESSSKDFVCYKEGTGPCSESEE 271 Query 201 KAVCLEGRCSGDLQFCTEFTCSYA----KADCNCKRNQVSGVAVVHTKHGSFMPECMGQS 256 KA G C GD+QFC C + +A C C G VV P+C G S Sbjct 272 KACKTSGSCRGDMQFCKVAGCEHGEEASEAKCRCSLVHKPGEVVVSYGGMRVRPKCYGFS 331 Query 257 LWSVRKPLSKRSVTVQQPCMDCESDCKVDHILVIVRHFYPDHYQACLGSTCLTGRAKDKE 316 ++ + Q C C +C + +I C C + + K Sbjct 332 RMMATLEVNPPEQRIGQ-CTGCHLECINGGVRLITLTSELRSATVCASHFCSSASSGKKS 390 Query 317 FKIPFKMADRLSDSHFEIRIWDKERSNEYFLESRCESVDACAAITCWFCRANWANIHCFS 376 +I F + + ++ E+ E C D C A+ C FCR N C+ Sbjct 391 TEIHFHSGSLVGKTAIHVK-GALVDGTEFTFEGSCMFPDGCDAVDCTFCREFLKNPQCYP 449 Query 377 KEQ---VLILVAVSSLCILLLASVLRALKVIATFTWKIIKPFWWILSLLCRTCSKRLNKR 433 ++ ++I++ + ++LL +VL+A+ V ++ +K + I+ L RT S + K Sbjct 450 AKKWLFIIIVILLGYAGLMLLTNVLKAIGVWGSWVIAPVKLMFAIIKKLMRTVSCLVGKL 509 Query 434 AERLKESIHSLEEGLNNVDEGPREQNNPARAVARP-NVRQKMFNLTRLSPVVVGM 487 +R ++ IH E G N G Q++ +ARP VR M++ L+ + +G+ Sbjct 510 MDRGRQVIHE-EIGEN----GEGNQDDVRIEMARPRRVRHWMYSPVILTILAIGL 559 >RecName: Full=Envelopment polyprotein; AltName: Full=M polyprotein; Contains: RecName: Full=NSm-Gn protein; Contains: RecName: Full=Glycoprotein N; Short=Gn; AltName: Full=Glycoprotein G1; Contains: RecName: Full=Glycoprotein C; Short=Gc; AltName: Full=Glycoprotein G2; Flags: Precursor [Punta Toro virus] Sequence ID: P03517.1 Length: 1313 Range 1: 353 to 772 Score:82.4 bits(202), Expect:3e-15, Method:Compositional matrix adjust., Identities:97/427(23%), Positives:165/427(38%), Gaps:49/427(11%) Query 49 EVSVNHSQHLHNFRSY---NCEEGRRTLTMMDPKSGKFKRLKCNENQTLSKDCASC--IE 103 E+ N R+Y C + +D K GK + +KC EN +++DCA C I+ Sbjct 353 EIGTNKEFKCFEERAYIKGTCPTNINAVHYIDNK-GKLRYVKCKENLEMTEDCAFCRKIK 411 Query 104 KK---SSIMKSEHLVYDDAICQSD---YSSPEAMPDHETHLCRIGPLHIQHCTHEAKRVQ 157 KK S ++ + DAICQ + YS P+ +P +C+IG + + C + + Sbjct 412 KKAGQSVQVQKTSVPLQDAICQENSDTYSGPK-IPFK--GVCKIGLIKYKECKFKTSSYE 468 Query 158 HVSWFWIDGKLRVY-DDFSVSWTEGKFLSLFDCLNETSKD-----HNCNKAVCL------ 205 VS+ + K ++Y + + E F C +D H K V + Sbjct 469 TVSFITLKEKGKIYIEHLMLKNIEVVTNVSFVCYEHVGQDEQEVEHRALKRVSVNDCKIV 528 Query 206 ----EGRCSGDLQFCTEFTCSYAKADCNCKRNQVSGVAVVHTKHGSFMPECMGQSLWSVR 261 + C+GD FC ++ CS + D C SG ++ P+C+G V Sbjct 529 DNSKQKICTGDHVFCEKYDCSTSYPDVTCIHAPGSGPLYINLMGSWIKPQCVGYERVLVD 588 Query 262 KPLSKRSVTVQQPCMDCESDCKVDHILVIVRHFYPDHYQACLGSTCLTG-RAKDKEFKIP 320 + + + + +Q C C S+C + + + F AC +C++ + +P Sbjct 589 REVKQPLLAPEQNCDTCVSECLDEGVHIKSTGFEITSAVACSHGSCISAHQEPSTSVIVP 648 Query 321 FKMADRLSDSHFEIRIWDKERSNEYFLESRCESVDACAAITCWFCRANWANIHCFSKEQV 380 + I + S + C D+CAA C C N C S Sbjct 649 YPGLLASVGGRIGIHLSHTSDSASVHMVVVCPPRDSCAAHNCLLCYHGILNYQCHS---T 705 Query 381 LILVAVSSLCILLLASVLRA----LKVIATFTWKIIKPFWWI----------LSLLCRTC 426 L + S L IL + +V L V+ ++ P W+ L + R Sbjct 706 LSAILTSFLLILFIYTVFSVTTNILYVLRLIPKQLKSPVGWLKLFINWLLTALRIKTRNV 765 Query 427 SKRLNKR 433 +R+N+R Sbjct 766 MRRINQR 772 >RecName: Full=Envelopment polyprotein; AltName: Full=M polyprotein; Contains: RecName: Full=Glycoprotein N; Short=Gn; AltName: Full=Glycoprotein G1; Contains: RecName: Full=Glycoprotein C; Short=Gc; AltName: Full=Glycoprotein G2; Flags: Precursor [Severe fever with thrombocytopenia syndrome virus] Sequence ID: R4V2Q5.1 Length: 1073 Range 1: 156 to 559 Score:82.0 bits(201), Expect:4e-15, Method:Compositional matrix adjust., Identities:90/415(22%), Positives:163/415(39%), Gaps:36/415(8%) Query 98 CASCIEKKSSIMKSEHLVY-DDAICQSDYSSPEA------------MPDHETHLCRIGPL 144 C+S S ++ S+ +++ D CQ PE PD +C+I + Sbjct 156 CSSDSGTSSGLLPSDRVLWIGDVACQPMTPIPEETFLELKSFSQSEFPD----ICKIDGI 211 Query 145 HIQHCTHEA-KRVQHVSWFWIDGKLRV-YDDFSVSWTEGKFLSLFDCLNETSK--DHNCN 200 C E+ + V+W + ++ + W + F C E + + Sbjct 212 VFNQCEGESLPQPFDVAWMDVGHSHKIIMREHKTKWVQESSSKDFVCYKEGTGPCSESEE 271 Query 201 KAVCLEGRCSGDLQFCTEFTCSYA----KADCNCKRNQVSGVAVVHTKHGSFMPECMGQS 256 K G C GD+QFC C + +A C C G VV P+C G S Sbjct 272 KTCKTSGSCRGDMQFCKVAGCEHGEEASEAKCRCSLVHKPGEVVVSYGGMRVRPKCYGFS 331 Query 257 LWSVRKPLSKRSVTVQQPCMDCESDCKVDHILVIVRHFYPDHYQACLGSTCLTGRAKDKE 316 +++ + Q C C +C + +I C C + + K Sbjct 332 RMMATLEVNQPEQRLGQ-CTGCHLECINGGVRLITLTSELKSATVCASHFCSSATSGKKS 390 Query 317 FKIPFKMADRLSDSHFEIRIWDKERSNEYFLESRCESVDACAAITCWFCRANWANIHCFS 376 +I F + + ++ E+ E C D C A+ C FCR N C+ Sbjct 391 TEIQFHSGSLVGKTAIHVK-GALVDGTEFTFEGSCMFPDGCDAVDCTFCREFLKNPQCYP 449 Query 377 KEQ---VLILVAVSSLCILLLASVLRALKVIATFTWKIIKPFWWILSLLCRTCSKRLNKR 433 ++ ++I++ + ++LL +VL+A+ + ++ +K + I+ L RT S + K Sbjct 450 AKKWLFIIIVILLGYAGLMLLTNVLKAIGIWGSWVIAPVKLMFAIIKKLMRTVSCLMRKL 509 Query 434 AERLKESIHSLEEGLNNVDEGPREQNNPARAVARP-NVRQKMFNLTRLSPVVVGM 487 +R ++ IH E + EG Q++ +ARP VR M++ L+ + +G+ Sbjct 510 MDRGRQVIH---EEIGENREG--NQDDVRIEMARPRRVRHWMYSPVILTILAIGL 559