RID: 32AM7YCM016 Job Title:GP_SFTS R4V2Q5 Envelopment polyprotein (M... Program: BLASTP Query: GP_SFTS R4V2Q5 Envelopment polyprotein (M polyprotein) (Glycoprotein N {ECO:0000250|UniProtKB:P21401}) (Gn) (Glycoprotein G1) (Glycoprotein C {ECO:0000250|UniProtKB:P21401}) (Gc) (Glycoprotein G2) (Precursor) ID: lcl|Query_7038482(amino acid) Length: 543 Database: swissprot Non-redundant UniProtKB/SwissProt sequences Sequences producing significant alignments: Scientific Common Max Total Query E Per. Acc. Description Name Name Taxid Score Score cover Value Ident Len Accession RecName: Full=Envelopment polyprotein; AltName: Full=M... Severe fever... NA 1003835 1126 1126 100% 0.0 100.00 1073 R4V2Q5.1 RecName: Full=Envelopment polyprotein; AltName: Full=M... SFTS virus HB29 NA 992212 1100 1100 99% 0.0 97.79 1073 A0A0B5A886.2 RecName: Full=Envelopment polyprotein; AltName: Full=M... Heartland virus NA 1216928 692 692 98% 0.0 58.63 1076 J3WAX0.1 RecName: Full=Envelopment polyprotein; AltName: Full=M... Bhanja virus NA 1213620 140 140 70% 7e-34 25.76 1070 L7V0S7.1 RecName: Full=Envelopment polyprotein; AltName: Full=M... Uukuniemi vi... NA 487099 83.6 83.6 74% 2e-15 21.72 1008 P09613.2 Alignments: >RecName: Full=Envelopment polyprotein; AltName: Full=M polyprotein; Contains: RecName: Full=Glycoprotein N; Short=Gn; AltName: Full=Glycoprotein G1; Contains: RecName: Full=Glycoprotein C; Short=Gc; AltName: Full=Glycoprotein G2; Flags: Precursor [Severe fever with thrombocytopenia syndrome virus] Sequence ID: R4V2Q5.1 Length: 1073 Range 1: 20 to 562 Score:1126 bits(2913), Expect:0.0, Method:Compositional matrix adjust., Identities:543/543(100%), Positives:543/543(100%), Gaps:0/543(0%) Query 1 DSGPIICAGPIHSNKSADIPHLLGYSEKICQIDRLIHVSSWLRNHSQFQGYVGQRGGRSQ 60 DSGPIICAGPIHSNKSADIPHLLGYSEKICQIDRLIHVSSWLRNHSQFQGYVGQRGGRSQ Sbjct 20 DSGPIICAGPIHSNKSADIPHLLGYSEKICQIDRLIHVSSWLRNHSQFQGYVGQRGGRSQ 79 Query 61 VSYYPAENSYSRWSGLLSPCDADWLGMLVVKKAKGSDMIVPGPSYKGKVFFERPTFDGYV 120 VSYYPAENSYSRWSGLLSPCDADWLGMLVVKKAKGSDMIVPGPSYKGKVFFERPTFDGYV Sbjct 80 VSYYPAENSYSRWSGLLSPCDADWLGMLVVKKAKGSDMIVPGPSYKGKVFFERPTFDGYV 139 Query 121 GWGCGSGKSRTESGELCSSDSGTSSGLLPSDRVLWIGDVACQPMTPIPEETFLELKSFSQ 180 GWGCGSGKSRTESGELCSSDSGTSSGLLPSDRVLWIGDVACQPMTPIPEETFLELKSFSQ Sbjct 140 GWGCGSGKSRTESGELCSSDSGTSSGLLPSDRVLWIGDVACQPMTPIPEETFLELKSFSQ 199 Query 181 SEFPDICKIDGIVFNQCEGESLPQPFDVAWMDVGHSHKIIMREHKTKWVQESSSKDFVCY 240 SEFPDICKIDGIVFNQCEGESLPQPFDVAWMDVGHSHKIIMREHKTKWVQESSSKDFVCY Sbjct 200 SEFPDICKIDGIVFNQCEGESLPQPFDVAWMDVGHSHKIIMREHKTKWVQESSSKDFVCY 259 Query 241 KEGTGPCSESEEKTCKTSGSCRGDMQFCKVAGCEHGEEASEAKCRCSLVHKPGEVVVSYG 300 KEGTGPCSESEEKTCKTSGSCRGDMQFCKVAGCEHGEEASEAKCRCSLVHKPGEVVVSYG Sbjct 260 KEGTGPCSESEEKTCKTSGSCRGDMQFCKVAGCEHGEEASEAKCRCSLVHKPGEVVVSYG 319 Query 301 GMRVRPKCYGFSRMMATLEVNQPEQRLGQCTGCHLECINGGVRLITLTSELKSATVCASH 360 GMRVRPKCYGFSRMMATLEVNQPEQRLGQCTGCHLECINGGVRLITLTSELKSATVCASH Sbjct 320 GMRVRPKCYGFSRMMATLEVNQPEQRLGQCTGCHLECINGGVRLITLTSELKSATVCASH 379 Query 361 FCSSATSGKKSTEIQFHSGSLVGKTAIHVKGALVDGTEFTFEGSCMFPDGCDAVDCTFCR 420 FCSSATSGKKSTEIQFHSGSLVGKTAIHVKGALVDGTEFTFEGSCMFPDGCDAVDCTFCR Sbjct 380 FCSSATSGKKSTEIQFHSGSLVGKTAIHVKGALVDGTEFTFEGSCMFPDGCDAVDCTFCR 439 Query 421 EFLKNPQCYPAKKWLFIIIVILLGYAGLMLLTNVLKAIGIWGSWVIAPVKLMFAIIKKLM 480 EFLKNPQCYPAKKWLFIIIVILLGYAGLMLLTNVLKAIGIWGSWVIAPVKLMFAIIKKLM Sbjct 440 EFLKNPQCYPAKKWLFIIIVILLGYAGLMLLTNVLKAIGIWGSWVIAPVKLMFAIIKKLM 499 Query 481 RTVSCLMRKLMDRGRQVIHEEIGENREGNQDDVRIEMARPRRVRHWMYSPVILTILAIGL 540 RTVSCLMRKLMDRGRQVIHEEIGENREGNQDDVRIEMARPRRVRHWMYSPVILTILAIGL Sbjct 500 RTVSCLMRKLMDRGRQVIHEEIGENREGNQDDVRIEMARPRRVRHWMYSPVILTILAIGL 559 Query 541 AES 543 AES Sbjct 560 AES 562 >RecName: Full=Envelopment polyprotein; AltName: Full=M polyprotein; Contains: RecName: Full=Glycoprotein N; Short=Gn; AltName: Full=Glycoprotein G1; Contains: RecName: Full=Glycoprotein C; Short=Gc; AltName: Full=Glycoprotein G2; Flags: Precursor [SFTS virus HB29] Sequence ID: A0A0B5A886.2 Length: 1073 Range 1: 20 to 561 Score:1100 bits(2844), Expect:0.0, Method:Compositional matrix adjust., Identities:530/542(98%), Positives:535/542(98%), Gaps:0/542(0%) Query 1 DSGPIICAGPIHSNKSADIPHLLGYSEKICQIDRLIHVSSWLRNHSQFQGYVGQRGGRSQ 60 DSGPIICAGPIHSNKSA IPHLLGYSEKICQIDRLIHVSSWLRNHSQFQGYVGQRGGRSQ Sbjct 20 DSGPIICAGPIHSNKSAGIPHLLGYSEKICQIDRLIHVSSWLRNHSQFQGYVGQRGGRSQ 79 Query 61 VSYYPAENSYSRWSGLLSPCDADWLGMLVVKKAKGSDMIVPGPSYKGKVFFERPTFDGYV 120 VSYYPAENSYSRWSGLLSPCDADWLGMLVVKKAK SDMIVPGPSYKGKVFFERPTFDGYV Sbjct 80 VSYYPAENSYSRWSGLLSPCDADWLGMLVVKKAKESDMIVPGPSYKGKVFFERPTFDGYV 139 Query 121 GWGCGSGKSRTESGELCSSDSGTSSGLLPSDRVLWIGDVACQPMTPIPEETFLELKSFSQ 180 GWGCGSGKSRTESGELCSSDSGTSSGLLPSDRVLWIGDVACQPMTPIPEETFLELKSFSQ Sbjct 140 GWGCGSGKSRTESGELCSSDSGTSSGLLPSDRVLWIGDVACQPMTPIPEETFLELKSFSQ 199 Query 181 SEFPDICKIDGIVFNQCEGESLPQPFDVAWMDVGHSHKIIMREHKTKWVQESSSKDFVCY 240 SEFPDICKIDGIVFNQCEGESLPQPFDVAWMDVGHSHKIIMREHKTKWVQESSSKDFVCY Sbjct 200 SEFPDICKIDGIVFNQCEGESLPQPFDVAWMDVGHSHKIIMREHKTKWVQESSSKDFVCY 259 Query 241 KEGTGPCSESEEKTCKTSGSCRGDMQFCKVAGCEHGEEASEAKCRCSLVHKPGEVVVSYG 300 KEGTGPCSESEEK CKTSGSCRGDMQFCKVAGCEHGEEASEAKCRCSLVHKPGEVVVSYG Sbjct 260 KEGTGPCSESEEKACKTSGSCRGDMQFCKVAGCEHGEEASEAKCRCSLVHKPGEVVVSYG 319 Query 301 GMRVRPKCYGFSRMMATLEVNQPEQRLGQCTGCHLECINGGVRLITLTSELKSATVCASH 360 GMRVRPKCYGFSRMMATLEVN PEQR+GQCTGCHLECINGGVRLITLTSEL+SATVCASH Sbjct 320 GMRVRPKCYGFSRMMATLEVNPPEQRIGQCTGCHLECINGGVRLITLTSELRSATVCASH 379 Query 361 FCSSATSGKKSTEIQFHSGSLVGKTAIHVKGALVDGTEFTFEGSCMFPDGCDAVDCTFCR 420 FCSSA+SGKKSTEI FHSGSLVGKTAIHVKGALVDGTEFTFEGSCMFPDGCDAVDCTFCR Sbjct 380 FCSSASSGKKSTEIHFHSGSLVGKTAIHVKGALVDGTEFTFEGSCMFPDGCDAVDCTFCR 439 Query 421 EFLKNPQCYPAKKWLFIIIVILLGYAGLMLLTNVLKAIGIWGSWVIAPVKLMFAIIKKLM 480 EFLKNPQCYPAKKWLFIIIVILLGYAGLMLLTNVLKAIG+WGSWVIAPVKLMFAIIKKLM Sbjct 440 EFLKNPQCYPAKKWLFIIIVILLGYAGLMLLTNVLKAIGVWGSWVIAPVKLMFAIIKKLM 499 Query 481 RTVSCLMRKLMDRGRQVIHEEIGENREGNQDDVRIEMARPRRVRHWMYSPVILTILAIGL 540 RTVSCL+ KLMDRGRQVIHEEIGEN EGNQDDVRIEMARPRRVRHWMYSPVILTILAIGL Sbjct 500 RTVSCLVGKLMDRGRQVIHEEIGENGEGNQDDVRIEMARPRRVRHWMYSPVILTILAIGL 559 Query 541 AE 542 AE Sbjct 560 AE 561 >RecName: Full=Envelopment polyprotein; AltName: Full=M polyprotein; Contains: RecName: Full=Glycoprotein N; Short=Gn; AltName: Full=Glycoprotein G1; Contains: RecName: Full=Glycoprotein C; Short=Gc; AltName: Full=Glycoprotein G2; Flags: Precursor [Heartland virus] Sequence ID: J3WAX0.1 Length: 1076 Range 1: 25 to 563 Score:692 bits(1785), Expect:0.0, Method:Compositional matrix adjust., Identities:316/539(59%), Positives:411/539(76%), Gaps:2/539(0%) Query 4 PIICAGPIHSNKSADIPHLLGYSEKICQIDRLIHVSSWLRNHSQFQGYVGQRGGRSQVSY 63 PI+C +NKS I G SEK+CQIDRL HV+SWLRNHS FQG +GQ GR VSY Sbjct 25 PIVCGVRTETNKSIQIEWKEGRSEKLCQIDRLGHVTSWLRNHSSFQGLIGQVKGRPSVSY 84 Query 64 YPAENSYSRWSGLLSPCDADWLGMLVVKKAKGSDMIVPGPSYKGKVFFERPTFDGYVGWG 123 +P SY RWSGLLSPCDA+WLG++ V KA +DMIVPGP+YKGK+F ERPT++GY GWG Sbjct 85 FPEGASYPRWSGLLSPCDAEWLGLIAVSKAGDTDMIVPGPTYKGKIFVERPTYNGYKGWG 144 Query 124 CGSGKSRTESGELCSSDSGTSSGLLPSDRVLWIGDVACQPMTPIPEETFLELKSFSQSEF 183 C GKS + SG C +DS SSGL+ DRVLW+G+V CQ TP+PE+ F EL S SQSEF Sbjct 145 CADGKSLSHSGTYCETDSSVSSGLIQGDRVLWVGEVVCQRGTPVPEDVFSELVSLSQSEF 204 Query 184 PDICKIDGIVFNQCEGESLPQPFDVAWMDVGHSHKIIMREHKTKWVQESSSKDFVCYKEG 243 PD+CKIDG+ NQCE ES+PQP DVAW+DVG SHK++MREHKTKWVQESS+KDFVC+K G Sbjct 205 PDVCKIDGVALNQCEQESIPQPLDVAWIDVGRSHKVLMREHKTKWVQESSAKDFVCFKVG 264 Query 244 TGPCSESEEKTCKTSGSCRGDMQFCKVAGCEHGEEASEAKCRCSLVHKPGEVVVSYGGMR 303 GPCS+ EE C + G+C GD FC++AGC + ++ CRC L+ KPGE++V+YGG+ Sbjct 265 QGPCSKQEEDDCMSKGNCHGDEVFCRMAGCSARMQDNQEGCRCELLQKPGEIIVNYGGVS 324 Query 304 VRPKCYGFSRMMATLEVNQPEQRLGQCTGCHLECINGGVRLITLTSELKSATVCASHFCS 363 VRP CYGFSRMMATLEV++P++ L CTGCHLECI GGV+++TLTSEL+SATVCASHFC+ Sbjct 325 VRPTCYGFSRMMATLEVHKPDRELTGCTGCHLECIEGGVKIVTLTSELRSATVCASHFCA 384 Query 364 SATSGKKSTEIQFHSGSLVGKTAIHVKGALVDGTEFTFEGSCMFPDGCDAVDCTFCREFL 423 SA G K+T+I FH+G+LVG +I + G L+DG++F+F+G C+FPDGC A+DCTFC+EFL Sbjct 385 SAKGGSKTTDILFHTGALVGPNSIRITGQLLDGSKFSFDGHCIFPDGCMALDCTFCKEFL 444 Query 424 KNPQCYPAKKWLFIIIVILLGYAGLMLLTNVLKAIGIWGSWVIAPVKLMFAIIKKLMRTV 483 +NPQCYP KKWLF+++VI+ Y LMLLTN+L+AIG+WG+WV AP+KL A+ +L + Sbjct 445 RNPQCYPVKKWLFLVVVIMCCYCALMLLTNILRAIGVWGTWVFAPIKLALALGLRLAKLS 504 Query 484 SCLMRKLMDRGRQVIHEEIGENR--EGNQDDVRIEMARPRRVRHWMYSPVILTILAIGL 540 + ++ RG+ ++++E+ + R G Q++ R +RHW+YSP ++ IL + Sbjct 505 KKGLVAVVTRGQMIVNDELHQVRVERGEQNEGRQGYGPRGPIRHWLYSPALILILTTSI 563 >RecName: Full=Envelopment polyprotein; AltName: Full=M polyprotein; Contains: RecName: Full=Glycoprotein N; Short=Gn; AltName: Full=Glycoprotein G1; Contains: RecName: Full=Glycoprotein C; Short=Gc; AltName: Full=Glycoprotein G2; Flags: Precursor [Bhanja virus] Sequence ID: L7V0S7.1 Length: 1070 Range 1: 177 to 555 Score:140 bits(353), Expect:7e-34, Method:Compositional matrix adjust., Identities:102/396(26%), Positives:168/396(42%), Gaps:28/396(7%) Query 153 VLWIGDVACQPMTPIPEETFLELKSFSQSEFPDICKIDGIVFNQCEGESLPQPFDVAWMD 212 V+ +G Q + E + +S P+IC IDG+ NQC+ L P W+ Sbjct 177 VISVGVQHAQEANQVDEHEARYISEARKSINPEICSIDGVEINQCD---LASPG--RWLM 231 Query 213 VGHSHKIIMREH---------KTKWVQ-ESSSKDFVCYKEGTGPCSESEEKTCKTSGSCR 262 + H ++E KW Q + DF C + + +C+ Sbjct 232 L-HYASFRLQEGSLVYLSPGLNIKWSQINVPASDFYCINVSDHLNTHYRPCEVNCTDNCQ 290 Query 263 GDMQFCKVAGCEHGEEASEAKCRCSLVHKPGEVVVSYGGMRVRPKCYGFSRMMATLEVNQ 322 GD +C V C A A+C+CS + G V G +P G + +V Sbjct 291 GDELYCSVHQC-----ARSAECKCSFIGSRGMAEVQIGDRWFKPAVVGSQQFFVKEDVPV 345 Query 323 PEQRLGQCTGCHLECINGGVRLITLTSELKSATVCASHFCSSATS-GKKSTEIQFHSGSL 381 +Q CT C + C G+ + ++ ELK TVC FCS+ S G K +I+FH+ Sbjct 346 LQQPSADCTTCSMTCTAEGIAISSIKDELKDVTVCVEGFCSTRVSKGSKVWKIEFHNQYP 405 Query 382 VGKTAIHVKGALVDGTEFTFEGSCMFPDGCDAVDCTFCREFLKNPQCYPAKKWLFIIIVI 441 + +G V G F C GC+ ++C FCRE L NPQCYP KW + +++ Sbjct 406 SSGSVALARGTTVSGETFELTAECGRRTGCEQINCLFCREMLSNPQCYPYGKWFLLFLIL 465 Query 442 LLGYAGLMLLTNVLKAIGIWGSWVIAPVKLMFAIIKKLMRTVSCLMRKLMDRGRQVIHEE 501 Y + LL +++ S + P F II K+ R + L ++ +R + E Sbjct 466 ATLYIIVALLKTIMRIFMACLSVLYGP----FIIIIKISRCLGRLGKRKGERTYVRLMEA 521 Query 502 IGENREGNQDDVRIEMARPRRVRHWMYSPVILTILA 537 + + R+ + + R ++ R ++ ++L +L Sbjct 522 LDDERKPEVVRAPVSLGRTKQPRIVLF--IVLALLV 555 >RecName: Full=Envelopment polyprotein; AltName: Full=M polyprotein; AltName: Full=p110; Contains: RecName: Full=Glycoprotein N; Short=Gn; AltName: Full=Glycoprotein G1; Contains: RecName: Full=Glycoprotein C; Short=Gc; AltName: Full=Glycoprotein G2; Flags: Precursor [Uukuniemi virus S23] Sequence ID: P09613.2 Length: 1008 Range 1: 115 to 504 Score:83.6 bits(205), Expect:2e-15, Method:Compositional matrix adjust., Identities:91/419(22%), Positives:166/419(39%), Gaps:44/419(10%) Query 137 CSSDSGTSSGLLPSDRVLWIGDVACQPMTPIPEETFLELKSFSQSEFPD----ICKIDGI 192 C+S S ++ S+ +++ D CQ PE PD +C+I + Sbjct 115 CASCIEKKSSIMKSEHLVY-DDAICQSDYSSPEA------------MPDHETHLCRIGPL 161 Query 193 VFNQCEGESLPQPFDVAWMDVGHSHKIIMREHKTKWVQESSSKDFVCYKEGTGPCSESEE 252 C E+ + V+W + ++ + W + F C E S++ Sbjct 162 HIQHCTHEA-KRVQHVSWFWIDGKLRV-YDDFSVSWTEGKFLSLFDCLNET------SKD 213 Query 253 KTCKTS----GSCRGDMQFCKVAGCEHGEEASEAKCRCSLVHKPGEVVVSYGGMRVRPKC 308 C + G C GD+QFC C + +A C C G VV P+C Sbjct 214 HNCNKAVCLEGRCSGDLQFCTEFTCSYA----KADCNCKRNQVSGVAVVHTKHGSFMPEC 269 Query 309 YGFSRMMATLEVNQPEQRLGQ-CTGCHLECINGGVRLITLTSELKSATVCASHFCSSATS 367 G S +++ + Q C C +C + +I C C + + Sbjct 270 MGQSLWSVRKPLSKRSVTVQQPCMDCESDCKVDHILVIVRHFYPDHYQACLGSTCLTGRA 329 Query 368 GKKSTEIQFHSGSLVGKTAIHVKGALVD-GTEFTFEGSCMFPDGCDAVDCTFCREFLKNP 426 K +I F + + ++ + E+ E C D C A+ C FCR N Sbjct 330 KDKEFKIPFKMADRLSDSHFEIRIWDKERSNEYFLESRCESVDACAAITCWFCRANWANI 389 Query 427 QCYPAKKWLFIIIVILLGYAGLMLLTNVLKAIGIWGSWVIAPVKLMFAIIKKLMRTVSCL 486 C+ ++ ++I++ + ++LL +VL+A+ + ++ +K + I+ L RT S Sbjct 390 HCFSKEQ---VLILVAVSSLCILLLASVLRALKVIATFTWKIIKPFWWILSLLCRTCSKR 446 Query 487 MRKLMDRGRQVIH---EEIGENREG--NQDDVRIEMARPRRVRHWMYSPVILTILAIGL 540 + K +R ++ IH E + EG Q++ +ARP VR M++ L+ + +G+ Sbjct 447 LNKRAERLKESIHSLEEGLNNVDEGPREQNNPARAVARP-NVRQKMFNLTRLSPVVVGM 504