RID: EXPGED0D016 Job Title:Protein Sequence Program: BLASTP Query: unnamed protein product ID: lcl|Query_5446427(amino acid) Length: 548 Database: swissprot Non-redundant UniProtKB/SwissProt sequences Sequences producing significant alignments: Scientific Common Max Total Query E Per. Acc. Description Name Name Taxid Score Score cover Value Ident Len Accession RecName: Full=Envelopment polyprotein; AltName: Full=M... Heartland virus NA 1216928 1135 1135 100% 0.0 100.00 1076 J3WAX0.1 RecName: Full=Envelopment polyprotein; AltName: Full=M... Severe fever... NA 1003835 701 701 98% 0.0 58.63 1073 R4V2Q5.1 RecName: Full=Envelopment polyprotein; AltName: Full=M... SFTS virus HB29 NA 992212 698 698 98% 0.0 58.49 1073 A0A0B5A886.2 RecName: Full=Envelopment polyprotein; AltName: Full=M... Bhanja virus NA 1213620 136 136 64% 2e-32 25.98 1070 L7V0S7.1 Alignments: >RecName: Full=Envelopment polyprotein; AltName: Full=M polyprotein; Contains: RecName: Full=Glycoprotein N; Short=Gn; AltName: Full=Glycoprotein G1; Contains: RecName: Full=Glycoprotein C; Short=Gc; AltName: Full=Glycoprotein G2; Flags: Precursor [Heartland virus] Sequence ID: J3WAX0.1 Length: 1076 Range 1: 19 to 566 Score:1135 bits(2937), Expect:0.0, Method:Compositional matrix adjust., Identities:548/548(100%), Positives:548/548(100%), Gaps:0/548(0%) Query 1 WGSPGDPIVCGVRTETNKSIQIEWKEGRSEKLCQIDRLGHVTSWLRNHSSFQGLIGQVKG 60 WGSPGDPIVCGVRTETNKSIQIEWKEGRSEKLCQIDRLGHVTSWLRNHSSFQGLIGQVKG Sbjct 19 WGSPGDPIVCGVRTETNKSIQIEWKEGRSEKLCQIDRLGHVTSWLRNHSSFQGLIGQVKG 78 Query 61 RPSVSYFPEGASYPRWSGLLSPCDAEWLGLIAVSKAGDTDMIVPGPTYKGKIFVERPTYN 120 RPSVSYFPEGASYPRWSGLLSPCDAEWLGLIAVSKAGDTDMIVPGPTYKGKIFVERPTYN Sbjct 79 RPSVSYFPEGASYPRWSGLLSPCDAEWLGLIAVSKAGDTDMIVPGPTYKGKIFVERPTYN 138 Query 121 GYKGWGCADGKSLSHSGTYCETDSSVSSGLIQGDRVLWVGEVVCQRGTPVPEDVFSELVS 180 GYKGWGCADGKSLSHSGTYCETDSSVSSGLIQGDRVLWVGEVVCQRGTPVPEDVFSELVS Sbjct 139 GYKGWGCADGKSLSHSGTYCETDSSVSSGLIQGDRVLWVGEVVCQRGTPVPEDVFSELVS 198 Query 181 LSQSEFPDVCKIDGVALNQCEQESIPQPLDVAWIDVGRSHKVLMREHKTKWVQESSAKDF 240 LSQSEFPDVCKIDGVALNQCEQESIPQPLDVAWIDVGRSHKVLMREHKTKWVQESSAKDF Sbjct 199 LSQSEFPDVCKIDGVALNQCEQESIPQPLDVAWIDVGRSHKVLMREHKTKWVQESSAKDF 258 Query 241 VCFKVGQGPCSKQEEDDCMSKGNCHGDEVFCRMAGCSARMQDNQEGCRCELLQKPGEIIV 300 VCFKVGQGPCSKQEEDDCMSKGNCHGDEVFCRMAGCSARMQDNQEGCRCELLQKPGEIIV Sbjct 259 VCFKVGQGPCSKQEEDDCMSKGNCHGDEVFCRMAGCSARMQDNQEGCRCELLQKPGEIIV 318 Query 301 NYGGVSVRPTCYGFSRMMATLEVHKPDRELTGCTGCHLECIEGGVKIVTLTSELRSATVC 360 NYGGVSVRPTCYGFSRMMATLEVHKPDRELTGCTGCHLECIEGGVKIVTLTSELRSATVC Sbjct 319 NYGGVSVRPTCYGFSRMMATLEVHKPDRELTGCTGCHLECIEGGVKIVTLTSELRSATVC 378 Query 361 ASHFCASAKGGSKTTDILFHTGALVGPNSIRITGQLLDGSKFSFDGHCIFPDGCMALDCT 420 ASHFCASAKGGSKTTDILFHTGALVGPNSIRITGQLLDGSKFSFDGHCIFPDGCMALDCT Sbjct 379 ASHFCASAKGGSKTTDILFHTGALVGPNSIRITGQLLDGSKFSFDGHCIFPDGCMALDCT 438 Query 421 FCKEFLRNPQCYPVKKWLFLVVVIMCCYCALMLLTNILRAIGVWGTWVFAPIKLALALGL 480 FCKEFLRNPQCYPVKKWLFLVVVIMCCYCALMLLTNILRAIGVWGTWVFAPIKLALALGL Sbjct 439 FCKEFLRNPQCYPVKKWLFLVVVIMCCYCALMLLTNILRAIGVWGTWVFAPIKLALALGL 498 Query 481 RLAKLSKKGLVAVVTRGQMIVNDELHQVRVERGEQNEGRQGYGPRGPIRHWLYSPALILI 540 RLAKLSKKGLVAVVTRGQMIVNDELHQVRVERGEQNEGRQGYGPRGPIRHWLYSPALILI Sbjct 499 RLAKLSKKGLVAVVTRGQMIVNDELHQVRVERGEQNEGRQGYGPRGPIRHWLYSPALILI 558 Query 541 LTTSICSG 548 LTTSICSG Sbjct 559 LTTSICSG 566 >RecName: Full=Envelopment polyprotein; AltName: Full=M polyprotein; Contains: RecName: Full=Glycoprotein N; Short=Gn; AltName: Full=Glycoprotein G1; Contains: RecName: Full=Glycoprotein C; Short=Gc; AltName: Full=Glycoprotein G2; Flags: Precursor [Severe fever with thrombocytopenia syndrome virus] Sequence ID: R4V2Q5.1 Length: 1073 Range 1: 23 to 559 Score:701 bits(1808), Expect:0.0, Method:Compositional matrix adjust., Identities:316/539(59%), Positives:411/539(76%), Gaps:2/539(0%) Query 7 PIVCGVRTETNKSIQIEWKEGRSEKLCQIDRLGHVTSWLRNHSSFQGLIGQVKGRPSVSY 66 PI+C +NKS I G SEK+CQIDRL HV+SWLRNHS FQG +GQ GR VSY Sbjct 23 PIICAGPIHSNKSADIPHLLGYSEKICQIDRLIHVSSWLRNHSQFQGYVGQRGGRSQVSY 82 Query 67 FPEGASYPRWSGLLSPCDAEWLGLIAVSKAGDTDMIVPGPTYKGKIFVERPTYNGYKGWG 126 +P SY RWSGLLSPCDA+WLG++ V KA +DMIVPGP+YKGK+F ERPT++GY GWG Sbjct 83 YPAENSYSRWSGLLSPCDADWLGMLVVKKAKGSDMIVPGPSYKGKVFFERPTFDGYVGWG 142 Query 127 CADGKSLSHSGTYCETDSSVSSGLIQGDRVLWVGEVVCQRGTPVPEDVFSELVSLSQSEF 186 C GKS + SG C +DS SSGL+ DRVLW+G+V CQ TP+PE+ F EL S SQSEF Sbjct 143 CGSGKSRTESGELCSSDSGTSSGLLPSDRVLWIGDVACQPMTPIPEETFLELKSFSQSEF 202 Query 187 PDVCKIDGVALNQCEQESIPQPLDVAWIDVGRSHKVLMREHKTKWVQESSAKDFVCFKVG 246 PD+CKIDG+ NQCE ES+PQP DVAW+DVG SHK++MREHKTKWVQESS+KDFVC+K G Sbjct 203 PDICKIDGIVFNQCEGESLPQPFDVAWMDVGHSHKIIMREHKTKWVQESSSKDFVCYKEG 262 Query 247 QGPCSKQEEDDCMSKGNCHGDEVFCRMAGCSARMQDNQEGCRCELLQKPGEIIVNYGGVS 306 GPCS+ EE C + G+C GD FC++AGC + ++ CRC L+ KPGE++V+YGG+ Sbjct 263 TGPCSESEEKTCKTSGSCRGDMQFCKVAGCEHGEEASEAKCRCSLVHKPGEVVVSYGGMR 322 Query 307 VRPTCYGFSRMMATLEVHKPDRELTGCTGCHLECIEGGVKIVTLTSELRSATVCASHFCA 366 VRP CYGFSRMMATLEV++P++ L CTGCHLECI GGV+++TLTSEL+SATVCASHFC+ Sbjct 323 VRPKCYGFSRMMATLEVNQPEQRLGQCTGCHLECINGGVRLITLTSELKSATVCASHFCS 382 Query 367 SAKGGSKTTDILFHTGALVGPNSIRITGQLLDGSKFSFDGHCIFPDGCMALDCTFCKEFL 426 SA G K+T+I FH+G+LVG +I + G L+DG++F+F+G C+FPDGC A+DCTFC+EFL Sbjct 383 SATSGKKSTEIQFHSGSLVGKTAIHVKGALVDGTEFTFEGSCMFPDGCDAVDCTFCREFL 442 Query 427 RNPQCYPVKKWLFLVVVIMCCYCALMLLTNILRAIGVWGTWVFAPIKLALALGLRLAKLS 486 +NPQCYP KKWLF+++VI+ Y LMLLTN+L+AIG+WG+WV AP+KL A+ +L + Sbjct 443 KNPQCYPAKKWLFIIIVILLGYAGLMLLTNVLKAIGIWGSWVIAPVKLMFAIIKKLMRTV 502 Query 487 KKGLVAVVTRGQMIVNDELHQVRVERGEQNEGRQGYGPRGPIRHWLYSPALILILTTSI 545 + ++ RG+ ++++E+ + R G Q++ R +RHW+YSP ++ IL + Sbjct 503 SCLMRKLMDRGRQVIHEEIGENR--EGNQDDVRIEMARPRRVRHWMYSPVILTILAIGL 559 >RecName: Full=Envelopment polyprotein; AltName: Full=M polyprotein; Contains: RecName: Full=Glycoprotein N; Short=Gn; AltName: Full=Glycoprotein G1; Contains: RecName: Full=Glycoprotein C; Short=Gc; AltName: Full=Glycoprotein G2; Flags: Precursor [SFTS virus HB29] Sequence ID: A0A0B5A886.2 Length: 1073 Range 1: 23 to 562 Score:698 bits(1802), Expect:0.0, Method:Compositional matrix adjust., Identities:317/542(58%), Positives:411/542(75%), Gaps:2/542(0%) Query 7 PIVCGVRTETNKSIQIEWKEGRSEKLCQIDRLGHVTSWLRNHSSFQGLIGQVKGRPSVSY 66 PI+C +NKS I G SEK+CQIDRL HV+SWLRNHS FQG +GQ GR VSY Sbjct 23 PIICAGPIHSNKSAGIPHLLGYSEKICQIDRLIHVSSWLRNHSQFQGYVGQRGGRSQVSY 82 Query 67 FPEGASYPRWSGLLSPCDAEWLGLIAVSKAGDTDMIVPGPTYKGKIFVERPTYNGYKGWG 126 +P SY RWSGLLSPCDA+WLG++ V KA ++DMIVPGP+YKGK+F ERPT++GY GWG Sbjct 83 YPAENSYSRWSGLLSPCDADWLGMLVVKKAKESDMIVPGPSYKGKVFFERPTFDGYVGWG 142 Query 127 CADGKSLSHSGTYCETDSSVSSGLIQGDRVLWVGEVVCQRGTPVPEDVFSELVSLSQSEF 186 C GKS + SG C +DS SSGL+ DRVLW+G+V CQ TP+PE+ F EL S SQSEF Sbjct 143 CGSGKSRTESGELCSSDSGTSSGLLPSDRVLWIGDVACQPMTPIPEETFLELKSFSQSEF 202 Query 187 PDVCKIDGVALNQCEQESIPQPLDVAWIDVGRSHKVLMREHKTKWVQESSAKDFVCFKVG 246 PD+CKIDG+ NQCE ES+PQP DVAW+DVG SHK++MREHKTKWVQESS+KDFVC+K G Sbjct 203 PDICKIDGIVFNQCEGESLPQPFDVAWMDVGHSHKIIMREHKTKWVQESSSKDFVCYKEG 262 Query 247 QGPCSKQEEDDCMSKGNCHGDEVFCRMAGCSARMQDNQEGCRCELLQKPGEIIVNYGGVS 306 GPCS+ EE C + G+C GD FC++AGC + ++ CRC L+ KPGE++V+YGG+ Sbjct 263 TGPCSESEEKACKTSGSCRGDMQFCKVAGCEHGEEASEAKCRCSLVHKPGEVVVSYGGMR 322 Query 307 VRPTCYGFSRMMATLEVHKPDRELTGCTGCHLECIEGGVKIVTLTSELRSATVCASHFCA 366 VRP CYGFSRMMATLEV+ P++ + CTGCHLECI GGV+++TLTSELRSATVCASHFC+ Sbjct 323 VRPKCYGFSRMMATLEVNPPEQRIGQCTGCHLECINGGVRLITLTSELRSATVCASHFCS 382 Query 367 SAKGGSKTTDILFHTGALVGPNSIRITGQLLDGSKFSFDGHCIFPDGCMALDCTFCKEFL 426 SA G K+T+I FH+G+LVG +I + G L+DG++F+F+G C+FPDGC A+DCTFC+EFL Sbjct 383 SASSGKKSTEIHFHSGSLVGKTAIHVKGALVDGTEFTFEGSCMFPDGCDAVDCTFCREFL 442 Query 427 RNPQCYPVKKWLFLVVVIMCCYCALMLLTNILRAIGVWGTWVFAPIKLALALGLRLAKLS 486 +NPQCYP KKWLF+++VI+ Y LMLLTN+L+AIGVWG+WV AP+KL A+ +L + Sbjct 443 KNPQCYPAKKWLFIIIVILLGYAGLMLLTNVLKAIGVWGSWVIAPVKLMFAIIKKLMRTV 502 Query 487 KKGLVAVVTRGQMIVNDELHQVRVERGEQNEGRQGYGPRGPIRHWLYSPALILILTTSIC 546 + ++ RG+ ++++E+ + G Q++ R +RHW+YSP ++ IL + Sbjct 503 SCLVGKLMDRGRQVIHEEIGE--NGEGNQDDVRIEMARPRRVRHWMYSPVILTILAIGLA 560 Query 547 SG 548 G Sbjct 561 EG 562 >RecName: Full=Envelopment polyprotein; AltName: Full=M polyprotein; Contains: RecName: Full=Glycoprotein N; Short=Gn; AltName: Full=Glycoprotein G1; Contains: RecName: Full=Glycoprotein C; Short=Gc; AltName: Full=Glycoprotein G2; Flags: Precursor [Bhanja virus] Sequence ID: L7V0S7.1 Length: 1070 Range 1: 173 to 525 Score:136 bits(342), Expect:2e-32, Method:Compositional matrix adjust., Identities:93/358(26%), Positives:145/358(40%), Gaps:10/358(2%) Query 152 QGDRVLWVGEVVCQRGTPVPEDVFSELVSLSQSEFPDVCKIDGVALNQCEQESIPQPLDV 211 + V+ VG Q V E + +S P++C IDGV +NQC+ S + L + Sbjct 173 EATNVISVGVQHAQEANQVDEHEARYISEARKSINPEICSIDGVEINQCDLASPGRWLML 232 Query 212 AWIDVGRSHKVLMREH---KTKWVQ-ESSAKDFVCFKVGQGPCSKQEEDDCMSKGNCHGD 267 + L+ KW Q A DF C V + + NC GD Sbjct 233 HYASFRLQEGSLVYLSPGLNIKWSQINVPASDFYCINVSDHLNTHYRPCEVNCTDNCQGD 292 Query 268 EVFCRMAGCSARMQDNQEGCRCELLQKPGEIIVNYGGVSVRPTCYGFSRMMATLEVHKPD 327 E++C + C+ + C+C + G V G +P G + +V Sbjct 293 ELYCSVHQCARSAE-----CKCSFIGSRGMAEVQIGDRWFKPAVVGSQQFFVKEDVPVLQ 347 Query 328 RELTGCTGCHLECIEGGVKIVTLTSELRSATVCASHFCAS-AKGGSKTTDILFHTGALVG 386 + CT C + C G+ I ++ EL+ TVC FC++ GSK I FH Sbjct 348 QPSADCTTCSMTCTAEGIAISSIKDELKDVTVCVEGFCSTRVSKGSKVWKIEFHNQYPSS 407 Query 387 PNSIRITGQLLDGSKFSFDGHCIFPDGCMALDCTFCKEFLRNPQCYPVKKWLFLVVVIMC 446 + G + G F C GC ++C FC+E L NPQCYP KW L +++ Sbjct 408 GSVALARGTTVSGETFELTAECGRRTGCEQINCLFCREMLSNPQCYPYGKWFLLFLILAT 467 Query 447 CYCALMLLTNILRAIGVWGTWVFAPIKLALALGLRLAKLSKKGLVAVVTRGQMIVNDE 504 Y + LL I+R + ++ P + + + L +L K+ R ++DE Sbjct 468 LYIIVALLKTIMRIFMACLSVLYGPFIIIIKISRCLGRLGKRKGERTYVRLMEALDDE 525