RID: 3DJ9W8TT013 Job Title:POL_CAEVC P33459 Pol polyprotein (Protease)... Program: BLASTP Query: unnamed protein product ID: lcl|Query_7277924(amino acid) Length: 713 Database: swissprot Non-redundant UniProtKB/SwissProt sequences Sequences producing significant alignments: Scientific Common Max Total Query E Per. Acc. Description Name Name Taxid Score Score cover Value Ident Len Accession RecName: Full=Pol polyprotein; Contains: RecName: Full=Proteas... Equine infec... NA 11670 489 489 100% 5e-158 39.89 1146 P11204.1 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... SIVcpz EK505 NA 388912 380 380 75% 5e-115 42.44 1448 Q1A249.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11695 379 379 78% 2e-114 41.22 1432 P18802.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:B_89.6 NA 401671 374 374 78% 1e-112 41.04 1435 Q73368.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:B_MN NA 11696 370 370 75% 4e-111 40.98 1441 P05961.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 N_YBF30 NA 388818 369 369 75% 7e-111 40.50 1449 O91080.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 82834 369 369 75% 7e-111 41.35 1435 P0C6F2.1 Alignments: >RecName: Full=Pol polyprotein; Contains: RecName: Full=Protease; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; AltName: Full=Exoribonuclease H; Contains: RecName: Full=Integrase; Short=IN [Equine infectious anemia virus (CLONE 1369)] Sequence ID: P11204.1 Length: 1146 Range 1: 195 to 911 Score:489 bits(1260), Expect:5e-158, Method:Compositional matrix adjust., Identities:288/722(40%), Positives:434/722(60%), Gaps:17/722(2%) Query 4 KVKLKEGCTGPHVPQWPLTEEKLKGLTEIIDKLVEEGKLGKAPPHWTCNTPIFCIKKKSG 63 K++LKEG GP +PQWPLT+EKL+G EI+ +L+ EGK+ +A + N+PIF IKK+SG Sbjct 195 KIELKEGTMGPKIPQWPLTKEKLEGAKEIVQRLLSEGKISEASDNNPYNSPIFVIKKRSG 254 Query 64 KWRMLIDFRELNKQTEDLTEAQLGLPHPGGLQKKKHVTILDIGDAYFTIPLYEPYREYTC 123 KWR+L D RELNK + TE GLPHPGGL K KH+T+LDIGDAYFTIPL +R YT Sbjct 255 KWRLLQDLRELNKTVQVGTEISRGLPHPGGLIKCKHMTVLDIGDAYFTIPLDPEFRPYTA 314 Query 124 FTLLSPNNLGPCKRYYWKVLPQGWKLSPSVYQFTMQEILEDWIQQHPEIQFGIYMDDIYI 183 FT+ S N+ P KRY W LPQG+ LSP +YQ T+QEIL+ + +++PE+Q YMDD+++ Sbjct 315 FTIPSINHQEPDKRYVWNCLPQGFVLSPYIYQKTLQEILQPFRERYPEVQLYQYMDDLFV 374 Query 184 GSDLEIKKHREIVKDLANYIAQYGFTLPEEKRQKGYPAKWLGFELHPQTWKFQKHTLPEL 243 GS+ K+H+E++ +L + + GF P++K Q+ P WLG++L P+ WK QK L ++ Sbjct 375 GSNGSKKQHKELIIELRAILLEEGFETPDDKLQEVPPYSWLGYQLCPENWKVQKMQL-DM 433 Query 244 TKGTITLNKLQKLVGELVWRQS-IIGKSIPNILKLMEGDRELQSERKIEEVHVKEWEACR 302 K TLN +QKL+G + W S + G ++ +I +G EL + E KE E Sbjct 434 VKNP-TLNDVQKLMGNITWMSSGVPGLTVKHIAATTKGCLELNQKVIWTEEAQKELEENN 492 Query 303 KKLEEMEG-NYYNKDKDVYGQLAWG-DKAIEYIVYQEKGKPLWVN--VVHNIKNLSIPQQ 358 +K++ +G YYN ++++ ++ + Y++ Q +G LW ++ K S + Sbjct 493 EKIKNAQGLQYYNPEEEMLCEVEITKNYEATYVIKQSQGI-LWAGKKIMKANKGWSTVKN 551 Query 359 VIKAAQKLTQEVIIRTGKIPWILLPGKEEDWRLELQLG-NITWMPKFWSCYR-GHTRWRK 416 ++ Q + E I R GK P +P +E E+Q G +W+P+ ++ H WR Sbjct 552 LMLLLQHVATESITRVGKCPTFKVPFTKEQVMWEMQKGWYYSWLPEIVYTHQVVHDDWRM 611 Query 417 RNIIEEVVEGPTYYTDGGKKNKVGSLGFIVSTGE-KFRKHEEGTNQQLELRAIEEALKQG 475 + ++EE G T YTDGGK+N G ++ S G K ++ T+Q E AI+ AL+ Sbjct 612 K-LVEEPTSGITIYTDGGKQNGEGIAAYVTSNGRTKQKRLGPVTHQVAERMAIQMALEDT 670 Query 476 -PQTMNLVTDSRYAFEFLLRNWDEEVIKNPIQARIMEIAHKKDRIGVHWVPGHKGIPQN- 533 + +N+VTDS Y ++ + E ++P I I +K+ + WVPGHKGI N Sbjct 671 RDKQVNIVTDSYYCWKNITEGLGLEGPQSPWWPIIQNI-REKEIVYFAWVPGHKGICGNQ 729 Query 534 --EEIDKYISEIFLAKEGEGILPKREEDAGYDLICPEEVTIEPGQVKCIPIELRLNLKKS 591 +E K EI LA +G I KR+EDAG+DL P ++ I K IP ++++ + + Sbjct 730 LADEAAKIKEEIMLAYQGTQIKEKRDEDAGFDLCVPYDIMIPVSDTKIIPTDVKIQVPPN 789 Query 592 QWAMIATKSSMAAKGVFTQGGIIDSGYQGQIQVIMYNSNKIAVVIPQGRKFAQLILMDKK 651 + + KSSMA +G+ GGIID GY G+IQVI N K + + +G+KFAQLI++ Sbjct 790 SFGWVTGKSSMAKQGLLINGGIIDEGYTGEIQVICTNIGKSNIKLIEGQKFAQLIILQHH 849 Query 652 HGKLEPWGESRKTERGEKGFGSTGMYWIENIPLAEEDHTKWHQDARSLHLEFEIPRTAAE 711 +PW E++ ++RG+KGFGSTG++W+ENI A+++H WH + L ++IP T A+ Sbjct 850 SNSRQPWDENKISQRGDKGFGSTGVFWVENIQEAQDEHENWHTSPKILARNYKIPLTVAK 909 Query 712 DI 713 I Sbjct 910 QI 911 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [SIVcpz EK505] Sequence ID: Q1A249.3 Length: 1448 Range 1: 606 to 1149 Score:380 bits(977), Expect:5e-115, Method:Compositional matrix adjust., Identities:233/549(42%), Positives:327/549(59%), Gaps:17/549(3%) Query 5 VKLKEGCTGPHVPQWPLTEEKLKGLTEIIDKLVEEGKLGKAPPHWTCNTPIFCIKKK-SG 63 VKLK G GP V QWPLTEEK+K LTEI ++ +EGK+ + P NTPIF IKKK S Sbjct 606 VKLKPGMDGPRVKQWPLTEEKIKALTEICTEMEKEGKISRIGPENPYNTPIFAIKKKDST 665 Query 64 KWRMLIDFRELNKQTEDLTEAQLGLPHPGGLQKKKHVTILDIGDAYFTIPLYEPYREYTC 123 KWR L+DFRELNK+T+D E QLG+PHP GL+KKK VT+LD+GDAYF+ PL E +R+YT Sbjct 666 KWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSCPLDENFRKYTA 725 Query 124 FTLLSPNNLGPCKRYYWKVLPQGWKLSPSVYQFTMQEILEDWIQQHPEIQFGIYMDDIYI 183 FT+ S NN P RY + VLPQGWK SP+++Q TM +ILE + + +PE+ YMDD+Y+ Sbjct 726 FTIPSVNNETPGIRYQYNVLPQGWKGSPAIFQSTMTKILEPFRKNNPELVIYQYMDDLYV 785 Query 184 GSDLEIKKHREIVKDLANYIAQYGFTLPEEKRQKGYPAKWLGFELHPQTWKFQKHTLPEL 243 GSDLEI +HRE V+ L +++ +GFT P++K QK P W+G+ELHP W Q LPE Sbjct 786 GSDLEITQHREAVERLRSHLLTWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQTIQLPE- 844 Query 244 TKGTITLNKLQKLVGELVWRQSII-GKSIPNILKLMEGDRELQSERKIEEVHVKEWEACR 302 K T T+N +Q+LVG+L W I G + + KL+ G + L + E R Sbjct 845 -KDTWTVNDIQQLVGKLNWASQIYPGIKVKQLCKLIRGAKALTEVVTLTREAELELAENR 903 Query 303 KKLEE-MEGNYYNKDKDVYGQL-AWGDKAIEYIVYQEKGKPLWVNVVHNIKNLSIP--QQ 358 + L+E + G YYN DK++ ++ G Y +YQ+ K L +++ +Q Sbjct 904 EILKEPVHGAYYNPDKELIAEIQKQGQGQWTYQIYQDLHKNLKTGKYAKMRSTHTNDIRQ 963 Query 359 VIKAAQKLTQEVIIRTGKIPWILLPGKEEDWRL-ELQLGNITWMP--KFWSCYRGHTRWR 415 + + QK+ E I+ GK P LP ++E W + TW+P +F + W Sbjct 964 LTEVVQKVALESIVIWGKTPKFRLPVQKEVWETWWTEYWQATWIPDWEFVNTPPLVKLWY 1023 Query 416 KRNIIEEVVEGPTYYTDGG--KKNKVGSLGFIVSTG-EKFRKHEEGTNQQLELRAIEEAL 472 + E + TYY DG ++ K+G GF+ G +K E TNQQ EL+A+ AL Sbjct 1024 QLE-TEPISGAETYYVDGAANRETKLGKAGFVTDRGRQKVTSISETTNQQAELQAVLMAL 1082 Query 473 KQGPQTMNLVTDSRYAFEFLLRNWDEEVIKNPIQARIMEIAHKKDRIGVHWVPGHKGIPQ 532 + Q +N+VTDS+Y + D+ ++ + +I+E KK+RI + WVP HKGI Sbjct 1083 QDAGQEVNIVTDSQYVLGIIHSQPDKS--ESELVNQIIEELIKKERIYLSWVPAHKGIGG 1140 Query 533 NEEIDKYIS 541 NE+IDK +S Sbjct 1141 NEQIDKLVS 1149 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 1 (NDK ISOLATE)] Sequence ID: P18802.3 Length: 1432 Range 1: 594 to 1159 Score:379 bits(973), Expect:2e-114, Method:Compositional matrix adjust., Identities:237/575(41%), Positives:343/575(59%), Gaps:26/575(4%) Query 5 VKLKEGCTGPHVPQWPLTEEKLKGLTEIIDKLVEEGKLGKAPPHWTCNTPIFCIKKK-SG 63 VKLK G GP V QWPLTEEK+K LTEI ++ +EGK+ + P NTPIF IKKK S Sbjct 594 VKLKPGMDGPKVKQWPLTEEKIKALTEICTEMEKEGKISRIGPENPYNTPIFAIKKKDST 653 Query 64 KWRMLIDFRELNKQTEDLTEAQLGLPHPGGLQKKKHVTILDIGDAYFTIPLYEPYREYTC 123 KWR L+DFRELNK+T+D E QLG+PHP GL+KKK VT+LD+GDAYF++PL E +R+YT Sbjct 654 KWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDEDFRKYTA 713 Query 124 FTLLSPNNLGPCKRYYWKVLPQGWKLSPSVYQFTMQEILEDWIQQHPEIQFGIYMDDIYI 183 FT+ S NN P RY + VLPQGWK SP+++Q +M +ILE + +Q+PEI YMDD+Y+ Sbjct 714 FTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPEIVIYQYMDDLYV 773 Query 184 GSDLEIKKHREIVKDLANYIAQYGFTLPEEKRQKGYPAKWLGFELHPQTWKFQKHTLPEL 243 GSDLEI +HR +++L ++ ++GFT P++K QK P W+G+ELHP W Q LPE Sbjct 774 GSDLEIGQHRTKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPINLPE- 832 Query 244 TKGTITLNKLQKLVGELVWRQSI-IGKSIPNILKLMEGDRELQSERKIEEVHVKEWEACR 302 K + T+N +QKLVG+L W I G + + KL+ G + L + E E R Sbjct 833 -KESWTVNDIQKLVGKLNWASQIYAGIKVKQLCKLLRGTKALTEVVPLTEEAELELAENR 891 Query 303 KKLEE-MEGNYYNKDKDVYGQL-AWGDKAIEYIVYQEKGKPLWVNVVHNIKNLSIP--QQ 358 + L+E + G YY+ KD+ +L GD Y +YQE K L + +Q Sbjct 892 EILKEPVHGVYYDPSKDLIAELQKQGDGQWTYQIYQEPFKNLKTGKYARTRGAHTNDVKQ 951 Query 359 VIKAAQKLTQEVIIRTGKIPWILLPGKEEDWRL-ELQLGNITWMP--KFWSCYRGHTRWR 415 + +A QK+ E I+ GK P LP ++E W ++ TW+P +F + W Sbjct 952 LTEAVQKIATESIVIWGKTPKFKLPIQKETWETWWIEYWQATWIPEWEFVNTPPLVKLWY 1011 Query 416 KRNIIEEVVEGPTYYTDGG--KKNKVGSLGFIVSTG-EKFRKHEEGTNQQLELRAIEEAL 472 + E ++ T+Y DG ++ K+G G++ G +K + TNQ+ EL+AI AL Sbjct 1012 QLE-KEPIIGAETFYVDGAANRETKLGKAGYVTDRGRQKVVPFTDTTNQKTELQAINLAL 1070 Query 473 KQGPQTMNLVTDSRYAFEFLLRNWDEEVIKNPIQARIMEIAHKKDRIGVHWVPGHKGIPQ 532 + +N+VTDS+YA + D+ ++ + ++I+E KK+++ + WVP HKGI Sbjct 1071 QDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVSQIIEQLIKKEKVYLAWVPAHKGIGG 1128 Query 533 NEEIDKYISE-----IFLAKEGEGILPKREEDAGY 562 NE++DK +S+ +FL +GI +EE Y Sbjct 1129 NEQVDKLVSQGIRKVLFL----DGIDKAQEEHEKY 1159 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:B_89.6] Sequence ID: Q73368.3 Length: 1435 Range 1: 597 to 1162 Score:374 bits(960), Expect:1e-112, Method:Compositional matrix adjust., Identities:236/575(41%), Positives:337/575(58%), Gaps:26/575(4%) Query 5 VKLKEGCTGPHVPQWPLTEEKLKGLTEIIDKLVEEGKLGKAPPHWTCNTPIFCIKKK-SG 63 VKLK G GP V QWPLTEEK+K L EI ++ +EGK+ K P NTP+F IKKK S Sbjct 597 VKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKIGPENPYNTPVFAIKKKDST 656 Query 64 KWRMLIDFRELNKQTEDLTEAQLGLPHPGGLQKKKHVTILDIGDAYFTIPLYEPYREYTC 123 KWR L+DFRELNK+T+D E QLG+PHP GL+KKK VT+LD+GDAYF++PL E +R+YT Sbjct 657 KWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDEDFRKYTA 716 Query 124 FTLLSPNNLGPCKRYYWKVLPQGWKLSPSVYQFTMQEILEDWIQQHPEIQFGIYMDDIYI 183 FT+ S NN P RY + VLPQGWK SP+++Q +M +ILE + +Q+P+I YMDD+Y+ Sbjct 717 FTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPDIVIYQYMDDLYV 776 Query 184 GSDLEIKKHREIVKDLANYIAQYGFTLPEEKRQKGYPAKWLGFELHPQTWKFQKHTLPEL 243 GSDLEI +HR ++DL ++ ++GFT P++K QK P W+G+ELHP W Q LPE Sbjct 777 GSDLEIGQHRAKIEDLRQHLLKWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPIVLPE- 835 Query 244 TKGTITLNKLQKLVGELVWRQSI-IGKSIPNILKLMEGDRELQSERKIEEVHVKEWEACR 302 K + T+N +QKLVG+L W I G + + KL+ G + L + E E R Sbjct 836 -KDSWTVNDIQKLVGKLNWASQIYAGIKVKQLCKLLRGTKALTEVVPLTEEAELELAENR 894 Query 303 KKLEE-MEGNYYNKDKDVYGQL-AWGDKAIEYIVYQEKGKPLWVNVVHNIKNLSIP--QQ 358 + L+E + G YY+ KD+ +L G Y +YQE K L ++ +Q Sbjct 895 EILKEPVHGVYYDPTKDLIAELQKQGQGQWTYQIYQEPYKNLKTGKYARMRGAHTNDVKQ 954 Query 359 VIKAAQKLTQEVIIRTGKIPWILLPGKEEDWRL-ELQLGNITWMP--KFWSCYRGHTRWR 415 + +A QK+ E I+ GK P LP ++E W TW+P +F + W Sbjct 955 LTEAVQKIATESIVIWGKTPKFKLPIQKETWEAWWTDYWQATWIPEWEFVNTPPLVKLWY 1014 Query 416 KRNIIEEVVEGPTYYTDGG--KKNKVGSLGFIVSTG-EKFRKHEEGTNQQLELRAIEEAL 472 + E +V T+Y DG + K G G++ G +K + TNQ+ EL+AI AL Sbjct 1015 QLE-KEPIVGAETFYVDGAANRDTKSGKAGYVTDRGRQKVVSLADTTNQKTELQAIHLAL 1073 Query 473 KQGPQTMNLVTDSRYAFEFLLRNWDEEVIKNPIQARIMEIAHKKDRIGVHWVPGHKGIPQ 532 + +N+VTDS+YA + D+ ++ + ++I+E KK+++ + WVP HKGI Sbjct 1074 QDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVSQIIEQLIKKEKVYLAWVPAHKGIGG 1131 Query 533 NEEIDKYISE-----IFLAKEGEGILPKREEDAGY 562 NE++DK +S +FL +GI +EE Y Sbjct 1132 NEQVDKLVSAGIRKVLFL----DGIDKAQEEHEKY 1162 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:B_MN] Sequence ID: P05961.3 Length: 1441 Range 1: 603 to 1146 Score:370 bits(949), Expect:4e-111, Method:Compositional matrix adjust., Identities:225/549(41%), Positives:330/549(60%), Gaps:17/549(3%) Query 5 VKLKEGCTGPHVPQWPLTEEKLKGLTEIIDKLVEEGKLGKAPPHWTCNTPIFCIKKK-SG 63 VKLK G GP V QWPLTEEK+K L EI ++ +EGK+ K P NTP+F IKKK S Sbjct 603 VKLKPGMDGPKVKQWPLTEEKIKALIEICTEMEKEGKISKIGPENPYNTPVFAIKKKDST 662 Query 64 KWRMLIDFRELNKQTEDLTEAQLGLPHPGGLQKKKHVTILDIGDAYFTIPLYEPYREYTC 123 KWR L+DFRELNK+T+D E QLG+PHP GL+KKK VT+LD+GDAYF++PL + +R+YT Sbjct 663 KWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDFRKYTA 722 Query 124 FTLLSPNNLGPCKRYYWKVLPQGWKLSPSVYQFTMQEILEDWIQQHPEIQFGIYMDDIYI 183 FT+ S NN P RY + VLPQGWK SP+++Q +M +ILE + +Q+P+I YMDD+Y+ Sbjct 723 FTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPDIVIYQYMDDLYV 782 Query 184 GSDLEIKKHREIVKDLANYIAQYGFTLPEEKRQKGYPAKWLGFELHPQTWKFQKHTLPEL 243 GSDLEI +HR +++L ++ ++GFT P++K QK P W+G+ELHP W Q LPE Sbjct 783 GSDLEIGQHRAKIEELRRHLLRWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPIVLPE- 841 Query 244 TKGTITLNKLQKLVGELVWRQSI-IGKSIPNILKLMEGDRELQSERKIEEVHVKEWEACR 302 K + T+N +QKLVG+L W I G + + KL+ G + L + E E R Sbjct 842 -KDSWTVNDIQKLVGKLNWASQIYAGIKVKQLCKLLRGTKALTEVIPLTEEAELELAENR 900 Query 303 KKLEE-MEGNYYNKDKDVYGQL-AWGDKAIEYIVYQEKGKPLWVNVVHNIKNLSIP--QQ 358 + L+E + G YY+ KD+ ++ G Y +YQE K L ++ +Q Sbjct 901 EILKEPVHGVYYDPSKDLIAEVQKQGQGQWTYQIYQEPFKNLKTGKYARMRGAHTNDVKQ 960 Query 359 VIKAAQKLTQEVIIRTGKIPWILLPGKEEDWRL-ELQLGNITWMPKFWSCYRGHTRWRKR 417 + +A QK+ E I+ GK P LP ++E W + TW+P+ W + Sbjct 961 LTEAVQKIATESIVIWGKTPKFRLPIQKETWETWWTEYTXATWIPE-WEVVNTPPLVKLW 1019 Query 418 NIIEE--VVEGPTYYTDGG--KKNKVGSLGFIVSTG-EKFRKHEEGTNQQLELRAIEEAL 472 +E+ +V T+Y DG ++ K G G++ + G +K + TNQ+ EL+AI AL Sbjct 1020 YQLEKEPIVGAETFYVDGAANRETKKGKAGYVTNRGRQKVVSLTDTTNQKTELQAIHLAL 1079 Query 473 KQGPQTMNLVTDSRYAFEFLLRNWDEEVIKNPIQARIMEIAHKKDRIGVHWVPGHKGIPQ 532 + +N+VTDS+YA + D+ ++ + ++I+E KK+++ + WVP HKGI Sbjct 1080 QDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVSQIIEQLIKKEKVYLAWVPAHKGIGG 1137 Query 533 NEEIDKYIS 541 NE++DK +S Sbjct 1138 NEQVDKLVS 1146 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 N_YBF30] Sequence ID: O91080.3 Length: 1449 Range 1: 607 to 1150 Score:369 bits(948), Expect:7e-111, Method:Compositional matrix adjust., Identities:226/558(41%), Positives:329/558(58%), Gaps:35/558(6%) Query 5 VKLKEGCTGPHVPQWPLTEEKLKGLTEIIDKLVEEGKLGKAPPHWTCNTPIFCIKKK-SG 63 VKLK G GP V QWPLT EK++ L EI ++ +EGK+ + P NTPIF IKKK S Sbjct 607 VKLKPGMDGPKVKQWPLTTEKIEALREICTEMEKEGKISRIGPENPYNTPIFAIKKKDST 666 Query 64 KWRMLIDFRELNKQTEDLTEAQLGLPHPGGLQKKKHVTILDIGDAYFTIPLYEPYREYTC 123 KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF+ PL + +R+YT Sbjct 667 KWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKQKKSVTVLDVGDAYFSCPLDKDFRKYTA 726 Query 124 FTLLSPNNLGPCKRYYWKVLPQGWKLSPSVYQFTMQEILEDWIQQHPEIQFGIYMDDIYI 183 FT+ S NN P RY + VLPQGWK SP+++Q TM +ILE + ++HPEI YMDD+Y+ Sbjct 727 FTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSTMTKILEPFREKHPEIIIYQYMDDLYV 786 Query 184 GSDLEIKKHREIVKDLANYIAQYGFTLPEEKRQKGYPAKWLGFELHPQTWKFQKHTLPEL 243 GSDLE+ +HRE V+DL +++ ++GFT P++K QK P W+G+ELHP W Q LPE Sbjct 787 GSDLELAQHREAVEDLRDHLLKWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPIKLPE- 845 Query 244 TKGTITLNKLQKLVGELVWRQSII-GKSIPNILKLMEGDRELQSERKIEEVHVKEWEACR 302 K T+N +QKLVG+L W I G + + KL+ G + L E E R Sbjct 846 -KDVWTVNDIQKLVGKLNWASQIYPGIRVKQLCKLIRGTKALTEVVNFTEEAELELAENR 904 Query 303 KKLEE-MEGNYYNKDKDVYGQL-AWGDKAIEYIVYQEKGKPLWVNVVHNIKNLSIP--QQ 358 + L+E + G YY+ K++ ++ G Y +YQE K L +++ +Q Sbjct 905 EILKEPLHGVYYDPGKELVAEIQKQGQGQWTYQIYQELHKNLKTGKYAKMRSAHTNDIKQ 964 Query 359 VIKAAQKLTQEVIIRTGKIPWILLPGKEEDWRL------------ELQLGNITWMPKFWS 406 +++ +K+ E I+ GK P LP ++E W E + N + K W Sbjct 965 LVEVVRKVATESIVIWGKTPKFRLPVQKEVWEAWWTDHWQATWIPEWEFVNTPPLVKLW- 1023 Query 407 CYRGHTRWRKRNIIEEVVEGPTYYTDGG--KKNKVGSLGFIVSTG-EKFRKHEEGTNQQL 463 Y+ T E + T+Y DG ++ K+G GF+ G +K + TNQ+ Sbjct 1024 -YQLET--------EPISGAETFYVDGAANRETKLGKAGFVTDRGRQKVVSIADTTNQKA 1074 Query 464 ELRAIEEALKQGPQTMNLVTDSRYAFEFLLRNWDEEVIKNPIQARIMEIAHKKDRIGVHW 523 EL+AI AL++ + +N+VTDS+YA + D+ ++ + ++I+E KK+R+ + W Sbjct 1075 ELQAILMALQESGRDVNIVTDSQYAMGIIHSQPDKS--ESELVSQIIEELIKKERVYLSW 1132 Query 524 VPGHKGIPQNEEIDKYIS 541 VP HKGI NE++DK +S Sbjct 1133 VPAHKGIGGNEQVDKLVS 1150 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 1 lw12.3 isolate] Sequence ID: P0C6F2.1 Length: 1435 Range 1: 597 to 1140 Score:369 bits(947), Expect:7e-111, Method:Compositional matrix adjust., Identities:227/549(41%), Positives:329/549(59%), Gaps:17/549(3%) Query 5 VKLKEGCTGPHVPQWPLTEEKLKGLTEIIDKLVEEGKLGKAPPHWTCNTPIFCIKKK-SG 63 VKLK G GP V QWPLTEEK+K L EI ++ +EGK+ K P NTP+F IKKK S Sbjct 597 VKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKIGPENPYNTPVFAIKKKDST 656 Query 64 KWRMLIDFRELNKQTEDLTEAQLGLPHPGGLQKKKHVTILDIGDAYFTIPLYEPYREYTC 123 KWR L+DFRELNK+T+D E QLG+PHP GL+KKK VT+LD+GDAYF++PL E +R+YT Sbjct 657 KWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDEDFRKYTA 716 Query 124 FTLLSPNNLGPCKRYYWKVLPQGWKLSPSVYQFTMQEILEDWIQQHPEIQFGIYMDDIYI 183 FT+ S NN P RY + VLPQGWK SP+++Q +M +ILE + +Q+P+I YMDD+Y+ Sbjct 717 FTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPDIVIYQYMDDLYV 776 Query 184 GSDLEIKKHREIVKDLANYIAQYGFTLPEEKRQKGYPAKWLGFELHPQTWKFQKHTLPEL 243 GSDLEI +HR +++L ++ ++G T P++K QK P W+G+ELHP W Q LPE Sbjct 777 GSDLEIGQHRTKIEELRQHLLRWGLTTPDKKHQKEPPFLWMGYELHPDKWTVQPIVLPE- 835 Query 244 TKGTITLNKLQKLVGELVWRQSII-GKSIPNILKLMEGDRELQSERKIEEVHVKEWEACR 302 K + T+N +QKLVG+L W I G + + KL+ G + L + E E R Sbjct 836 -KDSWTVNDIQKLVGKLNWASQIYPGIKVRQLCKLLRGTKALTEVIPLTEEAELELAENR 894 Query 303 KKLEE-MEGNYYNKDKDVYGQL-AWGDKAIEYIVYQEKGKPLWVNVVHNIKNLSIP--QQ 358 + L+E + G YY+ KD+ ++ G Y +YQE K L ++ +Q Sbjct 895 EILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYARMRGTHTNDVKQ 954 Query 359 VIKAAQKLTQEVIIRTGKIPWILLPGKEEDWRL-ELQLGNITWMP--KFWSCYRGHTRWR 415 + +A QK+T E I+ GK P LP ++E W + TW+P +F + W Sbjct 955 LTEAVQKITTESIVIWGKTPKFKLPIQKETWETWWTEYWQATWIPEWEFVNTPPLVKLWY 1014 Query 416 KRNIIEEVVEGPTYYTDGG--KKNKVGSLGFIVSTG-EKFRKHEEGTNQQLELRAIEEAL 472 + E +V T+Y DG ++ K+G G++ + G +K TNQ+ EL+AI AL Sbjct 1015 QLE-KEPIVGAETFYVDGAANRETKLGKAGYVTNKGRQKVVPLTNTTNQKTELQAIYLAL 1073 Query 473 KQGPQTMNLVTDSRYAFEFLLRNWDEEVIKNPIQARIMEIAHKKDRIGVHWVPGHKGIPQ 532 + +N+VTDS+YA + D+ ++ + +I+E KK+++ + WVP HKGI Sbjct 1074 QDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVNQIIEQLIKKEKVYLAWVPAHKGIGG 1131 Query 533 NEEIDKYIS 541 NE++DK +S Sbjct 1132 NEQVDKLVS 1140