RID: 6E1UZA2J013 Job Title:virus Program: BLASTP Query: unnamed protein product ID: lcl|Query_33410(amino acid) Length: 571 Database: swissprot Non-redundant UniProtKB/SwissProt sequences Sequences producing significant alignments: Scientific Common Max Total Query E Per. Acc. Description Name Name Taxid Score Score cover Value Ident Len Accession RecName: Full=Gag-Pol polyprotein; Contains: RecName:... Visna lentiv... NA 11742 1164 1164 100% 0.0 100.00 1506 P03370.2 RecName: Full=Gag-Pol polyprotein; Contains: RecName:... Visna lentiv... NA 11744 1164 1164 100% 0.0 99.82 1506 P23427.2 RecName: Full=Gag-Pol polyprotein; Contains: RecName:... Visna/maedi ... NA 36374 1162 1162 100% 0.0 99.65 1506 P35956.2 RecName: Full=Gag-Pol polyprotein; Contains: RecName:... Visna lentiv... NA 11743 1151 1151 100% 0.0 98.77 1506 P23426.2 RecName: Full=Pol polyprotein; Contains: RecName: Full=Proteas... Ovine lentiv... NA 11664 1056 1056 100% 0.0 90.02 1086 P16901.1 RecName: Full=Pol polyprotein; Contains: RecName: Full=Proteas... Caprine arth... NA 11661 995 995 100% 0.0 82.84 1109 P33459.1 RecName: Full=Pol polyprotein; Contains: RecName: Full=Proteas... Feline immun... NA 31676 408 408 99% 4e-129 44.03 1124 P31822.1 RecName: Full=Pol polyprotein; Contains: RecName: Full=Proteas... Feline immun... NA 11674 403 403 98% 2e-127 44.56 1124 P16088.1 RecName: Full=Pol polyprotein; Contains: RecName: Full=Proteas... Feline immun... NA 11675 401 401 98% 1e-126 43.94 1124 P19028.1 RecName: Full=Pol polyprotein; Contains: RecName: Full=Proteas... Equine infec... NA 31675 357 357 99% 5e-110 39.69 1146 P32542.1 RecName: Full=Pol polyprotein; Contains: RecName: Full=Proteas... Equine infec... NA 11670 357 357 99% 6e-110 39.69 1146 P11204.1 RecName: Full=Pol polyprotein; Contains: RecName: Full=Proteas... Equine infec... NA 11672 357 357 99% 8e-110 40.14 1145 P03371.1 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:F2_M... NA 388823 357 357 95% 3e-108 39.29 1434 Q9QBZ1.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Simian immun... NA 11732 357 357 95% 4e-108 39.75 1441 P22382.2 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11695 356 356 96% 7e-108 39.75 1432 P18802.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11699 355 355 95% 8e-108 39.37 1434 P20892.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... SIVcpz MB66 NA 388911 355 355 95% 1e-107 40.25 1438 Q1A267.4 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11697 355 355 95% 2e-107 39.64 1440 P04588.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:F2_M... NA 388815 354 354 95% 2e-107 39.36 1430 Q9QBZ5.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11683 353 353 96% 5e-107 38.77 1436 P12499.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11718 353 353 96% 6e-107 41.59 1462 P12451.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:B_MN NA 11696 353 353 95% 8e-107 39.29 1441 P05961.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:G_SE... NA 388824 353 353 95% 8e-107 39.18 1433 O89940.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... SIVcpz TAN1 NA 388910 353 353 96% 1e-106 39.68 1462 Q8AII1.4 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:B_89.6 NA 401671 352 352 95% 1e-106 38.84 1435 Q73368.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11688 352 352 95% 1e-106 39.64 1439 P20875.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 O_MVP5180 NA 388816 352 352 96% 3e-106 39.57 1446 Q79666.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11701 351 351 95% 4e-106 39.02 1436 P05959.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 N_YBF30 NA 388818 351 351 95% 4e-106 38.95 1449 O91080.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:H_90... NA 388826 351 351 95% 5e-106 39.29 1435 O93215.4 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:J_SE... NA 388905 351 351 95% 5e-106 38.66 1432 Q9WC54.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11703 350 350 95% 5e-106 39.50 1428 P24740.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11686 350 350 95% 1e-105 38.66 1447 P03367.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:B_HXB2R NA 11706 349 349 95% 2e-105 39.29 1435 P04585.4 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:F1_9... NA 388814 349 349 94% 2e-105 38.99 1430 O89290.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:J_SE... NA 388904 349 349 95% 2e-105 38.78 1432 Q9WC63.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:B_AR... NA 11685 349 349 95% 2e-105 39.29 1437 P03369.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11698 348 348 95% 3e-105 39.02 1435 P12497.4 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 362651 348 348 95% 4e-105 38.66 1435 P35963.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:K_96... NA 388906 348 348 95% 6e-105 39.11 1430 Q9QBY3.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11678 347 347 95% 8e-105 39.29 1447 P03366.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11714 348 348 97% 8e-105 40.07 1550 P18096.4 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... SIVcpz EK505 NA 388912 347 347 95% 8e-105 39.00 1448 Q1A249.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 82834 347 347 95% 9e-105 39.29 1435 P0C6F2.1 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11715 347 347 97% 1e-104 40.49 1462 P24107.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr170Gag-Pol;... Jembrana dis... NA 36370 347 347 93% 1e-104 40.04 1432 Q82851.1 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:C_ET... NA 388796 347 347 95% 1e-104 38.64 1439 Q75002.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:H_VI991 NA 388888 347 347 95% 1e-104 38.82 1436 Q9Q720.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11682 347 347 95% 2e-104 39.11 1447 P04587.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:K_97... NA 388907 346 346 95% 2e-104 38.39 1429 Q9QBZ9.2 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 N_YBF106 NA 388819 345 345 95% 4e-104 39.18 1449 Q9IDV9.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11717 346 346 96% 4e-104 39.89 1464 P18042.4 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11721 345 345 97% 4e-104 40.17 1463 P20876.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:G_92... NA 388825 345 345 95% 5e-104 38.64 1435 O41798.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11713 345 345 97% 6e-104 38.74 1462 P17757.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11689 343 343 96% 2e-103 38.07 1435 P04589.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 73484 343 343 97% 3e-103 40.17 1463 Q74120.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 O_ANT70 NA 327105 342 342 96% 5e-103 38.93 1435 Q77373.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11719 342 342 97% 7e-103 40.00 1461 P05962.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11720 341 341 97% 2e-102 40.14 1464 P04584.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:C_92... NA 388812 340 340 95% 3e-102 38.64 1431 O12158.2 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr170Gag-Pol;... Bovine immun... NA 417296 338 338 91% 2e-101 40.52 1475 P19560.2 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... SIVcpz GAB1 NA 402771 337 337 94% 2e-101 39.42 1384 P17283.2 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-2 B_UC1 NA 388822 337 337 97% 5e-101 38.82 1471 Q76634.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-1 M:F1_V... NA 388813 337 337 95% 6e-101 37.88 1430 Q9QSR3.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... HIV-2 B_EHO NA 388821 335 335 97% 2e-100 38.95 1464 Q89928.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Simian immun... NA 11737 334 334 97% 4e-100 38.60 1449 P12502.2 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Simian immun... NA 11733 333 333 97% 8e-100 38.95 1448 P05896.2 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Simian immun... NA 11738 333 333 97% 9e-100 38.77 1449 P19505.2 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Simian immun... NA 11727 325 325 95% 9e-97 38.24 1470 P27973.2 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Simian immun... NA 31684 315 315 95% 3e-93 36.04 1472 Q02836.2 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Human immuno... NA 11716 313 313 97% 2e-92 37.59 1465 P15833.3 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Simian immun... NA 11731 310 310 96% 2e-91 36.57 1467 P05895.2 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Simian immun... NA 11730 306 306 96% 4e-90 37.04 1465 P27980.2 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol;... Simian immun... NA 11735 300 300 96% 4e-88 37.46 1446 P05897.2 RecName: Full=Intracisternal A-particle Pol-related polyprotei... Mouse intrac... NA 11753 160 160 90% 2e-40 27.24 867 P11368.1 RecName: Full=Endogenous retrovirus group K member 18 Pol... Homo sapiens human 9606 159 159 93% 3e-40 25.04 812 Q9QC07.2 RecName: Full=Endogenous retrovirus group K member 11 Pol... Homo sapiens human 9606 159 159 93% 4e-40 25.61 969 Q9UQG0.2 RecName: Full=Endogenous retrovirus group K member 8 Pol... Homo sapiens human 9606 158 158 83% 1e-39 26.98 956 P63133.1 RecName: Full=Gag-Pro-Pol polyprotein; Contains: RecName:... Jaagsiekte s... NA 11746 158 158 82% 1e-39 26.63 1726 P31623.2 RecName: Full=Endogenous retrovirus group K member 7 Pol... Homo sapiens human 9606 158 158 87% 1e-39 26.58 1459 P63135.1 RecName: Full=Intracisternal A-particle Pol-related polyprotei... Golden hamst... NA 11752 156 156 90% 3e-39 27.26 863 P04026.1 RecName: Full=Endogenous retrovirus group K member 113 Pol... Homo sapiens human 9606 155 155 87% 1e-38 26.42 956 P63132.1 RecName: Full=Endogenous retrovirus group K member 6 Pol... Homo sapiens human 9606 154 154 87% 3e-38 25.95 956 Q9BXR3.2 RecName: Full=Endogenous retrovirus group K member 10 Pol... Homo sapiens human 9606 153 153 93% 5e-38 24.91 1014 P10266.2 RecName: Full=Endogenous retrovirus group K member 25 Pol... Homo sapiens human 9606 153 153 52% 6e-38 30.94 954 P63136.1 RecName: Full=Pol polyprotein; Contains: RecName: Full=Reverse... Avian leukos... NA 11864 152 152 90% 8e-38 28.49 895 Q7SQ98.1 RecName: Full=Gag-Pro-Pol polyprotein; Contains: RecName:... Mouse mammar... NA 11758 152 152 94% 1e-37 25.51 1755 P03365.3 RecName: Full=Gag-Pol polyprotein; Contains: RecName:... Rous sarcoma... NA 269446 151 151 90% 2e-37 28.49 1603 Q04095.2 RecName: Full=Gag-Pol polyprotein; Contains: RecName:... Rous sarcoma... NA 11888 151 151 90% 3e-37 28.34 1603 P03354.2 RecName: Full=Gag-Pro-Pol polyprotein; Contains: RecName:... Mouse mammar... NA 11759 151 151 94% 4e-37 24.91 1755 P11283.2 RecName: Full=Gag-Pol polyprotein; Contains: RecName:... Rous sarcoma... NA 269447 150 150 90% 4e-37 28.32 1603 O92956.2 RecName: Full=Endogenous retrovirus group K member 19 Pol... Homo sapiens human 9606 147 147 87% 5e-36 25.61 959 Q9WJR5.2 RecName: Full=Gag-Pro-Pol polyprotein; Contains: RecName:... Squirrel mon... NA 11856 144 144 43% 9e-35 36.44 1880 P03364.3 RecName: Full=Gag-Pro-Pol polyprotein; Contains: RecName:... Simian retro... NA 39068 142 142 57% 4e-34 30.63 1768 P51517.2 RecName: Full=Gag-Pro-Pol polyprotein; AltName:... Human T-cell... NA 406769 140 140 59% 9e-34 29.51 1440 Q4U0X6.4 RecName: Full=Gag-Pro-Pol polyprotein; AltName:... HTLV-3 strai... NA 402036 138 138 59% 5e-33 29.31 1440 Q0R5R2.3 RecName: Full=Gag-Pro-Pol polyprotein; AltName: Full=Pr180;... Mason-Pfizer... NA 11855 136 136 51% 3e-32 31.44 1771 P07572.2 RecName: Full=Gag-Pro-Pol polyprotein; Contains: RecName:... Simian retro... NA 11942 132 132 48% 8e-31 32.51 1772 P04025.2 RecName: Full=Gag-Pro-Pol polyprotein; AltName:... HTLV-1 isola... NA 402046 126 126 57% 4e-29 27.65 1462 P0C211.2 RecName: Full=Gag-Pro-Pol polyprotein; Contains: RecName:... Bovine leuke... NA 11903 126 126 56% 4e-29 27.49 1416 P25059.2 RecName: Full=Gag-Pro-Pol polyprotein; Contains: RecName:... Bovine leuke... NA 11907 125 125 56% 7e-29 27.79 1416 P03361.2 RecName: Full=Gag-Pro-Pol polyprotein; AltName:... Human T-cell... NA 11927 124 124 57% 2e-28 28.24 1462 P14078.3 RecName: Full=Gag-Pro-Pol polyprotein; AltName:... Human T-lymp... NA 11909 124 124 59% 2e-28 27.20 1461 P03363.4 RecName: Full=Gag-Pro-Pol polyprotein; AltName:... Human T-cell... NA 11926 124 124 57% 3e-28 28.95 1462 P03362.3 RecName: Full=Endogenous retrovirus group K member 9 Pol... Homo sapiens human 9606 109 109 25% 1e-23 38.51 1117 P63128.3 RecName: Full=Gag-Pol polyprotein; Contains: RecName:... Walleye derm... NA 39720 95.1 95.1 35% 5e-19 30.62 1752 O92815.2 RecName: Full=Pro-Pol polyprotein; AltName: Full=Pr125Pol;... Human spumar... NA 11963 89.7 89.7 45% 2e-17 29.26 1143 P14350.2 RecName: Full=Pro-Pol polyprotein; AltName: Full=Pr125Pol;... Pan troglody... NA 298339 89.0 89.0 45% 4e-17 28.89 1146 Q87040.1 RecName: Full=Gag-Pol polyprotein; Contains: RecName:... Reticuloendo... NA 11636 84.3 84.3 41% 1e-15 26.42 1152 P03360.2 RecName: Full=Pro-Pol polyprotein; AltName: Full=Pr125Pol;... Macaque simi... NA 338478 82.4 82.4 43% 5e-15 29.46 1149 P23074.3 RecName: Full=Pro-Pol polyprotein; AltName: Full=Pr125Pol;... Simian foamy... NA 11644 82.0 82.0 45% 5e-15 28.52 1143 P27401.2 RecName: Full=Pro-Pol polyprotein; AltName: Full=Pr125Pol;... Feline foamy... NA 53182 80.5 80.5 38% 2e-14 27.80 1156 O93209.1 RecName: Full=Intracisternal A-particle Pol-related polyprotei... Mouse intrac... NA 11754 78.2 78.2 69% 8e-14 23.95 814 P12894.1 RecName: Full=Gag-Pol polyprotein; Short=Pr180gag-pol; Contain... Cas-Br-E mur... NA 11792 76.6 76.6 38% 3e-13 27.88 1733 P08361.2 RecName: Full=Gag-Pol polyprotein; Contains: RecName:... Friend murin... NA 11796 75.1 75.1 40% 9e-13 27.00 1739 P26810.2 RecName: Full=Gag-Pol polyprotein; Contains: RecName:... Friend murin... NA 11797 74.7 74.7 38% 1e-12 27.43 1738 P26809.2 RecName: Full=Gag-Pol polyprotein; Contains: RecName:... AKR (endogen... NA 11791 74.7 74.7 31% 1e-12 29.19 1734 P03356.3 RecName: Full=Gag-Pol polyprotein; Contains: RecName:... Woolly monke... NA 11970 74.3 74.3 38% 2e-12 24.55 1687 P03359.2 RecName: Full=Gag-Pol polyprotein; Short=Pr180gag-pol; Contain... Moloney muri... NA 928306 74.3 74.3 38% 2e-12 27.43 1738 P03355.5 RecName: Full=Gag-Pol polyprotein; Short=Pr180gag-pol; Contain... Xenotropic M... NA 356663 73.6 73.6 38% 3e-12 27.43 1733 Q2F7J3.1 RecName: Full=Gag-Pol polyprotein; Short=Pr180gag-pol; Contain... Xenotropic M... NA 373193 73.6 73.6 38% 3e-12 27.43 1733 A1Z651.1 RecName: Full=Gag-Pol polyprotein; Short=Pr180gag-pol; Contain... Xenotropic M... NA 356664 73.6 73.6 38% 3e-12 27.43 1733 Q2F7J0.1 RecName: Full=Pol polyprotein; Contains: RecName: Full=Reverse... Feline endog... NA 11766 73.2 73.2 31% 3e-12 27.27 1046 P31792.1 RecName: Full=Gag-Pol polyprotein; Contains: RecName:... Friend murin... NA 11798 73.2 73.2 38% 3e-12 26.99 1738 P26808.2 RecName: Full=Gag-pol polyprotein; Contains: RecName:... Murine leuke... NA 31687 73.2 73.2 38% 4e-12 27.43 1734 Q7SVK7.2 RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr125Pol;... Koala retrov... NA 394239 72.4 72.4 38% 6e-12 24.55 1687 Q9TTC1.2 RecName: Full=Gag-Pol polyprotein; Contains: RecName:... Gibbon ape l... NA 11840 71.6 71.6 38% 1e-11 23.66 1686 P21414.2 RecName: Full=Gag-Pol polyprotein; Contains: RecName:... Feline leuke... NA 11768 70.1 70.1 31% 3e-11 27.27 1712 P10273.2 RecName: Full=Transposon Ty3-G Gag-Pol polyprotein; AltName:... Saccharomyce... NA 559292 69.7 69.7 48% 5e-11 22.61 1547 Q99315.3 RecName: Full=Putative enzymatic polyprotein; Includes: RecNam... Cassava vein... NA 38062 68.9 68.9 36% 5e-11 27.80 652 Q89703.1 RecName: Full=Transposon Ty3-I Gag-Pol polyprotein; AltName:... Saccharomyce... NA 559292 69.3 69.3 48% 6e-11 22.61 1498 Q7LHG5.2 RecName: Full=Gag-Pol polyprotein; Contains: RecName:... Radiation mu... NA 11787 69.3 69.3 38% 6e-11 26.99 1734 P11227.2 RecName: Full=Gag-Pol polyprotein; Contains: RecName:... Baboon endog... NA 11764 68.6 68.6 29% 1e-10 27.27 1727 P10272.2 RecName: Full=Enzymatic polyprotein; Includes: RecName:... Cauliflower ... NA 10648 63.9 63.9 43% 2e-09 26.92 679 P03554.1 RecName: Full=Enzymatic polyprotein; Includes: RecName:... Cauliflower ... NA 10644 63.9 63.9 43% 2e-09 26.92 679 P03555.1 RecName: Full=Enzymatic polyprotein; Includes: RecName:... Cauliflower ... NA 31556 63.9 63.9 43% 2e-09 26.92 679 Q02964.1 RecName: Full=Enzymatic polyprotein; Includes: RecName:... Figwort mosa... NA 10650 62.8 62.8 38% 5e-09 25.99 666 P09523.1 RecName: Full=Polyprotein P3; Includes: RecName: Full=Putative... Commelina ye... NA 10653 63.2 63.2 32% 5e-09 28.12 1886 P19199.2 RecName: Full=Genome polyprotein; Includes: RecName:... Petunia vein... NA 492095 62.4 62.4 44% 1e-08 22.73 2180 Q6XKE6.1 RecName: Full=Enzymatic polyprotein; Includes: RecName:... Cauliflower ... NA 10645 61.6 61.6 43% 1e-08 26.85 674 P03556.1 RecName: Full=Genome polyprotein; Includes: RecName:... Petunia vein... NA 492094 60.8 60.8 45% 2e-08 21.77 2179 Q91DM0.1 RecName: Full=Enzymatic polyprotein; Includes: RecName:... Cauliflower ... NA 31557 58.9 58.9 45% 8e-08 25.75 680 Q00962.1 RecName: Full=Retrovirus-related Pol polyprotein from transpos... Drosophila m... fruit fly 7227 58.9 58.9 40% 9e-08 25.00 1059 P20825.1 RecName: Full=Enzymatic polyprotein; Includes: RecName:... Soybean chlo... NA 10651 58.5 58.5 37% 1e-07 26.55 692 P15629.2 RecName: Full=Retrovirus-related Pol polyprotein from transpos... Drosophila m... fruit fly 7227 57.8 57.8 34% 2e-07 28.04 1058 P04323.1 RecName: Full=Retrovirus-related Pol polyprotein from transpos... Drosophila m... fruit fly 7227 55.1 55.1 37% 1e-06 25.66 1237 P10394.1 RecName: Full=Enzymatic polyprotein; Includes: RecName:... Carnation et... NA 10640 54.3 54.3 39% 2e-06 24.58 659 P05400.1 RecName: Full=Pol polyprotein [Simian immunodeficiency virus... Simian immun... NA 11728 47.4 47.4 11% 1e-05 45.07 100 P12500.1 RecName: Full=Ribonuclease H; Short=RNase H [Cupriavidus necat... Cupriavidus ... NA 381666 48.1 48.1 19% 2e-05 30.23 145 Q0K8W6.1 RecName: Full=Transposon Tf2-1 polyprotein; AltName:... Schizosaccha... NA 284812 51.2 51.2 44% 2e-05 21.46 1333 P0CT34.1 RecName: Full=Uncharacterized protein K02A2.6 [Caenorhabditis... Caenorhabdit... NA 6239 51.2 51.2 42% 2e-05 25.79 1268 Q09575.1 RecName: Full=Transposon Tf2-3 polyprotein; AltName:... Schizosaccha... NA 284812 51.2 51.2 44% 2e-05 21.46 1333 P0CT36.1 RecName: Full=Ribonuclease H; Short=RNase H [Cupriavidus... Cupriavidus ... NA 264198 47.8 47.8 18% 3e-05 31.50 145 Q46Z81.1 RecName: Full=Retrovirus-related Pol polyprotein from transpos... Drosophila m... fruit fly 7227 50.8 50.8 54% 3e-05 24.70 1035 P10401.1 RecName: Full=Ribonuclease H; Short=RNase H [Cupriavidus... Cupriavidus ... NA 266264 47.4 47.4 19% 3e-05 31.01 145 Q1LL89.1 RecName: Full=Enzymatic polyprotein; Includes: RecName:... Cestrum yell... NA 175814 50.4 50.4 29% 4e-05 27.84 643 Q7TD08.1 RecName: Full=Transposon Tf2-7 polyprotein; AltName:... Schizosaccha... NA 284812 50.4 50.4 44% 4e-05 21.07 1333 P0CT42.1 RecName: Full=Transposon Tf2-11 polyprotein; AltName:... Schizosaccha... NA 284812 50.1 50.1 44% 5e-05 21.07 1333 Q9UR07.1 RecName: Full=Retrovirus-related Pol polyprotein from transpos... Drosophila m... fruit fly 7227 49.7 49.7 26% 6e-05 27.61 1003 Q8I7P9.1 RecName: Full=Polyprotein P3; AltName: Full=P194 protein;... Rice tungro ... NA 10655 48.5 48.5 42% 1e-04 22.44 1675 P27502.1 RecName: Full=Ribonuclease H; Short=RNase H [Rhizobium etli CF... Rhizobium et... NA 347834 45.1 45.1 18% 3e-04 28.35 151 Q2KBL2.1 RecName: Full=Ribonuclease H; Short=RNase H [Methylococcus... Methylococcu... NA 243233 45.1 45.1 20% 3e-04 28.87 155 Q60AW8.1 RecName: Full=Ribonuclease H; Short=RNase H [Ralstonia... Ralstonia so... NA 267608 44.3 44.3 19% 4e-04 29.46 151 Q8XZ91.1 RecName: Full=Ribonuclease H; Short=RNase H [Bradyrhizobium... Bradyrhizobi... NA 224911 44.3 44.3 23% 4e-04 28.03 154 Q89UU3.1 RecName: Full=Ribonuclease H; Short=RNase H [Bordetella... Bordetella p... NA 257313 43.9 43.9 21% 6e-04 27.86 155 Q7VRX8.1 RecName: Full=Ribonuclease H; Short=RNase H [Bordetella... Bordetella b... NA 257310 43.5 43.5 21% 7e-04 27.86 155 Q7WCJ8.1 RecName: Full=Ribonuclease H; Short=RNase H [Novosphingobium... Novosphingob... NA 279238 42.7 42.7 20% 0.001 29.93 143 Q2G9E3.1 RecName: Full=Ribonuclease H; Short=RNase H [Agrobacterium... Agrobacteriu... NA 176299 42.7 42.7 22% 0.001 26.71 146 Q8UHA7.1 RecName: Full=Ribonuclease H; Short=RNase H [Psychromonas... Psychromonas... NA 357804 42.7 42.7 18% 0.001 26.77 153 A1SS86.2 RecName: Full=Ribonuclease H; Short=RNase H [Oleidesulfovibrio... Oleidesulfov... NA 207559 42.7 42.7 13% 0.002 30.11 154 Q30X61.1 RecName: Full=Ribonuclease H; Short=RNase H [Campylobacter... Campylobacte... NA 360104 42.4 42.4 19% 0.002 29.23 144 A8Z6F7.1 RecName: Full=Ribonuclease H; Short=RNase H [Syntrophobacter... Syntrophobac... NA 335543 42.4 42.4 18% 0.002 29.55 164 A0LGJ7.1 RecName: Full=Ribonuclease H; Short=RNase H [Trichodesmium... Trichodesmiu... NA 203124 42.0 42.0 19% 0.003 27.27 157 Q115G0.1 RecName: Full=Ribonuclease H; Short=RNase H [Caulobacter... Caulobacter ... NA 565050 41.6 41.6 21% 0.003 28.06 149 B8H4W7.1 RecName: Full=Ribonuclease H; Short=RNase H [Zymomonas mobilis... Zymomonas mo... NA 264203 41.6 41.6 20% 0.004 27.01 156 O69014.1 RecName: Full=Ribonuclease H; Short=RNase H [Lachnoclostridium... Lachnoclostr... NA 357809 41.6 41.6 19% 0.004 30.43 158 A9KLJ9.1 RecName: Full=Ribonuclease HI; Short=RNase HI [Synechocystis s... Synechocysti... NA 1111708 41.6 41.6 19% 0.004 27.27 160 Q55801.1 RecName: Full=Ribonuclease H; Short=RNase H [Yersinia... Yersinia ent... NA 393305 41.2 41.2 23% 0.005 28.39 154 A1JKB1.1 RecName: Full=Ribonuclease HI; Short=RNase HI [Helicobacter... Helicobacter... NA 85962 40.8 40.8 18% 0.005 29.23 143 P56120.1 RecName: Full=Ribonuclease H; Short=RNase H [Campylobacter... Campylobacte... NA 360105 40.8 40.8 20% 0.006 29.10 149 A7H185.1 RecName: Full=Ribonuclease H; Short=RNase H [Rhizobium... Rhizobium le... NA 216596 40.8 40.8 13% 0.006 28.26 151 Q1MKH6.1 RecName: Full=Ribonuclease H; Short=RNase H [Helicobacter... Helicobacter... NA 382638 40.4 40.4 18% 0.007 30.00 143 Q17XJ7.1 RecName: Full=Ribonuclease H; Short=RNase H [Sulfurovum sp.... Sulfurovum s... NA 387093 40.4 40.4 16% 0.008 32.73 147 A6QCI9.1 RecName: Full=Ribonuclease H; Short=RNase H [Erythrobacter... Erythrobacte... NA 314225 40.4 40.4 19% 0.008 28.03 144 Q2ND39.1 RecName: Full=Ribonuclease H; Short=RNase H [Magnetospirillum... Magnetospiri... NA 342108 40.4 40.4 23% 0.008 27.33 154 Q2W9A9.1 RecName: Full=Ribonuclease H; Short=RNase H [Thiobacillus... Thiobacillus... NA 292415 40.4 40.4 22% 0.009 26.71 148 Q3SIB2.1 RecName: Full=Ribonuclease H; Short=RNase H [Anaeromyxobacter... Anaeromyxoba... NA 404589 40.8 40.8 13% 0.009 31.52 175 A7HB50.1 RecName: Full=Ribonuclease H; Short=RNase H [Pseudomonas... Pseudomonas ... NA 379731 40.0 40.0 22% 0.012 27.03 151 A4VLR0.1 RecName: Full=Ribonuclease H; Short=RNase H [Helicobacter pylo... Helicobacter... NA 357544 39.7 39.7 18% 0.013 29.23 143 Q1CTK9.1 RecName: Full=Ribonuclease H; Short=RNase H [Desulfovibrio... Desulfovibri... NA 391774 40.0 40.0 13% 0.013 28.72 156 A1VFS4.1 RecName: Full=Ribonuclease H; Short=RNase H [Thermus... Thermus ther... NA 300852 40.0 40.0 19% 0.013 26.72 166 P29253.2 RecName: Full=Ribonuclease H; Short=RNase H [Pseudoalteromonas... Pseudoaltero... NA 342610 40.0 40.0 21% 0.014 29.17 153 Q15TA7.1 RecName: Full=Ribonuclease H; Short=RNase H [Thermus... Thermus ther... NA 262724 40.0 40.0 19% 0.014 26.72 166 Q72IE1.1 RecName: Full=Ribonuclease H; Short=RNase H [Hahella chejuensi... Hahella chej... NA 349521 39.7 39.7 18% 0.016 34.38 148 Q2SJ45.1 RecName: Full=Ribonuclease H; Short=RNase H [Yersinia pestis... Yersinia pes... NA 386656 39.7 39.7 23% 0.016 27.10 154 A4TL54.1 RecName: Full=Ribonuclease H1; Short=RNase H1 [Rattus norvegicus] Rattus norve... Norway rat 10116 40.8 40.8 21% 0.020 28.97 285 Q5BK46.1 RecName: Full=Ribonuclease H; Short=RNase H [Legionella... Legionella p... NA 297245 39.3 39.3 18% 0.020 28.12 143 Q5WWW5.1 RecName: Full=Ribonuclease H; Short=RNase H [Syntrophotalea... Syntrophotal... NA 338963 39.3 39.3 17% 0.021 30.25 152 Q3A827.1 RecName: Full=Ribonuclease H1; Short=RNase H1; AltName:... Homo sapiens human 9606 40.4 40.4 22% 0.024 30.32 286 O60930.2 RecName: Full=Ribonuclease H; Short=RNase H [Chelativorans sp.... Chelativoran... NA 266779 39.3 39.3 22% 0.026 25.34 162 Q11KC5.1 RecName: Full=Ribonuclease H; Short=RNase H [Legionella... Legionella p... NA 400673 38.9 38.9 18% 0.026 28.12 143 A5IBM5.1 RecName: Full=Ribonuclease H; Short=RNase H [Candidatus Ruthia... Candidatus R... NA 413404 38.9 38.9 18% 0.027 25.78 146 A1AW38.1 RecName: Full=Ribonuclease H; Short=RNase H [Neisseria... Neisseria go... NA 242231 38.9 38.9 19% 0.028 30.30 145 Q5F7K9.1 RecName: Full=Ribonuclease H; Short=RNase H [Neisseria... Neisseria me... NA 272831 38.9 38.9 19% 0.029 30.30 145 A1KV38.1 RecName: Full=Ribonuclease H1; Short=RNase H1 [Mus musculus] Mus musculus house mouse 10090 40.0 40.0 22% 0.031 29.68 285 O70338.1 RecName: Full=Ribonuclease H; Short=RNase H [Bradyrhizobium sp... Bradyrhizobi... NA 114615 38.5 38.5 22% 0.039 26.90 154 A4Z216.1 RecName: Full=Ribonuclease H; Short=RNase H [Brucella anthropi... Brucella ant... NA 439375 38.5 38.5 21% 0.040 29.29 154 A6WWG8.1 RecName: Full=Ribonuclease H; Short=RNase H [Acidovorax sp. JS42] Acidovorax s... NA 232721 38.5 38.5 18% 0.041 30.23 148 A1W6Q8.1 Alignments: >RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Matrix protein p16; Contains: RecName: Full=Capsid protein p25; Contains: RecName: Full=Nucleocapsid protein p14; Contains: RecName: Full=Protease; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; AltName: Full=Exoribonuclease H; Contains: RecName: Full=Deoxyuridine 5'-triphosphate nucleotidohydrolase; Short=dUTPase; Contains: RecName: Full=Integrase; Short=IN [Visna lentivirus (strain 1514)] Sequence ID: P03370.2 Length: 1506 Range 1: 541 to 1111 Score:1164 bits(3012), Expect:0.0, Method:Compositional matrix adjust., Identities:571/571(100%), Positives:571/571(100%), Gaps:0/571(0%) Query 1 ANLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCN 60 ANLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCN Sbjct 541 ANLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCN 600 Query 61 TPIFCIKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 TPIFCIKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI Sbjct 601 TPIFCIKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 660 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI Sbjct 661 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 720 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK Sbjct 721 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 780 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDRALQSERYIES 300 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDRALQSERYIES Sbjct 781 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDRALQSERYIES 840 Query 301 IHVREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKN 360 IHVREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKN Sbjct 841 IHVREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKN 900 Query 361 LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMGNINWMPSFWSCYKGSV 420 LSQAQQIIKAAQKLTQEVIIRTGKIPWILLPGREEDWILELQMGNINWMPSFWSCYKGSV Sbjct 901 LSQAQQIIKAAQKLTQEVIIRTGKIPWILLPGREEDWILELQMGNINWMPSFWSCYKGSV 960 Query 421 RWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHEEGTNQQLELRAIEEAC 480 RWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHEEGTNQQLELRAIEEAC Sbjct 961 RWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHEEGTNQQLELRAIEEAC 1020 Query 481 KQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGIPQ 540 KQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGIPQ Sbjct 1021 KQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGIPQ 1080 Query 541 NEEIDRYISEIFLAKEGRGILQKRAEDAGYD 571 NEEIDRYISEIFLAKEGRGILQKRAEDAGYD Sbjct 1081 NEEIDRYISEIFLAKEGRGILQKRAEDAGYD 1111 >RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Matrix protein p16; Contains: RecName: Full=Capsid protein p25; Contains: RecName: Full=Nucleocapsid protein p14; Contains: RecName: Full=Protease; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; AltName: Full=Exoribonuclease H; Contains: RecName: Full=Deoxyuridine 5'-triphosphate nucleotidohydrolase; Short=dUTPase; Contains: RecName: Full=Integrase; Short=IN [Visna lentivirus (strain 1514 / clone LV1-1KS2)] Sequence ID: P23427.2 Length: 1506 Range 1: 541 to 1111 Score:1164 bits(3010), Expect:0.0, Method:Compositional matrix adjust., Identities:570/571(99%), Positives:571/571(100%), Gaps:0/571(0%) Query 1 ANLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCN 60 ANLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCN Sbjct 541 ANLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCN 600 Query 61 TPIFCIKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 TPIFCIKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI Sbjct 601 TPIFCIKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 660 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI Sbjct 661 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 720 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK Sbjct 721 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 780 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDRALQSERYIES 300 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDRALQSERYIES Sbjct 781 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDRALQSERYIES 840 Query 301 IHVREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKN 360 IHVREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKN Sbjct 841 IHVREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKN 900 Query 361 LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMGNINWMPSFWSCYKGSV 420 LSQAQQIIKAAQKLTQEVIIRTGKIPWILLPGREEDWILELQMGNINWMPSFWSCYKGSV Sbjct 901 LSQAQQIIKAAQKLTQEVIIRTGKIPWILLPGREEDWILELQMGNINWMPSFWSCYKGSV 960 Query 421 RWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHEEGTNQQLELRAIEEAC 480 RWKKRNVIAE+VPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHEEGTNQQLELRAIEEAC Sbjct 961 RWKKRNVIAEVVPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHEEGTNQQLELRAIEEAC 1020 Query 481 KQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGIPQ 540 KQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGIPQ Sbjct 1021 KQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGIPQ 1080 Query 541 NEEIDRYISEIFLAKEGRGILQKRAEDAGYD 571 NEEIDRYISEIFLAKEGRGILQKRAEDAGYD Sbjct 1081 NEEIDRYISEIFLAKEGRGILQKRAEDAGYD 1111 >RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Matrix protein p16; Contains: RecName: Full=Capsid protein p25; Contains: RecName: Full=Nucleocapsid protein p14; Contains: RecName: Full=Protease; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; AltName: Full=Exoribonuclease H; Contains: RecName: Full=Deoxyuridine 5'-triphosphate nucleotidohydrolase; Short=dUTPase; Contains: RecName: Full=Integrase; Short=IN [Visna/maedi virus EV1 KV1772] Sequence ID: P35956.2 Length: 1506 Range 1: 541 to 1111 Score:1162 bits(3005), Expect:0.0, Method:Compositional matrix adjust., Identities:569/571(99%), Positives:570/571(99%), Gaps:0/571(0%) Query 1 ANLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCN 60 ANLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCN Sbjct 541 ANLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCN 600 Query 61 TPIFCIKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 TPIFCIKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI Sbjct 601 TPIFCIKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 660 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI Sbjct 661 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 720 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK Sbjct 721 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 780 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDRALQSERYIES 300 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDRALQSERYIES Sbjct 781 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDRALQSERYIES 840 Query 301 IHVREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKN 360 IHVREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKN Sbjct 841 IHVREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKN 900 Query 361 LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMGNINWMPSFWSCYKGSV 420 LSQAQQIIKAAQKLTQEVIIRTGKIPWILLPGREEDWILELQMGNINWMPSFWSCYKGSV Sbjct 901 LSQAQQIIKAAQKLTQEVIIRTGKIPWILLPGREEDWILELQMGNINWMPSFWSCYKGSV 960 Query 421 RWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHEEGTNQQLELRAIEEAC 480 RWKKRNVIAE+VPGPTYYTDGGKKNGRGSLGYI STGEKFRIHEEGTNQQLELRAIEEAC Sbjct 961 RWKKRNVIAEVVPGPTYYTDGGKKNGRGSLGYITSTGEKFRIHEEGTNQQLELRAIEEAC 1020 Query 481 KQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGIPQ 540 KQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGIPQ Sbjct 1021 KQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGIPQ 1080 Query 541 NEEIDRYISEIFLAKEGRGILQKRAEDAGYD 571 NEEIDRYISEIFLAKEGRGILQKRAEDAGYD Sbjct 1081 NEEIDRYISEIFLAKEGRGILQKRAEDAGYD 1111 >RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Matrix protein p16; Contains: RecName: Full=Capsid protein p25; Contains: RecName: Full=Nucleocapsid protein p14; Contains: RecName: Full=Protease; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; AltName: Full=Exoribonuclease H; Contains: RecName: Full=Deoxyuridine 5'-triphosphate nucleotidohydrolase; Short=dUTPase; Contains: RecName: Full=Integrase; Short=IN [Visna lentivirus (strain 1514 / clone LV1-1KS1)] Sequence ID: P23426.2 Length: 1506 Range 1: 541 to 1111 Score:1151 bits(2978), Expect:0.0, Method:Compositional matrix adjust., Identities:564/571(99%), Positives:568/571(99%), Gaps:0/571(0%) Query 1 ANLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCN 60 ANLEEKKIP T VRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCN Sbjct 541 ANLEEKKIPITEVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCN 600 Query 61 TPIFCIKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 TPIFCIKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI Sbjct 601 TPIFCIKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 660 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSP+VYQFTMQKILRGWIEEHPMI Sbjct 661 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPSVYQFTMQKILRGWIEEHPMI 720 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK Sbjct 721 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 780 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDRALQSERYIES 300 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDRALQSERYIES Sbjct 781 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDRALQSERYIES 840 Query 301 IHVREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKN 360 IHVREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKN Sbjct 841 IHVREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKN 900 Query 361 LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMGNINWMPSFWSCYKGSV 420 LSQAQQIIKAAQKLTQEVIIRTGKIPWILLPGREEDWILELQMGNINWMPSFWSCYKGSV Sbjct 901 LSQAQQIIKAAQKLTQEVIIRTGKIPWILLPGREEDWILELQMGNINWMPSFWSCYKGSV 960 Query 421 RWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHEEGTNQQLELRAIEEAC 480 RWKKRNVIAE+V GPTYYTDGGKKNGRGSLGYIASTGEKFRI+EEGTNQQLELRAIEEAC Sbjct 961 RWKKRNVIAEVVSGPTYYTDGGKKNGRGSLGYIASTGEKFRIYEEGTNQQLELRAIEEAC 1020 Query 481 KQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGIPQ 540 KQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMEL+HNKEKIGVHWVPGHKGIPQ Sbjct 1021 KQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELMHNKEKIGVHWVPGHKGIPQ 1080 Query 541 NEEIDRYISEIFLAKEGRGILQKRAEDAGYD 571 NEEIDRYISEIFLAKEGRGILQKRAEDAGYD Sbjct 1081 NEEIDRYISEIFLAKEGRGILQKRAEDAGYD 1111 >RecName: Full=Pol polyprotein; Contains: RecName: Full=Protease; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; AltName: Full=Exoribonuclease H; Contains: RecName: Full=Deoxyuridine 5'-triphosphate nucleotidohydrolase; Short=dUTPase; Contains: RecName: Full=Integrase; Short=IN [Ovine lentivirus (strain SA-OMVV)] Sequence ID: P16901.1 Length: 1086 Range 1: 121 to 691 Score:1056 bits(2730), Expect:0.0, Method:Compositional matrix adjust., Identities:514/571(90%), Positives:546/571(95%), Gaps:0/571(0%) Query 1 ANLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCN 60 ANLEEKKIP T+V+LKEGCKGPHIAQWPLTQEKLEGLKEIVD+LEKEGKVGRAPPHWTCN Sbjct 121 ANLEEKKIPITQVKLKEGCKGPHIAQWPLTQEKLEGLKEIVDKLEKEGKVGRAPPHWTCN 180 Query 61 TPIFCIKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 TPIFCIKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQ+KKHVTILDIGDAYFTI Sbjct 181 TPIFCIKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQKKKHVTILDIGDAYFTI 240 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PLYEPYR YTCFTMLSPNNLGPC RYYWKVLPQGWKLSP+VYQFTMQ+ILR WI +HPMI Sbjct 241 PLYEPYRPYTCFTMLSPNNLGPCTRYYWKVLPQGWKLSPSVYQFTMQEILRDWIAKHPMI 300 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 QFGIYMDDIYIGSDL + +HR IV ELASYIAQYGFMLPE+KRQEGYPAKWLGFELHPEK Sbjct 301 QFGIYMDDIYIGSDLDIMKHREIVEELASYIAQYGFMLPEEKRQEGYPAKWLGFELHPEK 360 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDRALQSERYIES 300 W+FQKHTLPEI EG ITLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDRALQSER IE Sbjct 361 WRFQKHTLPEIKEGTITLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDRALQSERRIEL 420 Query 301 IHVREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKN 360 HV+EWE CR+KL EMEGNYYDEEKD+YGQ+DWG+KAIEYIVFQE+GKPLWVNVVH+IKN Sbjct 421 RHVKEWEECRRKLAEMEGNYYDEEKDVYGQIDWGDKAIEYIVFQERGKPLWVNVVHNIKN 480 Query 361 LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMGNINWMPSFWSCYKGSV 420 LSQ+QQIIKAAQKLTQEVIIR GKIPWILLPG+EEDWILELQ+GNI WMPSFWSCY+GS+ Sbjct 481 LSQSQQIIKAAQKLTQEVIIRIGKIPWILLPGKEEDWILELQIGNITWMPSFWSCYRGSI 540 Query 421 RWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHEEGTNQQLELRAIEEAC 480 RWKKRNVI E+V GPTYYTDGGKKNG+GSLG+IASTG KFR HEEGTNQQLELRAIEEAC Sbjct 541 RWKKRNVITEVVEGPTYYTDGGKKNGKGSLGFIASTGVKFRKHEEGTNQQLELRAIEEAC 600 Query 481 KQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGIPQ 540 KQGPEKMNIVTDSRYAYEFM RNWDEEVI+NPIQARIM+LVH+KE+IGVHWVPGHKGIPQ Sbjct 601 KQGPEKMNIVTDSRYAYEFMRRNWDEEVIKNPIQARIMKLVHDKEQIGVHWVPGHKGIPQ 660 Query 541 NEEIDRYISEIFLAKEGRGILQKRAEDAGYD 571 NEEID+YISEIFLA+EG GIL KRAEDAGYD Sbjct 661 NEEIDKYISEIFLAREGSGILPKRAEDAGYD 691 >RecName: Full=Pol polyprotein; Contains: RecName: Full=Protease; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; AltName: Full=Exoribonuclease H; Contains: RecName: Full=Integrase; Short=IN [Caprine arthritis encephalitis virus strain Cork] Sequence ID: P33459.1 Length: 1109 Range 1: 145 to 715 Score:995 bits(2573), Expect:0.0, Method:Compositional matrix adjust., Identities:473/571(83%), Positives:531/571(92%), Gaps:0/571(0%) Query 1 ANLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCN 60 ANLEEK+IP T+V+LKEGC GPH+ QWPLT+EKL+GL EI+D+L +EGK+G+APPHWTCN Sbjct 145 ANLEEKRIPITKVKLKEGCTGPHVPQWPLTEEKLKGLTEIIDKLVEEGKLGKAPPHWTCN 204 Query 61 TPIFCIKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 TPIFCIKKKSGKWRMLIDFRELNKQTEDL EAQLGLPHPGGLQ+KKHVTILDIGDAYFTI Sbjct 205 TPIFCIKKKSGKWRMLIDFRELNKQTEDLTEAQLGLPHPGGLQKKKHVTILDIGDAYFTI 264 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PLYEPYR+YTCFT+LSPNNLGPC RYYWKVLPQGWKLSP+VYQFTMQ+IL WI++HP I Sbjct 265 PLYEPYREYTCFTLLSPNNLGPCKRYYWKVLPQGWKLSPSVYQFTMQEILEDWIQQHPEI 324 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 QFGIYMDDIYIGSDL +++HR IV +LA+YIAQYGF LPE+KRQ+GYPAKWLGFELHP+ Sbjct 325 QFGIYMDDIYIGSDLEIKKHREIVKDLANYIAQYGFTLPEEKRQKGYPAKWLGFELHPQT 384 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDRALQSERYIES 300 WKFQKHTLPE+T+G ITLNKLQKLVG+LVWRQS+IGKSIPNILKLMEGDR LQSER IE Sbjct 385 WKFQKHTLPELTKGTITLNKLQKLVGELVWRQSIIGKSIPNILKLMEGDRELQSERKIEE 444 Query 301 IHVREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKN 360 +HV+EWEACR+KL+EMEGNYY+++KD+YGQL WG+KAIEYIV+QEKGKPLWVNVVH+IKN Sbjct 445 VHVKEWEACRKKLEEMEGNYYNKDKDVYGQLAWGDKAIEYIVYQEKGKPLWVNVVHNIKN 504 Query 361 LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMGNINWMPSFWSCYKGSV 420 LS QQ+IKAAQKLTQEVIIRTGKIPWILLPG+EEDW LELQ+GNI WMP FWSCY+G Sbjct 505 LSIPQQVIKAAQKLTQEVIIRTGKIPWILLPGKEEDWRLELQLGNITWMPKFWSCYRGHT 564 Query 421 RWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHEEGTNQQLELRAIEEAC 480 RW+KRN+I E+V GPTYYTDGGKKN GSLG+I STGEKFR HEEGTNQQLELRAIEEA Sbjct 565 RWRKRNIIEEVVEGPTYYTDGGKKNKVGSLGFIVSTGEKFRKHEEGTNQQLELRAIEEAL 624 Query 481 KQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGIPQ 540 KQGP+ MN+VTDSRYA+EF+LRNWDEEVI+NPIQARIME+ H K++IGVHWVPGHKGIPQ Sbjct 625 KQGPQTMNLVTDSRYAFEFLLRNWDEEVIKNPIQARIMEIAHKKDRIGVHWVPGHKGIPQ 684 Query 541 NEEIDRYISEIFLAKEGRGILQKRAEDAGYD 571 NEEID+YISEIFLAKEG GIL KR EDAGYD Sbjct 685 NEEIDKYISEIFLAKEGEGILPKREEDAGYD 715 >RecName: Full=Pol polyprotein; Contains: RecName: Full=Protease; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; AltName: Full=Exoribonuclease H; Contains: RecName: Full=Deoxyuridine 5'-triphosphate nucleotidohydrolase; Short=dUTPase; Contains: RecName: Full=Integrase; Short=IN [Feline immunodeficiency virus (isolate TM2)] Sequence ID: P31822.1 Length: 1124 Range 1: 158 to 728 Score:408 bits(1048), Expect:4e-129, Method:Compositional matrix adjust., Identities:258/586(44%), Positives:346/586(59%), Gaps:35/586(5%) Query 6 KKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFC 65 +KIP +VR+K+ +GP + QWPL+ EK+E L +IV+RLE EGKV RA P+ NTP+F Sbjct 158 EKIPIVKVRMKDPTQGPQVKQWPLSNEKIEALTDIVERLESEGKVKRADPNNPWNTPVFA 217 Query 66 IKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEP 125 IKKKSGKWRMLIDFR LNK T+ AE QLGLPHP GLQ KK VT+LDIGDAYFTIPL Sbjct 218 IKKKSGKWRMLIDFRVLNKLTDKGAEVQLGLPHPAGLQMKKQVTVLDIGDAYFTIPLDPD 277 Query 126 YRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIY 185 Y YT FT+ NN GP RY W LPQGW LSP +YQ T+ IL+ +I+++ + Y Sbjct 278 YAPYTAFTLPRKNNAGPGRRYVWCSLPQGWVLSPLIYQSTLNNILQPFIKQNSELDIYQY 337 Query 186 MDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQK 245 MDDIYIGS+L +EH+ V EL + +GF PEDK QE P KW+G+ELHP W Q+ Sbjct 338 MDDIYIGSNLNKKEHKQKVEELRKLLLWWGFETPEDKLQEEPPYKWMGYELHPLTWSIQQ 397 Query 246 HTLPEITEGPITLNKLQKLVGDLVW-RQSLIGKSIPNILKLMEGDRALQSERYIESIHVR 304 L EI E P TLN+LQKL G + W Q++ SI + +M GD+ L S +R Sbjct 398 KQL-EIPERP-TLNELQKLAGKINWASQTIPDLSIKELTNMMRGDQKLDS--------IR 447 Query 305 EW--EACRQKLKEMEG-------NYYDEEKDIYGQLDW-GNKAIEYIVFQEKGKP-LWVN 353 EW EA R+ K E NYYD + +Y +L G I Y V+Q+ + LW Sbjct 448 EWTVEAKREVQKAKEAIETQAQLNYYDPNRGLYAKLSLVGPHQICYQVYQKNPEHILWYG 507 Query 354 VVHSIKNLSqaqqiikaaq--kLTQEVIIRTGKIPWILLPGREEDWILELQMGNINWMPS 411 ++ K ++ I K+ +E IIR GK P +P E W L P Sbjct 508 KINRQKKKAENTCDIALRACYKIREESIIRIGKEPVYEIPASREAWESNLIRSPYLKAPP 567 Query 412 FWSCYKGSVRWKKRNVI----AELVPGPTYYTDGGKKNGRGS-LGYIASTGEKFRIHE-E 465 + + KR + A ++ T+Y DG +K G+ + Y +TG K++I E E Sbjct 568 PEVEFIHAALSIKRALSMIQDAPIIGAETWYIDGSRKQGKAARAAYWTNTG-KWQIMEIE 626 Query 466 GTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKE 525 G+NQ+ E++A+ A K G E+MNI+TDS+Y + + D + + ++E + K Sbjct 627 GSNQKAEVQALLLALKAGSEEMNIITDSQYILNILNQQPD---LMEGLWQEVLEQMEKKI 683 Query 526 KIGVHWVPGHKGIPQNEEIDRYISEIFLAKEGRGILQKRAEDAGYD 571 I + WVPGHKGIP NEE+D+ + + + EG GIL+KR+EDAGYD Sbjct 684 AIFIDWVPGHKGIPGNEEVDK-LCQTMMIIEGEGILEKRSEDAGYD 728 >RecName: Full=Pol polyprotein; Contains: RecName: Full=Protease; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; AltName: Full=Exoribonuclease H; Contains: RecName: Full=Deoxyuridine 5'-triphosphate nucleotidohydrolase; Short=dUTPase; Contains: RecName: Full=Integrase; Short=IN [Feline immunodeficiency virus (isolate Petaluma)] Sequence ID: P16088.1 Length: 1124 Range 1: 160 to 729 Score:403 bits(1036), Expect:2e-127, Method:Compositional matrix adjust., Identities:258/579(45%), Positives:348/579(60%), Gaps:23/579(3%) Query 7 KIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCI 66 KIP +V++K+ KGP I QWPLT EK+E L EIV+RLEKEGKV RA + NTP+F I Sbjct 160 KIPVVKVKMKDPNKGPQIKQWPLTNEKIEALTEIVERLEKEGKVKRADSNNPWNTPVFAI 219 Query 67 KKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPY 126 KKKSGKWRMLIDFRELNK TE AE QLGLPHP GLQ KK VT+LDIGDAYFTIPL Y Sbjct 220 KKKSGKWRMLIDFRELNKLTEKGAEVQLGLPHPAGLQIKKQVTVLDIGDAYFTIPLDPDY 279 Query 127 RQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYM 186 YT FT+ NN GP R+ W LPQGW LSP +YQ T+ I++ +I ++P + YM Sbjct 280 APYTAFTLPRKNNAGPGRRFVWCSLPQGWILSPLIYQSTLDNIIQPFIRQNPQLDIYQYM 339 Query 187 DDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKH 246 DDIYIGS+L +EH+ V EL + +GF PEDK QE P W+G+ELHP W Q+ Sbjct 340 DDIYIGSNLSKKEHKEKVEELRKLLLWWGFETPEDKLQEEPPYTWMGYELHPLTWTIQQK 399 Query 247 TLPEITEGPITLNKLQKLVGDLVW-RQSLIGKSIPNILKLMEGDRALQSERYIESIHVRE 305 L +I E P TLN+LQKL G + W Q++ SI + +M G++ L S R E Sbjct 400 QL-DIPEQP-TLNELQKLAGKINWASQAIPDLSIKALTNMMRGNQNLNSTRQWTKEARLE 457 Query 306 WEACRQKLKE-MEGNYYDEEKDIYGQLDW-GNKAIEYIVFQ-EKGKPLWVNVVHSIKNLS 362 + ++ ++E ++ YYD K++Y +L G I Y V+Q + K LW + K + Sbjct 458 VQKAKKAIEEQVQLGYYDPSKELYAKLSLVGPHQISYQVYQKDPEKILWYGKMSRQKKKA 517 Query 363 qaqqiikaaq--kLTQEVIIRTGKIPWILLPGREEDWILELQMGNINWM---PSFWSCYK 417 + I K+ +E IIR GK P +P E W E + N ++ P Sbjct 518 ENTCDIALRACYKIREESIIRIGKEPRYEIPTSREAW--ESNLINSPYLKAPPPEVEYIH 575 Query 418 GSVRWKKRNVIAELVPGP---TYYTDGGKKNGRGS-LGYIASTGEKFRIHE-EGTNQQLE 472 ++ K+ + + P P T+Y DGG+K G+ + Y TG K+R+ + EG+NQ+ E Sbjct 576 AALNIKRALSMIKDAPIPGAETWYIDGGRKLGKAAKAAYWTDTG-KWRVMDLEGSNQKAE 634 Query 473 LRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWV 532 ++A+ A K G E+MNI+TDS+Y +L+ D + I ++E + K I + WV Sbjct 635 IQALLLALKAGSEEMNIITDSQYVINIILQQPD---MMEGIWQEVLEELEKKTAIFIDWV 691 Query 533 PGHKGIPQNEEIDRYISEIFLAKEGRGILQKRAEDAGYD 571 PGHKGIP NEE+D+ + + + EG GIL KR+EDAGYD Sbjct 692 PGHKGIPGNEEVDK-LCQTMMIIEGDGILDKRSEDAGYD 729 >RecName: Full=Pol polyprotein; Contains: RecName: Full=Protease; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; AltName: Full=Exoribonuclease H; Contains: RecName: Full=Deoxyuridine 5'-triphosphate nucleotidohydrolase; Short=dUTPase; Contains: RecName: Full=Integrase; Short=IN [Feline immunodeficiency virus (isolate San Diego)] Sequence ID: P19028.1 Length: 1124 Range 1: 160 to 729 Score:401 bits(1031), Expect:1e-126, Method:Compositional matrix adjust., Identities:254/578(44%), Positives:345/578(59%), Gaps:21/578(3%) Query 7 KIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCI 66 KIP +V++K+ KGP I QWPL+ EK+E L EIV+RLE+EGKV RA P+ NTP+F I Sbjct 160 KIPIVKVKMKDPNKGPQIKQWPLSNEKIEALTEIVERLEREGKVKRADPNNPWNTPVFAI 219 Query 67 KKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPY 126 KKKSGKWRMLIDFRELNK TE AE QLGLPHP GLQ KK +T+LDIGDAYFT PL Y Sbjct 220 KKKSGKWRMLIDFRELNKLTEKGAEVQLGLPHPAGLQMKKQITVLDIGDAYFTNPLDPDY 279 Query 127 RQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYM 186 YT FT+ NN GP R+ W LPQGW LSP +YQ T+ I++ +I ++P + YM Sbjct 280 APYTAFTLPRKNNAGPGRRFVWCSLPQGWILSPLIYQSTLDNIIQPFIRQNPQLDIYQYM 339 Query 187 DDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKH 246 DDIYIGS+L +EH+ V EL + +GF PEDK QE P KW+G+ELHP W Q+ Sbjct 340 DDIYIGSNLSKKEHKEKVEELRKLLLWWGFETPEDKLQEEPPYKWMGYELHPLTWTIQQK 399 Query 247 TLPEITEGPITLNKLQKLVGDLVW-RQSLIGKSIPNILKLMEGDRALQSER-YIESIHVR 304 L EI E P TLN+LQKL G + W Q++ SI ++ + G++ L S R + E + Sbjct 400 QL-EIPEKP-TLNELQKLAGKINWASQTIPELSIKSLTNMTRGNQNLNSTREWTEEARLE 457 Query 305 EWEACRQKLKEMEGNYYDEEKDIYGQLDW-GNKAIEYIVFQE-KGKPLWVNVVHSIKNLS 362 +A R ++++ YYD K++Y +L G I Y V+Q+ K LW + K + Sbjct 458 VQKAKRAIEEQVQLGYYDPSKELYAKLSLVGPHQISYQVYQKCPEKILWYGKMSRQKKKA 517 Query 363 qaqqiikaaq--kLTQEVIIRTGKIPWILLPGREEDWILELQMGNINWM---PSFWSCYK 417 + I K+ +E IIR GK P +P E W E + N ++ P Sbjct 518 ENTCDIALRACYKIREESIIRIGKEPRYEIPTSREAW--ESNLINSPYLKAPPPEVDYIH 575 Query 418 GSVRWKKRNVIAELVP---GPTYYTDGGKKNGRGS-LGYIASTGEKFRIHEEGTNQQLEL 473 ++ K+ + + P T+Y DGG+K G+ + Y TG+ + EG+NQ+ E+ Sbjct 576 AALNIKRALSMIKDPPISGAETWYIDGGRKLGKAAKAAYWTDTGKWQVMELEGSNQKAEI 635 Query 474 RAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVP 533 +A+ A K GPE+MNI+TDS+Y + + D+ I ++E + K I + WVP Sbjct 636 QALLLALKAGPEEMNIITDSQYMINILSQQPDK---MEGIWQEVLEELEKKTAIFIDWVP 692 Query 534 GHKGIPQNEEIDRYISEIFLAKEGRGILQKRAEDAGYD 571 GHKGIP NEE+D+ + + + EG GIL KR EDAGYD Sbjct 693 GHKGIPGNEEVDK-LCQTMMIIEGDGILDKRTEDAGYD 729 >RecName: Full=Pol polyprotein; Contains: RecName: Full=Protease; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; AltName: Full=Exoribonuclease H; Contains: RecName: Full=Integrase; Short=IN [Equine infectious anemia virus (clone CL22)] Sequence ID: P32542.1 Length: 1146 Range 1: 189 to 761 Score:357 bits(917), Expect:5e-110, Method:Compositional matrix adjust., Identities:229/577(40%), Positives:335/577(58%), Gaps:15/577(2%) Query 6 KKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFC 65 K+I ++ LKEG GP I QWPLT+EKLEG KEIV RL EGK+ A + N+PIF Sbjct 189 KEIKFRKIELKEGTMGPKIPQWPLTKEKLEGAKEIVQRLLSEGKISEASDNNPYNSPIFV 248 Query 66 IKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEP 125 IKK+SGKWR+L D RELNK + E GLPHPGGL + KH+T+LDIGDAYFTIPL Sbjct 249 IKKRSGKWRLLQDLRELNKTVQVGTEISRGLPHPGGLIKCKHMTVLDIGDAYFTIPLDPE 308 Query 126 YRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIY 185 +R YT FT+ S N+ P RY W LPQG+ LSP +YQ T+Q+IL+ + E +P +Q Y Sbjct 309 FRPYTAFTIPSINHQEPDKRYVWNCLPQGFVLSPYIYQKTLQEILQPFRERYPEVQLYQY 368 Query 186 MDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQK 245 MDD+++GS+ ++H+ ++ EL + + + GF P+DK QE P WLG++L PE WK QK Sbjct 369 MDDLFVGSNGSKKQHKELIIELRAILLEKGFETPDDKLQEVPPYSWLGYQLCPENWKVQK 428 Query 246 HTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIESIHVR 304 L ++ + P TLN +QKL+G++ W S + G ++ +I +G L + + Sbjct 429 MQL-DMVKNP-TLNDVQKLMGNITWMSSGVPGLTVKHIAATTKGCLELNQKVIWTEEAQK 486 Query 305 EWEACRQKLKEMEG-NYYDEEKDIYGQLDWG-NKAIEYIVFQEKGKPLWV--NVVHSIKN 360 E E +K+K +G YY+ E+++ +++ N Y++ Q +G LW ++ + K Sbjct 487 ELEENNEKIKNAQGLQYYNPEEEMLCEVEITKNYEATYVIKQSQGI-LWAGKKIMKANKG 545 Query 361 LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMG-NINWMPSFWSCYKGS 419 S + ++ Q + E I R GK P +P +E + E+Q G +W+P ++ Sbjct 546 WSTVKNLMLLLQHVATESITRVGKCPTFKVPFTKEQVMWEMQKGWYYSWLPEIVYTHQVV 605 Query 420 VRWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGE-KFRIHEEGTNQQLELRAIEE 478 + ++ E G T YTDGGK+NG G Y+ S G K + T+Q E AI+ Sbjct 606 HDDWRMKLVEEPTSGITIYTDGGKQNGEGIAAYVTSNGRTKQKRLGPVTHQVAERMAIQM 665 Query 479 ACKQGPEK-MNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKG 537 A + +K +NIVTDS Y ++ + E ++P I++ + KE + WVPGHKG Sbjct 666 ALEDTRDKQVNIVTDSYYCWKNITEGLGLEGPQSPWWP-IIQNIREKEIVYFAWVPGHKG 724 Query 538 IPQNEEID---RYISEIFLAKEGRGILQKRAEDAGYD 571 I N+ D + EI LA +G I +KR EDAG+D Sbjct 725 ICGNQLADEAAKIKEEIMLAYQGTQIKEKRDEDAGFD 761 >RecName: Full=Pol polyprotein; Contains: RecName: Full=Protease; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; AltName: Full=Exoribonuclease H; Contains: RecName: Full=Integrase; Short=IN [Equine infectious anemia virus (CLONE 1369)] Sequence ID: P11204.1 Length: 1146 Range 1: 189 to 761 Score:357 bits(916), Expect:6e-110, Method:Compositional matrix adjust., Identities:229/577(40%), Positives:335/577(58%), Gaps:15/577(2%) Query 6 KKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFC 65 K+I ++ LKEG GP I QWPLT+EKLEG KEIV RL EGK+ A + N+PIF Sbjct 189 KEIKFRKIELKEGTMGPKIPQWPLTKEKLEGAKEIVQRLLSEGKISEASDNNPYNSPIFV 248 Query 66 IKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEP 125 IKK+SGKWR+L D RELNK + E GLPHPGGL + KH+T+LDIGDAYFTIPL Sbjct 249 IKKRSGKWRLLQDLRELNKTVQVGTEISRGLPHPGGLIKCKHMTVLDIGDAYFTIPLDPE 308 Query 126 YRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIY 185 +R YT FT+ S N+ P RY W LPQG+ LSP +YQ T+Q+IL+ + E +P +Q Y Sbjct 309 FRPYTAFTIPSINHQEPDKRYVWNCLPQGFVLSPYIYQKTLQEILQPFRERYPEVQLYQY 368 Query 186 MDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQK 245 MDD+++GS+ ++H+ ++ EL + + + GF P+DK QE P WLG++L PE WK QK Sbjct 369 MDDLFVGSNGSKKQHKELIIELRAILLEEGFETPDDKLQEVPPYSWLGYQLCPENWKVQK 428 Query 246 HTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIESIHVR 304 L ++ + P TLN +QKL+G++ W S + G ++ +I +G L + + Sbjct 429 MQL-DMVKNP-TLNDVQKLMGNITWMSSGVPGLTVKHIAATTKGCLELNQKVIWTEEAQK 486 Query 305 EWEACRQKLKEMEG-NYYDEEKDIYGQLDWG-NKAIEYIVFQEKGKPLWV--NVVHSIKN 360 E E +K+K +G YY+ E+++ +++ N Y++ Q +G LW ++ + K Sbjct 487 ELEENNEKIKNAQGLQYYNPEEEMLCEVEITKNYEATYVIKQSQGI-LWAGKKIMKANKG 545 Query 361 LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMG-NINWMPSFWSCYKGS 419 S + ++ Q + E I R GK P +P +E + E+Q G +W+P ++ Sbjct 546 WSTVKNLMLLLQHVATESITRVGKCPTFKVPFTKEQVMWEMQKGWYYSWLPEIVYTHQVV 605 Query 420 VRWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGE-KFRIHEEGTNQQLELRAIEE 478 + ++ E G T YTDGGK+NG G Y+ S G K + T+Q E AI+ Sbjct 606 HDDWRMKLVEEPTSGITIYTDGGKQNGEGIAAYVTSNGRTKQKRLGPVTHQVAERMAIQM 665 Query 479 ACKQGPEK-MNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKG 537 A + +K +NIVTDS Y ++ + E ++P I++ + KE + WVPGHKG Sbjct 666 ALEDTRDKQVNIVTDSYYCWKNITEGLGLEGPQSPWWP-IIQNIREKEIVYFAWVPGHKG 724 Query 538 IPQN---EEIDRYISEIFLAKEGRGILQKRAEDAGYD 571 I N +E + EI LA +G I +KR EDAG+D Sbjct 725 ICGNQLADEAAKIKEEIMLAYQGTQIKEKRDEDAGFD 761 >RecName: Full=Pol polyprotein; Contains: RecName: Full=Protease; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; AltName: Full=Exoribonuclease H; Contains: RecName: Full=Integrase; Short=IN [Equine infectious anemia virus (ISOLATE WYOMING)] Sequence ID: P03371.1 Length: 1145 Range 1: 188 to 760 Score:357 bits(915), Expect:8e-110, Method:Compositional matrix adjust., Identities:232/578(40%), Positives:335/578(57%), Gaps:16/578(2%) Query 5 EKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIF 64 K+I ++ LKEG GP I QWPLT+EKLEG KE V RL EGK+ A + N+PIF Sbjct 188 SKEIKFRKIELKEGTMGPKIPQWPLTKEKLEGAKETVQRLLSEGKISEASDNNPYNSPIF 247 Query 65 CIKKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYE 124 IKK+SGKWR+L D RELNK + E GLPHPGGL + KH+T+LDIGDAYFTIPL Sbjct 248 VIKKRSGKWRLLQDLRELNKTVQVGTEISRGLPHPGGLIKCKHMTVLDIGDAYFTIPLDP 307 Query 125 PYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGI 184 +R YT FT+ S N+ P RY WK LPQG+ LSP +YQ T+Q+IL+ + E +P +Q Sbjct 308 EFRPYTAFTIPSINHQEPDKRYVWKCLPQGFVLSPYIYQKTLQEILQPFRERYPEVQLYQ 367 Query 185 YMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQ 244 YMDD+++GS+ ++H+ ++ EL + I Q GF P+DK QE P WLG++L PE WK Q Sbjct 368 YMDDLFVGSNGSKKQHKELIIELRA-ILQKGFETPDDKLQEVPPYSWLGYQLCPENWKVQ 426 Query 245 KHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIESIHV 303 K L ++ + P TLN +QKL+G++ W S + G ++ +I +G L + Sbjct 427 KMQL-DMVKNP-TLNDVQKLMGNITWMSSGVPGLTVKHIAATTKGCLELNQKVIWTEEAQ 484 Query 304 REWEACRQKLKEMEG-NYYDEEKDIYGQLDWG-NKAIEYIVFQEKGKPLWV--NVVHSIK 359 +E E +K+K +G YY+ E+++ +++ N Y++ Q +G LW ++ + K Sbjct 485 KELEENNEKIKNAQGLQYYNPEEEMLCEVEITKNYEATYVIKQSQGI-LWAGKKIMKANK 543 Query 360 NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMG-NINWMPSFWSCYKG 418 S + ++ Q + E I R GK P +P +E + E+Q G +W+P ++ Sbjct 544 GWSTVKNLMLLLQHVATESITRVGKCPTFKVPFTKEQVMWEMQKGWYYSWLPEIVYTHQV 603 Query 419 SVRWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGE-KFRIHEEGTNQQLELRAIE 477 + ++ E G T YTDGGK+NG G Y+ S G K + T+Q E AI+ Sbjct 604 VHDDWRMKLVEEPTSGITIYTDGGKQNGEGIAAYVTSNGRTKQKRLGPVTHQVAERMAIQ 663 Query 478 EACKQGPEK-MNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHK 536 A + +K +NIVTDS Y ++ + E +NP I++ + KE + WVPGHK Sbjct 664 MALEDTRDKQVNIVTDSYYCWKNITEGLGLEGPQNPWWP-IIQNIREKEIVYFAWVPGHK 722 Query 537 GIPQNEEID---RYISEIFLAKEGRGILQKRAEDAGYD 571 GI N+ D + EI LA +G I +KR EDAG+D Sbjct 723 GIYGNQLADEAAKIKEEIMLAYQGTQIKEKRDEDAGFD 760 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:F2_MP257C] Sequence ID: Q9QBZ1.3 Length: 1434 Range 1: 585 to 1139 Score:357 bits(915), Expect:3e-108, Method:Compositional matrix adjust., Identities:220/560(39%), Positives:328/560(58%), Gaps:17/560(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 585 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALTEICTEMEKEGKISKIGPENPYNT 644 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++K+ VT+LD+GDAYF++ Sbjct 645 PVFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKRSVTVLDVGDAYFSV 704 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL + +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + +E+P I Sbjct 705 PLDKEFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMIKILEPFRKENPEI 764 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 765 VIYQYMDDLYVGSDLEIGQHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELHPDK 824 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LP+ + T+N +QKLVG L W + G + ++ KL+ G +AL + Sbjct 825 WTVQAIQLPD--KSSWTVNDIQKLVGKLNWASQIYPGIRVKHLCKLLRGTKALTDVVPLT 882 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 + E R+ LKE + G YYD KD+ ++ G+ Y ++QE K L Sbjct 883 AEAELELAENREILKEPVHGVYYDPSKDLIAEIQKQGHDQWTYQIYQEPHKNLKTGKYAR 942 Query 358 IKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMP--SF 412 K+ + +Q+ + QK+ E I+ GK+P LP ++E W I + W+P F Sbjct 943 RKSAHTNDVKQLTEVVQKVATEGIVIWGKVPKFRLPIQKETWEIWWTEYWQATWIPEWEF 1002 Query 413 WSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQ 469 + W + ++ T+Y DG ++ G GYI G +K E TNQ Sbjct 1003 VNTPPLVKLWYQLET-EPIIGAETFYVDGAANRETKLGKAGYITDRGRQKVVSLTETTNQ 1061 Query 470 QLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGV 529 + EL+AI+ A + ++NIVTDS+YA + + D+ + I +I+E + KE++ + Sbjct 1062 KTELQAIQLALQDSGSEVNIVTDSQYALGIIQAHPDKS--ESEIVNQIIEQLIQKERVYL 1119 Query 530 HWVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE++D+ +S Sbjct 1120 SWVPAHKGIGGNEQVDKLVS 1139 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Simian immunodeficiency virus (ISOLATE GB1)] Sequence ID: P22382.2 Length: 1441 Range 1: 600 to 1149 Score:357 bits(915), Expect:4e-108, Method:Compositional matrix adjust., Identities:227/571(40%), Positives:319/571(55%), Gaps:49/571(8%) Query 7 KIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCI 66 KIP T+V+LK G GP I QWPL++EK+ GL++I DRLE+EGK+ R P NTPIF I Sbjct 600 KIPITKVKLKPGVDGPRIKQWPLSKEKIVGLQKICDRLEEEGKISRVDPGNNYNTPIFAI 659 Query 67 KKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEP 125 KKK +WR LIDFRELNK T+D E QLG+PHP G+++ K +T+LDIGDAYF+IPL Sbjct 660 KKKDKNEWRKLIDFRELNKLTQDFHELQLGIPHPAGIKKCKRITVLDIGDAYFSIPLDPD 719 Query 126 YRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIY 185 YR YT FT+ S NN P RY + VLPQGWK SP ++Q T+ +L + + HP +Q Y Sbjct 720 YRPYTAFTVPSVNNQAPGKRYMYNVLPQGWKGSPCIFQGTVASLLEVFRKNHPTVQLYQY 779 Query 186 MDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQK 245 MDD+++GSD EEH + EL + + + PE K Q+ P W+G+ELHP+KWK +K Sbjct 780 MDDLFVGSDYTAEEHEKAIVELRALLMTWNLETPEKKYQKEPPFHWMGYELHPDKWKIEK 839 Query 246 HTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIESIHVR 304 LPE+ E P T+N++QKLVG L W L G + KL+ G + + + Sbjct 840 VQLPELAEQP-TVNEIQKLVGKLNWAAQLYPGIKTKQLCKLIRGGLNITEKVTMTEEARL 898 Query 305 EWEACRQKL-KEMEGNYYDEEKDIY--------GQLDW----GNKAIEYIVFQEKGKPLW 351 E+E ++ L +E EG+YYD K++Y G + + GNK + GK Sbjct 899 EYEQNKEILAEEQEGSYYDPNKELYVRFQKTTGGDISFQWKQGNKVL------RAGKYGK 952 Query 352 VNVVHSIKNLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMGNINWMPS 411 HS + ++ A QK+ +E I+ G +P + +P E W +W Sbjct 953 QKTAHS----NDLMKLAGATQKVGRESIVIWGFVPKMQIPTTREIW--------EDWWHE 1000 Query 412 FWSC-YKGSVRWKKRNVIA----ELVPGP-----TYYTDGG--KKNGRGSLGYIASTG-E 458 +W C + V + ++ L P P TYY DG + + G GYI G + Sbjct 1001 YWQCTWIPEVEFISTPMLEREWYSLSPEPLEGVETYYVDGAANRDSKMGKAGYITDRGFQ 1060 Query 459 KFRIHEEGTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIM 518 + + TNQQ EL A++ A + +NIVTDS+Y + E +PI I+ Sbjct 1061 RVEEYLNTTNQQTELHAVKLALEDSGSYVNIVTDSQYVVGILASRPTE--TDHPIVKEII 1118 Query 519 ELVHNKEKIGVHWVPGHKGIPQNEEIDRYIS 549 EL+ KEKI + W+P HKGI NE+ID+ +S Sbjct 1119 ELMKGKEKIYLSWLPAHKGIGGNEQIDKLVS 1149 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 1 (NDK ISOLATE)] Sequence ID: P18802.3 Length: 1432 Range 1: 583 to 1138 Score:356 bits(913), Expect:7e-108, Method:Compositional matrix adjust., Identities:223/561(40%), Positives:326/561(58%), Gaps:17/561(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ R P NT Sbjct 583 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALTEICTEMEKEGKISRIGPENPYNT 642 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 PIF IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 643 PIFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 702 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + +++P I Sbjct 703 PLDEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPEI 762 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 763 VIYQYMDDLYVGSDLEIGQHRTKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELHPDK 822 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVW-RQSLIGKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W Q G + + KL+ G +AL + Sbjct 823 WTVQPINLPE--KESWTVNDIQKLVGKLNWASQIYAGIKVKQLCKLLRGTKALTEVVPLT 880 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD KD+ +L G+ Y ++QE K L Sbjct 881 EEAELELAENREILKEPVHGVYYDPSKDLIAELQKQGDGQWTYQIYQEPFKNLKTGKYAR 940 Query 358 IK--NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMP--SF 412 + + + +Q+ +A QK+ E I+ GK P LP ++E W ++ W+P F Sbjct 941 TRGAHTNDVKQLTEAVQKIATESIVIWGKTPKFKLPIQKETWETWWIEYWQATWIPEWEF 1000 Query 413 WSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQ 469 + W + ++ T+Y DG ++ G GY+ G +K + TNQ Sbjct 1001 VNTPPLVKLWYQLEK-EPIIGAETFYVDGAANRETKLGKAGYVTDRGRQKVVPFTDTTNQ 1059 Query 470 QLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGV 529 + EL+AI A + ++NIVTDS+YA + D+ + + ++I+E + KEK+ + Sbjct 1060 KTELQAINLALQDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVSQIIEQLIKKEKVYL 1117 Query 530 HWVPGHKGIPQNEEIDRYISE 550 WVP HKGI NE++D+ +S+ Sbjct 1118 AWVPAHKGIGGNEQVDKLVSQ 1138 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 1 (OYI ISOLATE)] Sequence ID: P20892.3 Length: 1434 Range 1: 585 to 1139 Score:355 bits(912), Expect:8e-108, Method:Compositional matrix adjust., Identities:224/569(39%), Positives:326/569(57%), Gaps:35/569(6%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 585 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKVLIEICTEMEKEGKISKVGPENPYNT 644 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 645 PVFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 704 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL + +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + +++P I Sbjct 705 PLDKDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPDI 764 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 765 VIYQYMDDLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDK 824 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVW-RQSLIGKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W Q G + N+ KL+ G +AL + Sbjct 825 WTVQPIMLPE--KDSWTVNDIQKLVGKLNWASQIYAGIKVKNLCKLLRGTKALTEVIPLT 882 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD KD+ +L G Y ++QE K L Sbjct 883 EEAELELAENREILKEPVHGVYYDPSKDLVAELQKQGQGQWTYQIYQEPFKNLKTGKYAR 942 Query 358 IK--NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREE------------DWILELQM 403 ++ + + +Q+ +A QK+TQE I+ GK P LP ++E WI E + Sbjct 943 MRGAHTNDVKQLTEAVQKITQESIVIWGKTPKFKLPIQKETWEAWWTEYWQATWIPEWEF 1002 Query 404 GNINWMPSFWSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKF 460 N + W + +V T+Y DG ++ G GY+ G +K Sbjct 1003 VNTPPLVKLWYQLEKD----------PIVGAETFYVDGAANRETKLGKAGYVTDRGRQKV 1052 Query 461 RIHEEGTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMEL 520 + TNQ+ EL+AI A + ++NIVTDS+YA + D+ + + ++I+E Sbjct 1053 VSLTDTTNQKTELQAIHLALQDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVSQIIEQ 1110 Query 521 VHNKEKIGVHWVPGHKGIPQNEEIDRYIS 549 + KEK+ + WVP HKGI NE++D+ +S Sbjct 1111 LIKKEKVYLAWVPAHKGIGGNEQVDKLVS 1139 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [SIVcpz MB66] Sequence ID: Q1A267.4 Length: 1438 Range 1: 585 to 1139 Score:355 bits(911), Expect:1e-107, Method:Compositional matrix adjust., Identities:225/559(40%), Positives:322/559(57%), Gaps:15/559(2%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V LK G GP + QWPLT+EK+ L EI +EKEGK+ R P NT Sbjct 585 NFPISPIETVPVSLKPGMDGPRVKQWPLTEEKIRALTEICTEMEKEGKISRVGPENPYNT 644 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 PIF IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF+ Sbjct 645 PIFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKQKKSVTVLDVGDAYFSC 704 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + +++P I Sbjct 705 PLDENFRKYTAFTIPSVNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPEI 764 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL +E HR V EL +++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 765 IIYQYMDDLYVGSDLKIELHREKVEELRAHLLKWGFTTPDKKHQKEPPFLWMGYELHPDK 824 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWR-QSLIGKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKL+G L W Q G + + KL+ G +AL Sbjct 825 WTVQPIQLPE--KESWTVNDIQKLIGKLNWACQIYPGIRVKQLCKLIRGTKALTEVVTFT 882 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 + E R+ LKE + G YYD K++ ++ G Y +FQE+ K L Sbjct 883 TEAELELAENREILKEPVHGAYYDPSKELIAEIQKQGQGQWTYQIFQEQYKNLKTGKYAR 942 Query 358 IKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMPSFWS 414 +++ + +Q+ + QK+ E I+ GK+P LP ++E W W+P + Sbjct 943 MRSAHTNDVKQLTEVVQKVALESIVIWGKVPRFRLPIQKETWEAWWTDYWQATWIPEWEY 1002 Query 415 CYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQQ 470 + + + +PG T+Y DG ++ G GY+ G +K E TNQ+ Sbjct 1003 VNTPPLVKLWYQLEQDPIPGAETFYVDGAANRETKLGKAGYVTDKGRQKIISLTETTNQK 1062 Query 471 LELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVH 530 EL+AI+ A + ++NIVTDS+YA + D + I +I+E + KEK+ + Sbjct 1063 AELQAIQLALQDSEVEVNIVTDSQYALGIIQGQPDTS--ESEIVNQIIEELIKKEKVYLS 1120 Query 531 WVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE+ID+ +S Sbjct 1121 WVPAHKGIGGNEQIDKLVS 1139 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 1 (MAL ISOLATE)] Sequence ID: P04588.3 Length: 1440 Range 1: 591 to 1145 Score:355 bits(910), Expect:2e-107, Method:Compositional matrix adjust., Identities:222/560(40%), Positives:326/560(58%), Gaps:17/560(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 591 NFPISPIETVPVKLKPGMDGPRVKQWPLTEEKIKALTEICKDMEKEGKILKIGPENPYNT 650 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L++FRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 651 PVFAIKKKDSTKWRKLVNFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 710 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + ++P I Sbjct 711 PLDEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRTKNPEI 770 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 771 VIYQYMDDLYVGSDLEIGQHRTKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELHPDK 830 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LP+ + T+N +QKLVG L W + G + + KL+ G +AL + Sbjct 831 WTVQPIQLPD--KESWTVNDIQKLVGKLNWASQIYPGIKVKQLCKLLRGAKALTDIVPLT 888 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 + E R+ LKE + G YYD KD+ ++ G Y ++QE+ K L Sbjct 889 AEAELELAENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEQYKNLKTGKYAR 948 Query 358 IKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMP--SF 412 IK+ + +Q+ +A QK+ QE I+ GK P LP ++E W + W+P F Sbjct 949 IKSAHTNDVKQLTEAVQKIAQESIVIWGKTPKFRLPIQKETWEAWWTEYWQATWIPEWEF 1008 Query 413 WSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQ 469 + W + +V T+Y DG ++ +G GY+ G +K E TNQ Sbjct 1009 VNTPPLVKLWYQLET-EPIVGAETFYVDGAANRETKKGKAGYVTDRGRQKVVSLTETTNQ 1067 Query 470 QLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGV 529 + EL+AI A + ++NIVTDS+YA + D+ + I +I+E + K+K+ + Sbjct 1068 KTELQAIHLALQDSGSEVNIVTDSQYALGIIQAQPDKS--ESEIVNQIIEQLIQKDKVYL 1125 Query 530 HWVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE++D+ +S Sbjct 1126 SWVPAHKGIGGNEQVDKLVS 1145 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:F2_MP255C] Sequence ID: Q9QBZ5.3 Length: 1430 Range 1: 581 to 1135 Score:354 bits(909), Expect:2e-107, Method:Compositional matrix adjust., Identities:220/559(39%), Positives:325/559(58%), Gaps:15/559(2%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 581 NFPISPIETVPVKLKPGMDGPRVKQWPLTEEKIKALTEICTEMEKEGKISKIGPENPYNT 640 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 641 PVFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 700 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL + +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + ++P I Sbjct 701 PLDKEFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQCSMTKILEPFRAKNPEI 760 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 761 VIYQYMDDLYVGSDLEIGQHRTKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELHPDK 820 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W + G I ++ +L+ G +AL + Sbjct 821 WTVQPIQLPE--KSSWTVNDIQKLVGKLNWASQIYPGIRIKHLCRLLRGAKALTDVVPLT 878 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 + E R+ +KE + G YYD KD+ ++ G+ Y ++QE K L Sbjct 879 AEAELELAENREIIKEPVHGVYYDPSKDLIAEIQKQGHDQWTYQIYQEPYKNLKTGKYAK 938 Query 358 IKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMPSFWS 414 K+ + +Q+ + QK+ E I+ GKIP LP ++E W I + W+P + Sbjct 939 RKSAHTNDVKQLTEVVQKIATESIVIWGKIPKFRLPIQKETWEIWWTEYWQATWIPEWEF 998 Query 415 CYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQQ 470 + + E + G T+Y DG ++ G GY+ G +K E TNQ+ Sbjct 999 VNTPPLVKLWYQLETEPIAGAETFYVDGAANRETKLGKAGYVTDRGRQKVVPLTETTNQK 1058 Query 471 LELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVH 530 EL+AI A + ++NIVTDS+YA + D+ + + +I+E + KEK+ + Sbjct 1059 TELQAIHLALQDSGSEVNIVTDSQYALGIIQAQPDKS--ESELVNQIIEQLIQKEKVYLS 1116 Query 531 WVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE++D+ +S Sbjct 1117 WVPAHKGIGGNEQVDKLVS 1135 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 1 (Z2/CDC-Z34 ISOLATE)] Sequence ID: P12499.3 Length: 1436 Range 1: 587 to 1142 Score:353 bits(907), Expect:5e-107, Method:Compositional matrix adjust., Identities:221/570(39%), Positives:327/570(57%), Gaps:35/570(6%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ R P NT Sbjct 587 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALTEICTEMEKEGKISRVGPENPYNT 646 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 PIF IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 647 PIFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 706 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL + +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + +++P I Sbjct 707 PLDKDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPEI 766 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 767 VIYQYMDDLYVGSDLEIGQHRTKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELHPDK 826 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W + G + + KL+ G +AL + Sbjct 827 WTVQSIKLPE--KESWTVNDIQKLVGKLNWASQIYPGIKVRQLCKLLRGTKALTEVIPLT 884 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD KD+ ++ G+ Y ++QE K L Sbjct 885 EEAELELAENREILKEPVHGVYYDPSKDLIAEIQKQGHGQWTYQIYQEPFKNLKTGKYAR 944 Query 358 IK--NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREE------------DWILELQM 403 ++ + + +Q+ + QK++ E I+ GK P LP ++E WI E + Sbjct 945 MRGAHTNDVKQLAEVVQKISTESIVIWGKTPKFRLPIQKETWETWWVEYWQATWIPEWEF 1004 Query 404 GNINWMPSFWSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKF 460 N + W + +K +I T+Y DG ++ G GY+ G +K Sbjct 1005 VNTPPLVKLW------YQLEKEPIIG----AETFYVDGAANRETKLGKAGYVTDRGRQKV 1054 Query 461 RIHEEGTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMEL 520 + TNQ+ EL+AI A + ++NIVTDS+YA + D+ + + ++I+E Sbjct 1055 VPFTDTTNQKTELQAINLALQDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVSQIIEQ 1112 Query 521 VHNKEKIGVHWVPGHKGIPQNEEIDRYISE 550 + KEK+ + WVP HKGI NE++D+ +S+ Sbjct 1113 LIKKEKVYLAWVPAHKGIGGNEQVDKLVSQ 1142 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 2 (ISOLATE SBLISY)] Sequence ID: P12451.3 Length: 1462 Range 1: 609 to 1163 Score:353 bits(907), Expect:6e-107, Method:Compositional matrix adjust., Identities:235/565(42%), Positives:325/565(57%), Gaps:26/565(4%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 NL KI +V LK G GP QWPLT+EK+E L+EI +++E+EG++ APP NT Sbjct 609 NLPVAKIEPVKVTLKPGKDGPKQRQWPLTREKIEALREICEKMEREGQLEEAPPTNPYNT 668 Query 62 PIFCIKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P F IKKK KWRMLIDFRELNK T+D E QLG+PHP GL +K+ +T+LD+GDAYF+I Sbjct 669 PTFAIKKKDKNKWRMLIDFRELNKVTQDFTEVQLGIPHPAGLAKKRRITVLDVGDAYFSI 728 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PLYE +RQYT FT+ S NN P RY +KVLPQGWK SPA++Q+TM+++L + + +P + Sbjct 729 PLYEDFRQYTAFTLPSVNNAEPGKRYIYKVLPQGWKGSPAIFQYTMRQVLEPFRKANPDV 788 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDDI I SD EH +V +L + GF P++K Q+ P +W+G+EL P K Sbjct 789 IIVQYMDDILIASDRTDLEHDKVVLQLKELLNGLGFSTPDEKFQKDPPYQWMGYELWPTK 848 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 WK QK LP+ + T+N +QKLVG L W + G ++ KL+ G E ++ Sbjct 849 WKLQKIQLPQ--KEVWTVNDIQKLVGVLNWAAQIYPGIKTKHLCKLIRGKMTPTEE--VQ 904 Query 300 SIHVREWEACRQKL---KEMEGNYYDEEKDIYG--QLDWGNKAIEYIVFQEKGKPLWVNV 354 + E E K+ +E EG+YY EEK++ Q D N+ Y V Q + K L V Sbjct 905 WTELAEAELEENKIILSQEQEGHYYQEEKELEATVQKDQDNQWT-YKVHQGE-KILKVGK 962 Query 355 VHSIKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMGN---INWM 409 IKN + + + + QK+ +E ++ G+IP LP E W E N + W+ Sbjct 963 YAKIKNTHTNGVRLLAQVVQKIGKEALVIWGRIPKFHLPVERETW--EQWWDNYWQVTWI 1020 Query 410 PSFWSCYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHEE 465 P + + N++ + +PG T+YTDG +++ G GYI G +K RI E+ Sbjct 1021 PDWDFVSTPPLVRLAFNLVKDPIPGAETFYTDGSCNRQSKEGKAGYITDRGKDKVRILEQ 1080 Query 466 GTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKE 525 TNQQ EL A A K+NIV DS+Y + E R I +I+E + KE Sbjct 1081 TTNQQAELEAFAMAVTDSGPKVNIVVDSQYVMGIVTGQPAESESR--IVNKIIEEMIKKE 1138 Query 526 KIGVHWVPGHKGIPQNEEIDRYISE 550 I V WVP HKGI N+EID +S+ Sbjct 1139 AIYVAWVPAHKGIGGNQEIDHLVSQ 1163 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:B_MN] Sequence ID: P05961.3 Length: 1441 Range 1: 592 to 1146 Score:353 bits(905), Expect:8e-107, Method:Compositional matrix adjust., Identities:220/560(39%), Positives:326/560(58%), Gaps:17/560(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 592 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALIEICTEMEKEGKISKIGPENPYNT 651 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 652 PVFAIKKKDSTKWRKLVDFRELNKKTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 711 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL + +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + +++P I Sbjct 712 PLDKDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPDI 771 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 772 VIYQYMDDLYVGSDLEIGQHRAKIEELRRHLLRWGFTTPDKKHQKEPPFLWMGYELHPDK 831 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVW-RQSLIGKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W Q G + + KL+ G +AL + Sbjct 832 WTVQPIVLPE--KDSWTVNDIQKLVGKLNWASQIYAGIKVKQLCKLLRGTKALTEVIPLT 889 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD KD+ ++ G Y ++QE K L Sbjct 890 EEAELELAENREILKEPVHGVYYDPSKDLIAEVQKQGQGQWTYQIYQEPFKNLKTGKYAR 949 Query 358 IK--NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMPSFWS 414 ++ + + +Q+ +A QK+ E I+ GK P LP ++E W + W+P W Sbjct 950 MRGAHTNDVKQLTEAVQKIATESIVIWGKTPKFRLPIQKETWETWWTEYTXATWIPE-WE 1008 Query 415 CYKGS--VRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQ 469 V+ + +V T+Y DG ++ +G GY+ + G +K + TNQ Sbjct 1009 VVNTPPLVKLWYQLEKEPIVGAETFYVDGAANRETKKGKAGYVTNRGRQKVVSLTDTTNQ 1068 Query 470 QLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGV 529 + EL+AI A + ++NIVTDS+YA + D+ + + ++I+E + KEK+ + Sbjct 1069 KTELQAIHLALQDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVSQIIEQLIKKEKVYL 1126 Query 530 HWVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE++D+ +S Sbjct 1127 AWVPAHKGIGGNEQVDKLVS 1146 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:G_SE6165] Sequence ID: O89940.3 Length: 1433 Range 1: 584 to 1138 Score:353 bits(905), Expect:8e-107, Method:Compositional matrix adjust., Identities:219/559(39%), Positives:320/559(57%), Gaps:15/559(2%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +E+EGK+ + P NT Sbjct 584 NFPISPIETVPVKLKPGMDGPRVKQWPLTEEKIKALTEICKEMEEEGKISKIGPENPYNT 643 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 PIF IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 644 PIFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 703 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +R+YT FT+ S NN P VRY + VLPQGWK SPA++Q +M +IL + +P + Sbjct 704 PLDEDFRKYTAFTIPSINNETPGVRYQYNVLPQGWKGSPAIFQSSMTRILEPFRANNPEM 763 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 764 VIYQYMDDLYVGSDLEIGQHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELHPDK 823 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LP+ + T+N +QKLVG L W + G + ++ KL+ G +AL + Sbjct 824 WTVQPIQLPD--KESWTVNDIQKLVGKLNWASQIYPGIKVTHLCKLLRGAKALTDIVSLT 881 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVN--VV 355 + E R+ L+E + G YYD K++ ++ G Y ++QE K L Sbjct 882 AEAEMELAENREILREPVHGVYYDPSKELIAEVQKQGLDQWTYQIYQEPYKNLKTGKYAK 941 Query 356 HSIKNLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMPSFWS 414 + + +Q+ + QK+ E I+ GK P LP R+E W I W+P + Sbjct 942 RGSAHTNDVKQLTEVVQKIATESIVIWGKTPKFKLPIRKETWEIWWTDYWQATWIPEWEF 1001 Query 415 CYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQQ 470 + + E +PG TYY DG ++ G GY+ G +K E TNQ+ Sbjct 1002 VNTPPLVKLWYRLETEPIPGAETYYVDGAANRETKLGKAGYVTDKGKQKIITLTETTNQK 1061 Query 471 LELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVH 530 EL+AI+ A + ++NIVTDS+YA + D + +I+E + KEK+ + Sbjct 1062 AELQAIQLALQDSRSEVNIVTDSQYALGIIQAQPDRS--EAELVNQIIEQLIKKEKVYLS 1119 Query 531 WVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE++D+ +S Sbjct 1120 WVPAHKGIGGNEQVDKLVS 1138 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [SIVcpz TAN1] Sequence ID: Q8AII1.4 Length: 1462 Range 1: 613 to 1168 Score:353 bits(905), Expect:1e-106, Method:Compositional matrix adjust., Identities:223/562(40%), Positives:317/562(56%), Gaps:19/562(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I +V+LKEG GP + QWPL++EK+E L EI LEKEGK+ P NT Sbjct 613 NFPISPIEVVKVQLKEGMDGPKVKQWPLSKEKIEALTEICKTLEKEGKISAVGPENPYNT 672 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 PIF IKKK + KWR L+DFRELNK+T+D E QLG+PHP GL+++ VT+LD+GDAYF+I Sbjct 673 PIFAIKKKDTSKWRKLVDFRELNKRTQDFWELQLGIPHPAGLRKRNMVTVLDVGDAYFSI 732 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL +R+YT FT+ S NN P R+ + VLPQGWK SPA++Q +M KIL + +EHP + Sbjct 733 PLDPDFRKYTAFTIPSLNNNTPGKRFQYNVLPQGWKGSPAIFQSSMTKILDPFRKEHPDV 792 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+YIGSDL EEHR ++ +L ++ +G P+ K QE P W+G+ELHP K Sbjct 793 DIYQYMDDLYIGSDLNEEEHRKLIKKLRQHLLTWGLETPDKKYQEKPPFMWMGYELHPNK 852 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q TLPE + T+N +QKLVG L W + G + KL+ G + L + Sbjct 853 WTVQNITLPEPEQ--WTVNHIQKLVGKLNWASQIYHGIKTKELCKLIRGVKGLTEPVEMT 910 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E E +Q LKE ++G YYD + + + G Y ++QE+GK L Sbjct 911 REAELELEENKQILKEKVQGAYYDPKLPLQAAIQKQGQGQWTYQIYQEEGKNL--KTGKY 968 Query 358 IKNLSqaqqiikaaqkLTQ----EVIIRTGKIPWILLPGREEDWIL-ELQMGNINWMPSF 412 K+ I+ L Q E II G +P LLP +E W + W+P + Sbjct 969 AKSPGTHTNEIRQLAGLIQKIGNESIIIWGIVPKFLLPVSKETWSQWWTDYWQVTWVPEW 1028 Query 413 WSCYKGSVRWKKRNVIAELVP-GPTYYTDGG--KKNGRGSLGYIASTGE-KFRIHEEGTN 468 + N++++ +P T+Y DG + + +G GY+ + G + + E TN Sbjct 1029 EFINTPPLIRLWYNLLSDPIPEAETFYVDGAANRDSKKGRAGYVTNRGRYRSKDLENTTN 1088 Query 469 QQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIG 528 QQ EL A++ A K ++NIVTDS+Y + D+ +PI +I++ + K I Sbjct 1089 QQAELWAVDLALKDSGAQVNIVTDSQYVMGVLQGLPDQS--DSPIVEQIIQKLTQKTAIY 1146 Query 529 VHWVPGHKGIPQNEEIDRYISE 550 + WVP HKGI NEE+D+ +S+ Sbjct 1147 LAWVPAHKGIGGNEEVDKLVSK 1168 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:B_89.6] Sequence ID: Q73368.3 Length: 1435 Range 1: 586 to 1140 Score:352 bits(903), Expect:1e-106, Method:Compositional matrix adjust., Identities:221/569(39%), Positives:322/569(56%), Gaps:35/569(6%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 586 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKIGPENPYNT 645 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 646 PVFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 705 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + +++P I Sbjct 706 PLDEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPDI 765 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + +L ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 766 VIYQYMDDLYVGSDLEIGQHRAKIEDLRQHLLKWGFTTPDKKHQKEPPFLWMGYELHPDK 825 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVW-RQSLIGKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W Q G + + KL+ G +AL + Sbjct 826 WTVQPIVLPE--KDSWTVNDIQKLVGKLNWASQIYAGIKVKQLCKLLRGTKALTEVVPLT 883 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD KD+ +L G Y ++QE K L Sbjct 884 EEAELELAENREILKEPVHGVYYDPTKDLIAELQKQGQGQWTYQIYQEPYKNLKTGKYAR 943 Query 358 IK--NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREE------------DWILELQM 403 ++ + + +Q+ +A QK+ E I+ GK P LP ++E WI E + Sbjct 944 MRGAHTNDVKQLTEAVQKIATESIVIWGKTPKFKLPIQKETWEAWWTDYWQATWIPEWEF 1003 Query 404 GNINWMPSFWSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKF 460 N + W + +V T+Y DG + G GY+ G +K Sbjct 1004 VNTPPLVKLWYQLEKE----------PIVGAETFYVDGAANRDTKSGKAGYVTDRGRQKV 1053 Query 461 RIHEEGTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMEL 520 + TNQ+ EL+AI A + ++NIVTDS+YA + D+ + + ++I+E Sbjct 1054 VSLADTTNQKTELQAIHLALQDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVSQIIEQ 1111 Query 521 VHNKEKIGVHWVPGHKGIPQNEEIDRYIS 549 + KEK+ + WVP HKGI NE++D+ +S Sbjct 1112 LIKKEKVYLAWVPAHKGIGGNEQVDKLVS 1140 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 1 (JRCSF ISOLATE)] Sequence ID: P20875.3 Length: 1439 Range 1: 590 to 1144 Score:352 bits(903), Expect:1e-106, Method:Compositional matrix adjust., Identities:222/560(40%), Positives:325/560(58%), Gaps:17/560(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 590 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKIGPENPYNT 649 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELN++T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 650 PVFAIKKKDSTKWRKLVDFRELNRRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 709 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL + +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + +++P I Sbjct 710 PLDKDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPDI 769 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 770 IIYQYMDDLYVGSDLEIGQHRTKIEELRQHLLKWGFTTPDKKHQKEPPFLWMGYELHPDK 829 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVW-RQSLIGKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W Q G + + KL+ G +AL + Sbjct 830 WTVQPIVLPE--KDSWTVNDIQKLVGKLNWASQIYAGIKVKQLCKLLRGTKALTEVIPLT 887 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD KD+ ++ G Y +FQE K L Sbjct 888 KEAELELAENREILKEPVHGVYYDPSKDLIVEIQKQGQGQWTYQIFQEPFKNLKTGKYAR 947 Query 358 IK--NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMP--SF 412 + + + +Q+ +A QK+ E I+ GKIP LP ++E W + W+P F Sbjct 948 TRGAHTNDVKQLTEAVQKIANESIVIWGKIPKFKLPIQKETWETWWTEYWQATWIPEWEF 1007 Query 413 WSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQ 469 + W + +V T+Y DG ++ G GY+ S G +K + TNQ Sbjct 1008 VNTPPLVKLWYQLEK-EPIVGAETFYVDGAANRETKLGKAGYVTSRGRQKVVSLTDTTNQ 1066 Query 470 QLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGV 529 + EL+AI A + ++NIVTDS+YA + D+ + + ++I+E + KEK+ + Sbjct 1067 KTELQAIHLALQDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVSQIIEQLIKKEKVYL 1124 Query 530 HWVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE++D+ +S Sbjct 1125 AWVPAHKGIGGNEQVDKLVS 1144 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 O_MVP5180] Sequence ID: Q79666.3 Length: 1446 Range 1: 587 to 1142 Score:352 bits(902), Expect:3e-106, Method:Compositional matrix adjust., Identities:222/561(40%), Positives:321/561(57%), Gaps:17/561(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I V+LK G GP + QWPL++EK+E L I +E+EGK+ R P NT Sbjct 587 NFPISPIAPVPVKLKPGMDGPKVKQWPLSREKIEALTAICQEMEQEGKISRIGPENPYNT 646 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 PIF IKKK S KWR L+DFRELNK+T+D E QLG+PHPGGL++++ VT+LD+GDAYF+ Sbjct 647 PIFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPGGLKQRQSVTVLDVGDAYFSC 706 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL +R+YT FT+ S NN P VRY + VLPQGWK SPA++Q +M KIL + + +P + Sbjct 707 PLDPDFRKYTAFTIPSVNNETPGVRYQYNVLPQGWKGSPAIFQSSMTKILDPFRKSNPEV 766 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 + Y+DD+Y+GSDL L EHR V L ++ Q+GF P+ K Q+ P W+G+ELHP+K Sbjct 767 EIYQYIDDLYVGSDLPLAEHRKRVELLREHLYQWGFTTPDKKHQKEPPFLWMGYELHPDK 826 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LP+ + T+N +QKLVG L W + G + + KL+ G ++L + Sbjct 827 WTVQPIQLPD--KEVWTVNDIQKLVGKLNWASQIYQGIRVKELCKLIRGTKSLTEVVPLS 884 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVN--VV 355 E E R+KLKE + G YY +KD++ + G Y V+Q++ K L Sbjct 885 KEAELELEENREKLKEPVHGVYYQPDKDLWVSIQKHGEGQWTYQVYQDEHKNLKTGKYAR 944 Query 356 HSIKNLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMP--SF 412 + + +Q+ + QK++QE I+ GK+P LP E W + W+P F Sbjct 945 QKASHTNDIRQLAEVVQKVSQEAIVIWGKLPKFRLPVTRETWETWWAEYWQATWIPEWEF 1004 Query 413 WSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTGEKFRIH-EEGTNQ 469 S W + +V T+Y DG + G GY+ G++ I EE TNQ Sbjct 1005 VSTPPLIKLWYQLET-EPIVGAETFYVDGAANRNTKLGKAGYVTEQGKQNIIKLEETTNQ 1063 Query 470 QLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGV 529 + EL A+ A + E++NIVTDS+Y + + +PI +I+E + KE++ + Sbjct 1064 KAELMAVLIALQDSKEQVNIVTDSQYVLGIISSQPTQS--DSPIVQQIIEELTKKERVYL 1121 Query 530 HWVPGHKGIPQNEEIDRYISE 550 WVP HKGI NE+ID+ +S+ Sbjct 1122 TWVPAHKGIGGNEKIDKLVSK 1142 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 1 (RF/HAT ISOLATE)] Sequence ID: P05959.3 Length: 1436 Range 1: 587 to 1141 Score:351 bits(900), Expect:4e-106, Method:Compositional matrix adjust., Identities:222/569(39%), Positives:325/569(57%), Gaps:35/569(6%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 587 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKIGPENPYNT 646 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 647 PVFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 706 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL + +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + +++P I Sbjct 707 PLDKEFRKYTAFTIPSINNETPRIRYQYNVLPQGWKGSPAIFQSSMTKILEPFKKQNPEI 766 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 767 VIYQYMDDLYVGSDLEIGQHRIKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELHPDK 826 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVW-RQSLIGKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W Q G + + KL+ G +AL + Sbjct 827 WTVQPIVLPE--KDSWTVNDIQKLVGKLNWASQIYAGIKVKQLCKLLRGTKALTEVVQLT 884 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD KD+ ++ G Y ++QE K L Sbjct 885 KEAELELAENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYAR 944 Query 358 IK--NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREE------------DWILELQM 403 ++ + + +Q+ +A QK+ E I+ GK P LP ++E WI E + Sbjct 945 MRGAHTNDVKQLTEAVQKVATESIVIWGKTPKFKLPIQKETWEAWWTEYWQATWIPEWEF 1004 Query 404 GNINWMPSFWSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKF 460 N + W + +K +I T+Y DG ++ G GY+ G +K Sbjct 1005 VNTPPLVKLW------YQLEKEPIIG----AETFYVDGAANRETKLGKAGYVTDRGRQKV 1054 Query 461 RIHEEGTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMEL 520 + TNQ+ EL+AI A + ++NIVTDS+YA + D+ + + ++I+E Sbjct 1055 VSLTDTTNQKTELQAIHLALQDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVSQIIEQ 1112 Query 521 VHNKEKIGVHWVPGHKGIPQNEEIDRYIS 549 + KEK+ + WVP HKGI NE++DR +S Sbjct 1113 LIKKEKVYLAWVPAHKGIGGNEQVDRLVS 1141 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 N_YBF30] Sequence ID: O91080.3 Length: 1449 Range 1: 596 to 1150 Score:351 bits(900), Expect:4e-106, Method:Compositional matrix adjust., Identities:222/570(39%), Positives:322/570(56%), Gaps:37/570(6%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT EK+E L+EI +EKEGK+ R P NT Sbjct 596 NFPISPIETVPVKLKPGMDGPKVKQWPLTTEKIEALREICTEMEKEGKISRIGPENPYNT 655 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 PIF IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF+ Sbjct 656 PIFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKQKKSVTVLDVGDAYFSC 715 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL + +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q TM KIL + E+HP I Sbjct 716 PLDKDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSTMTKILEPFREKHPEI 775 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL L +HR V +L ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 776 IIYQYMDDLYVGSDLELAQHREAVEDLRDHLLKWGFTTPDKKHQKEPPFLWMGYELHPDK 835 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W + G + + KL+ G +AL Sbjct 836 WTVQPIKLPE--KDVWTVNDIQKLVGKLNWASQIYPGIRVKQLCKLIRGTKALTEVVNFT 893 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD K++ ++ G Y ++QE K L Sbjct 894 EEAELELAENREILKEPLHGVYYDPGKELVAEIQKQGQGQWTYQIYQELHKNLKTGKYAK 953 Query 358 IKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREE------------DWILELQM 403 +++ + +Q+++ +K+ E I+ GK P LP ++E WI E + Sbjct 954 MRSAHTNDIKQLVEVVRKVATESIVIWGKTPKFRLPVQKEVWEAWWTDHWQATWIPEWEF 1013 Query 404 GNINWMPSFWSCYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EK 459 N + W + E + G T+Y DG ++ G G++ G +K Sbjct 1014 VNTPPLVKLWY-----------QLETEPISGAETFYVDGAANRETKLGKAGFVTDRGRQK 1062 Query 460 FRIHEEGTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIME 519 + TNQ+ EL+AI A ++ +NIVTDS+YA + D+ + + ++I+E Sbjct 1063 VVSIADTTNQKAELQAILMALQESGRDVNIVTDSQYAMGIIHSQPDKS--ESELVSQIIE 1120 Query 520 LVHNKEKIGVHWVPGHKGIPQNEEIDRYIS 549 + KE++ + WVP HKGI NE++D+ +S Sbjct 1121 ELIKKERVYLSWVPAHKGIGGNEQVDKLVS 1150 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:H_90CF056] Sequence ID: O93215.4 Length: 1435 Range 1: 586 to 1140 Score:351 bits(900), Expect:5e-106, Method:Compositional matrix adjust., Identities:222/565(39%), Positives:325/565(57%), Gaps:27/565(4%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ R P +T Sbjct 586 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALTEICTEMEKEGKISRIGPENPYST 645 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 PIF IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK V++LD+GDAYF++ Sbjct 646 PIFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVSVLDVGDAYFSV 705 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL + +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + E++P + Sbjct 706 PLDKEFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILAPFREQNPEM 765 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL +++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 766 VIYQYMDDLYVGSDLEIGQHRAKIEELRAHLLKWGFTTPDKKHQKEPPFLWMGYELHPDK 825 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLIGK-SIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W + + + KL+ G +AL + Sbjct 826 WTVQTVKLPE--KDSWTVNDIQKLVGKLNWASQIYPNIKVKQLCKLLRGAKALTDIIPLT 883 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQL-DWGNKAIEYIVFQE------KGKPLW 351 E R+ L+E + G YYD KD+ ++ G Y ++QE GK Sbjct 884 KEAELELAENREILREPIHGVYYDPSKDLIAEIRKQGQGQWTYQIYQEPFKNLKTGKYAK 943 Query 352 VNVVHS--IKNLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINW 408 + H+ IK L+ +A QK++ E I+ GKIP LP ++E W + W Sbjct 944 MRTAHTNDIKQLT------EAVQKISTESIVIWGKIPKFRLPIQKETWETWWTEYWQATW 997 Query 409 MPSFWSCYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHE 464 +P + + + E + G TYY DG ++ G GY+ G +K Sbjct 998 IPEWEFVNTPHLVKLWYQLETEPIAGAETYYIDGAANRETKLGKAGYVTDRGKQKVVSLT 1057 Query 465 EGTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNK 524 E TNQ+ EL+AI A + ++NIVTDS+YA + D+ + + +I+E + K Sbjct 1058 ETTNQKTELQAIYLALQDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVNQIIEELIKK 1115 Query 525 EKIGVHWVPGHKGIPQNEEIDRYIS 549 EK+ + WVP HKGI NE++D+ +S Sbjct 1116 EKVYLSWVPAHKGIGGNEQVDKLVS 1140 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:J_SE9280] Sequence ID: Q9WC54.3 Length: 1432 Range 1: 583 to 1137 Score:351 bits(900), Expect:5e-106, Method:Compositional matrix adjust., Identities:220/569(39%), Positives:322/569(56%), Gaps:35/569(6%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP I QWPLT+EK++ L +I +E+EGK+ R P NT Sbjct 583 NFPISPIETVPVKLKPGMDGPKIKQWPLTEEKIKALTQICAEMEEEGKISRVGPENPYNT 642 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 643 PVFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 702 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PLYE +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL+ + E +P I Sbjct 703 PLYEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQCSMTKILKPFRERNPEI 762 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL +E+HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 763 VIYQYMDDLYVGSDLEIEQHRRKIKELREHLLKWGFTTPDKKHQKEPPFLWMGYELHPDK 822 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W + G + + KL++G +AL + Sbjct 823 WTVQPIQLPEKEDW--TVNDIQKLVGKLNWASQIYPGIKVKQLCKLLKGAKALTDIVPLT 880 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVN--VV 355 E ++ LKE + G YYD K++ ++ G Y ++QE K L Sbjct 881 REAELELAENKEILKEPVHGVYYDSAKELIAEVQKQGLDQWTYQIYQEPFKNLKTGKYAK 940 Query 356 HSIKNLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREE------------DWILELQM 403 + + +Q+ + QK+ E I+ GK P LP + E WI E + Sbjct 941 RRSAHTNDVKQLAEVVQKIALEAIVIWGKTPKFRLPIQRETWETWWTDYWQATWIPEWEF 1000 Query 404 GNINWMPSFWSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKF 460 N + W + +K ++ T+Y DG ++ G GY+ G +K Sbjct 1001 VNTPPLVKLW------YQLEKEPIMG----AETFYVDGASNRETKTGKAGYVTDKGRQKV 1050 Query 461 RIHEEGTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMEL 520 + TNQ+ EL AI A + ++NIVTDS+YA + D+ + + +I+E Sbjct 1051 VTLTDTTNQKTELHAIYLALRDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVNQIIEE 1108 Query 521 VHNKEKIGVHWVPGHKGIPQNEEIDRYIS 549 + KEK+ + WVP HKGI NE++D+ +S Sbjct 1109 LIKKEKVYLSWVPAHKGIGGNEQVDKLVS 1137 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 1 (STRAIN UGANDAN / ISOLATE U455)] Sequence ID: P24740.3 Length: 1428 Range 1: 579 to 1133 Score:350 bits(899), Expect:5e-106, Method:Compositional matrix adjust., Identities:222/562(40%), Positives:322/562(57%), Gaps:21/562(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK GP + QWPLT+EK++ L EI + +EKEGK+ + P NT Sbjct 579 NFPISPIETVPVKLKPEMDGPKVKQWPLTEEKIKALTEICNEMEKEGKISKIGPENPYNT 638 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T+D E QLG+PH GL++KK VT+LD+GDAYF++ Sbjct 639 PVFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHTAGLKKKKSVTVLDVGDAYFSV 698 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +R+YT FT+ S NN P VRY + VLPQGWK SP+++Q +M KIL + +HP I Sbjct 699 PLDESFRKYTAFTIPSINNETPGVRYQYNVLPQGWKGSPSIFQSSMTKILEPFRSQHPDI 758 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL +++ +GF+ P+ K Q+ P W+G+ELHP+K Sbjct 759 VIYQYMDDLYVGSDLEIGQHRAKIEELRAHLLSWGFITPDKKHQKEPPFLWMGYELHPDK 818 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVW-RQSLIGKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W Q G + + KL+ G +AL + Sbjct 819 WTVQPIQLPE--KDSWTVNDIQKLVGKLNWASQIYAGIKVKQLCKLLRGAKALTDIVTLT 876 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVN--VV 355 E R+ LK+ + G YYD KD+ ++ G Y ++QE K L Sbjct 877 EEAELELAENREILKDPVHGVYYDPSKDLVAEIQKQGQDQWTYQIYQEPFKNLKTGKYAR 936 Query 356 HSIKNLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMP--SF 412 + + +Q+ + QK++ E I+ GKIP LP ++E W ++ W+P F Sbjct 937 KRSAHTNDVKQLTEVVQKVSTESIVIWGKIPKFRLPIQKETWEAWWMEYWQATWIPEWEF 996 Query 413 WSCYKGSVRWKK--RNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGT 467 + W + ++ IA T+Y DG ++ G GY+ G +K E T Sbjct 997 VNTPPLVKLWYQLEKDPIAG---AETFYVDGAANRETKLGKAGYVTDRGRQKVVSLTETT 1053 Query 468 NQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKI 527 NQ+ EL AI A + ++NIVTDS+YA + D + I +I+E + KEK+ Sbjct 1054 NQKTELHAIHLALQDSGSEVNIVTDSQYALGIIQAQPDRS--ESEIVNQIIEKLIEKEKV 1111 Query 528 GVHWVPGHKGIPQNEEIDRYIS 549 + WVP HKGI NE++D+ +S Sbjct 1112 YLSWVPAHKGIGGNEQVDKLVS 1133 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 1 (BRU ISOLATE)] Sequence ID: P03367.3 Length: 1447 Range 1: 598 to 1152 Score:350 bits(897), Expect:1e-105, Method:Compositional matrix adjust., Identities:220/569(39%), Positives:322/569(56%), Gaps:35/569(6%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 598 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKIGPENPYNT 657 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 658 PVFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 717 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + +++P I Sbjct 718 PLDEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPDI 777 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++G P+ K Q+ P W+G+ELHP+K Sbjct 778 VIYQYMDDLYVGSDLEIGQHRTKIEELRQHLLRWGLTTPDKKHQKEPPFLWMGYELHPDK 837 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W + G + + KL+ G +AL + Sbjct 838 WTVQPIVLPE--KDSWTVNDIQKLVGKLNWASQIYPGIKVRQLCKLLRGTKALTEVIPLT 895 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD KD+ ++ G Y ++QE K L Sbjct 896 EEAELELAENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYAR 955 Query 358 IK--NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREE------------DWILELQM 403 + + + +Q+ +A QK+T E I+ GK P LP ++E WI E + Sbjct 956 TRGAHTNDVKQLTEAVQKITTESIVIWGKTPKFKLPIQKETWETWWTEYWQATWIPEWEF 1015 Query 404 GNINWMPSFWSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKF 460 N + W + +V T+Y DG ++ G GY+ + G +K Sbjct 1016 VNTPPLVKLWYQLEKE----------PIVGAETFYVDGAASRETKLGKAGYVTNRGRQKV 1065 Query 461 RIHEEGTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMEL 520 + TNQ+ EL+AI A + ++NIVTDS+YA + D+ + + +I+E Sbjct 1066 VTLTDTTNQKTELQAIHLALQDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVNQIIEQ 1123 Query 521 VHNKEKIGVHWVPGHKGIPQNEEIDRYIS 549 + KEK+ + WVP HKGI NE++D+ +S Sbjct 1124 LIKKEKVYLAWVPAHKGIGGNEQVDKLVS 1152 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:B_HXB2R] Sequence ID: P04585.4 Length: 1435 Range 1: 586 to 1140 Score:349 bits(896), Expect:2e-105, Method:Compositional matrix adjust., Identities:220/560(39%), Positives:324/560(57%), Gaps:17/560(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 586 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKIGPENPYNT 645 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 646 PVFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 705 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + +++P I Sbjct 706 PLDEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPDI 765 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++G P+ K Q+ P W+G+ELHP+K Sbjct 766 VIYQYMDDLYVGSDLEIGQHRTKIEELRQHLLRWGLTTPDKKHQKEPPFLWMGYELHPDK 825 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W + G + + KL+ G +AL + Sbjct 826 WTVQPIVLPE--KDSWTVNDIQKLVGKLNWASQIYPGIKVRQLCKLLRGTKALTEVIPLT 883 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD KD+ ++ G Y ++QE K L Sbjct 884 EEAELELAENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYAR 943 Query 358 IK--NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMP--SF 412 ++ + + +Q+ +A QK+T E I+ GK P LP ++E W + W+P F Sbjct 944 MRGAHTNDVKQLTEAVQKITTESIVIWGKTPKFKLPIQKETWETWWTEYWQATWIPEWEF 1003 Query 413 WSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQ 469 + W + +V T+Y DG ++ G GY+ + G +K + TNQ Sbjct 1004 VNTPPLVKLWYQLEK-EPIVGAETFYVDGAANRETKLGKAGYVTNRGRQKVVTLTDTTNQ 1062 Query 470 QLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGV 529 + EL+AI A + ++NIVTDS+YA + D+ + + +I+E + KEK+ + Sbjct 1063 KTELQAIYLALQDSGLEVNIVTDSQYALGIIQAQPDQS--ESELVNQIIEQLIKKEKVYL 1120 Query 530 HWVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE++D+ +S Sbjct 1121 AWVPAHKGIGGNEQVDKLVS 1140 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:F1_93BR020] Sequence ID: O89290.3 Length: 1430 Range 1: 587 to 1135 Score:349 bits(896), Expect:2e-105, Method:Compositional matrix adjust., Identities:216/554(39%), Positives:323/554(58%), Gaps:17/554(3%) Query 8 IPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIK 67 I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NTP+F IK Sbjct 587 IETVPVKLKPGMDGPKVKQWPLTEEKIKALTEICMEMEKEGKISKIGPENPYNTPVFAIK 646 Query 68 KK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPY 126 KK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++PL + + Sbjct 647 KKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSVPLDKDF 706 Query 127 RQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYM 186 R+YT T+ S NN P VRY + VLPQGWK SPA++Q++M KIL + ++P I YM Sbjct 707 RKYTASTIPSTNNETPGVRYQYNVLPQGWKGSPAIFQYSMTKILDPFRAKNPDIVIYQYM 766 Query 187 DDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKH 246 DD+Y+GSDL + +HR + EL ++ ++G P+ K Q+ P W+G+ELHP+KW Q Sbjct 767 DDLYVGSDLEIGQHRTKIEELREHLLKWGLTTPDKKHQKEPPFLWMGYELHPDKWTVQPI 826 Query 247 TLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIESIHVRE 305 LP+ + T+N +QKLVG L W + G + + KL+ G +AL + + E Sbjct 827 QLPD--KDSWTVNDIQKLVGKLNWASQIYPGIKVKQLCKLLRGAKALTDIVPLTTEAELE 884 Query 306 WEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHSIKN--L 361 R+ LKE + G YYD KD+ ++ G Y ++QE K L +++ Sbjct 885 LAENREILKEPVHGAYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYAKMRSAHT 944 Query 362 SqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMP--SFWSCYKG 418 + +Q+ +A QK++ E I+ GK P LP +E W + W+P F + Sbjct 945 NDVKQLTEAVQKISLESIVIWGKTPKFRLPILKETWDTWWTEYWQATWIPEWEFVNTPPL 1004 Query 419 SVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTGEKFRIH-EEGTNQQLELRA 475 W + +V T+Y DG ++ +G GY+ G + + E TNQ+ EL+A Sbjct 1005 VKLWYQLET-EPIVGAETFYVDGASNRETKKGKAGYVTDRGRQKAVSLTETTNQKAELQA 1063 Query 476 IEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGH 535 I+ A + ++NIVTDS+YA + D+ + + +I+E + KEK+ + WVP H Sbjct 1064 IQLALQDSGSEVNIVTDSQYALGIIQAQPDKS--ESELVNQIIEQLIKKEKVYLSWVPAH 1121 Query 536 KGIPQNEEIDRYIS 549 KGI NE++D+ +S Sbjct 1122 KGIGGNEQVDKLVS 1135 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:J_SE9173] Sequence ID: Q9WC63.3 Length: 1432 Range 1: 583 to 1137 Score:349 bits(895), Expect:2e-105, Method:Compositional matrix adjust., Identities:223/575(39%), Positives:323/575(56%), Gaps:47/575(8%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP I QWPLT+EK++ L +I LE+EGK+ R P NT Sbjct 583 NFPISPIETVPVKLKPGMDGPKIKQWPLTEEKIKALTQICAELEEEGKISRIGPENPYNT 642 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 643 PVFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 702 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PLYE +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL+ + E +P I Sbjct 703 PLYEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQCSMTKILKPFRERNPEI 762 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL +E+HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 763 VIYQYMDDLYVGSDLEIEQHRRKIKELREHLLKWGFYTPDKKHQKEPPFLWMGYELHPDK 822 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W + G I + KL+ G +AL + Sbjct 823 WTVQPIQLPEKEDW--TVNDIQKLVGKLNWASQIYPGIKIKELCKLIRGAKALTDIVPLT 880 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNV--- 354 E ++ LKE + G YYD +++ ++ G Y ++QE K L Sbjct 881 REAELELAENKEILKEPVHGVYYDPARELIAEVQKQGLDQWTYQIYQEPFKNLKTGKYAK 940 Query 355 -----VHSIKNLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREE------------DW 397 + +K LS + QK+ E I+ GK P LP ++E W Sbjct 941 RRSAHTNDVKQLS------QVVQKIALEAIVIWGKTPKFRLPIQKETWETWWTDYWQATW 994 Query 398 ILELQMGNINWMPSFWSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIAS 455 I E + N + W + +K ++ T+Y DG ++ G GY+ Sbjct 995 IPEWEFVNTPPLVKLW------YQLEKEPIMG----AETFYVDGASNRETKVGKAGYVTD 1044 Query 456 TG-EKFRIHEEGTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQ 514 G +K + TNQ+ EL+AI A + ++NIVTDS+YA + D+ + + Sbjct 1045 KGRQKVITLTDTTNQKTELQAIYLALQDSGIEVNIVTDSQYALGIIQAQPDKS--ESELV 1102 Query 515 ARIMELVHNKEKIGVHWVPGHKGIPQNEEIDRYIS 549 +I+E + KEK+ + WVP HKGI NE++D+ +S Sbjct 1103 NQIIEELIKKEKVYLSWVPAHKGIGGNEQVDKLVS 1137 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:B_ARV2/SF2] Sequence ID: P03369.3 Length: 1437 Range 1: 588 to 1142 Score:349 bits(895), Expect:2e-105, Method:Compositional matrix adjust., Identities:220/560(39%), Positives:326/560(58%), Gaps:17/560(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 588 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKIGPENPYNT 647 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 648 PVFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 707 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL + +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + +++P I Sbjct 708 PLDKDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPDI 767 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 768 VIYQYMDDLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDK 827 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVW-RQSLIGKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W Q G + + KL+ G +AL + Sbjct 828 WTVQPIMLPE--KDSWTVNDIQKLVGKLNWASQIYAGIKVKQLCKLLRGTKALTEVIPLT 885 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + YYD KD+ ++ G Y ++QE K L Sbjct 886 EEAELELAENREILKEPVHEVYYDPSKDLVAEIQKQGQGQWTYQIYQEPFKNLKTGKYAR 945 Query 358 IK--NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMP--SF 412 ++ + + +Q+ +A QK++ E I+ GKIP LP ++E W ++ W+P F Sbjct 946 MRGAHTNDVKQLTEAVQKVSTESIVIWGKIPKFKLPIQKETWEAWWMEYWQATWIPEWEF 1005 Query 413 WSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQ 469 + W + +V T+Y DG ++ G GY+ G +K + TNQ Sbjct 1006 VNTPPLVKLWYQLEK-EPIVGAETFYVDGAANRETKLGKAGYVTDRGRQKVVSIADTTNQ 1064 Query 470 QLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGV 529 + EL+AI A + ++NIVTDS+YA + D+ + + ++I+E + KEK+ + Sbjct 1065 KTELQAIHLALQDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVSQIIEQLIKKEKVYL 1122 Query 530 HWVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE++D+ +S Sbjct 1123 AWVPAHKGIGGNEQVDKLVS 1142 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 1 (NEW YORK-5 ISOLATE)] Sequence ID: P12497.4 Length: 1435 Range 1: 586 to 1140 Score:348 bits(893), Expect:3e-105, Method:Compositional matrix adjust., Identities:222/569(39%), Positives:324/569(56%), Gaps:35/569(6%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 586 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKIGPENPYNT 645 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 646 PVFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKQKKSVTVLDVGDAYFSV 705 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL + +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + +++P I Sbjct 706 PLDKDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQCSMTKILEPFRKQNPDI 765 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 766 VIYQYMDDLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDK 825 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVW-RQSLIGKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W Q G + + KL+ G +AL + Sbjct 826 WTVQPIVLPE--KDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVVPLT 883 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD KD+ ++ G Y ++QE K L Sbjct 884 EEAELELAENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYAR 943 Query 358 IK--NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREE------------DWILELQM 403 +K + + +Q+ +A QK+ E I+ GK P LP ++E WI E + Sbjct 944 MKGAHTNDVKQLTEAVQKIATESIVIWGKTPKFKLPIQKETWEAWWTEYWQATWIPEWEF 1003 Query 404 GNINWMPSFWSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKF 460 N + W + +K +I T+Y DG ++ G GY+ G +K Sbjct 1004 VNTPPLVKLW------YQLEKEPIIG----AETFYVDGAANRETKLGKAGYVTDRGRQKV 1053 Query 461 RIHEEGTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMEL 520 + TNQ+ EL+AI A + ++NIVTDS+YA + D+ + + ++I+E Sbjct 1054 VPLTDTTNQKTELQAIHLALQDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVSQIIEQ 1111 Query 521 VHNKEKIGVHWVPGHKGIPQNEEIDRYIS 549 + KEK+ + WVP HKGI NE++D +S Sbjct 1112 LIKKEKVYLAWVPAHKGIGGNEQVDGLVS 1140 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 1 (isolate YU2)] Sequence ID: P35963.3 Length: 1435 Range 1: 586 to 1140 Score:348 bits(893), Expect:4e-105, Method:Compositional matrix adjust., Identities:220/569(39%), Positives:323/569(56%), Gaps:35/569(6%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 586 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKIGPENPYNT 645 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 646 PVFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 705 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL+E +R+YT FT+ S NN P RY + VLPQGWK SPA++Q +M IL + +++P + Sbjct 706 PLHEDFRKYTAFTIPSINNETPGTRYQYNVLPQGWKGSPAIFQSSMTTILEPFRKQNPDL 765 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 766 VIYQYMDDLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDK 825 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVW-RQSLIGKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W Q G + + KL+ G +AL + Sbjct 826 WTVQPIVLPE--KDSWTVNDIQKLVGKLNWASQIYAGIKVRQLCKLLRGTKALTEVIPLT 883 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD KD+ ++ G Y ++QE K L Sbjct 884 EEAELELAENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYAR 943 Query 358 IK--NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREE------------DWILELQM 403 + + + +Q+ +A QK+ E I+ GK P LP ++E WI E + Sbjct 944 TRGAHTNDVKQLTEAVQKIATESIVIWGKTPKFKLPIQKETWETWWTEYWQATWIPEWEF 1003 Query 404 GNINWMPSFWSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKF 460 N + W + +K +I T+Y DG ++ G GY+ + G +K Sbjct 1004 VNTPPLVKLW------YQLEKEPIIG----AETFYVDGAANRETKLGKAGYVTNKGRQKV 1053 Query 461 RIHEEGTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMEL 520 + TNQ+ EL+AI A + ++NIVTDS+YA + D + + ++I+E Sbjct 1054 VSLTDTTNQKTELQAIYLALQDSGLEVNIVTDSQYALGIIQAQPDRS--ESELVSQIIEQ 1111 Query 521 VHNKEKIGVHWVPGHKGIPQNEEIDRYIS 549 + KEK+ + WVP HKGI NE++D+ +S Sbjct 1112 LIKKEKVYLAWVPAHKGIGGNEQVDKLVS 1140 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:K_96CM-MP535] Sequence ID: Q9QBY3.3 Length: 1430 Range 1: 581 to 1135 Score:348 bits(892), Expect:6e-105, Method:Compositional matrix adjust., Identities:219/560(39%), Positives:324/560(57%), Gaps:17/560(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 581 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALTEICTEMEKEGKISKIGPENPYNT 640 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 641 PVFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 700 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL + +R+YT FT+ S NN P VRY + VLPQGWK SPA++Q +M KIL + ++P + Sbjct 701 PLDKDFRKYTAFTIPSINNETPGVRYQYNVLPQGWKGSPAIFQHSMTKILEPFRIKNPEM 760 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + + R + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 761 VIYQYMDDLYVGSDLEIGQPRTKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELHPDK 820 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LP+ + T+N +QKLVG L W + G + + KL+ G +AL + Sbjct 821 WTVQPIQLPD--KDSWTVNDIQKLVGKLNWASQIYPGIKVKQLCKLLRGVKALTDIVPLT 878 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 + E R+ LKE + G YYD KD+ ++ GN Y ++QE K L Sbjct 879 AEAELELAENREILKEPVHGVYYDPSKDLIAEIQKQGNDQWTYQIYQEPHKNLKTGKYAR 938 Query 358 IKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMP--SF 412 +++ + +Q+ +A QK+ E I+ GK P LP ++E W + W+P F Sbjct 939 MRSAHTNDVKQLTEAVQKIATEGIVIWGKTPKFRLPIQKETWETWWTEYWQATWIPEWEF 998 Query 413 WSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQ 469 + W + +V T+Y DG ++ +G GY+ G +K E TNQ Sbjct 999 VNTPPLVKLWYQLET-EPIVGAETFYVDGAAHRETKKGRAGYVTDRGRQKVVSITETTNQ 1057 Query 470 QLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGV 529 + EL+AI A + ++NIVTDS+YA + D+ + + +I+E + KE+I + Sbjct 1058 KAELQAICLALQDSGSEVNIVTDSQYALGIIQAQPDKS--ESDLVNQIIEQLIKKERIYL 1115 Query 530 HWVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE++D+ +S Sbjct 1116 SWVPAHKGIGGNEQVDKLVS 1135 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 1 BH10] Sequence ID: P03366.3 Length: 1447 Range 1: 598 to 1152 Score:347 bits(891), Expect:8e-105, Method:Compositional matrix adjust., Identities:220/560(39%), Positives:323/560(57%), Gaps:17/560(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 598 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKIGPENPYNT 657 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 658 PVFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 717 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + +++P I Sbjct 718 PLDEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFKKQNPDI 777 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++G P+ K Q+ P W+G+ELHP+K Sbjct 778 VIYQYMDDLYVGSDLEIGQHRTKIEELRQHLLRWGLTTPDKKHQKEPPFLWMGYELHPDK 837 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W + G + + KL+ G +AL + Sbjct 838 WTVQPIVLPE--KDSWTVNDIQKLVGKLNWASQIYPGIKVRQLCKLLRGTKALTEVIPLT 895 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD KD+ ++ G Y ++QE K L Sbjct 896 EEAELELAENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYAR 955 Query 358 IK--NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMP--SF 412 ++ + + +Q+ +A QK+T E I+ GK P LP ++E W + W+P F Sbjct 956 MRGAHTNDVKQLTEAVQKITTESIVIWGKTPKFKLPIQKETWETWWTEYWQATWIPEWEF 1015 Query 413 WSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQ 469 + W + +V T+Y DG ++ G GY+ + G +K TNQ Sbjct 1016 VNTPPLVKLWYQLEK-EPIVGAETFYVDGAANRETKLGKAGYVTNKGRQKVVPLTNTTNQ 1074 Query 470 QLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGV 529 + EL+AI A + ++NIVTDS+YA + D+ + + +I+E + KEK+ + Sbjct 1075 KTELQAIYLALQDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVNQIIEQLIKKEKVYL 1132 Query 530 HWVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE++D+ +S Sbjct 1133 AWVPAHKGIGGNEQVDKLVS 1152 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 2 (ISOLATE BEN)] Sequence ID: P18096.4 Length: 1550 Range 1: 610 to 1174 Score:348 bits(893), Expect:8e-105, Method:Compositional matrix adjust., Identities:230/574(40%), Positives:327/574(56%), Gaps:29/574(5%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 NL KI +V LK G GP + QWPLT+EK+E LKEI +++EKEG++ APP NT Sbjct 610 NLPVAKIEPIKVTLKPGKDGPRLKQWPLTKEKIEALKEICEKMEKEGQLEEAPPTNPYNT 669 Query 62 PIFCIKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P F IKKK KWRMLIDFRELNK T+D E QLG+PHP GL +KK ++ILD+GDAYF+I Sbjct 670 PTFAIKKKDKNKWRMLIDFRELNKVTQDFTEIQLGIPHPAGLAKKKRISILDVGDAYFSI 729 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL+E +RQYT FT+ + NN+ P RY +KVLPQGWK SPA++Q+TM+++L + + +P + Sbjct 730 PLHEDFRQYTAFTLPAVNNMEPGKRYIYKVLPQGWKGSPAIFQYTMRQVLEPFRKANPDV 789 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDDI I SD EH +V +L + GF P++K Q+ P +W+G EL P K Sbjct 790 ILIQYMDDILIASDRTGLEHDKVVLQLKELLNGLGFSTPDEKFQKDPPFQWMGCELWPTK 849 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVW-RQSLIGKSIPNILKLMEGDRALQSERYIE 299 WK QK LP+ + T+N +QKLVG L W Q G ++ +L+ G L E ++ Sbjct 850 WKLQKLQLPQ--KDIWTVNDIQKLVGVLNWAAQIYSGIKTKHLCRLIRGKMTLTEE--VQ 905 Query 300 SIHVREWEACRQKL---KEMEGNYYDEEKDIYGQLDWGN-KAIEYIVFQEKGKPLWVNVV 355 + E E K+ +E EG YY EEK++ + Y + QE+ K L V Sbjct 906 WTELAEAELEENKIILSQEQEGYYYQEEKELEATIQKSQGHQWTYKIHQEE-KILKVGKY 964 Query 356 HSIKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMGN---INWMP 410 IKN + + + + QK+ +E ++ G+IP LP E W E N + W+P Sbjct 965 AKIKNTHTNGVRLLAQVVQKIGKEALVIWGRIPKFHLPVERETW--EQWWDNYWQVTWIP 1022 Query 411 SFWSCYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEG 466 + + N++ + +PG T+YTDG +++ G GY+ G +K ++ E+ Sbjct 1023 EWDFVSTPPLVRLTFNLVGDPIPGAETFYTDGSCNRQSKEGKAGYVTDRGKDKVKVLEQT 1082 Query 467 TNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEK 526 TNQQ EL A K+NI+ DS+Y + E N I +I+E + KE Sbjct 1083 TNQQAELEVFRMALADSGPKVNIIVDSQYVMGIVAGQPTES--ENRIVNQIIEEMIKKEA 1140 Query 527 IGVHWVPGHKGIPQNEEIDRYISE-----IFLAK 555 + V WVP HKGI N+E+D +S+ +FL K Sbjct 1141 VYVAWVPAHKGIGGNQEVDHLVSQGIRQVLFLEK 1174 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [SIVcpz EK505] Sequence ID: Q1A249.3 Length: 1448 Range 1: 595 to 1149 Score:347 bits(891), Expect:8e-105, Method:Compositional matrix adjust., Identities:218/559(39%), Positives:320/559(57%), Gaps:15/559(2%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ R P NT Sbjct 595 NFPISPIETIPVKLKPGMDGPRVKQWPLTEEKIKALTEICTEMEKEGKISRIGPENPYNT 654 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 PIF IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF+ Sbjct 655 PIFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSC 714 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q TM KIL + + +P + Sbjct 715 PLDENFRKYTAFTIPSVNNETPGIRYQYNVLPQGWKGSPAIFQSTMTKILEPFRKNNPEL 774 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR V L S++ +GF P+ K Q+ P W+G+ELHP+K Sbjct 775 VIYQYMDDLYVGSDLEITQHREAVERLRSHLLTWGFTTPDKKHQKEPPFLWMGYELHPDK 834 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +Q+LVG L W + G + + KL+ G +AL + Sbjct 835 WTVQTIQLPE--KDTWTVNDIQQLVGKLNWASQIYPGIKVKQLCKLIRGAKALTEVVTLT 892 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YY+ +K++ ++ G Y ++Q+ K L Sbjct 893 REAELELAENREILKEPVHGAYYNPDKELIAEIQKQGQGQWTYQIYQDLHKNLKTGKYAK 952 Query 358 IKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMPSFWS 414 +++ + +Q+ + QK+ E I+ GK P LP ++E W + W+P + Sbjct 953 MRSTHTNDIRQLTEVVQKVALESIVIWGKTPKFRLPVQKEVWETWWTEYWQATWIPDWEF 1012 Query 415 CYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQQ 470 + + E + G TYY DG ++ G G++ G +K E TNQQ Sbjct 1013 VNTPPLVKLWYQLETEPISGAETYYVDGAANRETKLGKAGFVTDRGRQKVTSISETTNQQ 1072 Query 471 LELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVH 530 EL+A+ A + +++NIVTDS+Y + D+ + + +I+E + KE+I + Sbjct 1073 AELQAVLMALQDAGQEVNIVTDSQYVLGIIHSQPDKS--ESELVNQIIEELIKKERIYLS 1130 Query 531 WVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE+ID+ +S Sbjct 1131 WVPAHKGIGGNEQIDKLVS 1149 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 1 lw12.3 isolate] Sequence ID: P0C6F2.1 Length: 1435 Range 1: 586 to 1140 Score:347 bits(891), Expect:9e-105, Method:Compositional matrix adjust., Identities:220/560(39%), Positives:323/560(57%), Gaps:17/560(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 586 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKIGPENPYNT 645 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 646 PVFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 705 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + +++P I Sbjct 706 PLDEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPDI 765 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++G P+ K Q+ P W+G+ELHP+K Sbjct 766 VIYQYMDDLYVGSDLEIGQHRTKIEELRQHLLRWGLTTPDKKHQKEPPFLWMGYELHPDK 825 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W + G + + KL+ G +AL + Sbjct 826 WTVQPIVLPE--KDSWTVNDIQKLVGKLNWASQIYPGIKVRQLCKLLRGTKALTEVIPLT 883 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD KD+ ++ G Y ++QE K L Sbjct 884 EEAELELAENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYAR 943 Query 358 IK--NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMP--SF 412 ++ + + +Q+ +A QK+T E I+ GK P LP ++E W + W+P F Sbjct 944 MRGTHTNDVKQLTEAVQKITTESIVIWGKTPKFKLPIQKETWETWWTEYWQATWIPEWEF 1003 Query 413 WSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQ 469 + W + +V T+Y DG ++ G GY+ + G +K TNQ Sbjct 1004 VNTPPLVKLWYQLEK-EPIVGAETFYVDGAANRETKLGKAGYVTNKGRQKVVPLTNTTNQ 1062 Query 470 QLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGV 529 + EL+AI A + ++NIVTDS+YA + D+ + + +I+E + KEK+ + Sbjct 1063 KTELQAIYLALQDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVNQIIEQLIKKEKVYL 1120 Query 530 HWVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE++D+ +S Sbjct 1121 AWVPAHKGIGGNEQVDKLVS 1140 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 2 (ISOLATE CAM2)] Sequence ID: P24107.3 Length: 1462 Range 1: 610 to 1173 Score:347 bits(891), Expect:1e-104, Method:Compositional matrix adjust., Identities:232/573(40%), Positives:328/573(57%), Gaps:28/573(4%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 NL KI ++ LK G GP + QWPLT+EK+E LKEI +++EKEG++ APP NT Sbjct 610 NLPVAKIEPIKIMLKPGKDGPRLRQWPLTKEKIEALKEICEKMEKEGQLEEAPPTNPYNT 669 Query 62 PIFCIKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P F I+KK KWRMLIDFRELNK T+D E QLG+PHP GL +K+ +T+LD+GDAYF+I Sbjct 670 PTFAIRKKDKNKWRMLIDFRELNKVTQDFTEIQLGIPHPAGLAKKRRITVLDVGDAYFSI 729 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL+E +RQYT FT+ S NN P RY +KVLPQGWK SPA++Q+TM+++L + + + + Sbjct 730 PLHEDFRQYTAFTLPSVNNAEPGKRYIYKVLPQGWKGSPAIFQYTMRQVLEPFRKANSDV 789 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDDI I SD EH +V +L + GF P++K Q+ P +W+G+EL P K Sbjct 790 IIIQYMDDILIASDRTDLEHDKVVLQLKELLNNLGFSTPDEKFQKDPPYRWMGYELWPTK 849 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 WK QK LP+ + T+N +QKLVG L W + G ++ +L+ G L E Sbjct 850 WKLQKIQLPQ--KEVWTVNDIQKLVGVLNWAAQIYPGIKTKHLCRLIRGKMTLTEEVQWT 907 Query 300 SIHVREWEACRQKL-KEMEGNYYDEEKDIYG--QLDWGNKAIEYIVFQEKGKPLWVNVVH 356 + E E R L +E EG+YY EEK++ Q D N+ Y + QE+ K L V Sbjct 908 ELAEAELEENRIILSQEQEGHYYQEEKELEATVQKDQDNQWT-YKIHQEE-KILKVGKYA 965 Query 357 SIK--NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMGN---INWMPS 411 IK + + + + + QK+ +E ++ G+IP LP E W E N + W+P Sbjct 966 KIKHTHTNGVKLLAQVVQKIGKEALV-IGRIPKFHLPVEREVW--EQWWDNYWQVTWIPD 1022 Query 412 FWSCYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGT 467 + + N++ + +PG T+YTDG +++ G GY+ G +K +I E+ T Sbjct 1023 WDFVSTPPLVRLAFNLVGDPIPGTETFYTDGSCNRQSKEGKAGYVTDRGRDKVKILEQTT 1082 Query 468 NQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKI 527 NQQ EL A A K NI+ DS+Y + E N I +I+E + KE I Sbjct 1083 NQQAELEAFAMALTDSGPKANIIVDSQYVMGIVAGQPTES--ENRIVNQIIEEMIKKEAI 1140 Query 528 GVHWVPGHKGIPQNEEIDRYISE-----IFLAK 555 V WVP HKGI N+E+D +S+ +FL K Sbjct 1141 YVAWVPAHKGIGGNQEVDHLVSQGIRQVLFLEK 1173 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr170Gag-Pol; Contains: RecName: Full=Matrix protein p16; Short=MA; Contains: RecName: Full=Capsid protein p26; Short=CA; Contains: RecName: Full=Transframe peptide; AltName: Full=p11; Contains: RecName: Full=Protease; AltName: Full=P119; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; AltName: Full=Exoribonuclease H; AltName: Full=P72; Contains: RecName: Full=Integrase; Short=IN [Jembrana disease virus] Sequence ID: Q82851.1 Length: 1432 Range 1: 553 to 1078 Score:347 bits(890), Expect:1e-104, Method:Compositional matrix adjust., Identities:215/537(40%), Positives:317/537(59%), Gaps:16/537(2%) Query 21 GPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSG-KWRMLIDF 79 GP + QWPLT EK + LKEIV+ L K+GK+ R P NTP+F IKKK G KWRML+DF Sbjct 553 GPRVPQWPLTLEKYKALKEIVEELLKDGKISRTPWDNPFNTPVFVIKKKGGSKWRMLMDF 612 Query 80 RELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNN 139 R LNK T E Q+GLP+P G+Q+ +H+T +DI DAYFTIPL E +RQYT F+++ N Sbjct 613 RALNKVTNKGQEFQIGLPYPPGIQQCEHITAIDIKDAYFTIPLDENFRQYTAFSVVPVNR 672 Query 140 LGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEE 199 GP RY+W VLPQGW SPA+YQ T Q+I+ + P I YMDD+ IGSD + Sbjct 673 EGPLERYHWNVLPQGWVCSPAIYQTTTQEIIAEIKDRFPDIVLYQYMDDLLIGSD--RPD 730 Query 200 HRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPEITEGPITLN 259 H+ +V+E+ + YGF PE+K QE +WLG+EL P++W+FQ + + +T+N Sbjct 731 HKRVVSEIREELGAYGFKTPEEKIQEEQ-VQWLGYELTPKRWRFQPRQIK--IKKVVTVN 787 Query 260 KLQKLVGDLVWRQSLIGKSIPNILKLMEGDRALQSERYIESIHVREWEACRQKLKEME-G 318 +LQ+++G+ VW Q + + + L++G L+ + + ++ E ++LK+ E Sbjct 788 ELQQMIGNCVWVQPEVKIPLSPLSDLLKGKTDLKDKIKLTEEAIQCLETVNKRLKDPEWK 847 Query 319 NYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVH-SIKNLSqaqqiikaaqkLTQE 377 E ++ ++ + + Y + Q+ G P+W V + ++ ++++ +KL++ Sbjct 848 ERIKEGTELVVKIQLIPEGVVYDLLQD-GNPIWGGVKGWDYNHANKIKKMLSIMKKLSRI 906 Query 378 VIIRTGKIPWILLPGREEDWILELQ-MGNINWMPSFWSCYKGSVRWKKRNVIAELVPG-P 435 V+I TG+ L+PG EDW LQ + + +P YK + RW +V ++ P Sbjct 907 VMIMTGREVSFLIPGDSEDWESALQRINTLTEIPEV-KFYKHACRWT--SVCGPVIERYP 963 Query 436 TYYTDGGKKNGRGSLGYIASTGEKFRIHEEGTNQQLELRAIEEACKQGPEKMNIVTDSRY 495 TYYTDGGKK + + Y G+ R GTNQQ EL+A+ A + GP KMNI+TDSRY Sbjct 964 TYYTDGGKKGSKAAAAYWRE-GKIRREVFPGTNQQAELKAVLMALQDGPAKMNIITDSRY 1022 Query 496 AYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGIPQNEEIDRYISEIF 552 A+E M R E R + I E + KE +GV WVPGHKGI N E+D+ + + Sbjct 1023 AFEGM-REEPETWGREGLWKEIGEELRRKEYVGVSWVPGHKGIGGNTEVDQEVQKAL 1078 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:C_ETH2220] Sequence ID: Q75002.3 Length: 1439 Range 1: 590 to 1144 Score:347 bits(890), Expect:1e-104, Method:Compositional matrix adjust., Identities:216/559(39%), Positives:318/559(56%), Gaps:15/559(2%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L I + +E+EGK+ R P NT Sbjct 590 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALTAICEEMEQEGKISRIGPENPYNT 649 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 650 PVFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 709 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +R+YT FT+ S NN P +RY + VLPQGWK SP ++Q +M +IL + +P I Sbjct 710 PLDEGFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPPIFQSSMPQILEPFRAPNPEI 769 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 770 VIYQYMDDLYVGSDLEIGQHRAPIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELHPDK 829 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W + G + + KL+ G +AL + Sbjct 830 WTVQPIQLPE--KDSWTVNDIQKLVGKLNWASQIYPGIKVRQLCKLLRGAKALTDIVTLT 887 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVN--VV 355 E R+ LKE + G +YD KD+ ++ GN + +QE K L Sbjct 888 EEAELELAENREILKEPVHGVFYDPSKDLIAEIQKQGNDQWTFQFYQEPFKNLKTGKFAK 947 Query 356 HSIKNLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMPSFWS 414 + + +Q+ QK+ E I+ GK P LP ++E W W+P + Sbjct 948 RGTAHTNDVKQLTAVVQKIALESIVIWGKTPKFRLPIQKETWEAWWTDYWQATWIPEWEF 1007 Query 415 CYKGSVRWKKRNVIAELVPG-PTYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQQ 470 + + E + G T+Y DG ++ G GY+ G +K E TNQ+ Sbjct 1008 VNTPPLVKLWYQLEKEPIAGVETFYVDGAANRETKIGKAGYVTDRGRQKIVSLTETTNQK 1067 Query 471 LELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVH 530 EL+AI+ A + ++NIVTDS+YA +L D+ + I +I+E + +KE++ + Sbjct 1068 TELQAIQLALQDSGSEVNIVTDSQYALGIILAQPDKS--ESEIVNQIIEQLISKERVYLS 1125 Query 531 WVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE++D+ +S Sbjct 1126 WVPAHKGIGGNEQVDKLVS 1144 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:H_VI991] Sequence ID: Q9Q720.3 Length: 1436 Range 1: 587 to 1141 Score:347 bits(890), Expect:1e-104, Method:Compositional matrix adjust., Identities:217/559(39%), Positives:323/559(57%), Gaps:15/559(2%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 NL I + V LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 587 NLPISPIETVPVTLKPGMDGPKVKQWPLTEEKIKALTEICLEMEKEGKISKIGPENPYNT 646 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 PIF IKKK S +WR L+DFRELNK+T+D E QLG+PHP GL++KK V++LD+G AYF++ Sbjct 647 PIFAIKKKNSTRWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVSVLDVGGAYFSV 706 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL+E +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + +++P + Sbjct 707 PLHEDFRKYTAFTIPSTNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPEV 766 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL +++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 767 IIYQYMDDLYVGSDLEIGQHREKIEELRAHLLRWGFTTPDQKHQKEPPFLWMGYELHPDK 826 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W + G + + L+ G +AL + Sbjct 827 WTVQPVKLPE--KDSWTVNDIQKLVGKLNWASQIYPGIKVKQLCXLLRGAKALTEIVPLT 884 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD K++ ++ G Y ++QE K L Sbjct 885 KEAELELAENREILKEPVHGAYYDPSKELIAEIQKQGPDQWTYQIYQEPFKNLKTGKYAK 944 Query 358 IKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMPSFWS 414 +++ + +Q+ + QK+ E I+ GKIP LP ++E W + W+P + Sbjct 945 MRSAHTNDVKQLTEVVQKIATESIVIWGKIPKFRLPIQKETWETWWTEHWQATWIPEWEF 1004 Query 415 CYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQQ 470 + + E + G TYY DG ++ G GY+ G +K E TNQ+ Sbjct 1005 VNTPHLVKLWYQLETEPIEGAETYYVDGAANRETKMGKAGYVTDRGKQKIVSLTETTNQK 1064 Query 471 LELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVH 530 EL+AI A ++ ++NIVTDS+YA + D+ + + +I+E + KEK + Sbjct 1065 TELQAIYLALQESGPEVNIVTDSQYALGIIQAQPDKS--ESELVNQIIEELIKKEKFYLS 1122 Query 531 WVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE++D+ +S Sbjct 1123 WVPAHKGIGGNEQVDKLVS 1141 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 1 (BH5 ISOLATE)] Sequence ID: P04587.3 Length: 1447 Range 1: 598 to 1152 Score:347 bits(889), Expect:2e-104, Method:Compositional matrix adjust., Identities:219/560(39%), Positives:322/560(57%), Gaps:17/560(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 598 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKIGPENPYNT 657 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELN++T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 658 PVFAIKKKDSTKWRKLVDFRELNRRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 717 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +R+YT FT+ S NN P Y + VLPQGWK SPA++Q +M KIL + +++P I Sbjct 718 PLDEDFRKYTAFTIPSINNETPGSGYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPDI 777 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 778 VIYQYMDDLYVGSDLEIGQHRTKIEELRQHLLRWGFTTPDKKHQKEPPFLWMGYELHPDK 837 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W + G + + KL+ G +AL + Sbjct 838 WTIQPIVLPE--KDSWTVNDIQKLVGKLNWASQIYPGIKVRQLCKLLRGTKALTEVIPLT 895 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD KD+ ++ G Y ++QE K L Sbjct 896 EEAELELAENREILKEPVHGVYYDPSKDLIAEIQKQGQGQWTYQIYQEPFKNLKTGKYAR 955 Query 358 IK--NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMP--SF 412 ++ + + +Q+ +A QK+T E I+ GK P LP ++E W + W+P F Sbjct 956 MRGAHTNDVKQLTEAVQKITTESIVIWGKTPKFKLPIQKETWETWWTEYWQATWIPEWEF 1015 Query 413 WSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQ 469 + W + +V T+Y DG ++ G GY+ + G +K TNQ Sbjct 1016 VNTPPLVKLWYQLEK-EPIVGAETFYVDGAASRETKLGKAGYVTNRGRQKVVTLTHTTNQ 1074 Query 470 QLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGV 529 + EL+AI A + ++NIVTDS+YA + D+ + + +I+E + KEK+ + Sbjct 1075 KTELQAIHLALQDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVNQIIEQLIKKEKVYL 1132 Query 530 HWVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE++D+ +S Sbjct 1133 AWVPAHKGIGGNEQVDKLVS 1152 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:K_97ZR-EQTB11] Sequence ID: Q9QBZ9.2 Length: 1429 Range 1: 581 to 1135 Score:346 bits(888), Expect:2e-104, Method:Compositional matrix adjust., Identities:215/560(38%), Positives:322/560(57%), Gaps:17/560(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 581 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALVEICTEMEKEGKISKIGPENPYNT 640 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KW L+DFRELNK+T D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 641 PVFAIKKKDSTKWIKLVDFRELNKRTPDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 700 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL + +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + ++P + Sbjct 701 PLDKDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQCSMTKILEPFRRKNPDM 760 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 761 VLYQYMDDLYVGSDLEIGQHRAKIEELREHLLRWGFTTPDKKHQKEPPFLWMGYELHPDK 820 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LP+ + T+N +QKLVG L W + G + + KL+ G +AL + Sbjct 821 WTVQPIQLPD--KDSWTVNDIQKLVGKLNWASQIFPGIKVKQLCKLLRGVKALTDIVPLT 878 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 + E R+ LKE + G YYD KD+ ++ G+ Y ++QE K L Sbjct 879 AEAELELAENREILKEPVHGVYYDPSKDLIAEIQKQGHGQWTYQIYQEPYKNLKTGKYAR 938 Query 358 IKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMP--SF 412 I++ + +Q+ + QK+ E I+ GK P LP ++E W + W+P F Sbjct 939 IRSAHTNDVKQLTEVVQKVAMESIVIWGKTPKFRLPIQKETWGTWWTEYWQATWIPEWEF 998 Query 413 WSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTGEKFRIH-EEGTNQ 469 + W + +V T+Y DG ++ +G GY+ G + I E TNQ Sbjct 999 VNTPPLVKLWYQLET-EPIVGAETFYVDGAANRETKQGKAGYVTDKGRQKVISITETTNQ 1057 Query 470 QLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGV 529 + EL+AI A + ++NIVTDS+YA + D+ + + +I+E + K+++ + Sbjct 1058 KTELQAIHLALQDSGSEVNIVTDSQYALGIIQAQPDKS--ESELVNQIIEQLIKKDRVYL 1115 Query 530 HWVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE++D+ +S Sbjct 1116 SWVPAHKGIGGNEQVDKLVS 1135 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 N_YBF106] Sequence ID: Q9IDV9.3 Length: 1449 Range 1: 596 to 1150 Score:345 bits(886), Expect:4e-104, Method:Compositional matrix adjust., Identities:219/559(39%), Positives:317/559(56%), Gaps:15/559(2%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT EK+E L+EI +EKEGK+ R P NT Sbjct 596 NFPISPIETVPVKLKPGMDGPRVKQWPLTAEKIEALREICTEMEKEGKISRIGPENPYNT 655 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 PIF IKKK S KWR L+DFRELNK+T++ E QLG+PHP GL++KK VT+ D+GDAYF+ Sbjct 656 PIFAIKKKDSTKWRKLVDFRELNKRTQEFWEVQLGIPHPAGLKQKKSVTVXDVGDAYFSC 715 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL + +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + ++HP I Sbjct 716 PLDKDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKKHPEI 775 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR V EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 776 IIYQYMDDLYVGSDLEIAQHRETVEELRGHLLKWGFTTPDKKHQKEPPFLWMGYELHPDK 835 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W + G + + KL+ G +AL Sbjct 836 WTVQPIKLPE--KEVWTVNDIQKLVGKLNWASQIYPGIKVKQLCKLIRGTKALTEVVTFT 893 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD K++ ++ G Y ++QE K L Sbjct 894 QEAELELAENREILKEPLHGVYYDPGKELIAEIQKQGQGQWTYQIYQEPYKNLKTGKYAK 953 Query 358 IKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMPSFWS 414 ++ + +++ QK+ E I+ GK P LP ++E W + W+P + Sbjct 954 XRSAHTNDIKELAAVVQKVATESIVIWGKTPKFKLPVQKEVWETWWTEHWQATWIPEWEF 1013 Query 415 CYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQQ 470 + + E + G TYY DG K+ G G++ G +K E TNQ+ Sbjct 1014 VNTPPLVKLWYQLETEPISGAETYYVDGAANKETKLGKAGFVTDRGRQKVVSIENTTNQK 1073 Query 471 LELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVH 530 EL+AI A ++ ++ NIVTDS+YA + D+ + + +I+E + KE++ + Sbjct 1074 AELQAILLALQESGQEANIVTDSQYAMGIIHSQPDKS--ESDLVGQIIEELIKKERVYLS 1131 Query 531 WVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE++D +S Sbjct 1132 WVPAHKGIGGNEQVDXLVS 1150 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 2 (ISOLATE GHANA-1)] Sequence ID: P18042.4 Length: 1464 Range 1: 611 to 1165 Score:346 bits(887), Expect:4e-104, Method:Compositional matrix adjust., Identities:225/564(40%), Positives:323/564(57%), Gaps:24/564(4%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 NL KI +V LK G GP + QWPLT+EK+E L+EI +++EKEG++ APP NT Sbjct 611 NLPIAKIEPIKVTLKPGKDGPRLRQWPLTKEKIEALREICEKMEKEGQLEEAPPTNPYNT 670 Query 62 PIFCIKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P F IKKK KWRMLIDFRELN+ T+D E QLG+PHP GL +KK +T+LD+GDAYF+I Sbjct 671 PTFAIKKKDKNKWRMLIDFRELNRVTQDFTEIQLGIPHPAGLAKKKRITVLDVGDAYFSI 730 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL+E +RQYT FT+ S NN P RY +KVLPQGWK SPA++Q TM+++L + + +P + Sbjct 731 PLHEDFRQYTAFTLPSVNNAEPGKRYIYKVLPQGWKGSPAIFQHTMRQVLEPFRKANPDV 790 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDDI I SD EH +V +L + GF P++K Q+ P +W+G+EL P K Sbjct 791 ILIQYMDDILIASDRTGLEHDKVVLQLKELLNGLGFSTPDEKFQKDPPLQWMGYELWPTK 850 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 WK QK LP+ + T+N +QKLVG L W + G ++ +L++G L E ++ Sbjct 851 WKLQKLQLPQ--KEIWTVNDIQKLVGVLNWAAQIYPGIKTKHLCRLIKGKMTLTEE--VQ 906 Query 300 SIHVREWEACRQKL---KEMEGNYYDEEKDIYGQLDWG-NKAIEYIVFQEKGKPLWVNVV 355 + E E K+ +E EG YY EEK++ + + Y + QE+ K L V Sbjct 907 WTELAEAELEENKIILSQEQEGYYYQEEKELEATIQKNQDNQWTYKIHQEE-KILKVGKY 965 Query 356 HSIKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMGN---INWMP 410 IKN + + + + QK+ +E ++ G+IP LP E W E N + W+P Sbjct 966 AKIKNTHTNGVRLLAQVVQKIGKEALVIWGRIPKFHLPVERETW--EQWWDNYWQVTWIP 1023 Query 411 SFWSCYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEG 466 + + N++ + +PG T+YTDG +++ G Y+ G +K R+ E Sbjct 1024 EWDFVSTPPLVRLTFNLVGDPIPGAETFYTDGSCNRQSKEGKARYVTDRGRDKVRVLERT 1083 Query 467 TNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEK 526 TNQQ EL A K+NI+ DS+Y ++ E R I +I+E + KE Sbjct 1084 TNQQAELEAFAMTLTDSGPKVNIIVDSQYVMGIVVGQPTESESR--IVNQIIEDMIKKEA 1141 Query 527 IGVHWVPGHKGIPQNEEIDRYISE 550 + V WVP HKGI N+E+D +S+ Sbjct 1142 VYVAWVPAHKGIGGNQEVDHLVSQ 1165 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 2 (ISOLATE ST)] Sequence ID: P20876.3 Length: 1463 Range 1: 610 to 1174 Score:345 bits(886), Expect:4e-104, Method:Compositional matrix adjust., Identities:231/575(40%), Positives:326/575(56%), Gaps:31/575(5%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 NL KI ++ LK G GP + QWPLT+EK+E LKEI +++E+EG++ APP NT Sbjct 610 NLPVAKIEPIKIMLKPGKDGPKLRQWPLTKEKIEALKEICEKMEREGQLEEAPPTNPYNT 669 Query 62 PIFCIKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P F IKKK KWRMLIDFRELNK T+D E QLG+PHP GL +KK +T+LD+GDAYF+I Sbjct 670 PTFAIKKKDKNKWRMLIDFRELNKVTQDFTEIQLGIPHPAGLAKKKRITVLDVGDAYFSI 729 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL+E +RQYT FT+ S NN P RY +KV PQGWK SPA++Q+TM+++L + + +P I Sbjct 730 PLHEDFRQYTAFTLPSINNAEPGKRYIYKVSPQGWKGSPAIFQYTMRQVLEPFRKANPDI 789 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDDI I SD EH +V +L + GF P++K Q+ P +W+G+EL P K Sbjct 790 ILIQYMDDILIASDRTDLEHDRVVLQLKELLNGLGFSTPDEKFQKDPPYQWMGYELWPTK 849 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 WK Q+ LP+ + T+N +QKLVG L W + G N+ +L+ G L E ++ Sbjct 850 WKLQRIQLPQ--KEVWTVNDIQKLVGVLNWAAQIYPGIKTRNLCRLIRGKMTLTEE--VQ 905 Query 300 SIHVREWEACRQKL---KEMEGNYYDEEKDIYG--QLDWGNKAIEYIVFQEKGKPLWVNV 354 + E E K+ +E EG YY EEK++ Q D N+ I + GK L V Sbjct 906 WTELAEAELEENKIILSQEQEGCYYQEEKELEATVQKDQDNQWTYKI--HQGGKILKVGK 963 Query 355 VHSIKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMGN---INWM 409 +KN + + + + QK+ +E ++ G+IP LP + W E N + W+ Sbjct 964 YAKVKNTHTNGVRLLAQVVQKIGKEALVIWGRIPKFHLPVERDTW--EQWWDNYWQVTWI 1021 Query 410 PSFWSCYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHEE 465 P + + N++ + + G T+YTDG K++ G GYI G +K R+ E+ Sbjct 1022 PDWDFISTPPLVRLVFNLVKDPILGAETFYTDGSCNKQSREGKAGYITDRGRDKVRLLEQ 1081 Query 466 GTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKE 525 TNQQ EL A A K NI+ DS+Y + E + I +I+E + KE Sbjct 1082 TTNQQAELEAFAMAVTDSGPKANIIVDSQYVMGIVAGQPTES--ESKIVNQIIEEMIKKE 1139 Query 526 KIGVHWVPGHKGIPQNEEIDRYISE-----IFLAK 555 I V WVP HKGI N+E+D +S+ +FL K Sbjct 1140 AIYVAWVPAHKGIGGNQEVDHLVSQGIRQVLFLEK 1174 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:G_92NG083] Sequence ID: O41798.3 Length: 1435 Range 1: 586 to 1140 Score:345 bits(885), Expect:5e-104, Method:Compositional matrix adjust., Identities:216/559(39%), Positives:321/559(57%), Gaps:15/559(2%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 586 NFPISPIETVPVKLKPGMDGPRVKQWPLTEEKIKALTEICKDMEKEGKISKIGPENPYNT 645 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 PIF IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++K+ VT+LD+GDAYF++ Sbjct 646 PIFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKRSVTVLDVGDAYFSV 705 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL + +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL ++P + Sbjct 706 PLDKDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPSRTKNPEM 765 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++G P+ K Q+ P W+G+ELHP+K Sbjct 766 VIYQYMDDLYVGSDLEIGQHRAKIEELREHLLKWGLTTPDKKHQKEPPFLWMGYELHPDK 825 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W + G + ++ +L+ G +AL + Sbjct 826 WTVQPIQLPEKEDW--TVNDIQKLVGKLNWASQIYPGIKVKHLCRLLRGAKALTDIVPLT 883 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVN--VV 355 + E R+ LKE + G Y+D K++ ++ G Y ++QE K L Sbjct 884 AEAEMELAENREILKEPVHGVYHDPSKELIAEVQKQGPDQWTYQIYQEPYKNLKTGKYAK 943 Query 356 HSIKNLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMPSFWS 414 + + +Q+ + QK+ E I+ GKIP LP R+E W + + W+P + Sbjct 944 RGSAHTNDVKQLTEVVQKIATEGIVIWGKIPKFKLPIRKETWEVWWTEYWQAAWIPEWEF 1003 Query 415 CYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQQ 470 + + E +PG TYY DG ++ G G++ G +K E TNQ+ Sbjct 1004 VNTPPLVKLWYQLETEPIPGAETYYVDGAANRETKLGKAGHVTDKGKQKIITLTETTNQK 1063 Query 471 LELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVH 530 EL AI+ A + ++NIVTDS+YA + D + + +I+E + KEK+ + Sbjct 1064 AELHAIQLALQDSRPEVNIVTDSQYALGIIQAQPDRS--GSELVNQIIEQLIKKEKVYLS 1121 Query 531 WVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE++D+ +S Sbjct 1122 WVPAHKGIGGNEQVDKLVS 1140 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 2 (ISOLATE D194)] Sequence ID: P17757.3 Length: 1462 Range 1: 609 to 1173 Score:345 bits(885), Expect:6e-104, Method:Compositional matrix adjust., Identities:222/573(39%), Positives:324/573(56%), Gaps:27/573(4%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 NL K+ +V LK G GP + QWPLT+EK+E LKEI +++E+EG++ APP NT Sbjct 609 NLPVAKLDPIKVTLKPGKDGPRLKQWPLTKEKIEALKEICEKMEREGQLEEAPPTNPYNT 668 Query 62 PIFCIKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P F IKKK KWRMLIDFRELN+ T+D E QLG+PHP GL +KK +T+LD+GDAYF+I Sbjct 669 PTFAIKKKDKNKWRMLIDFRELNRVTQDFTEIQLGIPHPAGLAKKKRITVLDVGDAYFSI 728 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL+E +RQYT FT+ S NN P RY +KVLPQGWK SPA++QF M++IL + + +P + Sbjct 729 PLHEDFRQYTAFTLPSVNNAEPEKRYVYKVLPQGWKGSPAIFQFMMRQILEPFRKANPDV 788 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDDI I SD EH +V +L + GF P++K Q+ P +W+G+EL P K Sbjct 789 ILIQYMDDILIASDRTGLEHDKVVLQLKELLNGLGFSTPDEKFQKDPPFQWMGYELWPTK 848 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 WK QK LP+ + T+N +QKLVG L W + G ++ KL+ G L E ++ Sbjct 849 WKLQKIQLPQ--KEIWTVNDIQKLVGVLNWAAQIYPGIKTKHLCKLIRGKMTLTEE--VQ 904 Query 300 SIHVREWEACRQKL---KEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVH 356 + E E K+ +E EG+YY EE+++ + + + L V Sbjct 905 WTELAEAELEENKIILSQEQEGSYYQEEEELEATVIKSQDNQWAYKIHQGERVLKVGKYA 964 Query 357 SIKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMGN---INWMPS 411 IKN + + + + QK+ +E ++ G++P LP + W E N + W+P Sbjct 965 KIKNTHTNGVRLLAQVVQKIGKEALVIWGRVPKFHLPVERDTW--EQWWDNYWQVTWVPE 1022 Query 412 FWSCYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGT 467 + + N++ + +PG T+YTDG +++ G GY+ G ++ R+ E+ + Sbjct 1023 WDFVSTPPLVRLTFNLVGDPIPGTETFYTDGSCNRQSKEGKAGYVTDRGRDRVRVLEQTS 1082 Query 468 NQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKI 527 NQQ EL A A K+NI+ DS+Y + E N I +I+E + KE + Sbjct 1083 NQQAELEAFAMALADSGPKVNIIVDSQYVMGIVAGQPTES--ENRIVNQIIEDMIKKEAV 1140 Query 528 GVHWVPGHKGIPQNEEIDRYISE-----IFLAK 555 V WVP HKGI N+E+D +S+ +FL K Sbjct 1141 YVAWVPAHKGIGGNQEVDHLVSQGIRQVLFLEK 1173 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 1 (ELI ISOLATE)] Sequence ID: P04589.3 Length: 1435 Range 1: 586 to 1141 Score:343 bits(880), Expect:2e-103, Method:Compositional matrix adjust., Identities:217/570(38%), Positives:325/570(57%), Gaps:35/570(6%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ R P NT Sbjct 586 NFPISPIETVPVKLKPGMDGPKVKQWPLTEEKIKALTEICTDMEKEGKISRIGPENPYNT 645 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 PIF IKKK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 646 PIFAIKKKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 705 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + +++P + Sbjct 706 PLDEDFRKYTAFTISSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILEPFRKQNPEM 765 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + +L ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 766 VIYQYMDDLYVGSDLEIGQHRTKIEKLREHLLRWGFTRPDKKHQKEPPFLWMGYELHPDK 825 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +Q LV L W + G + + KL+ G +AL + Sbjct 826 WTVQSIKLPE--KESWTVNDIQNLVERLNWASQIYPGIKVRQLCKLLRGTKALTEVIPLT 883 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD KD+ ++ G+ Y ++QE K L Sbjct 884 EEAELELAENREILKEPVHGVYYDPSKDLIAEIQKQGHGQWTYQIYQEPFKNLKTGKYAR 943 Query 358 IK--NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREE------------DWILELQM 403 ++ + + +Q+ +A Q+++ E I+ G+ P LP ++E WI E + Sbjct 944 MRGAHTNDVKQLAEAVQRISTESIVIWGRTPKFRLPIQKETWETWWAEYWQATWIPEWEF 1003 Query 404 GNINWMPSFWSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKF 460 N + W + +K +I T+Y DG ++ G GY+ G +K Sbjct 1004 VNTPPLVKLW------YQLEKEPIIG----AETFYVDGAANRETKLGKAGYVTDRGRQKV 1053 Query 461 RIHEEGTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMEL 520 + TNQ+ EL+AI A + ++NIVTDS+YA + D+ + + +I+E Sbjct 1054 VPLTDTTNQKTELQAINLALQDSGLEVNIVTDSQYALGIIQAQPDKS--ESELVNQIIEQ 1111 Query 521 VHNKEKIGVHWVPGHKGIPQNEEIDRYISE 550 + KEK+ + WVP HKGI NE++D+ +S+ Sbjct 1112 LIKKEKVYLAWVPAHKGIGGNEQVDKLVSQ 1141 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 2 (isolate KR)] Sequence ID: Q74120.3 Length: 1463 Range 1: 610 to 1174 Score:343 bits(881), Expect:3e-103, Method:Compositional matrix adjust., Identities:231/575(40%), Positives:326/575(56%), Gaps:31/575(5%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 NL K+ +V LK G GP + QWPLT+EK+E LKEI +++E+EG++ APP NT Sbjct 610 NLPVAKVDPIKVILKPGKDGPKVRQWPLTKEKIEALKEICEKMEREGQLEEAPPTNPYNT 669 Query 62 PIFCIKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P F IKKK KWRMLIDFRELNK T++ E QLG+PHP GL +K+ +T+LDIGDAYF+I Sbjct 670 PTFAIKKKDKNKWRMLIDFRELNKVTQEFTEIQLGIPHPAGLAKKRRITVLDIGDAYFSI 729 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL+E +RQYT FT+ + NN P RY +KVLPQGWK SPA++Q TM+++L + + +P + Sbjct 730 PLHEDFRQYTAFTLPTVNNAEPGKRYIYKVLPQGWKGSPAIFQHTMRQVLEPFRKANPDV 789 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDDI I SD EH V +L + GF P++K Q+ P KW+G+EL P K Sbjct 790 ILVQYMDDILIASDRTDLEHDRTVLQLKELLNGLGFSTPDEKFQKDPPYKWMGYELWPTK 849 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 WK QK LP+ + T+N +QKLVG L W + G ++ +L+ G L E ++ Sbjct 850 WKLQKIQLPQ--KEVWTVNDIQKLVGVLNWAAQIYPGIKTKHLCRLIRGKMTLTEE--VQ 905 Query 300 SIHVREWEACRQKL---KEMEGNYYDEEKDIYG--QLDWGNKAIEYIVFQEKGKPLWVNV 354 + E E K+ +E EG YY EEK++ Q D N+ Y + Q + K L V Sbjct 906 WTELAEAELEENKIILSQEQEGCYYQEEKELEATVQKDQDNQWT-YKIHQGE-KILKVGK 963 Query 355 VHSIKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMGN---INWM 409 IKN + + + QK+ +E ++ G+IP LP E W E N + W+ Sbjct 964 YAKIKNTHTNGVRLLAHVVQKIGKEALVIWGRIPKFHLPVERETW--EQWWDNYWQVTWI 1021 Query 410 PSFWSCYKGSVRWKKRNVIAELVPG-PTYYTDGG--KKNGRGSLGYIASTG-EKFRIHEE 465 P + + N++ + +PG T+YTDG +++ G GYI G +K RI E+ Sbjct 1022 PDWDFVSTPPLVRLAFNLVKDPIPGEETFYTDGSCNRQSKEGKAGYITDRGRDKVRILEQ 1081 Query 466 GTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKE 525 TNQQ EL A A K NI+ DS+Y + E + + +I+E + KE Sbjct 1082 TTNQQAELEAFAMALTDSGPKANIIVDSQYVMGIVAGQPTES--ESKLVNQIIEEMIKKE 1139 Query 526 KIGVHWVPGHKGIPQNEEIDRYISE-----IFLAK 555 + V WVP HKGI N+E+D +S+ +FL K Sbjct 1140 TLYVAWVPAHKGIGGNQEVDHLVSQGIRQVLFLEK 1174 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 O_ANT70] Sequence ID: Q77373.3 Length: 1435 Range 1: 585 to 1140 Score:342 bits(878), Expect:5e-103, Method:Compositional matrix adjust., Identities:218/560(39%), Positives:322/560(57%), Gaps:15/560(2%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I V+LK G GP + QWPL++EK+E L I +E+EGK+ R P NT Sbjct 585 NFPISPIAPVPVKLKPGMDGPKVKQWPLSKEKIEALTAICQEMEQEGKISRIGPENPYNT 644 Query 62 PIFCIKKKSG-KWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 PIF IKKK G KWR L+DFRELNK+T++ E QLG+PHPGGL++K+ VT+LD+GDAYF+ Sbjct 645 PIFAIKKKDGTKWRKLVDFRELNKRTQEFWEVQLGIPHPGGLKQKQSVTVLDVGDAYFSC 704 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL +R+YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + ++P + Sbjct 705 PLDPDFRKYTAFTIPSVNNETPGIRYQYNVLPQGWKGSPAIFQSSMTKILDPFRRDNPEL 764 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 + YMDD+Y+GSDL L EHR + L ++ Q+GF P+ K Q+ P W+G+ELHP+K Sbjct 765 EICQYMDDLYVGSDLPLTEHRKRIELLREHLYQWGFTTPDKKHQKEPPFLWMGYELHPDK 824 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LP + T+N +QKL+G L W + G + + KL+ G ++L + Sbjct 825 WTVQSIQLP--NKDVWTVNDIQKLIGKLNWASQIYQGIRVRELCKLIRGTKSLTEVVPLS 882 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVN--VV 355 E E R++LK+ + G YY +KD++ + G + Y ++QE+ K L Sbjct 883 REAELELEENRERLKQPVHGVYYQPDKDLWVNIQKQGGEQWTYQIYQEEHKNLKTGKYTR 942 Query 356 HSIKNLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMPSF-W 413 + + +Q+ + QK++QE II GK+P LP E W W+P + + Sbjct 943 QKASHTNDIRQLAEVIQKVSQESIIIWGKLPKFKLPVTRETWETWWADYWQATWIPEWEF 1002 Query 414 SCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTGEKFRIH-EEGTNQQ 470 ++ R ++ TYY DG ++ G GY+ G++ I +E TNQ+ Sbjct 1003 VSTPPLIKLWYRLESEPIMGAETYYVDGAANRETKLGKAGYVTEQGKQKIIKLDETTNQK 1062 Query 471 LELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVH 530 EL AI A + E +NIVTDS+YA + + +PI +I+E + KE++ + Sbjct 1063 AELMAILLALQDSKETVNIVTDSQYALGVISSQPTQS--ESPIVQQIIEELTKKEQVYLT 1120 Query 531 WVPGHKGIPQNEEIDRYISE 550 WVP HKGI NE+ID+ +S+ Sbjct 1121 WVPAHKGIGGNEKIDKLVSK 1140 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 2 (ISOLATE NIH-Z)] Sequence ID: P05962.3 Length: 1461 Range 1: 608 to 1172 Score:342 bits(878), Expect:7e-103, Method:Compositional matrix adjust., Identities:230/575(40%), Positives:323/575(56%), Gaps:31/575(5%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 NL KI ++ LK G GP + QWPLT+EK+E LKEI +++EKEG++ APP NT Sbjct 608 NLPVAKIEPIKIMLKPGKDGPRLKQWPLTKEKIEALKEICEKMEKEGQLEEAPPTNPYNT 667 Query 62 PIFCIKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P F IKKK KWRMLIDFRELNK T+D E QLG+PHP GL +K+ +T+LD+GDAYF+I Sbjct 668 PTFAIKKKDKNKWRMLIDFRELNKVTQDFTEIQLGIPHPAGLAKKRRITVLDVGDAYFSI 727 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL+E +RQYT FT+ S NN P RY +KVLPQGWK SPA++Q+TM++IL + + + + Sbjct 728 PLHEDFRQYTAFTLPSVNNAEPGKRYIYKVLPQGWKGSPAIFQYTMRQILEPFRKANEDV 787 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDDI I SD EH +V +L + GF P++K Q+ P +W+G+EL P K Sbjct 788 IIIQYMDDILIASDRTDLEHDKVVLQLKELLNGLGFSTPDEKFQKDPPYRWMGYELWPTK 847 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 WK QK LP+ + T+N +QKLVG L W + G ++ +L+ G L E Sbjct 848 WKLQKIQLPQ--KEVWTVNDIQKLVGVLNWAAQIYPGIKTKHLCRLIRGKMTLTEEVQWT 905 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYG--QLDWGNKAIEYIVFQE----KGKPLWV 352 + E E R L + EG+YY EEK + Q D N+ Y V Q KG + Sbjct 906 ELAEAELEENRIILSQKQEGHYYQEEKKLEATVQKDQDNQWT-YKVHQGEKILKGGKICK 964 Query 353 NVVHSIKNLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMGN---INWM 409 + + + + + + QK+ +E ++ G+IP LP + W E N + W+ Sbjct 965 DKKYPYQRVR---LLAQVVQKIGKEALVIWGRIPKFHLPVERDTW--EQWWDNYWQVTWI 1019 Query 410 PSFWSCYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHEE 465 P + + N++ E VPG T+YTDG +++ G GYI G ++ ++ E+ Sbjct 1020 PDWDFVSTPPLVRLAFNLVGEPVPGAETFYTDGSCNRQSKEGKAGYITDRGRDRVKVLEQ 1079 Query 466 GTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKE 525 TNQQ EL A A K NI+ DS+Y + E N I +I+E + KE Sbjct 1080 TTNQQAELEAFAMALTDSGPKANIIVDSQYVMGIVAGQPTES--ENRIVNQIIEEMIKKE 1137 Query 526 KIGVHWVPGHKGIPQNEEIDRYISE-----IFLAK 555 I V WVP HKGI N+E+D +S+ +FL K Sbjct 1138 AIYVAWVPAHKGIGGNQEVDHLVSQGIRQVLFLEK 1172 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 2 (ISOLATE ROD)] Sequence ID: P04584.3 Length: 1464 Range 1: 611 to 1175 Score:341 bits(874), Expect:2e-102, Method:Compositional matrix adjust., Identities:230/573(40%), Positives:324/573(56%), Gaps:27/573(4%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 NL K+ ++ LK G GP + QWPLT+EK+E LKEI +++EKEG++ APP NT Sbjct 611 NLPVAKVEPIKIMLKPGKDGPKLRQWPLTKEKIEALKEICEKMEKEGQLEEAPPTNPYNT 670 Query 62 PIFCIKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P F IKKK KWRMLIDFRELNK T+D E QLG+PHP GL +K+ +T+LD+GDAYF+I Sbjct 671 PTFAIKKKDKNKWRMLIDFRELNKVTQDFTEIQLGIPHPAGLAKKRRITVLDVGDAYFSI 730 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL+E +R YT FT+ S NN P RY +KVLPQGWK SPA++Q TM+++L + + + + Sbjct 731 PLHEDFRPYTAFTLPSVNNAEPGKRYIYKVLPQGWKGSPAIFQHTMRQVLEPFRKANKDV 790 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDDI I SD EH +V +L + GF P++K Q+ P W+G+EL P K Sbjct 791 IIIQYMDDILIASDRTDLEHDRVVLQLKELLNGLGFSTPDEKFQKDPPYHWMGYELWPTK 850 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 WK QK LP+ + T+N +QKLVG L W L G ++ +L+ G L E Sbjct 851 WKLQKIQLPQ--KEIWTVNDIQKLVGVLNWAAQLYPGIKTKHLCRLIRGKMTLTEEVQWT 908 Query 300 SIHVREWEACRQKL-KEMEGNYYDEEKDIYG--QLDWGNKAIEYIVFQEKGKPLWVNVVH 356 + E E R L +E EG+YY EEK++ Q D N+ Y + QE+ K L V Sbjct 909 ELAEAELEENRIILSQEQEGHYYQEEKELEATVQKDQENQWT-YKIHQEE-KILKVGKYA 966 Query 357 SIKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQMGN---INWMPS 411 +KN + + + + QK+ +E ++ G+IP LP E W E N + W+P Sbjct 967 KVKNTHTNGIRLLAQVVQKIGKEALVIWGRIPKFHLPVEREIW--EQWWDNYWQVTWIPD 1024 Query 412 FWSCYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGT 467 + + N++ + +PG T+YTDG +++ G GY+ G +K + E+ T Sbjct 1025 WDFVSTPPLVRLAFNLVGDPIPGAETFYTDGSCNRQSKEGKAGYVTDRGKDKVKKLEQTT 1084 Query 468 NQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKI 527 NQQ EL A A K+NI+ DS+Y E + I +I+E + KE I Sbjct 1085 NQQAELEAFAMALTDSGPKVNIIVDSQYVMGISASQPTES--ESKIVNQIIEEMIKKEAI 1142 Query 528 GVHWVPGHKGIPQNEEIDRYISE-----IFLAK 555 V WVP HKGI N+E+D +S+ +FL K Sbjct 1143 YVAWVPAHKGIGGNQEVDHLVSQGIRQVLFLEK 1175 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:C_92BR025] Sequence ID: O12158.2 Length: 1431 Range 1: 582 to 1136 Score:340 bits(872), Expect:3e-102, Method:Compositional matrix adjust., Identities:216/559(39%), Positives:319/559(57%), Gaps:15/559(2%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QW LT+EK++ L I D +E+EGK+ + P NT Sbjct 582 NFPISPIETVPVKLKPGMDGPKVKQWLLTEEKIKALTAICDEMEREGKITKIGPENPYNT 641 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DFRELNK+T D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 642 PVFAIKKKDSTKWRKLVDFRELNKRTWDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 701 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +R+YT FT+ S NN P +RY + VLPQGWK SP+++Q + KIL + ++P I Sbjct 702 PLDEGFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPSIFQSSTTKILEPFRAQNPEI 761 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++GF P+ K Q+ P W+G+ELHP+K Sbjct 762 IIYQYMDDLYVGSDLEIGQHRAKIEELREHLLKWGFTTPDKKHQKEPPFLWMGYELHPDK 821 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LPE + T+N +QKLVG L W + G + + KL+ G +AL + Sbjct 822 WTVQPIQLPE--KDSWTVNDIQKLVGKLNWASQIYPGIKVRQLCKLLRGAKALTDIVPLT 879 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVNVVHS 357 E R+ LKE + G YYD KD+ ++ G Y ++QE K L Sbjct 880 EEAELELAENREILKEPVHGVYYDPSKDLIAEIQKQGQNQWTYQIYQEPFKNLKTGKYAK 939 Query 358 IK--NLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMPSFWS 414 ++ + + +Q+ +A QK+ E II GK P LP ++E W W+P + Sbjct 940 MRTAHTNDVRQLTEAVQKIALESIIIWGKTPKFRLPIQKETWEAWWTDYWQATWIPEWEF 999 Query 415 CYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQQ 470 + + E + G T+Y DG ++ G GY+ G +K E TNQ+ Sbjct 1000 VNTPPLVKLWYQLEKEPIAGAETFYVDGAANREIKMGKAGYVTDRGRQKIVSITETTNQK 1059 Query 471 LELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVH 530 EL+AI+ A + ++NIVTDS+YA + D+ + + +I+E + KE++ + Sbjct 1060 TELQAIQLALQDSGSEVNIVTDSQYALGIIQAQPDKS--ESELVNQIIEQLIKKERVYLS 1117 Query 531 WVPGHKGIPQNEEIDRYIS 549 WVP HKGI NE++D+ +S Sbjct 1118 WVPAHKGIGGNEQVDKLVS 1136 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr170Gag-Pol; Contains: RecName: Full=Matrix protein p16; Short=MA; Contains: RecName: Full=p2L; Contains: RecName: Full=Capsid protein p26; Short=CA; Contains: RecName: Full=p3; Contains: RecName: Full=Transframe peptide; AltName: Full=p11; Contains: RecName: Full=Protease; AltName: Full=P119; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; AltName: Full=Exoribonuclease H; AltName: Full=P72; Contains: RecName: Full=Integrase; Short=IN [Bovine immunodeficiency virus R29] Sequence ID: P19560.2 Length: 1475 Range 1: 593 to 1111 Score:338 bits(868), Expect:2e-101, Method:Compositional matrix adjust., Identities:218/538(41%), Positives:301/538(55%), Gaps:32/538(5%) Query 21 GPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKK-SGKWRMLIDF 79 GP + QWPLT+EK + LKEIV L EGK+ A NTP+F IKKK +G+WRML+DF Sbjct 593 GPKVPQWPLTKEKYQALKEIVKDLLAEGKISEAAWDNPYNTPVFVIKKKGTGRWRMLMDF 652 Query 80 RELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNN 139 RELNK T E GLP+P G++ +H+T +DI DAYFTIPL+E +R +T F+++ N Sbjct 653 RELNKITVKGQEFSTGLPYPPGIKECEHLTAIDIKDAYFTIPLHEDFRPFTAFSVVPVNR 712 Query 140 LGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEE 199 GP R+ W VLPQGW SPA+YQ T QKI+ + HP + YMDD+ IGS+ ++ Sbjct 713 EGPIERFQWNVLPQGWVCSPAIYQTTTQKIIENIKKSHPDVMLYQYMDDLLIGSN--RDD 770 Query 200 HRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPEITEGPITLN 259 H+ IV E+ + YGF P++K QE KW+GFEL P+KW+FQ L + P+T+N Sbjct 771 HKQIVQEIRDKLGSYGFKTPDEKVQEER-VKWIGFELTPKKWRFQPRQLK--IKNPLTVN 827 Query 260 KLQKLVGDLVWRQSLIGKSIPNILKLMEGDRALQSERYIESIHVREWEACRQKLKEME-G 318 +LQ+LVG+ VW Q + + + L+ LQ + + ++ E KLK+ E Sbjct 828 ELQQLVGNCVWVQPEVKIPLYPLTDLLRDKTNLQEKIQLTPEAIKCVEEFNLKLKDPEWK 887 Query 319 NYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLW-----VNVVHSIKNLSqaqqiikaaqk 373 + E ++ ++ + I + + Q+ G P+W +N HS K ++I++ + Sbjct 888 DRIREGAELVIKIQMVPRGIVFDLLQD-GNPIWGGVKGLNYDHSNK----IKKILRTMNE 942 Query 374 LTQEVIIRTGKIPWILLPGREEDWILELQMGNINWMPSFWSCYKGSVRWKKRNVIAELVP 433 L + V+I TG+ LLPG EDW LQ Y+ S RW + Sbjct 943 LNRTVVIMTGREASFLLPGSSEDWEAALQKEESLTQIFPVKFYRHSCRWTS-------IC 995 Query 434 GP------TYYTDGGKKNGRGSLGYIASTGEKFRIHEEGTNQQLELRAIEEACKQGPEKM 487 GP TYYTDGGKK + Y K ++ GTNQQ EL+AI A GP KM Sbjct 996 GPVRENLTTYYTDGGKKGKTAAAVYWCEGRTKSKVF-PGTNQQAELKAICMALLDGPPKM 1054 Query 488 NIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGIPQNEEID 545 NI+TDSRYAYE M R E R I I +++ K+ +GV WVP HKGI N E D Sbjct 1055 NIITDSRYAYEGM-REEPETWAREGIWLEIAKILPFKQYVGVGWVPAHKGIGGNTEAD 1111 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [SIVcpz GAB1] Sequence ID: P17283.2 Length: 1384 Range 1: 541 to 1089 Score:337 bits(865), Expect:2e-101, Method:Compositional matrix adjust., Identities:218/553(39%), Positives:319/553(57%), Gaps:15/553(2%) Query 8 IPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIK 67 I + V+LK G GP + QWPL+ EK++ L EI +EKEGK+ + P NTPIF IK Sbjct 541 IETVPVKLKPGMDGPKVKQWPLSAEKIKALTEICQEMEKEGKISKIGPENPYNTPIFAIK 600 Query 68 KK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPY 126 KK S KWR L+DFRELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF+ PL + + Sbjct 601 KKDSTKWRKLVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSCPLDKDF 660 Query 127 RQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYM 186 R+YT FT+ S NN P VRY + VLPQGWK SP+++Q +M KIL + E++P I YM Sbjct 661 RKYTAFTIPSINNETPGVRYQYNVLPQGWKGSPSIFQSSMTKILEPFREKNPDITIYQYM 720 Query 187 DDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKH 246 DD+Y+GSDL +++HR V EL ++ ++GF P+ K Q+ P W+G+ELHP+KW Q Sbjct 721 DDLYVGSDLEIDQHRKKVEELRQHLLKWGFTTPDKKHQKEPPFLWMGYELHPDKWTVQPI 780 Query 247 TLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIESIHVRE 305 LPE + T+N +QKL+G L W + G I + KL+ G + L + E Sbjct 781 QLPE--KEVWTVNDIQKLIGKLNWASQIYPGIKIKQLCKLIRGTKKLTDVVPLTPEAELE 838 Query 306 WEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQEKGKPLWVN--VVHSIKNL 361 R+ + + G YYD +K++ ++ GN Y +FQE K L + Sbjct 839 LAENREIVSTPVHGVYYDPDKELIAEIQKQGNCQWTYQIFQEPHKNLKTGKYARQRSAHT 898 Query 362 SqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMPSFWSCYKGSV 420 + +Q+ +A QK+ E I+ GK P LP ++E W + W+P + + Sbjct 899 NDIRQLAEAVQKIATESIVIWGKTPKFRLPVQKESWEAWWAEYWQATWIPEWEFINTPPL 958 Query 421 RWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTGEKFRIH-EEGTNQQLELRAI 476 ++ E +P TYY DG ++ G GY+ G++ I E TNQQ EL+A+ Sbjct 959 VKLWYSLETEPIPTTDTYYVDGAANRETKTGKAGYVTDKGKQKIISLENTTNQQAELKAL 1018 Query 477 EEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHK 536 A + +++NIVTDS+Y + D + + +I+E + KEKI + WVP HK Sbjct 1019 LLALQDSDQQVNIVTDSQYVLGIIQSQPDHS--ESELVNQIIEELIKKEKIYLSWVPAHK 1076 Query 537 GIPQNEEIDRYIS 549 GI NE++D+ +S Sbjct 1077 GIGGNEQVDKLVS 1089 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-2 B_UC1] Sequence ID: Q76634.3 Length: 1471 Range 1: 610 to 1174 Score:337 bits(864), Expect:5e-101, Method:Compositional matrix adjust., Identities:224/577(39%), Positives:319/577(55%), Gaps:35/577(6%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N KI +V+LK G GP I QWPL++EK+ LKEI +++EKEG++ APP NT Sbjct 610 NFPVAKIEPVKVKLKPGKDGPKIRQWPLSKEKILALKEICEKMEKEGQLEEAPPTNPYNT 669 Query 62 PIFCIKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P F IKK+ KWRMLIDFRELNK T+D E QLG+PHP GL K+ +T+LD+GDAYF+I Sbjct 670 PTFAIKKRDKNKWRMLIDFRELNKVTQDFTEVQLGIPHPAGLAEKRRITVLDVGDAYFSI 729 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL +RQYT FT+ S NN P RY +KVLPQGWK SPA++Q++M+K+L + + + + Sbjct 730 PLDPNFRQYTAFTLPSINNAEPGKRYIYKVLPQGWKGSPAIFQYSMRKVLDPFRKANSDV 789 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDDI I SD EH +V++L + GF PE+K Q+ P KW+G+EL P++ Sbjct 790 IIIQYMDDILIASDRSDLEHDRVVSQLKELLNDMGFSTPEEKFQKDPPFKWMGYELWPKR 849 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 WK QK LPE + T+N +QKLVG L W L G +I KL+ G L E ++ Sbjct 850 WKLQKIQLPE--KEVWTVNDIQKLVGVLNWAAQLFPGIKTRHICKLIRGKMTLTEE--VQ 905 Query 300 SIHVREWEACRQKL---KEMEGNYYDE--------EKDIYGQLDWGNKAIEYIVFQEKGK 348 + E E K+ +E EG+YY E +K++ Q W K + + GK Sbjct 906 WTELAEAELQENKIILEQEQEGSYYKEGVPLEATVQKNLANQ--WTYKIHQGNRILKVGK 963 Query 349 PLWVNVVHSIKNLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWIL-ELQMGNIN 407 V H+ + + + QK+ +E ++ G+IP LP E W + Sbjct 964 YAKVKNTHT----NGVRLLAHVVQKIGKEALVIWGEIPVFHLPVERETWDQWWTDYWQVT 1019 Query 408 WMPSF-WSCYKGSVRWKKRNVIAELVPGPTYYTDGG--KKNGRGSLGYIASTG-EKFRIH 463 W+P + + VR V L TYYTDG + + G GY+ G +K ++ Sbjct 1020 WIPEWDFVSTPPLVRLAYNLVKDPLEKVETYYTDGSCNRASKEGKAGYVTDRGKDKVKVL 1079 Query 464 EEGTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHN 523 E+ TNQQ EL A A + ++NI+ DS+Y + E +P+ +I+E + Sbjct 1080 EQTTNQQAELEAFALALQDSGPQVNIIVDSQYVMGIVAGQPTE--TESPLVNQIIEEMIK 1137 Query 524 KEKIGVHWVPGHKGIPQNEEIDRYISE-----IFLAK 555 KE I V WVP H+G+ N+E+D +S+ +FL K Sbjct 1138 KEAIYVGWVPAHRGLGGNQEVDHLVSQGIRQVLFLEK 1174 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-1 M:F1_VI850] Sequence ID: Q9QSR3.3 Length: 1430 Range 1: 580 to 1135 Score:337 bits(863), Expect:6e-101, Method:Compositional matrix adjust., Identities:214/565(38%), Positives:320/565(56%), Gaps:26/565(4%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N I + V+LK G GP + QWPLT+EK++ L EI +EKEGK+ + P NT Sbjct 580 NFPVSPIETVPVKLKPGMDGPKVKQWPLTEEKIKALTEICLEMEKEGKISKIGPENPYNT 639 Query 62 PIFCIKKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P+F IKKK S KWR L+DF+ELNK+T+D E QLG+PHP GL++KK VT+LD+GDAYF++ Sbjct 640 PVFAIKKKDSSKWRKLVDFKELNKRTQDFWEVQLGIPHPAGLKKKKSVTVLDVGDAYFSV 699 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL + +++YT FT+ S NN P +RY + VLPQGWK SPA++Q +M KIL + ++P I Sbjct 700 PLDKDFKKYTAFTIPSVNNETPGIRYQYNVLPQGWKGSPAIFQCSMTKILEPFRMKNPDI 759 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDD+Y+GSDL + +HR + EL ++ ++GF P+ K Q+ P W+G ELHP+K Sbjct 760 VIYQYMDDLYVGSDLEIGQHRTKIEELREHLLRWGFTTPDKKHQKEPPFLWMGHELHPDK 819 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 W Q LP + T+N +QKLVG L W + G + + KL+ G +AL + Sbjct 820 WTVQPIQLP--NKDSWTVNDIQKLVGKLNWASQIYPGIKVRPLCKLLRGAKALTDIVPLT 877 Query 300 SIHVREWEACRQKLKE-MEGNYYDEEKDIYGQLD-WGNKAIEYIVFQ------EKGKPLW 351 + E R+ L+E + G YYD KD+ ++ G+ Y ++Q + GK Sbjct 878 AEAELELAKNREILREPVHGVYYDPSKDLIAEIQKQGDGQWTYQIYQNPFKNLKTGKYAK 937 Query 352 VNVVHS--IKNLSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINW 408 V H+ +K L++A Q I + ++I + P LP +E W W Sbjct 938 VRSAHTNDVKQLTEAVQKIAL-----ESIVIWGKRSPKFKLPILKETWDTWWTDYWQATW 992 Query 409 MPSFWSCYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHE 464 +P + + + E + G T+Y DG ++ +G GY+ G +K Sbjct 993 IPEWEFVNTPPLVKLWYQLETEPIAGADTFYVDGASNRETKKGKAGYVTDKGKQKVVSLT 1052 Query 465 EGTNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNK 524 E TNQ+ EL+AI A + ++NIVTDS+YA + D+ + I +I+E + K Sbjct 1053 ETTNQKAELQAIYLALQDSGSEVNIVTDSQYALGIIQAQPDKS--ESEIVNQIIEQLIQK 1110 Query 525 EKIGVHWVPGHKGIPQNEEIDRYIS 549 E++ + WVP HKGI NE++D+ +S Sbjct 1111 ERVYLSWVPAHKGIGGNEQVDKLVS 1135 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [HIV-2 B_EHO] Sequence ID: Q89928.3 Length: 1464 Range 1: 608 to 1172 Score:335 bits(860), Expect:2e-100, Method:Compositional matrix adjust., Identities:222/570(39%), Positives:317/570(55%), Gaps:21/570(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N +I +V+LK GP I QWPL++EK+ LKEI +++EKEG++ APP N+ Sbjct 608 NFPVARIEPVKVQLKPEKDGPKIRQWPLSKEKILALKEICEKMEKEGQLEEAPPTNPYNS 667 Query 62 PIFCIKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P F IKKK KWRMLIDFRELNK T++ E QLG+PHP GL KK +T+LD+GDAYF++ Sbjct 668 PTFAIKKKDKNKWRMLIDFRELNKVTQEFTEVQLGIPHPAGLASKKRITVLDVGDAYFSV 727 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL +RQYT FT+ + NN P RY +KVLPQGWK SPA++Q+TM K+L + + + + Sbjct 728 PLDPDFRQYTAFTLPAVNNAEPGKRYLYKVLPQGWKGSPAIFQYTMAKVLDPFRKANNDV 787 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDDI + SD EH +V++L + GF PE+K Q+ P KW+G+EL P+K Sbjct 788 TIIQYMDDILVASDRSDLEHDRVVSQLKELLNNMGFSTPEEKFQKDPPFKWMGYELWPKK 847 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 WK QK LPE + T+N +QKLVG L W L G +I KL+ G L E Sbjct 848 WKLQKIQLPE--KEVWTVNDIQKLVGVLNWAAQLFPGIKTRHICKLIRGKMTLTEEVQWT 905 Query 300 SIHVREWEACRQKL-KEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKG-KPLWVNVVHS 357 + E++ + L +E EG+YY E + + N A ++ +G K L V Sbjct 906 ELAEAEFQENKIILEQEQEGSYYKEGVPLEATVQ-KNLANQWTYKIHQGDKILKVGKYAK 964 Query 358 IKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWIL-ELQMGNINWMPSFWS 414 +KN + + + QK+ +E ++ G+IP LP E W + W+P + Sbjct 965 VKNTHTNGVRLLAHVVQKIGKEALVIWGEIPMFHLPVERETWDQWWTDYWQVTWIPEWDF 1024 Query 415 CYKGSVRWKKRNVIAELVPG-PTYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQQ 470 + N++ + + G TYYTDG K + G GY+ G +K + E+ TNQQ Sbjct 1025 VSTPPLIRLAYNLVKDPLEGVETYYTDGSCNKASKEGKAGYVTDRGKDKVKPLEQTTNQQ 1084 Query 471 LELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVH 530 EL A A + ++NI+ DS+Y + E +PI I+E + KEKI V Sbjct 1085 AELEAFALALQDSGPQVNIIVDSQYVMGIVAAQPTE--TESPIVREIIEEMIKKEKIYVG 1142 Query 531 WVPGHKGIPQNEEIDRYISE-----IFLAK 555 WVP HKG+ N+E+D +S+ +FL K Sbjct 1143 WVPAHKGLGGNQEVDHLVSQGIRQILFLEK 1172 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Simian immunodeficiency virus (F236/SMH4 ISOLATE) (SOOTY MANGABEY)] Sequence ID: P12502.2 Length: 1449 Range 1: 596 to 1160 Score:334 bits(857), Expect:4e-100, Method:Compositional matrix adjust., Identities:220/570(39%), Positives:316/570(55%), Gaps:21/570(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 NL K+ +V LK G +GP + QWPL++EK+ L+EI +++EK+G++ APP NT Sbjct 596 NLPIAKVEPIKVTLKPGKEGPKLRQWPLSKEKIIALREICEKMEKDGQLEEAPPTNPYNT 655 Query 62 PIFCIKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P F IKKK KWRMLIDFRELNK T+D E QLG+PHP GL +++ +T+LD+GDAYF+I Sbjct 656 PTFAIKKKDKNKWRMLIDFRELNKVTQDFTEVQLGIPHPAGLAKRRRITVLDVGDAYFSI 715 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +RQYT FT+ S NN P RY +KVLPQGWK SPA++Q+TM+ +L + + +P + Sbjct 716 PLDEEFRQYTAFTLPSVNNAEPGKRYIYKVLPQGWKGSPAIFQYTMRNVLEPFRKANPDV 775 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDDI I SD EH +V +L + GF PE+K Q+ P +W+G+EL P K Sbjct 776 TLIQYMDDILIASDRTDLEHDRVVLQLKELLNGIGFSTPEEKFQKDPPFQWMGYELWPTK 835 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 WK QK LP+ T+N +QKLVG L W + G ++ +L+ G L E Sbjct 836 WKLQKIELPQ--RETWTVNDIQKLVGVLNWAAQIYPGIKTKHLCRLIRGKMTLTEEVQWT 893 Query 300 SIHVREWEACRQKL-KEMEGNYYDEEKDIYGQ-LDWGNKAIEYIVFQEKGKPLWVNVVHS 357 + E+E + L +E EG YY E K I + + Y + QE K L V Sbjct 894 EMAEAEYEENKIILSQEQEGCYYQEGKPIEATVIKSQDNQWSYKIHQED-KVLKVGKFAK 952 Query 358 IKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWIL-ELQMGNINWMPSFWS 414 +KN + + + QK+ +E ++ G++P LP E W + W+P + Sbjct 953 VKNTHTNGVRLLAHVVQKIGKEALVIWGEVPKFHLPVEREIWEQWWTDYWQVTWIPDWDF 1012 Query 415 CYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQQ 470 + N++ E + G T+Y DG +++ G GY+ G +K ++ E+ TNQQ Sbjct 1013 VSTPPLVRLVFNLVKEPIQGAETFYVDGSCNRQSREGKAGYVTDRGRDKAKLLEQTTNQQ 1072 Query 471 LELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVH 530 EL A A K NI+ DS+Y + E R + +I+E + KE I V Sbjct 1073 AELEAFYLALADSGPKANIIVDSQYVMGIIAGQPTESESR--LVNQIIEEMIKKEAIYVA 1130 Query 531 WVPGHKGIPQNEEIDRYISE-----IFLAK 555 WVP HKGI N+E+D +S+ +FL K Sbjct 1131 WVPAHKGIGGNQEVDHLVSQGIRQVLFLKK 1160 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Simian immunodeficiency virus (MM142-83 ISOLATE)] Sequence ID: P05896.2 Length: 1448 Range 1: 595 to 1159 Score:333 bits(855), Expect:8e-100, Method:Compositional matrix adjust., Identities:222/570(39%), Positives:315/570(55%), Gaps:21/570(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 NL K+ + LK G GP + QWPL++EK+ L+EI +++EK+G++ APP NT Sbjct 595 NLPIAKVEPVKSPLKPGKDGPKLKQWPLSKEKIVALREICEKMEKDGQLEEAPPTNPYNT 654 Query 62 PIFCIKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P F IKKK KWRMLIDFRELN+ T+D E QLG+PHP GL ++K +T+LDIGDAYF+I Sbjct 655 PTFAIKKKDKNKWRMLIDFRELNRVTQDFTEVQLGIPHPAGLAKRKRITVLDIGDAYFSI 714 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +RQYT FT+ S NN P RY +KVLPQGWK SPA++Q+TM+ +L + + +P + Sbjct 715 PLDEEFRQYTAFTLPSVNNAEPGKRYIYKVLPQGWKGSPAIFQYTMRHVLEPFRKANPDV 774 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDDI I SD EH +V +L + GF PE+K Q+ P +W+G+EL P K Sbjct 775 TLVQYMDDILIASDRTDLEHDRVVLQLKELLNSIGFSSPEEKFQKDPPFQWMGYELWPTK 834 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 WK QK LP+ T+N +QKLVG L W + G ++ +L+ G L E Sbjct 835 WKLQKIELPQ--RETWTVNDIQKLVGVLNWAAQIYPGIKTKHLCRLIRGKMTLTEEVQWT 892 Query 300 SIHVREWEACRQKL-KEMEGNYYDEEKDIYGQ-LDWGNKAIEYIVFQEKGKPLWVNVVHS 357 + E+E + L +E EG YY E K + + + Y + QE K L V Sbjct 893 EMAEAEYEENKIILSQEQEGCYYQESKPLEATVIKSQDNQWSYKIHQED-KILKVGKFAK 951 Query 358 IKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWIL-ELQMGNINWMPSFWS 414 IKN + + + QK+ +E I+ G++P LP ++ W + W+P + Sbjct 952 IKNTHTNGVRLLAHVIQKIGKEAIVIWGQVPKFHLPVEKDVWEQWWTDYWQVTWIPEWDF 1011 Query 415 CYKGSVRWKKRNVIAELVPG-PTYYTDG--GKKNGRGSLGYIASTG-EKFRIHEEGTNQQ 470 + N++ + + G TYY DG K++ G GYI G +K ++ E+ TNQQ Sbjct 1012 ISTPPLVRLVFNLVKDPIEGEETYYVDGSCSKQSKEGKAGYITDRGKDKVKVLEQTTNQQ 1071 Query 471 LELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVH 530 EL A A K NI+ DS+Y + E R + +I+E + K +I V Sbjct 1072 AELEAFLMALTDSGPKANIIVDSQYVMGIITGCPTESESR--LVNQIIEEMIKKTEIYVA 1129 Query 531 WVPGHKGIPQNEEIDRYISE-----IFLAK 555 WVP HKGI N+EID +S+ +FL K Sbjct 1130 WVPAHKGIGGNQEIDHLVSQGIRQVLFLEK 1159 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Simian immunodeficiency virus (PBJ/BC13 ISOLATE) (SOOTY MANGABEY)] Sequence ID: P19505.2 Length: 1449 Range 1: 596 to 1160 Score:333 bits(855), Expect:9e-100, Method:Compositional matrix adjust., Identities:221/570(39%), Positives:314/570(55%), Gaps:21/570(3%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 NL K+ +V LK G GP + QWPL++EK+ L+EI +++EK+G++ APP NT Sbjct 596 NLPIAKVEPIKVTLKPGKDGPKLRQWPLSKEKIIALREICEKMEKDGQLEEAPPTNPYNT 655 Query 62 PIFCIKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P F IKKK KWRMLIDFRELNK T+D E QLG+PHP GL +++ +T+LD+GDAYF+I Sbjct 656 PTFAIKKKDKNKWRMLIDFRELNKVTQDFTEVQLGIPHPAGLAKRRRITVLDVGDAYFSI 715 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +RQYT FT+ S NN P RY +KVLPQGWK SPA++Q TM+ +L + + +P + Sbjct 716 PLDEEFRQYTAFTLPSVNNAEPGKRYIYKVLPQGWKGSPAIFQHTMRNVLEPFRKANPDV 775 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDDI I SD EH +V +L + GF PE+K Q+ P +W+G+EL P K Sbjct 776 TLIQYMDDILIASDRTDLEHDRVVLQLKELLNSIGFSTPEEKFQKDPPFQWMGYELWPTK 835 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 WK QK LP+ T+N +QKLVG L W + G ++ +L+ G L E Sbjct 836 WKLQKIELPQ--RETWTVNDIQKLVGVLNWAAQIYPGIKTKHLCRLIRGKMTLTEEVQWT 893 Query 300 SIHVREWEACRQKL-KEMEGNYYDEEKDIYGQ-LDWGNKAIEYIVFQEKGKPLWVNVVHS 357 + E+E + L +E EG YY E K + + + Y + QE K L V Sbjct 894 EMAEAEYEENKIILSQEQEGCYYQEGKPLEATVIKSQDNQWSYKIHQED-KILKVGKFAK 952 Query 358 IKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWIL-ELQMGNINWMPSFWS 414 IKN + + + QK+ +E I+ G++P LP E W + W+P + Sbjct 953 IKNTHTNGVRLLAHVVQKIGKEAIVIWGQVPRFHLPVEREIWEQWWTDYWQVTWIPEWDF 1012 Query 415 CYKGSVRWKKRNVIAELVPGP-TYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQQ 470 + N++ E + G T+Y DG +++ G GY+ G +K ++ E+ TNQQ Sbjct 1013 VSTPPLVRLVFNLVKEPIQGAETFYVDGSCNRQSREGKAGYVTDRGRDKAKLLEQTTNQQ 1072 Query 471 LELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVH 530 EL A A K NI+ DS+Y + E R + +I+E + KE I V Sbjct 1073 AELEAFYLALADSGPKANIIVDSQYVMGIVAGQPTESESR--LVNQIIEEMIKKEAIYVA 1130 Query 531 WVPGHKGIPQNEEIDRYISE-----IFLAK 555 WVP HKGI N+E+D +S+ +FL K Sbjct 1131 WVPAHKGIGGNQEVDHLVSQGIRQVLFLEK 1160 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Simian immunodeficiency virus (AGM155 ISOLATE)] Sequence ID: P27973.2 Length: 1470 Range 1: 617 to 1167 Score:325 bits(833), Expect:9e-97, Method:Compositional matrix adjust., Identities:213/557(38%), Positives:309/557(55%), Gaps:18/557(3%) Query 6 KKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFC 65 + IP T VRLKEG +GP + QWPL++EK+ L+EI LE+EGK+ R NTP+FC Sbjct 617 QTIPITPVRLKEGARGPRLKQWPLSKEKIIALQEICKTLEEEGKLSRVGGDNAYNTPVFC 676 Query 66 IKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYE 124 I+KK +WRML+DFRELNK T+D E QLG+PHP GL++ K +TI+D+GDAY++IPL Sbjct 677 IRKKDKSQWRMLVDFRELNKATQDFFEVQLGIPHPAGLKKMKQITIIDVGDAYYSIPLDP 736 Query 125 PYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGI 184 +R+YT FT+ + NN GP +RY + LPQGWK SP ++Q T KIL +E + Sbjct 737 EFRKYTAFTIPTVNNEGPGIRYQFNCLPQGWKGSPTIFQNTASKILEEIKKELKQLTIVQ 796 Query 185 YMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQ 244 YMDD+++GS +H +V L + + ++G PE K Q P +W+G++L P KWK Q Sbjct 797 YMDDLWVGSQEEGPKHDQLVQTLRNRLQEWGLETPEKKVQREPPFEWMGYKLWPHKWKLQ 856 Query 245 KHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIESIHV 303 L + + T+N LQKLVG L W L G NI KL+ G + L Sbjct 857 SIELEKKEQ--WTVNDLQKLVGKLNWAAQLYPGLRTKNICKLLRGKKNLLDVVEWTPEAE 914 Query 304 REWEACRQKLK-EMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIK--N 360 E+E ++ LK E EG YY EK + + F+++GK L V K + Sbjct 915 AEYEENKEILKTEQEGTYYAPEKPLRAAVQKLGDGQWSYQFKQEGKILKVGKFAKQKATH 974 Query 361 LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMP--SFWSCYK 417 ++ + + QK+ +E ++ G++P LP + W ++W+P F S Sbjct 975 TNELRVLAGVVQKIGKEALVIWGQLPTFELPVERDTWEQWWADYWQVSWIPEWDFVSVPP 1034 Query 418 GSVRWKKRNVIAELVPG-PTYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTNQQLEL 473 W + E +PG YY DG +++ G GYI G ++ + E TNQQ EL Sbjct 1035 LVTLW--YTLTKEPIPGEDVYYVDGACNRQSKEGKAGYITQQGKQRVQQLENTTNQQAEL 1092 Query 474 RAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVP 533 AI+ A + K+NIVTDS+YA + + +P+ +I+ + KE I + WVP Sbjct 1093 TAIKMALEDSGPKVNIVTDSQYAMGILTAQPTQS--DSPLVEQIIAQMVQKEAIYLQWVP 1150 Query 534 GHKGIPQNEEIDRYISE 550 HKGI NEEID+ +S+ Sbjct 1151 AHKGIGGNEEIDKLVSK 1167 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Simian immunodeficiency virus (isolate AGM / clone GRI-1)] Sequence ID: Q02836.2 Length: 1472 Range 1: 613 to 1163 Score:315 bits(808), Expect:3e-93, Method:Compositional matrix adjust., Identities:200/555(36%), Positives:307/555(55%), Gaps:15/555(2%) Query 7 KIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCI 66 +I T+V+LKEG GP + QWPL++EK+E L EI ++E+EGK+ R NTP+F I Sbjct 613 QIEETKVQLKEGKDGPKLKQWPLSREKIEALTEICKQMEEEGKLSRIGGENPYNTPVFAI 672 Query 67 KKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEP 125 KKK +WRML+DFRELNK T+D E QLG+PHP GLQ+KK +T++DIGDAY++IPL + Sbjct 673 KKKDKTQWRMLVDFRELNKATQDFFEVQLGIPHPAGLQKKKQITVIDIGDAYYSIPLCKE 732 Query 126 YRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIY 185 +R+YT FT+ S NN GP +RY + LPQGWK SP ++Q T IL P ++ Y Sbjct 733 FRKYTAFTIPSVNNTGPGIRYQFNCLPQGWKGSPTIFQNTAANILEEIKRHTPGLEIVQY 792 Query 186 MDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQK 245 MDD+++ SD H V+ + + + G P+ K Q P +W+G++LHP KW K Sbjct 793 MDDLWLASDHDETRHNQQVDIVRKMLLEKGLETPDKKVQREPPWEWMGYKLHPNKWTINK 852 Query 246 HTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIESIHVR 304 LP + EG T+NK+QK+VG L W + G + ++ G + L E Sbjct 853 IELPPL-EGEWTVNKIQKVVGVLNWASQIYPGIKTKHTCAMLRGKKNLLEEIVWTEEAEA 911 Query 305 EWEACRQKLKEM-EGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVN--VVHSIKNL 361 E++ + ++E EG YYD K++ + + F ++G L V + Sbjct 912 EYKNNQGIVQETQEGTYYDPLKELIATVQKQGEGQWTYQFTQEGAVLKVGRYAKQRETHT 971 Query 362 SqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQ-MGNINWMPSFWSCYKGSV 420 + + + QK+ +E + G++P + LP ++ W + Q ++W+P W + Sbjct 972 NDLRTLAHLVQKICKEALTIWGRLPRVQLPVDKKTWDMWWQDYWQVSWIPE-WEFVSTPL 1030 Query 421 RWKK-RNVIAELVPG-PTYYTDGG--KKNGRGSLGYIASTGE-KFRIHEEGTNQQLELRA 475 K +++ E + G YY DG K G GY++ G+ + R E TNQQ EL A Sbjct 1031 LVKLWYSLVKEPIKGEDVYYVDGAASKVTKLGKAGYLSERGKSRIRELENTTNQQAELTA 1090 Query 476 IEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGH 535 ++ A + E +NIVTDS+Y + E +P+ +I++ + K ++ + WVP H Sbjct 1091 VKMALEDSGENVNIVTDSQYVMNILTACPQES--NSPLVEQIIQALMKKRQVYLQWVPAH 1148 Query 536 KGIPQNEEIDRYISE 550 KGI N EID+ +S+ Sbjct 1149 KGIGGNTEIDKLVSK 1163 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Spacer peptide 1; Short=SP1; AltName: Full=p2; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=Transframe peptide; Short=TF; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Human immunodeficiency virus type 2 (ISOLATE D205,7)] Sequence ID: P15833.3 Length: 1465 Range 1: 610 to 1173 Score:313 bits(801), Expect:2e-92, Method:Compositional matrix adjust., Identities:215/572(38%), Positives:311/572(54%), Gaps:26/572(4%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 N K+ +V LK G GP I QWPL++EK+ LKEI +++EKEG++ APP NT Sbjct 610 NFPVAKVEPVKVELKPGKDGPKIRQWPLSREKILALKEICEKMEKEGQLEEAPPTNPYNT 669 Query 62 PIFCIKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P F IKKK KWRMLIDFRELNK T+D E P + K+ +T++D+GDAYF+I Sbjct 670 PTFAIKKKDKNKWRMLIDFRELNKVTQDFTEVNWVFPT-RQVAEKRRITVIDVGDAYFSI 728 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL +RQYT FT+ S NN P RY +KVLPQGWK S ++ Q++M+K+L + + + + Sbjct 729 PLDPNFRQYTAFTLPSVNNAEPGKRYIYKVLPQGWKGSQSICQYSMRKVLDPFRKANSDV 788 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDDI I SD EH +V++L + GF PE+K Q+ P KW+G+EL P+K Sbjct 789 IIIQYMDDILIASDRSDLEHDRVVSQLKELLNDMGFSTPEEKFQKDPPFKWMGYELWPKK 848 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 WK QK LPE + T+N +QKLVG L W L G +I KL+ G L E ++ Sbjct 849 WKLQKIQLPE--KEVWTVNAIQKLVGVLNWAAQLFPGIKTRHICKLIRGKMTLTEE--VQ 904 Query 300 SIHVREWEACRQKL---KEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKG-KPLWVNVV 355 + E E K+ +E EG+YY E + + N A ++ +G K L V Sbjct 905 WTELAEAELQENKIILEQEQEGSYYKERVPLEATVQ-KNLANQWTYKIHQGNKVLKVGKY 963 Query 356 HSIKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWIL-ELQMGNINWMPSF 412 +KN + + + QK+ +E ++ G+IP LP E W + W+P + Sbjct 964 AKVKNTHTNGVRLLAHVVQKIGKEALVIWGEIPVFHLPVERETWDQWWTDYWQVTWIPEW 1023 Query 413 WSCYKGSVRWKKRNVIAELVPG-PTYYTDGG--KKNGRGSLGYIASTG-EKFRIHEEGTN 468 + N++ + + G TYYTDG + + G GY+ G +K ++ E+ TN Sbjct 1024 DFVSTPPLIRLAYNLVKDPLEGRETYYTDGSCNRTSKEGKAGYVTDRGKDKVKVLEQTTN 1083 Query 469 QQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIG 528 QQ EL A A ++NI+ DS+Y + E +PI A+I+E + KE + Sbjct 1084 QQAELEAFALALTDSEPQVNIIVDSQYVMGIIAAQPTE--TESPIVAKIIEEMIKKEAVY 1141 Query 529 VHWVPGHKGIPQNEEIDRYISE-----IFLAK 555 V WVP HKG+ N+E+D +S+ +FL K Sbjct 1142 VGWVPAHKGLGGNQEVDHLVSQGIRQVLFLEK 1173 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Simian immunodeficiency virus (TYO-1 ISOLATE)] Sequence ID: P05895.2 Length: 1467 Range 1: 621 to 1180 Score:310 bits(794), Expect:2e-91, Method:Compositional matrix adjust., Identities:207/566(37%), Positives:304/566(53%), Gaps:22/566(3%) Query 6 KKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFC 65 +KIP T V+LKEG +GP + QWPL++EK+E L+EI +LE+EGK+ R NTPIFC Sbjct 621 EKIPVTPVKLKEGARGPCVRQWPLSKEKIEALQEICSQLEQEGKISRVGGENAYNTPIFC 680 Query 66 IKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYE 124 IKKK +WRML+DFRELNK T+D E QLG+PHP GL++ + +T+LD+GDAY++IPL Sbjct 681 IKKKDKSQWRMLVDFRELNKATQDFFEVQLGIPHPAGLRKMRQITVLDVGDAYYSIPLDP 740 Query 125 PYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGI 184 +R+YT FT+ + NN GP +RY + LPQGWK SP ++Q T IL P + Sbjct 741 NFRKYTAFTIPTVNNQGPGIRYQFNCLPQGWKGSPTIFQNTAASILEEIKRNLPALTIVQ 800 Query 185 YMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQ 244 YMDD+++GS H +V +L + + +G PE K Q+ P +W+G++L P KW+ Sbjct 801 YMDDLWVGSQENEHTHDKLVEQLRTKLQAWGLETPEKKMQKEPPYEWMGYKLWPHKWELS 860 Query 245 KHTLPEITEGPITLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDRA--LQSERYIESIH 302 + L E E T+N +QKLVG L W L I KL+ G + L+ + Sbjct 861 RIQLEEKDE--WTVNDIQKLVGKLNWAAQLYPGLKTRICKLITGGKKNLLELVAWTPEAE 918 Query 303 VREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKN-- 360 E E EG YY I + F+++G+ L V KN Sbjct 919 AEYAENAEILKTEQEGTYYKPGIPIRAAVQKLEGGQWSYQFKQEGQVLKVGKYTKQKNTH 978 Query 361 LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMP--SFWSCYK 417 ++ + + QK+ +E ++ G +P + LP E W ++W+P F S Sbjct 979 TNELRTLAGLVQKICKEALVIWGILPVLELPIEREVWEQWWADYWQVSWIPEWDFVSTPP 1038 Query 418 GSVRWKKRNVIAELVPG-PTYYTDGGKKNGR-GSLGYIASTG-EKFRIHEEGTNQQLELR 474 W + E +P YY +N + G GYI+ G ++ E TNQQ +L Sbjct 1039 LLKLW--YTLTKEPIPKEDVYYVGACNRNSKEGKAGYISQYGKQRVETLENTTNQQAKLT 1096 Query 475 AIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPG 534 AI+ A + +NIVTDS+YA + + +P+ +I+ L+ K++I + WVP Sbjct 1097 AIKMALEDSGPNVNIVTDSQYAMGILTAQPTQS--DSPLVEQIIALMIQKQQIYLQWVPA 1154 Query 535 HKGIPQNEEIDRYISE-----IFLAK 555 HKGI NEEID+ +S+ +FL K Sbjct 1155 HKGIGGNEEIDKLVSKGIRRVLFLEK 1180 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Simian immunodeficiency virus (AGM3 ISOLATE)] Sequence ID: P27980.2 Length: 1465 Range 1: 618 to 1178 Score:306 bits(784), Expect:4e-90, Method:Compositional matrix adjust., Identities:210/567(37%), Positives:308/567(54%), Gaps:23/567(4%) Query 6 KKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFC 65 ++IP T V+LKEG +GP + QWPL++EK++ L+EI D+LEKEGK+ + NTP+FC Sbjct 618 EQIPITPVKLKEGARGPFLKQWPLSKEKIKALQEICDQLEKEGKISKIGGENAYNTPVFC 677 Query 66 IKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYE 124 IKKK +WRML+DFRELNK T+D E QLG+PHP G ++ +T+LDIGDAY++IPL Sbjct 678 IKKKDKSQWRMLVDFRELNKATQDFFEVQLGIPHPSGFEKMTEITVLDIGDAYYSIPLDP 737 Query 125 PYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGI 184 +R+YT FT+ S NN GP RY + LPQGWK SP ++Q T IL +E + Sbjct 738 EFRKYTAFTIPSVNNQGPGTRYQFNCLPQGWKGSPTIFQNTAASILEEIKKELKPLTIVQ 797 Query 185 YMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQ 244 YMDD+++GS H +V +L ++ +G P+ K Q+ P +W+G++L P KW+ Sbjct 798 YMDDLWVGSQEDEYTHDRLVEQLRMKLSAWGLETPDKKVQKKPPYEWMGYKLWPHKWQIS 857 Query 245 KHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIESIHV 303 L + E T+N +Q+LVG L W L G N+ KL+ G + L Sbjct 858 SIELEDKEE--WTVNDIQRLVGKLNWAAQLYPGLRTKNLCKLIRGKKNLLETVTWTEEAE 915 Query 304 REWEACRQKLK-EMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKN-- 360 E+ ++ LK E EG YY + I + F+++G+ L V KN Sbjct 916 AEYAENKEILKTEQEGTYYKPGRPIRAAVQKLEGGQWSYQFKQEGQVLKVGKYTKQKNTH 975 Query 361 LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDW-ILELQMGNINWMPS--FWSCYK 417 ++ + + QKL +E ++ G++P + LP E W ++W+P F S Sbjct 976 TNEFRVLAGLVQKLCKESLVIWGELPVLELPIEREVWEQWWADYWQVSWIPDWEFVSTPP 1035 Query 418 GSVRWKKRNVIAELVPG-PTYYTDGG-KKNGR-GSLGYIASTG-EKFRIHEEGTNQQLEL 473 W + E +P YY DG +N R G GYI G ++ E TNQQ EL Sbjct 1036 LVKLW--YTLTKEPIPKEDVYYVDGACNRNSREGKAGYITQYGKQRVEKLENTTNQQAEL 1093 Query 474 RAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIGVHWVP 533 AI+ A + +NIVTDS+YA + + +P+ +I+ L+ K +I + WVP Sbjct 1094 MAIKMALEDSGPNVNIVTDSQYAMGILTAQPTQS--DSPLIEQIIALMVQKHQIYLQWVP 1151 Query 534 GHKGIPQNEEIDRYISE-----IFLAK 555 KGI NEEID+ +S+ +FL K Sbjct 1152 ADKGIGGNEEIDKLVSQGMRKILFLEK 1178 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr160Gag-Pol; Contains: RecName: Full=Matrix protein p17; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p7; Short=NC; Contains: RecName: Full=p6-pol; Short=p6*; Contains: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=Exoribonuclease H; AltName: Full=p66 RT; Contains: RecName: Full=p51 RT; Contains: RecName: Full=p15; Contains: RecName: Full=Integrase; Short=IN [Simian immunodeficiency virus - mac K6W] Sequence ID: P05897.2 Length: 1446 Range 1: 595 to 1149 Score:300 bits(769), Expect:4e-88, Method:Compositional matrix adjust., Identities:212/566(37%), Positives:302/566(53%), Gaps:26/566(4%) Query 2 NLEEKKIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNT 61 NL K+ +V LK G GP + QWPL++EK+ L+EI +++EK+G++ APP NT Sbjct 595 NLPIAKVEPVKVALKPGKVGPKLKQWPLSKEKIVALREICEKMEKDGQLEEAPPTNPYNT 654 Query 62 PIFCIKKKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTI 120 P F IKKK KWRMLI FRELN+ T++L + + +PHP GL ++K +T+LDIGDAYF+I Sbjct 655 PTFAIKKKDKNKWRMLIHFRELNRVTQELYRSPIRIPHPAGLAKRKRITVLDIGDAYFSI 714 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 PL E +RQYT FT+ S NN P RY +KVLPQGWK SPA++Q+TM+ +L + + +P + Sbjct 715 PLDEEFRQYTAFTLPSVNNAEPGKRYIYKVLPQGWKGSPAIFQYTMRHVLEPFRKANPDV 774 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEK 240 YMDDI I SD EH +V +L + GF PE+K Q+ P +W+G+EL P K Sbjct 775 TLVQYMDDILIASDRTDLEHDRVVLQLKELLNSIGFSTPEEKFQKDPPFQWMGYELWPTK 834 Query 241 WKFQKHTLPEITEGPITLNKLQKLVGDLVWRQSLI-GKSIPNILKLMEGDRALQSERYIE 299 WK QK LP+ T+N +QKLVG L W + G ++ +L+ G L Sbjct 835 WKLQKIELPQ--RETWTVNDIQKLVGVLNWAAQIYPGIKTKHLCRLIRGKMTLTEAVQWT 892 Query 300 SIHVREWEACRQKL-KEMEGNYYDEEKDIYGQ-LDWGNKAIEYIVFQEKGKPLWVNVVHS 357 + E+E L +E EG YY E K + + + Y + QE K L V Sbjct 893 EMAEAEYEENNIILSQEQEGCYYQEGKPLEATVIKSQDNQWTYKIHQED-KILKVRKFAK 951 Query 358 IKN--LSqaqqiikaaqkLTQEVIIRTGKIPWILLPGREEDWIL-ELQMGNINWMPSF-W 413 IKN + + + QK+ +E I+ G++P LP + W + W+P + + Sbjct 952 IKNTHTNGVRLLAHVIQKIGKEAIVIVGQVPKFHLPVERDVWEQWWTDYWQVTWIPEWDF 1011 Query 414 SCYKGSVRWKKRNVIAELVPGP-----TYYTDGG--KKNGRGSLGYIASTGEKFRIHEEG 466 VR ++ LV P TYYTDG K++ G GYI G+ Sbjct 1012 ISTPPLVR-----LVFNLVKDPIEVEETYYTDGSCNKQSKEGKAGYITDRGKDIVKVLTT 1066 Query 467 TNQQLELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEK 526 TNQQ EL AI + K NI+ + + Y R E R + I E++ K + Sbjct 1067 TNQQAELEAIYHGIEDSGPKRNIIVELQVCYGNNNRFPTESESR-LVNQIIEEMI--KVR 1123 Query 527 IGVHWVPGHKGIPQNEEIDRYISEIF 552 + V WVP +GI N+EI +S+ F Sbjct 1124 VYVAWVPALEGIGGNQEIGPLVSQGF 1149 >RecName: Full=Intracisternal A-particle Pol-related polyprotein; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Mouse intracisternal A-particle MIA14] Sequence ID: P11368.1 Length: 867 Range 1: 31 to 566 Score:160 bits(406), Expect:2e-40, Method:Compositional matrix adjust., Identities:149/547(27%), Positives:239/547(43%), Gaps:40/547(7%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 + QW L+ EKLE + ++V+ K G + + W NTPIF IKKKSGKWR+L D R +N Sbjct 31 VPQWHLSSEKLEAVIQLVEEQLKLGHIDPSTSPW--NTPIFVIKKKSGKWRLLHDLRPIN 88 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPC 143 +Q Q GLP L R ++ I+DI D +F+IPL R FT+ S N+ P Sbjct 89 EQMNLFGPVQRGLPVLSALPRGWNLIIIDIKDCFFSIPLCPRDRPRFAFTIPSINSDEPD 148 Query 144 VRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGI 203 RY WKVLPQG SP + Q +Q+ L E+ P + +YMDDI + L + Sbjct 149 NRYQWKVLPQGMSNSPTMCQLYVQEALLPVREQFPSLILLLYMDDILLCHK-ELTMLQKA 207 Query 204 VNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPEITEGPI-TLNKLQ 262 L ++Q+G + +K Q ++LG + P+K QK EI + TLN Q Sbjct 208 YPFLLKTLSQWGLQIATEKVQISDTGQFLGSVVSPDKIVPQKV---EIRRDHLHTLNNFQ 264 Query 263 KLVGDLVWRQSLIGKSIPNILKL---MEGDRALQSERYIESIHVREWEACRQKLKEMEGN 319 KL+GD+ W + + + L +EGD + S R + + + + L+ + Sbjct 265 KLLGDINWLRPFLKIPSAELRPLFWYLEGDPHISSPRTLTLAANQALQKVEKALQNAQLQ 324 Query 320 YYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKNLSqaqqiikaaqkL--TQE 377 E+ + + + V + G LW++ S + A L + Sbjct 325 AI-EDSQPFSLCVFKTAQLPTAVLWQNGPLLWIHPNVSPAKIIDWYPDAIAQLALKGLKA 383 Query 378 VIIRTGKIPWILLPGREEDWILELQMGNINWM-----------------PSFWSCYKGSV 420 I G+ P++L+ + L + +W P SV Sbjct 384 AITHFGRSPYLLIVPYTAAQVQTLAATSNDWAVLVTSFSGKIDNHYPKHPILQFAQNQSV 443 Query 421 RWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHEEGTNQQLELRAIEEAC 480 + + V L G YTDG K G Y+A+ + + E + + +E + E Sbjct 444 VFPQITVRNPLKNGIVVYTDGSKT---GIGAYVANGKVVSKQYNENSPRMVECLVVLEVL 500 Query 481 KQGPEKMNIVTDSRY---AYEFMLRNWDEEV---IRNPIQARIMELVHNKEKIGVHWVPG 534 K E +NIV+DS Y A + W ++ + N Q +I ++ ++ + + V Sbjct 501 KTFLEPLNIVSDSCYVVNAVNLLEGGWSDKPSSRVANIFQ-QIQLVLLSRSPVYITHVRA 559 Query 535 HKGIPQN 541 H G+P + Sbjct 560 HSGLPTS 566 >RecName: Full=Endogenous retrovirus group K member 18 Pol protein; AltName: Full=HERV-K(C1a) Pol protein; AltName: Full=HERV-K110 Pol protein; AltName: Full=HERV-K18 Pol protein; AltName: Full=HERV-K_1q23.3 provirus ancestral Pol protein; Includes: RecName: Full=Reverse transcriptase; Includes: RecName: Full=Ribonuclease H; Short=RNase H [Homo sapiens] Sequence ID: Q9QC07.2 Length: 812 Range 1: 36 to 593 Score:159 bits(402), Expect:3e-40, Method:Compositional matrix adjust., Identities:144/575(25%), Positives:258/575(44%), Gaps:59/575(10%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 + QWPL ++KLE L + + ++G + + W N+P+F I+KKS KWRML D R +N Sbjct 36 VNQWPLPKQKLEALHLLANEQLEKGHIEPSFSPW--NSPVFVIQKKSSKWRMLTDLRAVN 93 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPC 143 + + Q GLP P + + + I+D+ D +FTIPL E + FT+ + NN P Sbjct 94 AVIQPMGPLQPGLPSPAMIPKDWPLIIIDLKDCFFTIPLAEQDCEKFAFTIPAINNKEPA 153 Query 144 VRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGI 203 R+ WKVLPQG SP + Q + + L+ ++ Y DDI ++ ++ Sbjct 154 TRFQWKVLPQGMLNSPTICQTFVGRALQPVRDKFSDCYIIHYFDDILCAAETK-DKLIDC 212 Query 204 VNELASYIAQYGFMLPEDKRQEGYPAKWLGFELH-----PEKWKFQKHTLPEITEGPITL 258 L + +A G + DK Q P +LG ++ P+K + +K TL TL Sbjct 213 YTFLQAEVANAGLAIASDKIQTSTPFHYLGMQIENRKIKPQKIEIRKDTLK-------TL 265 Query 259 NKLQKLVGDLVWRQSLIG---KSIPNILKLMEGDRALQSERYIESIHVREWEACRQKLKE 315 N QKL+GD+ W + +G ++ N+ ++ GD L S+R + +E + +K++ Sbjct 266 NDFQKLLGDINWIRPTLGIPTYAMSNLFSILRGDSDLNSKRMLTPEATKEIKLVEEKIQS 325 Query 316 MEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHS-IKNLSqaqqiikaaqkL 374 + N D + + + I+ Q W + HS +K + I Sbjct 326 AQINRIDPLAPLQLLIFATAHSPTGIIIQNTDLVEWSFLPHSTVKTFTLYLDQIATLIGP 385 Query 375 TQEVIIR-TGKIP-WILLPGREEDWILEL------QMGNINWMPSFWSCYKGSVRWKKRN 426 T+ II+ G P I++P +E Q+G N++ + Y + ++ Sbjct 386 TRLRIIKLCGNDPDKIVVPLTKEQVRQAFINSGAWQIGLANFVGIIDNHYPKTKIFQFLK 445 Query 427 VIAELVP----------GPTYYTDGGKKNGRGSLGYIASTGEKFRIHEE--GTNQQLELR 474 + ++P T +TDG S G +A TG K R+ + + Q+ EL Sbjct 446 LTTWILPKITRREPLENALTVFTDG------SSNGKVAYTGPKERVIKTPYQSAQRAELV 499 Query 475 AIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEKIG------ 528 A+ + + +NI++DS Y + R+ + +I+ + ++ +L + ++ Sbjct 500 AVITVLQDFDQPINIISDSAYVVQ-ATRDVETALIKYSMDDQLNQLFNLLQQTVRKRNFP 558 Query 529 --VHWVPGHKGIP-----QNEEIDRYISEIFLAKE 556 + + H +P NE+ D +S F+ + Sbjct 559 FYITHIRAHTNLPGPLTKANEQADLLVSSAFIKAQ 593 >RecName: Full=Endogenous retrovirus group K member 11 Pol protein; AltName: Full=HERV-K_3q27.2 provirus ancestral Pol protein; Includes: RecName: Full=Reverse transcriptase; Short=RT; Includes: RecName: Full=Ribonuclease H; Short=RNase H; Includes: RecName: Full=Integrase; Short=IN [Homo sapiens] Sequence ID: Q9UQG0.2 Length: 969 Range 1: 36 to 593 Score:159 bits(403), Expect:4e-40, Method:Compositional matrix adjust., Identities:146/570(26%), Positives:258/570(45%), Gaps:49/570(8%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 + QWPL ++KLE L + + ++G + + W N+P+F I+KKSGKWRML D R +N Sbjct 36 VNQWPLPKQKLEALHLLANEQLEKGHIEPSFSPW--NSPVFVIQKKSGKWRMLTDLRAVN 93 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPC 143 + + Q GLP P + + + I+D+ D +FTIPL E + FT+ + NN P Sbjct 94 AVIQPMGPLQPGLPSPAMIPKDWPLIIIDLKDCFFTIPLAEQDCEKFAFTIPAINNKEPA 153 Query 144 VRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGI 203 R+ WKVLPQG SP + Q + + L+ E+ Y+DDI ++ ++ Sbjct 154 TRFQWKVLPQGMLNSPTICQTFVGRALQPVREKFSDCYIIHYIDDILCAAETK-DKLIDC 212 Query 204 VNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPEITEGPITLNKLQK 263 L + +A G + DK Q P +LG ++ K K QK + + T TLN QK Sbjct 213 YTFLQAEVANAGLAIASDKIQTSTPFHYLGMQIENRKIKPQKIEIRKDTLK--TLNDFQK 270 Query 264 LVGDLVWRQSLIG---KSIPNILKLMEGDRALQSERYIESIHVREWEACRQKLKEMEGNY 320 L+GD+ W + +G ++ N+ ++ GD L S+R + +E + +K++ + N Sbjct 271 LLGDINWIRPTLGIPTYAMSNLFSILRGDSDLNSKRILTPEATKEIKLVEEKIQSAQINR 330 Query 321 YDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHS-IKNLSqaqqiikaaqkLTQEVI 379 D + + + I+ Q W + HS +K + I T+ I Sbjct 331 IDPLAPLQLLIFATAHSPTGIIIQNTDLVEWSFLPHSTVKTFTLYLDQIATLIGQTRLRI 390 Query 380 IR-TGKIP-WILLPGREEDWILEL------QMGNINWMPSFWSCYKGSVRWKKRNVIAEL 431 I+ G P I++P +E Q+G N++ + Y + ++ + + Sbjct 391 IKLCGNDPDKIVVPLTKEQVRQAFINSGAWQIGLANFVGIIDNHYPKTKIFQFLKMTTWI 450 Query 432 VP----------GPTYYTDGGKKNGRGSLGYIASTGEKFRIHEEG--TNQQLELRAIEEA 479 +P T +TDG NG+ A TG K R+ + + Q+ EL A+ Sbjct 451 LPKITRREPLENALTVFTDGS-SNGKA-----AYTGPKERVIKTPYQSAQRAELVAVITV 504 Query 480 CKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEK--------IGVHW 531 + + +NI++DS Y + R+ + +I+ + ++ +L + ++ + Sbjct 505 LQDFDQPINIISDSAYVVQ-ATRDVETALIKYSMDDQLNQLFNLLQQTVRKRNFPFYITH 563 Query 532 VPGHKGIP-----QNEEIDRYISEIFLAKE 556 + H +P NEE D +S + + Sbjct 564 IRAHTNLPGPLTKANEEADLLVSSALIKAQ 593 >RecName: Full=Endogenous retrovirus group K member 8 Pol protein; AltName: Full=HERV-K115 Pol protein; AltName: Full=HERV-K_8p23.1 provirus ancestral Pol protein; Includes: RecName: Full=Reverse transcriptase; Short=RT; Includes: RecName: Full=Ribonuclease H; Short=RNase H; Includes: RecName: Full=Integrase; Short=IN [Homo sapiens] Sequence ID: P63133.1 Length: 956 Range 1: 36 to 523 Score:158 bits(399), Expect:1e-39, Method:Compositional matrix adjust., Identities:136/504(27%), Positives:232/504(46%), Gaps:45/504(8%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 + QWPL ++KLE L + + ++G + + W N+P+F I+KKSGKWRML D R +N Sbjct 36 VNQWPLPKQKLEALHLLANEQLEKGHIEPSFSPW--NSPVFVIQKKSGKWRMLTDLRAVN 93 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPC 143 + + Q GLP P + + + I+D+ D +FTIPL E + FT+ + NN P Sbjct 94 AVIQPMGPLQPGLPSPAMIPKDWPLIIIDLKDCFFTIPLAEQDCEKFAFTIPAINNKEPA 153 Query 144 VRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGI 203 R+ WKVLPQG SP + Q + + L+ ++ Y+DDI ++ ++ Sbjct 154 TRFQWKVLPQGMLNSPTICQTFVGRALQPVRKKFSDCYIIHYIDDILCAAETK-DKLIDC 212 Query 204 VNELASYIAQYGFMLPEDKRQEGYPAKWLGFELH-----PEKWKFQKHTLPEITEGPITL 258 L + +A G + DK Q P +LG ++ P+K + +K TL TL Sbjct 213 YTFLQAEVASAGLAIASDKIQTSTPFHYLGMQIENRKIKPQKIEIRKDTLK-------TL 265 Query 259 NKLQKLVGDLVWRQSLIG---KSIPNILKLMEGDRALQSERYIESIHVREWEACRQKLKE 315 N QKL+GD+ W Q +G ++ N+ ++ GD L S+R + +E + +K++ Sbjct 266 NDFQKLLGDINWIQPTLGIPTYAMSNLFSILRGDSDLNSKRILTPEATKEIKLVEEKIQS 325 Query 316 MEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHS-IKNLSqaqqiikaaqkL 374 + N D + + + I+ Q W + HS +K + I Sbjct 326 AQINRIDPLAPLQLLIFATAHSPTGIIIQNTDLVEWSFLPHSTVKTFTLYLDQIATLIGQ 385 Query 375 TQEVIIR-TGKIP-WILLPGREEDWILEL------QMGNINWMPSFWSCYKGSVRWKKRN 426 T+ II+ G P I++P +E Q+G N++ + Y + ++ Sbjct 386 TRLRIIKLCGNDPDKIVVPLTKEQVRQAFINSGAWQIGLANFVGIIDNHYPKTKIFQFLK 445 Query 427 VIAELVP----------GPTYYTDGGKKNGRGSLGYIASTGEKFRIHEE--GTNQQLELR 474 + ++P T +TDG NG+ A TG K R+ + + Q+ EL Sbjct 446 LTTWILPKITRREPLENALTVFTDGS-SNGKA-----AYTGPKERVIKTPYQSAQRAELV 499 Query 475 AIEEACKQGPEKMNIVTDSRYAYE 498 A+ + + +NI++DS Y + Sbjct 500 AVITVLQDFDQPINIISDSAYVVQ 523 >RecName: Full=Gag-Pro-Pol polyprotein; Contains: RecName: Full=Matrix protein p10; Contains: RecName: Full=Phosphorylated protein; Contains: RecName: Full=p12; Contains: RecName: Full=Capsid protein p27; Contains: RecName: Full=Nucleocapsid protein-dUTPase; Short=NC-dUTPase; Contains: RecName: Full=Protease 17 kDa; Contains: RecName: Full=Protease 13 kDa; Contains: RecName: Full=G-patch peptide; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Jaagsiekte sheep retrovirus] Sequence ID: P31623.2 Length: 1726 Range 1: 890 to 1375 Score:158 bits(400), Expect:1e-39, Method:Compositional matrix adjust., Identities:135/507(27%), Positives:232/507(45%), Gaps:55/507(10%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 + QWPLTQEKL +++V + G + + W N+PIF IKKKSGKWR+L D R++N Sbjct 890 VDQWPLTQEKLSAAQQLVQEQLRLGHIEPSTSAW--NSPIFVIKKKSGKWRLLQDLRKVN 947 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPC 143 + + Q GLP P + K ++ ++D+ D ++TIPL + F++ S N P Sbjct 948 ETMMHMGALQPGLPTPSAIPDKSYIIVIDLKDCFYTIPLAPQDCKRFAFSLPSVNFKEPM 1007 Query 144 VRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEH--R 201 RY W+VLPQG SP + Q + + + P + YMDDI + +EH Sbjct 1008 QRYQWRVLPQGMTNSPTLCQKFVATAIAPVRQRFPQLYLVHYMDDILLAHT---DEHLLY 1064 Query 202 GIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPEITEGPITLNKL 261 + L +++ G ++ ++K Q +P +LGF L+P + Q L T+ TLN Sbjct 1065 QAFSILKQHLSLNGLVIADEKIQTHFPYNYLGFSLYPRVYNTQLVKLQ--TDHLKTLNDF 1122 Query 262 QKLVGDLVWRQ---SLIGKSIPNILKLMEGDRALQSERYIESIHVREWEACRQKLKEMEG 318 QKL+GD+ W + L ++ + +++GD S R + ++ + +++ + Sbjct 1123 QKLLGDINWIRPYLKLPTYTLQPLFDILKGDSDPASPRTLSLEGRTALQSIEEAIRQQQI 1182 Query 319 NYYDEEKDIYGQLDWG------NKAIEYIVFQEKGKPL---WVNVVHSIKNLSqaqqiik 369 Y D ++ WG +A +++Q+ KPL +++ + L + + K Sbjct 1183 TYCDYQR------SWGLYILPTPRAPTGVLYQD--KPLRWIYLSATPTKHLLPYYELVAK 1234 Query 370 aaqkLTQEVIIRTG-KIPWILLPG--REEDWILELQMGNINWMPSFWSCYKGSVRWK--- 423 K E I G + P+I +P ++DW+ + + NW +F + Y G + Sbjct 1235 IIAKGRHEAIQYFGMEPPFICVPYALEQQDWLFQF---SDNWSIAF-ANYPGQITHHYPS 1290 Query 424 --------------KRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHEEGTNQ 469 + V + +P T G NG +L I + + + Q Sbjct 1291 DKLLQFASSHAFIFPKIVRRQPIPEATLIFTDGSSNGTAAL--IINHQTYYAQTSFSSAQ 1348 Query 470 QLELRAIEEACKQGPEKMNIVTDSRYA 496 +EL A+ +A P N+ TDS Y Sbjct 1349 VVELFAVHQALLTVPTSFNLFTDSSYV 1375 >RecName: Full=Endogenous retrovirus group K member 7 Pol protein; AltName: Full=HERV-K(III) Pol protein; AltName: Full=HERV-K102 Pol protein; AltName: Full=HERV-K_1q22 provirus ancestral Pol protein; Includes: RecName: Full=Reverse transcriptase; Short=RT; Includes: RecName: Full=Ribonuclease H; Short=RNase H; Includes: RecName: Full=Integrase; Short=IN [Homo sapiens] Sequence ID: P63135.1 Length: 1459 Range 1: 36 to 546 Score:158 bits(400), Expect:1e-39, Method:Compositional matrix adjust., Identities:139/523(27%), Positives:243/523(46%), Gaps:36/523(6%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 + QWPL ++KLE L + + ++G + + W N+P+F I+KKSGKWRML D R +N Sbjct 36 VNQWPLPKQKLEALHLLANEQLEKGHIEPSFSPW--NSPVFVIQKKSGKWRMLTDLRAVN 93 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPC 143 + + Q GLP P + + + I+D+ D +FTIPL E + FT+ + NN P Sbjct 94 AVIQPMGPLQPGLPSPAMIPKDWPLIIIDLKDCFFTIPLAEQDCEKFAFTIPAINNKEPA 153 Query 144 VRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGI 203 R+ WKVLPQG SP + Q + + L+ E+ Y+DDI ++ ++ Sbjct 154 TRFQWKVLPQGMLNSPTICQTFVGRALQPVREKFSDCYIIHYIDDILCAAETR-DKLIDC 212 Query 204 VNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPEITEGPITLNKLQK 263 L + +A G + DK Q P +LG ++ K K QK + + T TLN QK Sbjct 213 YTFLQAEVANAGLAIASDKIQTSTPFHYLGMQIENRKIKQQKIEIRKDTLK--TLNDFQK 270 Query 264 LVGDLVWRQSLIG---KSIPNILKLMEGDRALQSERYIESIHVREWEACRQKLKEMEGNY 320 L+GD+ W + +G ++ N+ ++ GD L S+R + +E + +K++ + N Sbjct 271 LLGDINWIRPTLGIPTYAMSNLFSILRGDSDLNSKRILTPEATKEIKLVEEKIQSAQINR 330 Query 321 YDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHS-IKNLSqaqqiikaaqkLTQEVI 379 D + + + I+ Q W + HS +K + I T+ I Sbjct 331 IDPLAPLQLLIFATAHSPTGIIIQNTDLVEWSFLPHSTVKTFTLYLDQIATLIGQTRLRI 390 Query 380 IR-TGKIP-WILLPGREEDWILEL------QMGNINWMPSFWSCYKGSVRWKKRNVIAEL 431 I+ G P I++P +E Q+G N++ + Y + ++ + + Sbjct 391 IKLCGNDPDKIVVPLTKEQVRQAFINSGAWQIGLANFVGIIDNHYPKTKIFQFLKLTTWI 450 Query 432 VP----------GPTYYTDGGKKNGRGSLGYIASTGEKFRIHEE--GTNQQLELRAIEEA 479 +P T +TDG NG+ A TG K R+ + + Q+ EL A+ Sbjct 451 LPKITRREPLENALTVFTDGS-SNGKA-----AYTGPKERVIKTPYQSAQRAELVAVITV 504 Query 480 CKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVH 522 + + +NI++DS Y + R+ + +I+ + ++ +L + Sbjct 505 LQDFDQPINIISDSAYVVQ-ATRDVETALIKYSMDDQLNQLFN 546 >RecName: Full=Intracisternal A-particle Pol-related polyprotein; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Golden hamster intracisternal A-particle H18] Sequence ID: P04026.1 Length: 863 Range 1: 20 to 519 Score:156 bits(395), Expect:3e-39, Method:Compositional matrix adjust., Identities:145/532(27%), Positives:229/532(43%), Gaps:48/532(9%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 ++QWPL+ EKLE + +V E+ G + + W NTPIF IKKKSGKWR+L D R +N Sbjct 20 VSQWPLSSEKLEAVTRLVQEQERLGHLEPSTSPW--NTPIFVIKKKSGKWRLLHDLRAIN 77 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPC 143 Q Q GLP L + + I+DI D +F+IPLY R FT+ S N++ P Sbjct 78 NQMHLFGPVQRGLPLLSALPQDWKLIIIDIKDCFFSIPLYPRDRPRFAFTIPSLNHMEPD 137 Query 144 VRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGI 203 R+ WKVLPQG SP + Q +Q+ L ++ + YMDDI I L+ + Sbjct 138 KRFQWKVLPQGMANSPTICQLYVQEALEPIRKQFTSLIVIHYMDDILICHK-ELDVLQKA 196 Query 204 VNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPEITEGPI-TLNKLQ 262 L + + Q+G + +K Q +LG ++ P+ QK EI + + TLN Q Sbjct 197 FPMLVAELKQWGLEIASEKVQIADTGLFLGSKITPKNIVPQKI---EIRKDHLQTLNDFQ 253 Query 263 KLVGDLVWRQSLI---GKSIPNILKLMEGDRALQSERYIESIHVREWEACRQKLKEMEGN 319 KL+GD+ W + + + + L+EG+ + S R R + + L+E + Sbjct 254 KLLGDINWLRPFLKIPSADLKPLFDLLEGEPHISSPRKFTPAAHRALQMVEEALQEAQIT 313 Query 320 YYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKNLSqaqqiikaaqkLTQEVI 379 K I DW A+ Q + + + V H ++ +I Sbjct 314 TNSPAKII----DWYPDAVA----QPRSR-IKAAVTHFGRD--------------PDSLI 350 Query 380 IRTGKIPWILLPGREEDWILEL-----QMGN-INWMPSFWSCYKGSVRWKKRNVIAELVP 433 + L DW + + Q+ N P ++ + + L Sbjct 351 VPYTAAQVQTLAATSSDWAVLVTSFSGQIDNHFPKHPILQFALNQAIVFPQVTAKDPLPD 410 Query 434 GPTYYTDGGKKNGRGSLGYIASTGEKFRIHEEGTNQQLELRAIEEACKQGPEKMNIVTDS 493 G YTDG K G G+ Y+ + + E + Q +E + E + P +NIV+DS Sbjct 411 GTVVYTDGS-KTGLGA--YVVKDRVISKQYNETSPQVVECLIVLEVLEAFPGPLNIVSDS 467 Query 494 RYAYEFMLRNWDEEVIR------NPIQARIMELVHNKEKIGVHWVPGHKGIP 539 Y + +IR N Q L++ + + + V H G+P Sbjct 468 SYVVNAVNLLEIAGIIRSSSRVANIFQKIQAALLNRRFPVFITHVRAHSGLP 519 >RecName: Full=Endogenous retrovirus group K member 113 Pol protein; AltName: Full=HERV-K113 Pol protein; AltName: Full=HERV-K_19p13.11 provirus ancestral Pol protein; Includes: RecName: Full=Reverse transcriptase; Short=RT; Includes: RecName: Full=Ribonuclease H; Short=RNase H; Includes: RecName: Full=Integrase; Short=IN [Homo sapiens] Sequence ID: P63132.1 Length: 956 Range 1: 36 to 546 Score:155 bits(392), Expect:1e-38, Method:Compositional matrix adjust., Identities:140/530(26%), Positives:243/530(45%), Gaps:50/530(9%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 + QWPL ++KLE L + + ++G + + W N+P+F I+KKSGKWRML D R +N Sbjct 36 VNQWPLPKQKLEALHLLANEQLEKGHIEPSFSPW--NSPVFVIQKKSGKWRMLTDLRAVN 93 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPC 143 + + Q GLP P + + + I+D+ D +FTIPL E + FT+ + NN P Sbjct 94 AVIQPMGPLQPGLPSPAMIPKDWPLIIIDLKDCFFTIPLAEQDCEKFAFTIPAINNKEPA 153 Query 144 VRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGI 203 R+ WKVLPQG SP + Q + + L+ ++ Y+DDI ++ ++ Sbjct 154 TRFQWKVLPQGMLNSPTICQTFVGRALQPVRDKFSDCYIIHYIDDILCAAETK-DKLIDC 212 Query 204 VNELASYIAQYGFMLPEDKRQEGYPAKWLGFELH-----PEKWKFQKHTLPEITEGPITL 258 L + +A G + DK Q P +LG ++ P+K + +K TL TL Sbjct 213 YTFLQAEVANAGLAIASDKIQTSTPFHYLGMQIENRKIKPQKIEIRKDTLK-------TL 265 Query 259 NKLQKLVGDLVWRQSLIGKSIP-----NILKLMEGDRALQSERYIESIHVREWEACRQKL 313 N QKL+GD+ W + +G IP N+ ++ GD L S+R + +E + +K+ Sbjct 266 NDFQKLLGDINWIRPTLG--IPTYVMSNLFSILRGDSDLNSKRMLTPETTKEIKLVEEKI 323 Query 314 KEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHS-IKNLSqaqqiikaaq 372 + + N D + + + I+ Q W + HS +K + I Sbjct 324 QSAQINRIDPLAPLRLLIFATAHSPIGIIIQNTDLVEWSFLPHSTVKTFTLYLDQIATLI 383 Query 373 kLTQEVIIR-TGKIP-WILLPGREEDWILEL------QMGNINWMPSFWSCYKGSVRWKK 424 T+ II+ G P I++P +E Q+G N++ + Y + ++ Sbjct 384 GQTRLRIIKLCGNDPDKIVVPLTKEQVRQAFINSGAWQIGLANFVGIIDNHYPKTKIFQF 443 Query 425 RNVIAELVP----------GPTYYTDGGKKNGRGSLGYIASTGEKFRIHEE--GTNQQLE 472 + ++P T +TDG S G A TG K R+ + + Q+ E Sbjct 444 LKLTTWILPKITRREPLENALTVFTDG------SSNGKAAYTGLKERVIKTPYQSAQRAE 497 Query 473 LRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVH 522 L A+ + + +NI++DS Y + R+ + +I+ + ++ +L + Sbjct 498 LVAVITVLQDFDQPINIISDSAYVVQ-ATRDVETALIKYSMDDQLNQLFN 546 >RecName: Full=Endogenous retrovirus group K member 6 Pol protein; AltName: Full=HERV-K(C7) Pol protein; AltName: Full=HERV-K(HML-2.HOM) Pol protein; AltName: Full=HERV-K108 Pol protein; AltName: Full=HERV-K_7p22.1 provirus ancestral Pol protein; Includes: RecName: Full=Reverse transcriptase; Short=RT; Includes: RecName: Full=Ribonuclease H; Short=RNase H; Includes: RecName: Full=Integrase; Short=IN [Homo sapiens] Sequence ID: Q9BXR3.2 Length: 956 Range 1: 36 to 546 Score:154 bits(388), Expect:3e-38, Method:Compositional matrix adjust., Identities:137/528(26%), Positives:242/528(45%), Gaps:46/528(8%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 + QWPL ++KLE L + + ++G + + W N+P+F I+KKSGKWRML D R +N Sbjct 36 VNQWPLPKQKLEALHLLANEQLEKGHIEPSFSPW--NSPVFVIQKKSGKWRMLTDLRAVN 93 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPC 143 + + Q GLP P + + + I+D+ D +FTIPL E + FT+ + NN P Sbjct 94 AVIQPMGPLQPGLPSPAMIPKDWPLIIIDLKDCFFTIPLAEQDCEKFAFTIPAINNKEPA 153 Query 144 VRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGI 203 R+ WKVLPQG SP + Q + + L+ E+ +DDI ++ ++ Sbjct 154 TRFQWKVLPQGMLNSPTICQTFVGRALQPVREKFSDCYIIHCIDDILCAAETK-DKLIDC 212 Query 204 VNELASYIAQYGFMLPEDKRQEGYPAKWLGFELH-----PEKWKFQKHTLPEITEGPITL 258 L + +A G + DK Q P +LG ++ P+K + +K TL TL Sbjct 213 YTFLQAEVANAGLAIASDKIQTSTPFHYLGMQIENRKIKPQKIEIRKDTLK-------TL 265 Query 259 NKLQKLVGDLVWRQSLIG---KSIPNILKLMEGDRALQSERYIESIHVREWEACRQKLKE 315 N QKL+GD+ W + +G ++ N+ ++ GD L S+R + +E + +K++ Sbjct 266 NDFQKLLGDINWIRPTLGIPTYAMSNLFSILRGDSDLNSKRMLTPEATKEIKLVEEKIQS 325 Query 316 MEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHS-IKNLSqaqqiikaaqkL 374 + N D + + + I+ Q W + HS +K + I Sbjct 326 AQINRIDPLAPLQLLIFATAHSPTGIIIQNTDLVEWSFLPHSTVKTFTLYLDQIATLIGQ 385 Query 375 TQEVIIR-TGKIP-WILLPGREEDWILEL------QMGNINWMPSFWSCYKGSVRWKKRN 426 T+ II+ G P I++P +E ++G N++ + Y + ++ Sbjct 386 TRLRIIKLCGNDPDKIVVPLTKEQVRQAFINSGAWKIGLANFVGIIDNHYPKTKIFQFLK 445 Query 427 VIAELVP----------GPTYYTDGGKKNGRGSLGYIASTGEKFRIHEE--GTNQQLELR 474 + ++P T +TDG S G A TG K R+ + + Q+ EL Sbjct 446 LTTWILPKITRREPLENALTVFTDG------SSNGKAAYTGPKERVIKTPYQSAQRAELV 499 Query 475 AIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVH 522 A+ + + +NI++DS Y + R+ + +I+ + ++ +L + Sbjct 500 AVITVLQDFDQPINIISDSAYVVQ-ATRDVETALIKYSMDDQLNQLFN 546 >RecName: Full=Endogenous retrovirus group K member 10 Pol protein; AltName: Full=HERV-K10 Pol protein; AltName: Full=HERV-K107 Pol protein; AltName: Full=HERV-K_5q33.3 provirus ancestral Pol protein; Includes: RecName: Full=Reverse transcriptase; Short=RT; Includes: RecName: Full=Ribonuclease H; Short=RNase H; Includes: RecName: Full=Integrase; Short=IN [Homo sapiens] Sequence ID: P10266.2 Length: 1014 Range 1: 36 to 593 Score:153 bits(387), Expect:5e-38, Method:Compositional matrix adjust., Identities:142/570(25%), Positives:256/570(44%), Gaps:49/570(8%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 + QWPL ++KLE L + + ++G + + W N+P+F I+KKSGKW L D R +N Sbjct 36 VNQWPLPKQKLEALHLLANEQLEKGHIEPSFSPW--NSPVFVIQKKSGKWHTLTDLRAVN 93 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPC 143 + + Q GLP P + + + I+D+ D +FTIPL E + FT+ + NN P Sbjct 94 AVIQPMGPLQPGLPSPAMIPKDWPLIIIDLKDCFFTIPLAEQDCEKFAFTIPAINNKEPA 153 Query 144 VRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGI 203 R+ WKVLPQG SP + Q + + L+ E+ Y+DDI ++ ++ Sbjct 154 TRFQWKVLPQGMLNSPTICQTFVGRALQPVREKFSDCYIIHYIDDILCAAETK-DKLIDC 212 Query 204 VNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPEITEGPITLNKLQK 263 L + +A G + DK Q P +LG ++ K K QK + + T TLN QK Sbjct 213 YTFLQAEVANAGLAIASDKIQTSTPFHYLGMQIENRKIKPQKIEIRKDTLK--TLNDFQK 270 Query 264 LVGDLVWRQSLIG---KSIPNILKLMEGDRALQSERYIESIHVREWEACRQKLKEMEGNY 320 L+GD+ W + +G ++ N+ ++ GD L S+R + +E + +K++ + N Sbjct 271 LLGDINWIRPTLGIPTYAMSNLFSILRGDSDLNSQRILTPEATKEIKLVEEKIQSAQINR 330 Query 321 YDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHS-IKNLSqaqqiikaaqkLTQEVI 379 D + + + I+ Q W + HS +K + I T+ I Sbjct 331 IDPLAPLQLLIFATAHSPTGIIIQNTDLVEWSFLPHSTVKTFTLYLDQIATLIGQTRLRI 390 Query 380 IR-TGKIP-WILLPGREEDWILEL------QMGNINWMPSFWSCYKGSVRWKKRNVIAEL 431 + G P I++P +E Q+G N++ + Y + ++ + + Sbjct 391 TKLCGNDPDKIVVPLTKEQVRQAFINSGAWQIGLANFVGLIDNHYPKTKIFQFLKLTTWI 450 Query 432 VP----------GPTYYTDGGKKNGRGSLGYIASTGEKFRIHEEG--TNQQLELRAIEEA 479 +P T +TDG NG+ A TG K R+ + + Q+ EL A+ Sbjct 451 LPKITRREPLENALTVFTDGS-SNGKA-----AYTGPKERVIKTPYQSAQRDELVAVITV 504 Query 480 CKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEK--------IGVHW 531 + + +NI++DS Y + R+ + +I+ + ++ +L + ++ + + Sbjct 505 LQDFDQPINIISDSAYVVQ-ATRDVETALIKYSMDDQLNQLFNLLQQTVRKRNFPFYITY 563 Query 532 VPGHKGIP-----QNEEIDRYISEIFLAKE 556 + H +P NE+ D +S + + Sbjct 564 IRAHTNLPGPLTKANEQADLLVSSALIKAQ 593 >RecName: Full=Endogenous retrovirus group K member 25 Pol protein; AltName: Full=HERV-K_11q22.1 provirus ancestral Pol protein; Includes: RecName: Full=Reverse transcriptase; Short=RT; Includes: RecName: Full=Ribonuclease H; Short=RNase H; Includes: RecName: Full=Integrase; Short=IN [Homo sapiens] Sequence ID: P63136.1 Length: 954 Range 1: 36 to 332 Score:153 bits(386), Expect:6e-38, Method:Compositional matrix adjust., Identities:95/307(31%), Positives:156/307(50%), Gaps:18/307(5%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 + QWPL ++KLE L + + ++G + + W N+P+F I+KKSGKWRML D R +N Sbjct 36 VNQWPLPKQKLEALHLLANEQLEKGHIEPSFSPW--NSPVFVIQKKSGKWRMLTDLRAVN 93 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPC 143 + + Q GLP P + + + I+D+ D +FTIPL E + FT+ + NN P Sbjct 94 AVIQPMGPLQPGLPSPAMIPKDWPLIIIDLKDCFFTIPLAEQDCEKFAFTIPAINNKEPA 153 Query 144 VRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGI 203 R+ WKVLPQG SP + Q + + L+ E+ Y+DDI ++ ++ Sbjct 154 TRFQWKVLPQGMLNSPTICQTFVGRALQPVREKFSDCYIIHYIDDILCAAETK-DKLIDC 212 Query 204 VNELASYIAQYGFMLPEDKRQEGYPAKWLGFELH-----PEKWKFQKHTLPEITEGPITL 258 L + +A G + DK Q P +LG ++ P+K + +K TL L Sbjct 213 YTFLQAEVANAGLAIASDKIQTSTPFHYLGMQIENRKIKPQKIEIRKDTLK-------AL 265 Query 259 NKLQKLVGDLVWRQSLIG---KSIPNILKLMEGDRALQSERYIESIHVREWEACRQKLKE 315 N QKL+GD+ W + +G ++ N+ ++ GD L S+R + +E + +K++ Sbjct 266 NDFQKLLGDINWIRPTLGIPTYAMSNLFSILRGDSDLNSKRMLTPEATKEIKLVEEKIQS 325 Query 316 MEGNYYD 322 + N D Sbjct 326 AQINRID 332 >RecName: Full=Pol polyprotein; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN; AltName: Full=pp32 [Avian leukosis virus] Sequence ID: Q7SQ98.1 Length: 895 Range 1: 21 to 553 Score:152 bits(385), Expect:8e-38, Method:Compositional matrix adjust., Identities:159/558(28%), Positives:241/558(43%), Gaps:67/558(12%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTC-NTPIFCIKKKSGKWRMLIDFREL 82 I QWPL + KL L ++V EKE ++G P +C NTP+F I+K SG +R+L D R + Sbjct 21 IDQWPLPEGKLVALTQLV---EKELQLGHIEPSLSCWNTPVFVIRKASGSYRLLHDLRAV 77 Query 83 NKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGP 142 N + Q G P L R + +LD+ D +F+IPL E R+ FT+ S NN P Sbjct 78 NAKLVPFGAVQQGAPVLSALPRGWPLMVLDLKDCFFSIPLAEQDREAFAFTLPSVNNQAP 137 Query 143 CVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRG 202 R+ WKVLPQG SP + Q + ++L +HP ++ YMDD+ + + H G Sbjct 138 ARRFQWKVLPQGMTCSPTICQLVVGQVLEPLRLKHPSLRMLHYMDDLLLAAS----SHDG 193 Query 203 IV---NELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPE--ITEGPI- 256 + E+ S + + GF + DK Q ++LG++L + P + E I Sbjct 194 LEAAGEEVISTLERAGFTISPDKIQREPGVQYLGYKLG------STYVAPVGLVAEPRIA 247 Query 257 TLNKLQKLVGDLVWRQSLIG--------------KSIPNILKLMEGDRALQSERYIESIH 302 TL +QKLVG L W + +G S PN + D + ++ Sbjct 248 TLWDVQKLVGSLQWLRPALGIPPRLMGPFYEQLRGSDPNEAREWNLDMKMAWREIVQLST 307 Query 303 VREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKP-LWVNVVHSIKNL 361 E L +EG E+ G L G +P LW+ K Sbjct 308 TAALERWDPALP-LEGAVARCEQGAIGVLGQG--------LSTHPRPCLWLFSTQPTKAF 358 Query 362 SqaqqiikaaqkLTQEVIIRT-GK-IPWILLPG--RE-----EDWILELQ--MGNINW-- 408 + +++ + +RT GK + +LLP RE E +L L+ G I Sbjct 359 TAWLEVLTLLITKLRASAVRTFGKEVDILLLPACFREDLPLPEGILLALKGFAGKIRSSD 418 Query 409 MPSFWSCYKGSVRWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHE---- 464 PS + + K V VPGPT +TD +G + + G ++ I E Sbjct 419 TPSIFDIARPLHVSLKVRVTDHPVPGPTVFTDASSSTHKGVVVW--REGPRWEIKEIADS 476 Query 465 EGTNQQLELRAIEEACKQGPEK-MNIVTDSRYAYEFMLRNWDEEVIRNPIQARIME--LV 521 + QQLE RA+ A P N+VTDS + + +L+ +E + + A I+E L Sbjct 477 GASVQQLEARAVAMALLLWPTTPTNVVTDSAFVAKMLLKM-GQEGVPSTAAAFILEDALS 535 Query 522 HNKEKIGVHWVPGHKGIP 539 V V H +P Sbjct 536 QRSAMAAVLHVRSHSEVP 553 >RecName: Full=Gag-Pro-Pol polyprotein; Contains: RecName: Full=Matrix protein p10; Contains: RecName: Full=Phosphorylated protein pp21; Contains: RecName: Full=Protein p3; Contains: RecName: Full=Protein p8; Contains: RecName: Full=Protein n; Contains: RecName: Full=Capsid protein p27; Contains: RecName: Full=Nucleocapsid protein-dUTPase; Short=NC-dUTPase; Contains: RecName: Full=Protease; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Mouse mammary tumor virus (STRAIN BR6)] Sequence ID: P03365.3 Length: 1755 Range 1: 884 to 1452 Score:152 bits(384), Expect:1e-37, Method:Compositional matrix adjust., Identities:150/588(26%), Positives:254/588(43%), Gaps:65/588(11%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 + QWPL QEKL+ L+++V + G + + W NTP+F IKKKSGKWR+L D R +N Sbjct 884 LNQWPLKQEKLQALQQLVTEQLQLGHLEESNSPW--NTPVFVIKKKSGKWRLLQDLRAVN 941 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPC 143 D+ Q GLP P + + + I+D+ D +F I L+ + F++ SPN P Sbjct 942 ATMHDMGALQPGLPSPVAVPKGWEIIIIDLQDCFFNIKLHPEDCKRFAFSVPSPNFKRPY 1001 Query 144 VRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGI 203 R+ WKVLPQG K SP + Q + K + +++ YMDDI + R I Sbjct 1002 QRFQWKVLPQGMKNSPTLCQKFVDKAILTVRDKYQDSYIVHYMDDILLA-----HPSRSI 1056 Query 204 VNELASYIAQ----YGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPEITEGPITLN 259 V+E+ + + Q +G ++ +K Q+ K+LG + + +QK L T+ TLN Sbjct 1057 VDEILTSMIQALNKHGLVVSTEKIQKYDNLKYLGTHIQGDSVSYQK--LQIRTDKLRTLN 1114 Query 260 KLQKLVGDLVWRQ---SLIGKSIPNILKLMEGDRALQSERYIESIHVREWEACR--QKLK 314 QKL+G++ W + L + + +++ GD S R + EAC+ Q + Sbjct 1115 DFQKLLGNINWIRPFLKLTTGELKPLFEILNGDSNPISTRKLTP------EACKALQLMN 1168 Query 315 EMEGNYYDEEKDIYGQLDWGNKAIEYI---VFQEKGKPLWVNVVHSIKNLSqaqqiikaa 371 E + D+ EY + G W+++ H + I Sbjct 1169 ERLSTARVKRLDLSQPWSLCILKTEYTPTACLWQDGVVEWIHLPHISPKVITPYDIFCTQ 1228 Query 372 qkLTQE-------------VIIRTGKIPWILLPGREEDWILELQ--MGNINWM----PSF 412 + +++ K+ + LL +EDW + L +G +++ P Sbjct 1229 LIIKGRHRSKELFSKDPDYIVVPYTKVQFDLLLQEKEDWPISLLGFLGEVHFHLPKDPLL 1288 Query 413 WSCYKGSVRWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHEEGTNQQLE 472 + ++ + L G +TDG NGR S+ YI + + + T QQ E Sbjct 1289 TFTLQTAIIFPHMTSTTPLEKGIVIFTDGS-ANGR-SVTYIQGREPIIKENTQNTAQQAE 1346 Query 473 LRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVH-------NKE 525 + A+ A ++ + N+ TDS+Y E +P EL H +E Sbjct 1347 IVAVITAFEEVSQPFNLYTDSKYVTGLFPEI--ETATLSPRTKIYTELKHLQRLIHKRQE 1404 Query 526 KIGVHWVPGHKGIP--------QNEEIDRYISEIFLAKEGRGILQKRA 565 K + + GH G+P + + R ++ + A+E + + A Sbjct 1405 KFYIGHIRGHTGLPGPLAQGNAYADSLTRILTALESAQESHALHHQNA 1452 >RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Matrix protein p19; Contains: RecName: Full=p2A; Contains: RecName: Full=p2B; Contains: RecName: Full=p10; Contains: RecName: Full=p3; Contains: RecName: Full=Capsid protein p27, alternate cleaved 1; Contains: RecName: Full=Capsid protein p27, alternate cleaved 2; Contains: RecName: Full=Nucleocapsid protein p12; Contains: RecName: Full=Protease p15; Contains: RecName: Full=Reverse transcriptase beta-subunit; Short=RT-beta; Contains: RecName: Full=Reverse transcriptase alpha-subunit; Short=RT-alpha; Contains: RecName: Full=Integrase; Short=IN; AltName: Full=pp32; Contains: RecName: Full=p4 [Rous sarcoma virus (strain Schmidt-Ruppin A)] Sequence ID: Q04095.2 Length: 1603 Range 1: 729 to 1261 Score:151 bits(382), Expect:2e-37, Method:Compositional matrix adjust., Identities:159/558(28%), Positives:241/558(43%), Gaps:67/558(12%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTC-NTPIFCIKKKSGKWRMLIDFREL 82 I QWPL + KL L ++V EKE ++G P +C NTP+F I+K SG +R+L D R + Sbjct 729 IDQWPLPEGKLVALTQLV---EKELQLGHIEPSLSCWNTPVFVIRKASGSYRLLHDLRAV 785 Query 83 NKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGP 142 N + Q G P L R + +LD+ D +F+IPL E R+ FT+ S NN P Sbjct 786 NAKLVPFGAVQQGAPVLSALPRGWPLMVLDLKDCFFSIPLAEQDREAFAFTLPSVNNQAP 845 Query 143 CVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRG 202 R+ WKVLPQG SP + Q + ++L +HP ++ YMDD+ + + H G Sbjct 846 ARRFQWKVLPQGMTCSPTICQLVVGQVLEPLRLKHPSLRMLHYMDDLLLAAS----SHDG 901 Query 203 IV---NELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPE--ITEGPI- 256 + E+ S + + GF + DK Q ++LG++L + P + E I Sbjct 902 LEAAGEEVISTLERAGFTISPDKIQREPGVQYLGYKLG------STYVAPVGLVAEPRIA 955 Query 257 TLNKLQKLVGDLVWRQSLIG--------------KSIPNILKLMEGDRALQSERYIESIH 302 TL +QKLVG L W + +G S PN + D + ++ Sbjct 956 TLWDVQKLVGSLQWLRPALGIPPRLMGPFYEQLRGSDPNEAREWNLDMKMAWREIVQLST 1015 Query 303 VREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKP-LWVNVVHSIKNL 361 E L +EG E+ G L G +P LW+ K Sbjct 1016 TAALERWDPALP-LEGAVARCEQGAIGVLGQG--------LSTHPRPCLWLFSTQPTKAF 1066 Query 362 SqaqqiikaaqkLTQEVIIRT-GK-IPWILLPG--RE-----EDWILELQ--MGNINW-- 408 + +++ + +RT GK + +LLP RE E +L L+ G I Sbjct 1067 TAWLEVLTLLITKLRASAVRTFGKEVDILLLPACFREDLPLPEGILLALRGFAGKIRSSD 1126 Query 409 MPSFWSCYKGSVRWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHE---- 464 PS + + K V VPGPT +TD +G + + G ++ I E Sbjct 1127 TPSIFDIARPLHVSLKVRVTDHPVPGPTAFTDASSSTHKGVVVW--REGPRWEIKEIADL 1184 Query 465 EGTNQQLELRAIEEACKQGPEK-MNIVTDSRYAYEFMLRNWDEEVIRNPIQARIME--LV 521 + QQLE RA+ A P N+VTDS + + +L+ +E + + A I+E L Sbjct 1185 GASVQQLEARAVAMALLLWPTTPTNVVTDSAFVAKMLLK-MGQEGVPSTAAAFILEDALS 1243 Query 522 HNKEKIGVHWVPGHKGIP 539 V V H +P Sbjct 1244 QRSAMAAVLHVRSHSEVP 1261 >RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Matrix protein p19; Contains: RecName: Full=p2A; Contains: RecName: Full=p2B; Contains: RecName: Full=p10; Contains: RecName: Full=Capsid protein p27, alternate cleaved 1; Contains: RecName: Full=Capsid protein p27, alternate cleaved 2; Contains: RecName: Full=Spacer peptide; Short=SP; AltName: Full=p3; Contains: RecName: Full=Nucleocapsid protein p12; Contains: RecName: Full=Protease p15; Contains: RecName: Full=Reverse transcriptase beta-subunit; Short=RT-beta; Contains: RecName: Full=Reverse transcriptase alpha-subunit; Short=RT-alpha; Contains: RecName: Full=Integrase; Short=IN; AltName: Full=pp32; Contains: RecName: Full=p4 [Rous sarcoma virus - Prague C] Sequence ID: P03354.2 Length: 1603 Range 1: 729 to 1261 Score:151 bits(381), Expect:3e-37, Method:Compositional matrix adjust., Identities:159/561(28%), Positives:242/561(43%), Gaps:73/561(13%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTC-NTPIFCIKKKSGKWRMLIDFREL 82 I QWPL + KL L ++V EKE ++G P +C NTP+F I+K SG +R+L D R + Sbjct 729 IDQWPLPEGKLVALTQLV---EKELQLGHIEPSLSCWNTPVFVIRKASGSYRLLHDLRAV 785 Query 83 NKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGP 142 N + Q G P L R + +LD+ D +F+IPL E R+ FT+ S NN P Sbjct 786 NAKLVPFGAVQQGAPVLSALPRGWPLMVLDLKDCFFSIPLAEQDREAFAFTLPSVNNQAP 845 Query 143 CVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRG 202 R+ WKVLPQG SP + Q + ++L +HP + YMDD+ + + H G Sbjct 846 ARRFQWKVLPQGMTCSPTICQLVVGQVLEPLRLKHPSLCMLHYMDDLLLAAS----SHDG 901 Query 203 IV---NELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPE--ITEGPI- 256 + E+ S + + GF + DK Q ++LG++L + P + E I Sbjct 902 LEAAGEEVISTLERAGFTISPDKVQREPGVQYLGYKLG------STYVAPVGLVAEPRIA 955 Query 257 TLNKLQKLVGDLVWRQSLIG--------------KSIPNILKLMEGDRAL---QSERYIE 299 TL +QKLVG L W + +G S PN + D + + R Sbjct 956 TLWDVQKLVGSLQWLRPALGIPPRLMGPFYEQLRGSDPNEAREWNLDMKMAWREIVRLST 1015 Query 300 SIHVREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKP-LWVNVVHSI 358 + + W+ +EG E+ G L G +P LW+ Sbjct 1016 TAALERWDPAL----PLEGAVARCEQGAIGVLGQG--------LSTHPRPCLWLFSTQPT 1063 Query 359 KNLSqaqqiikaaqkLTQEVIIRT-GK-IPWILLPG--RE-----EDWILELQ--MGNIN 407 K + +++ + +RT GK + +LLP RE E +L L+ G I Sbjct 1064 KAFTAWLEVLTLLITKLRASAVRTFGKEVDILLLPACFREDLPLPEGILLALKGFAGKIR 1123 Query 408 W--MPSFWSCYKGSVRWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHE- 464 PS + + K V VPGPT +TD +G + + G ++ I E Sbjct 1124 SSDTPSIFDIARPLHVSLKVRVTDHPVPGPTVFTDASSSTHKGVVVW--REGPRWEIKEI 1181 Query 465 ---EGTNQQLELRAIEEACKQGPEK-MNIVTDSRYAYEFMLRNWDEEVIRNPIQARIME- 519 + QQLE RA+ A P N+VTDS + + +L+ +E + + A I+E Sbjct 1182 ADLGASVQQLEARAVAMALLLWPTTPTNVVTDSAFVAKMLLK-MGQEGVPSTAAAFILED 1240 Query 520 -LVHNKEKIGVHWVPGHKGIP 539 L V V H +P Sbjct 1241 ALSQRSAMAAVLHVRSHSEVP 1261 >RecName: Full=Gag-Pro-Pol polyprotein; Contains: RecName: Full=Matrix protein p10; Contains: RecName: Full=Phosphorylated protein pp21; Contains: RecName: Full=Protein p3; Contains: RecName: Full=Protein p8; Contains: RecName: Full=Protein n; Contains: RecName: Full=Capsid protein p27; Contains: RecName: Full=Nucleocapsid protein-dUTPase; Short=NC-dUTPase; Contains: RecName: Full=Protease; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Mouse mammary tumor virus (STRAIN C3H)] Sequence ID: P11283.2 Length: 1755 Range 1: 884 to 1452 Score:151 bits(381), Expect:4e-37, Method:Compositional matrix adjust., Identities:145/582(25%), Positives:253/582(43%), Gaps:53/582(9%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 + QWPL QEKL+ L+++V + G + + W NTP+F IKKKSGKWR+L D R +N Sbjct 884 LNQWPLKQEKLQALQQLVTEQLQLGHLEESNSPW--NTPVFVIKKKSGKWRLLQDLRAVN 941 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPC 143 D+ Q GLP P + + + I+D+ D +F I L+ + F++ SPN P Sbjct 942 ATMHDMGALQPGLPSPVAVPKGWEIIIIDLQDCFFNIKLHPEDCKRFAFSVPSPNFKRPY 1001 Query 144 VRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGI 203 R+ WKVLPQG K SP + Q + K + +++ YMDDI + R I Sbjct 1002 QRFQWKVLPQGMKNSPTLCQKFVDKAILTVRDKYQDSYIVHYMDDILLA-----HPSRSI 1056 Query 204 VNELASYIAQ----YGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPEITEGPITLN 259 V+E+ + + Q +G ++ +K Q+ K+LG + + +QK L T+ TLN Sbjct 1057 VDEILTSMIQALNKHGLVVSTEKIQKYDNLKYLGTHIQGDAVSYQK--LQIRTDKLRTLN 1114 Query 260 KLQKLVGDLVWRQ---SLIGKSIPNILKLMEGDRALQSERYIESIHVREWEACRQKLKEM 316 QKL+G++ W + L + + +++ GD S R + + + ++L Sbjct 1115 DFQKLLGNINWIRPFLKLTTGELKPLFEILNGDSNPISIRKLTPEACKALQLVNERLSIA 1174 Query 317 EGNYYDEEKD----------IYGQLDWGNKAIEYIVFQEKGKPLWV--NVVHSIKNLSqa 364 D + W N +E+I + ++ + + Sbjct 1175 RVKRLDLSRPWSLCILKTEYTPTACLWQNGVLEWIHLPHISPKVITPYDIFCTQLIIKGR 1234 Query 365 qqiikaaqkLTQEVIIRTGKIPWILLPGREEDWILELQ--MGNINWM----PSFWSCYKG 418 + + K +++ K+ + LL +EDW + L +G +++ P + Sbjct 1235 HRSKELFSKDPDYIVVPYTKVQFDLLLQEKEDWPISLLGFLGEVHFHLPKDPLLTFTLQT 1294 Query 419 SVRWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHEEGTNQQLELRAIEE 478 ++ + L G +TDG NGR S+ YI + + + T QQ E+ A+ Sbjct 1295 AIIFPHMTSTTPLEKGIVIFTDGS-ANGR-SVTYIQGREPIIKENTQNTAQQAEIVAVIT 1352 Query 479 ACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVH-------NKEKIGVHW 531 A ++ + N+ TDS+Y E +P EL H +EK + Sbjct 1353 AFEEVSQSFNLYTDSKYVTGLFPEI--ETATLSPRTKIYTELRHLQRLIHKRQEKFYIGH 1410 Query 532 VPGHKGIP--------QNEEIDRYISEIFLAKEGRGILQKRA 565 + GH G+P + + R ++ + A+E + + A Sbjct 1411 IRGHTGLPGPLAQGNAYADSLTRILTALESAQESHALHHQNA 1452 >RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Matrix protein p19; Contains: RecName: Full=p2A; Contains: RecName: Full=p2B; Contains: RecName: Full=p10; Contains: RecName: Full=Capsid protein p27, alternate cleaved 1; Contains: RecName: Full=Capsid protein p27, alternate cleaved 2; Contains: RecName: Full=p3; Contains: RecName: Full=Nucleocapsid protein p12; Contains: RecName: Full=Protease p15; Contains: RecName: Full=Reverse transcriptase beta-subunit; Short=RT-beta; Contains: RecName: Full=Reverse transcriptase alpha-subunit; Short=RT-alpha; Contains: RecName: Full=Integrase; Short=IN; AltName: Full=pp32; Contains: RecName: Full=p4 [Rous sarcoma virus - Schmidt-Ruppin B] Sequence ID: O92956.2 Length: 1603 Range 1: 729 to 1261 Score:150 bits(380), Expect:4e-37, Method:Compositional matrix adjust., Identities:158/558(28%), Positives:241/558(43%), Gaps:67/558(12%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTC-NTPIFCIKKKSGKWRMLIDFREL 82 I QWPL + KL L ++V EKE ++G P +C NTP+F I+K SG +R+L D R + Sbjct 729 IDQWPLPEGKLVALTQLV---EKELQLGHIEPSLSCWNTPVFVIRKASGSYRLLHDLRAV 785 Query 83 NKQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGP 142 N + Q G P L R + +LD+ D +F+IPL E R+ FT+ S NN P Sbjct 786 NAKLVPFGAVQQGAPVLSALPRGWPLMVLDLKDCFFSIPLAEQDREAFAFTLPSVNNQAP 845 Query 143 CVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRG 202 R+ WKVLPQG SP + Q + ++L +HP ++ YMDD+ + + H G Sbjct 846 ARRFQWKVLPQGMTCSPTICQLVVGQVLEPLRLKHPSLRMLHYMDDLLLAAS----SHDG 901 Query 203 IV---NELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPE--ITEGPI- 256 + E+ + + + GF + DK Q ++LG++L + P + E I Sbjct 902 LEAAGEEVINTLERAGFTISPDKIQREPGVQYLGYKLG------STYVAPVGLVAEPRIA 955 Query 257 TLNKLQKLVGDLVWRQSLIG--------------KSIPNILKLMEGDRALQSERYIESIH 302 TL +QKLVG L W + +G S PN + D + ++ Sbjct 956 TLWDVQKLVGSLQWLRPALGIPPRLMGPFYEQLRGSDPNEAREWNLDMKMAWREIVQLST 1015 Query 303 VREWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKP-LWVNVVHSIKNL 361 E L +EG E+ G L G +P LW+ K Sbjct 1016 TAALERWDPALP-LEGAVVRCEQGAIGVLGQG--------LSTHPRPCLWLFSTQPTKAF 1066 Query 362 SqaqqiikaaqkLTQEVIIRT-GK-IPWILLPG--RE-----EDWILELQ--MGNINW-- 408 + +++ + +RT GK + +LLP RE E +L L+ G I Sbjct 1067 TAWLEVLTLLITKLRASAVRTFGKEVDILLLPACFREDLPLPEGILLALRGFAGKIRSSD 1126 Query 409 MPSFWSCYKGSVRWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHE---- 464 PS + + K V VPGPT +TD +G + + G ++ I E Sbjct 1127 TPSIFDIARPLHVSLKVRVTDHPVPGPTVFTDASSSTHKGVVVW--REGPRWEIKEIADL 1184 Query 465 EGTNQQLELRAIEEACKQGPEK-MNIVTDSRYAYEFMLRNWDEEVIRNPIQARIME--LV 521 + QQLE RA+ A P N+VTDS + + +L+ +E + + A I+E L Sbjct 1185 GASVQQLEARAVAMALLLWPTTPTNVVTDSAFVAKMLLK-MGQEGVPSTAAAFILEDALS 1243 Query 522 HNKEKIGVHWVPGHKGIP 539 V V H +P Sbjct 1244 QRSAMAAVLHVRSHSEVP 1261 >RecName: Full=Endogenous retrovirus group K member 19 Pol protein; AltName: Full=HERV-K(C19) Pol protein; AltName: Full=HERV-K_19q11 provirus ancestral Pol protein; Includes: RecName: Full=Reverse transcriptase; Short=RT; Includes: RecName: Full=Ribonuclease H; Short=RNase H; Includes: RecName: Full=Integrase; Short=IN [Homo sapiens] Sequence ID: Q9WJR5.2 Length: 959 Range 1: 36 to 549 Score:147 bits(371), Expect:5e-36, Method:Compositional matrix adjust., Identities:136/531(26%), Positives:241/531(45%), Gaps:49/531(9%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 + QWPL ++KLE L + + ++G + + W N+P+F I+KKSGKWRML D R +N Sbjct 36 VNQWPLPKQKLEALHLLANEQLEKGHIEPSFSPW--NSPVFVIQKKSGKWRMLTDLRAVN 93 Query 84 KQT---EDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNL 140 + + Q GLP + + + I+D+ D +FTIPL E + FT+ + NN Sbjct 94 AVNAVIQPMGPLQPGLPSLAMIPKDWPLIIIDLKDCFFTIPLAEQDCEKFAFTIPAINNK 153 Query 141 GPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEH 200 P R+ WKVLPQG SP + Q + + L+ E+ Y+DDI +++ ++ Sbjct 154 EPATRFQWKVLPQGMLNSPTICQTFVGRALQPVREKFSDCYIIHYIDDILCAAEMK-DKL 212 Query 201 RGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELH-----PEKWKFQKHTLPEITEGP 255 L + +A G + DK Q P +L ++ P K + +K TL Sbjct 213 IDCYTFLQAEVANAGLAIASDKIQTSTPFHYLEMQIENRKIKPPKIEIRKDTLK------ 266 Query 256 ITLNKLQKLVGDLVWRQSLIG---KSIPNILKLMEGDRALQSERYIESIHVREWEACRQK 312 TLN QKL+GD+ W + +G ++ N+ ++ GD L S+R + +E + +K Sbjct 267 -TLNDFQKLLGDINWIRPTLGIPTYAMSNLFSILRGDSDLNSKRMLTPEATKEIKLVEEK 325 Query 313 LKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHS-IKNLSqaqqiikaa 371 ++ + N D + + + I+ Q W + HS +K + + Sbjct 326 IQSAQINRIDPLAPLQLLIFATAHSPTGIIIQNTDLVEWSFLPHSTVKTFTLYLDQMATL 385 Query 372 qkLTQEVIIR-TGKIP-WILLPGREEDWILEL------QMGNINWMPSFWSCYKGSVRWK 423 T+ II+ G P I++P +E Q+G N++ + Y + ++ Sbjct 386 IGQTRLRIIKLCGNDPDKIVVPLTKEQVRQAFINSGAWQIGLANFVGIIDNHYPKTKIFQ 445 Query 424 KRNVIAELVP----------GPTYYTDGGKKNGRGSLGYIASTGEKFRI--HEEGTNQQL 471 + ++P T +TDG S G A TG K R+ + + Q+ Sbjct 446 FLKMTTWILPKITRREPLENALTVFTDG------SSNGKAAYTGPKERVIKTQYQSAQRA 499 Query 472 ELRAIEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVH 522 EL A+ + + +NI++DS Y + R+ + +I+ + ++ +L + Sbjct 500 ELVAVITVLQDFDQPINIISDSAYVVQ-ATRDVETALIKYSMDDQLNQLFN 549 >RecName: Full=Gag-Pro-Pol polyprotein; Contains: RecName: Full=Matrix protein p19; Contains: RecName: Full=Core protein p16; Contains: RecName: Full=Capsid protein p35; AltName: Full=Capsid protein p34; Contains: RecName: Full=Probable nucleocapsid protein-dUTPase; Short=NC-dUTPase; Contains: RecName: Full=Protease 17 kDa; Contains: RecName: Full=Protease 13 kDa; Contains: RecName: Full=G-patch peptide; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Squirrel monkey retrovirus] Sequence ID: P03364.3 Length: 1880 Range 1: 1023 to 1264 Score:144 bits(362), Expect:9e-35, Method:Composition-based stats., Identities:90/247(36%), Positives:120/247(48%), Gaps:5/247(2%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 + QWPLT EK +V G + W NTPIF IKKKSG WR+L D R +N Sbjct 1023 VDQWPLTYEKTLAAIALVQEQLAAGHIEPTNSPW--NTPIFIIKKKSGSWRLLQDLRAVN 1080 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPC 143 K + Q GLP P + H ++D+ D +FTIPL+ R Y F++ N P Sbjct 1081 KVMVPMGALQPGLPSPVAIPLNYHKIVIDLKDCFFTIPLHPEDRPYFAFSVPQINFQSPM 1140 Query 144 VRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGI 203 RY WKVLPQG SP + Q + + + P YMDDI + D E + Sbjct 1141 PRYQWKVLPQGMANSPTLCQKFVAAAIAPVRSQWPEAYILHYMDDILLACD-SAEAAKAC 1199 Query 204 VNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPEITEGPITLNKLQK 263 + S + YG + DK Q P +LGFELH ++ + L T+ TLN QK Sbjct 1200 YAHIISCLTSYGLKIAPDKVQVSEPFSYLGFELHHQQVFTPRVCLK--TDHLKTLNDFQK 1257 Query 264 LVGDLVW 270 L+GD+ W Sbjct 1258 LLGDIQW 1264 >RecName: Full=Gag-Pro-Pol polyprotein; Contains: RecName: Full=Matrix protein p10; Contains: RecName: Full=Phosphorylated protein pp24; Contains: RecName: Full=Phosphorylated protein pp18; Contains: RecName: Full=p12; Contains: RecName: Full=Capsid protein p27; Contains: RecName: Full=Nucleocapsid protein-dUTPase; Short=NC-dUTPase; Contains: RecName: Full=Protease 17 kDa; Contains: RecName: Full=Protease 13 kDa; Contains: RecName: Full=G-patch peptide; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Simian retrovirus 2] Sequence ID: P51517.2 Length: 1768 Range 1: 935 to 1247 Score:142 bits(357), Expect:4e-34, Method:Compositional matrix adjust., Identities:102/333(31%), Positives:162/333(48%), Gaps:27/333(8%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 + QWPLTQEKL +++V + G + + W NTPIF IKKKSGKWR+L D R +N Sbjct 935 VDQWPLTQEKLAAAQQLVQEQLQAGHIIESNSPW--NTPIFVIKKKSGKWRLLQDLRAVN 992 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPC 143 + Q GLP P + + ++D+ D +FTIPL ++ F++ S N P Sbjct 993 ATMVLMGALQPGLPSPVAIPQGYFKIVIDLKDCFFTIPLQPVDQKRFAFSLPSTNFKQPM 1052 Query 144 VRYYWKVLPQGWKLSPAVYQ----FTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEE 199 RY WKVLPQG SP + Q ++ + + W + + +I YMDDI I LG E+ Sbjct 1053 KRYQWKVLPQGMANSPTLCQKYVAAAIEPVRKSWAQMY-IIH---YMDDILIAGKLG-EQ 1107 Query 200 HRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPEITEGPITLN 259 +L + G + +K Q P +LGF+++ K QK + + TLN Sbjct 1108 VLQCFAQLKQALTTTGLQIAPEKVQLQDPYTYLGFQINGPKITNQKAVIRR--DKLQTLN 1165 Query 260 KLQKLVGDLVWRQS---LIGKSIPNILKLMEGDRALQSERYIESIHVREWEACRQKLKEM 316 QKL+GD+ W + L + + +++GD S R + EA L+++ Sbjct 1166 DFQKLLGDINWLRPYLHLTTGDLKPLFDILKGDSNPNSPRSLS-------EAALASLQKV 1218 Query 317 EGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKP 349 E ++ Q+D+ + + +++F P Sbjct 1219 ETAIAEQ---FVTQIDY-TQPLTFLIFNTTLTP 1247 >RecName: Full=Gag-Pro-Pol polyprotein; AltName: Full=Pr160Gag-Pro-Pol; Contains: RecName: Full=Matrix protein p19; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p15-pro; Short=NC'; Short=NC-pro; Contains: RecName: Full=Protease; Short=PR; Contains: RecName: Full=p1; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Human T-cell leukemia virus 3 (strain Pyl43)] Sequence ID: Q4U0X6.4 Length: 1440 Range 1: 570 to 912 Score:140 bits(354), Expect:9e-34, Method:Compositional matrix adjust., Identities:103/349(30%), Positives:168/349(48%), Gaps:15/349(4%) Query 22 PHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRE 81 P ++Q+PL E+L+ L ++V R + + P N PIF +KK +GKWR + D R Sbjct 570 PEVSQFPLNPERLQALTDLVSRALEAKHI--EPYQGPGNNPIFPVKKPNGKWRFIHDLRA 627 Query 82 LNKQTEDLAEAQLGLPHPGGL-QRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNL 140 N T DLA G P L Q H+ +D+ DA+F IPL ++ Y FT+ PNN Sbjct 628 TNSLTRDLASPSPGPPDLTSLPQDLPHLRTIDLTDAFFQIPLPAVFQPYFAFTLPQPNNH 687 Query 141 GPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEH 200 GP RY W+VLPQG+K SP +++ + IL + P YMDDI + S L E Sbjct 688 GPGTRYSWRVLPQGFKNSPTLFEQQLSHILAPVRKAFPNSLIIQYMDDILLASP-ALREL 746 Query 201 RGIVNELASYIAQYGFMLPEDKRQEGYPAK--WLGFELHPEKWKFQKHTLPEITEGPI-T 257 + +++ + + + G + +K Q P +LG + P+ ++ TLP I I + Sbjct 747 TALTDKVTNALTKEGLPMSLEKTQ-ATPGSIHFLGQVISPDCITYE--TLPSIHVKSIWS 803 Query 258 LNKLQKLVGDLVWRQ---SLIGKSIPNILKLMEGDRALQSERYIESIHVREWEACRQKLK 314 L +LQ ++G+L W ++ S+ + + G R + + S V+ + ++ L Sbjct 804 LAELQSMLGELQWVSKGTPVLRSSLHQLYLALRGHRDPRDTIELTSTQVQALKTIQKALA 863 Query 315 EMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGK-PL-WVNVVHSIKNL 361 + + I + ++FQ K K PL W++ H +L Sbjct 864 LNCRSRLVSQLPILALIILRPTGTTAVLFQTKQKWPLVWLHTPHPATSL 912 >RecName: Full=Gag-Pro-Pol polyprotein; AltName: Full=Pr160Gag-Pro-Pol; Contains: RecName: Full=Matrix protein p19; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p15-pro; Short=NC'; Short=NC-pro; Contains: RecName: Full=Protease; Short=PR; Contains: RecName: Full=p1; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [HTLV-3 strain 2026ND] Sequence ID: Q0R5R2.3 Length: 1440 Range 1: 570 to 912 Score:138 bits(348), Expect:5e-33, Method:Compositional matrix adjust., Identities:102/348(29%), Positives:166/348(47%), Gaps:13/348(3%) Query 22 PHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRE 81 P ++Q+PL E+L+ L ++V R + + P N PIF +KK +GKWR + D R Sbjct 570 PEVSQFPLNPERLQALTDLVSRALEAKHI--EPYQGPGNNPIFPVKKPNGKWRFIHDLRA 627 Query 82 LNKQTEDLAEAQLGLPHPGGL-QRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNL 140 N T DLA G P L Q H+ +D+ DA+F IPL ++ Y FT+ PNN Sbjct 628 TNSVTRDLASPSPGPPDLTSLPQGLPHLRTIDLTDAFFQIPLPTIFQPYFAFTLPQPNNY 687 Query 141 GPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEH 200 GP RY W+VLPQG+K SP +++ + IL + P YMDDI + S E Sbjct 688 GPGTRYSWRVLPQGFKNSPTLFEQQLSHILTPVRKTFPNSLIIQYMDDILLASP-APGEL 746 Query 201 RGIVNELASYIAQYGFML-PEDKRQEGYPAKWLGFELHPEKWKFQKHTLPEI-TEGPITL 258 + +++ + + + G L PE + P +LG + + ++ TLP I + +L Sbjct 747 AALTDKVTNALTKEGLPLSPEKTQATPGPIHFLGQVISQDCITYE--TLPSINVKSTWSL 804 Query 259 NKLQKLVGDLVWRQ---SLIGKSIPNILKLMEGDRALQSERYIESIHVREWEACRQKLKE 315 +LQ ++G+L W ++ S+ + + G R + + SI V+ ++ L Sbjct 805 AELQSMLGELQWVSKGTPVLRSSLHQLYLALRGHRDPRDTIKLTSIQVQALRTIQKALTL 864 Query 316 MEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGK-PL-WVNVVHSIKNL 361 + + I + ++FQ K K PL W++ H +L Sbjct 865 NCRSRLVNQLPILALIMLRPTGTTAVLFQTKQKWPLVWLHTPHPATSL 912 >RecName: Full=Gag-Pro-Pol polyprotein; AltName: Full=Pr180; Contains: RecName: Full=Matrix protein p10; Contains: RecName: Full=Phosphorylated protein pp24; Contains: RecName: Full=Phosphorylated protein pp18; Contains: RecName: Full=p12; Contains: RecName: Full=Capsid protein p27; Contains: RecName: Full=Nucleocapsid protein-dUTPase; Short=NC-dUTPase; Contains: RecName: Full=Protease 17 kDa; Contains: RecName: Full=Protease 13 kDa; Contains: RecName: Full=G-patch peptide; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Mason-Pfizer monkey virus] Sequence ID: P07572.2 Length: 1771 Range 1: 938 to 1227 Score:136 bits(342), Expect:3e-32, Method:Compositional matrix adjust., Identities:94/299(31%), Positives:145/299(48%), Gaps:16/299(5%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 + QWPLT +KL +++V + G + + W NTPIF IKKKSGKWR+L D R +N Sbjct 938 VDQWPLTNDKLAAAQQLVQEQLEAGHITESSSPW--NTPIFVIKKKSGKWRLLQDLRAVN 995 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPC 143 + Q GLP P + + I+D+ D +F+IPL+ ++ F++ S N P Sbjct 996 ATMVLMGALQPGLPSPVAIPQGYLKIIIDLKDCFFSIPLHPSDQKRFAFSLPSTNFKEPM 1055 Query 144 VRYYWKVLPQGWKLSPAVYQ----FTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEE 199 R+ WKVLPQG SP + Q + K+ W + + +I YMDDI I G ++ Sbjct 1056 QRFQWKVLPQGMANSPTLCQKYVATAIHKVRHAWKQMY-IIH---YMDDILIAGKDG-QQ 1110 Query 200 HRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPEITEGPITLN 259 ++L + G + +K Q P +LGFEL+ K QK + + + TLN Sbjct 1111 VLQCFDQLKQELTAAGLHIAPEKVQLQDPYTYLGFELNGPKITNQKAVIRK--DKLQTLN 1168 Query 260 KLQKLVGDLVWRQ---SLIGKSIPNILKLMEGDRALQSERYIESIHVREWEACRQKLKE 315 QKL+GD+ W + L + + ++GD S R + + E + E Sbjct 1169 DFQKLLGDINWLRPYLKLTTGDLKPLFDTLKGDSDPNSHRSLSKEALASLEKVETAIAE 1227 >RecName: Full=Gag-Pro-Pol polyprotein; Contains: RecName: Full=Matrix protein p10; Contains: RecName: Full=Phosphorylated protein pp24; Contains: RecName: Full=Phosphorylated protein pp18; Contains: RecName: Full=p12; Contains: RecName: Full=Capsid protein p27; Contains: RecName: Full=Nucleocapsid protein-dUTPase; Short=NC-dUTPase; Contains: RecName: Full=Protease 17 kDa; Contains: RecName: Full=Protease 13 kDa; Contains: RecName: Full=G-patch peptide; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Simian retrovirus 1] Sequence ID: P04025.2 Length: 1772 Range 1: 939 to 1212 Score:132 bits(331), Expect:8e-31, Method:Compositional matrix adjust., Identities:92/283(33%), Positives:140/283(49%), Gaps:16/283(5%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 + QWPLT EKL +++V + G + + W NTPIF IKKKSGKWR+L D R +N Sbjct 939 VDQWPLTSEKLAAAQQLVQEQLEAGHITESNSPW--NTPIFVIKKKSGKWRLLQDLRAVN 996 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPC 143 + Q GLP P + + I+D+ D +F+IPL+ ++ F++ S N P Sbjct 997 ATMVLMGALQPGLPSPVAIPQGYLKIIIDLKDCFFSIPLHPSDQKRFAFSLPSTNFKEPM 1056 Query 144 VRYYWKVLPQGWKLSPAVYQ----FTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEE 199 R+ WKVLPQ SP + Q + K+ W + + +I YMDDI I G ++ Sbjct 1057 QRFQWKVLPQRMANSPTLCQKYVATAIHKVRHAWKQMY-IIH---YMDDILIAGKDG-QQ 1111 Query 200 HRGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPEITEGPITLN 259 ++L + G + +K Q P +LGFEL+ K QK + + + TLN Sbjct 1112 VLQCFDQLKQELTIAGLHIAPEKIQLQDPYTYLGFELNGPKITNQKAVIRK--DKLQTLN 1169 Query 260 KLQKLVGDLVWRQ---SLIGKSIPNILKLMEGDRALQSERYIE 299 QKL+GD+ W + L + + ++GD S R + Sbjct 1170 DFQKLLGDINWLRPYLKLTTADLKPLFDTLKGDSNPNSHRSLS 1212 >RecName: Full=Gag-Pro-Pol polyprotein; AltName: Full=Pr160Gag-Pro-Pol; Contains: RecName: Full=Matrix protein p19; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p15-pro; Short=NC'; Short=NC-pro; Contains: RecName: Full=Protease; Short=PR; Contains: RecName: Full=p1; Contains: RecName: Full=Reverse transcriptase/ribonuclease H, p49 subunit; Short=p49 RT; Contains: RecName: Full=Reverse transcriptase/ribonuclease H, p62 subunit; Short=p62 RT; Contains: RecName: Full=Integrase; Short=IN [HTLV-1 isolate Mel 15] Sequence ID: P0C211.2 Length: 1462 Range 1: 589 to 916 Score:126 bits(317), Expect:4e-29, Method:Compositional matrix adjust., Identities:94/340(28%), Positives:158/340(46%), Gaps:25/340(7%) Query 20 KGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDF 79 + P I+Q+PL E+L+ L+ +V + + G + P N P+F +KK +G WR + D Sbjct 589 RPPEISQFPLNPERLQALQHLVRKALEAGHI--EPYTGPGNNPVFPVKKANGTWRFIHDL 646 Query 80 RELNKQTEDLAEAQLGLPHPGGLQRK-KHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPN 138 R N T DL+ + G P L H+ +D+ DA+F IPL + ++ Y FT+ Sbjct 647 RATNSLTVDLSSSSPGPPDLSSLPTTLAHLQTIDLKDAFFQIPLPKQFQPYFAFTVPQQC 706 Query 139 NLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLE 198 N GP RY WKVLPQG+K SP +++ + IL+ + P YMDDI + S + Sbjct 707 NYGPGTRYAWKVLPQGFKNSPTLFEMQLASILQPIRQAFPQCVILQYMDDILLASPSPED 766 Query 199 EHRGIVNELASYIAQYGFMLPEDKRQEGY-PAKWLGFELHPEKWKFQKHTLPEITEGPI- 256 + +AS I+ +G + +DK Q+ K+LG + P + + PI Sbjct 767 LQQLSEATMASLIS-HGLPVSQDKTQQTPGTIKFLGQIISPNHITYD-----AVPTVPIR 820 Query 257 ---TLNKLQKLVGDLVWRQSLIGKSIPNILK-------LMEGDRALQSERYIESIHVREW 306 L +LQ L+G++ W + K P + + ++G + + Y+ V+ Sbjct 821 SRWALPELQALLGEIQW----VSKGTPTLRQPLHSLYCALQGHTDPRDQIYLNPSQVQSL 876 Query 307 EACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEK 346 +Q L + + + + G + +VFQ K Sbjct 877 MQLQQALSQNCRSRLAQTLPLLGAIMLTLTGTTTVVFQSK 916 >RecName: Full=Gag-Pro-Pol polyprotein; Contains: RecName: Full=Matrix protein p15; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p12-pro; Contains: RecName: Full=Protease; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Bovine leukemia virus (AUSTRALIAN ISOLATE)] Sequence ID: P25059.2 Length: 1416 Range 1: 570 to 890 Score:126 bits(317), Expect:4e-29, Method:Compositional matrix adjust., Identities:91/331(27%), Positives:157/331(47%), Gaps:21/331(6%) Query 29 LTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELNKQTED 88 L E+L+ L+++V R + G + +P N P+F ++K +G WR + D R N T+ Sbjct 570 LNLERLQALQDLVHRSLEAGYI--SPWDGPGNNPVFPVRKPNGAWRFVHDLRVTNALTKP 627 Query 89 LAEAQLGLPHPGGLQRK-KHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPCVRYY 147 + G P + H+ LD+ DA+F IP+ + +R Y FT+ +P L P R+ Sbjct 628 IPALSPGPPDLTAIPTHLPHIICLDLKDAFFQIPVEDRFRSYFAFTLPTPGGLQPHRRFA 687 Query 148 WKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHR-GIVNE 206 W+VLPQG+ SPA+++ +Q+ LR YMDDI S EE R Sbjct 688 WRVLPQGFINSPALFERALQEPLRQVSAAFSQSLLVSYMDDILYVSP--TEEQRLQCYQT 745 Query 207 LASYIAQYGFMLPEDK-RQEGYPAKWLGFELHPEKWKFQKHTLPEI-TEGPITLNKLQKL 264 +A+++ GF + +K RQ P +LG +H +Q +LP + PI+L++LQ + Sbjct 746 MAAHLRDLGFQVASEKTRQTPSPVPFLGQMVHERMVTYQ--SLPTLQISSPISLHQLQTV 803 Query 265 VGDLVWRQSLIGKSIPNILK----LMEGDRALQSERYIESIHVREWEA---CRQKLKEME 317 +GDL W + + P + L + + R I + + + RQ L Sbjct 804 LGDLQW----VSRGTPTTRRPLQLLYSSLKGIDDPRAIIHLSPEQQQGIAELRQALSHNA 859 Query 318 GNYYDEEKDIYGQLDWGNKAIEYIVFQEKGK 348 + Y+E++ + + ++FQ+ + Sbjct 860 RSRYNEQEPLLAYVHLTRAGSTLVLFQKGAQ 890 >RecName: Full=Gag-Pro-Pol polyprotein; Contains: RecName: Full=Matrix protein p15; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p12-pro; Contains: RecName: Full=Protease; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Bovine leukemia virus (JAPANESE ISOLATE BLV-1)] Sequence ID: P03361.2 Length: 1416 Range 1: 570 to 890 Score:125 bits(315), Expect:7e-29, Method:Compositional matrix adjust., Identities:92/331(28%), Positives:155/331(46%), Gaps:21/331(6%) Query 29 LTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELNKQTED 88 L E+L+ L+++V R + G + +P N P+F ++K +G WR + D R N T+ Sbjct 570 LNLERLQALQDLVHRSLEAGYI--SPWDGPGNNPVFPVRKPNGAWRFVHDLRATNALTKP 627 Query 89 LAEAQLGLPHPGGL-QRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPCVRYY 147 + G P + H+ LD+ DA+F IP+ + +R Y FT+ SP L P R+ Sbjct 628 IPALSPGPPDLTAIPTHPPHIICLDLKDAFFQIPVEDRFRFYLSFTLPSPGGLQPHRRFA 687 Query 148 WKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRG-IVNE 206 W+VLPQG+ SPA+++ +Q+ LR YMDDI S EE R Sbjct 688 WRVLPQGFINSPALFERALQEPLRQVSAAFSQSLLVSYMDDILYASP--TEEQRSQCYQA 745 Query 207 LASYIAQYGFMLPEDK-RQEGYPAKWLGFELHPEKWKFQKHTLPEI-TEGPITLNKLQKL 264 LA+ + GF + +K Q P +LG +H + +Q +LP + PI+L++LQ + Sbjct 746 LAARLRDLGFQVASEKTSQTPSPVPFLGQMVHEQIVTYQ--SLPTLQISSPISLHQLQAV 803 Query 265 VGDLVWRQSLIGKSIPNILK----LMEGDRALQSERYIESIHVREWEA---CRQKLKEME 317 +GDL W + + P + L + R I + + + RQ L Sbjct 804 LGDLQW----VSRGTPTTRRPLQLLYSSLKRHHDPRAIIQLSPEQLQGIAELRQALSHNA 859 Query 318 GNYYDEEKDIYGQLDWGNKAIEYIVFQEKGK 348 + Y+E++ + + ++FQ+ + Sbjct 860 RSRYNEQEPLLAYVHLTRAGSTLVLFQKGAQ 890 >RecName: Full=Gag-Pro-Pol polyprotein; AltName: Full=Pr160Gag-Pro-Pol; Contains: RecName: Full=Matrix protein p19; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p15-pro; Short=NC'; Short=NC-pro; Contains: RecName: Full=Protease; Short=PR; Contains: RecName: Full=p1; Contains: RecName: Full=Reverse transcriptase/ribonuclease H, p49 subunit; Short=p49 RT; Contains: RecName: Full=Reverse transcriptase/ribonuclease H, p62 subunit; Short=p62 RT; Contains: RecName: Full=Integrase; Short=IN [Human T-cell lymphotrophic virus type 1 (Caribbean isolate)] Sequence ID: P14078.3 Length: 1462 Range 1: 589 to 916 Score:124 bits(312), Expect:2e-28, Method:Compositional matrix adjust., Identities:96/340(28%), Positives:158/340(46%), Gaps:25/340(7%) Query 20 KGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDF 79 + P I+Q+PL E+L+ L+ +V + + G + P N P+F +KK +G WR + D Sbjct 589 RPPEISQFPLNPERLQALQHLVRKALEAGHI--EPYTGPGNNPVFPVKKANGTWRFIHDL 646 Query 80 RELNKQTEDLAEAQLGLPHPGGLQRK-KHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPN 138 R N T DL+ + G P L H+ +D+ DA+F IPL + ++ Y FT+ Sbjct 647 RATNSLTIDLSSSSPGPPDLSSLPTTLAHLQTIDLKDAFFQIPLPKQFQPYFAFTVPQQC 706 Query 139 NLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLE 198 N GP RY W+VLPQG+K SP +++ + IL+ + P YMDDI + S + Sbjct 707 NYGPGTRYAWRVLPQGFKNSPTLFEMQLAHILQPIRQAFPQCTILQYMDDILLASPSHAD 766 Query 199 EHRGIVNELASYIAQYGFMLPEDKRQEGY-PAKWLGFELHPEKWKFQKHTLPEITEGPI- 256 +AS I+ +G + E+K Q+ K+LG + P + + + PI Sbjct 767 LQLLSEATMASLIS-HGLPVSENKTQQTPGTIKFLGQIISPNHLTYD-----AVPKVPIR 820 Query 257 ---TLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDR-ALQ------SERYIESIHVREW 306 L +LQ L+G++ W + K P + + + ALQ + Y+ V+ Sbjct 821 SRWALPELQALLGEIQW----VSKGTPTLRQPLHSLYCALQRHTDPRDQIYLNPSQVQSL 876 Query 307 EACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEK 346 RQ L + + + + G + +VFQ K Sbjct 877 VQLRQALSQNCRSRLVQTLPLLGAIMLTLTGTTTVVFQSK 916 >RecName: Full=Gag-Pro-Pol polyprotein; AltName: Full=Pr160Gag-Pro-Pol; Contains: RecName: Full=Matrix protein p19; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p15-pro; Short=NC'; Short=NC-pro; Contains: RecName: Full=Protease; Short=PR; Contains: RecName: Full=p1; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Human T-lymphotropic virus 2] Sequence ID: P03363.4 Length: 1461 Range 1: 588 to 931 Score:124 bits(312), Expect:2e-28, Method:Compositional matrix adjust., Identities:96/353(27%), Positives:163/353(46%), Gaps:22/353(6%) Query 22 PHIAQWPLT-QEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFR 80 P + Q+PL E+L+ L ++V + + G + P N P+F +KK +GKWR + D R Sbjct 588 PQVDQFPLNLPERLQALNDLVSKALEAGHI--EPYSGPGNNPVFPVKKPNGKWRFIHDLR 645 Query 81 ELNKQTEDLAEAQLGLPHPGGLQRK-KHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNN 139 N T L G P L H+ +D+ DA+F IPL + Y+ Y FT+ P N Sbjct 646 ATNAITTTLTSPSPGPPDLTSLPTALPHLQTIDLTDAFFQIPLPKQYQPYFAFTIPQPCN 705 Query 140 LGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEE 199 GP RY W VLPQG+K SP +++ + +L + P YMDDI + S EE Sbjct 706 YGPGTRYAWTVLPQGFKNSPTLFEQQLAAVLNPMRKMFPTSTIVQYMDDILLASPTN-EE 764 Query 200 HRGIVNELASYIAQYGFMLPEDKRQEGYPA--KWLGFELHPEKWKFQKHTLPEITEGPI- 256 + + + +G + ++K Q+ P ++LG + P ++ + P I PI Sbjct 765 LQQLSQLTLQALTTHGLPISQEKTQQT-PGQIRFLGQVISPNHITYE--STPTI---PIK 818 Query 257 ---TLNKLQKLVGDLVWRQ---SLIGKSIPNILKLMEGDRALQSERYIESIHVREWEACR 310 TL +LQ ++G++ W ++ K + ++ + G R ++ + + A + Sbjct 819 SQWTLTELQVILGEIQWVSKGTPILRKHLQSLYSALHGYRDPRACITLTPQQLHALHAIQ 878 Query 311 QKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGK-PL-WVNVVHSIKNL 361 Q L+ + + G + ++FQ K PL W++ H +L Sbjct 879 QALQHNCRGRLNPALPLLGLISLSTSGTTSVIFQPKQNWPLAWLHTPHPPTSL 931 >RecName: Full=Gag-Pro-Pol polyprotein; AltName: Full=Pr160Gag-Pro-Pol; Contains: RecName: Full=Matrix protein p19; Short=MA; Contains: RecName: Full=Capsid protein p24; Short=CA; Contains: RecName: Full=Nucleocapsid protein p15-pro; Short=NC'; Short=NC-pro; Contains: RecName: Full=Protease; Short=PR; Contains: RecName: Full=p1; Contains: RecName: Full=Reverse transcriptase/ribonuclease H, p49 subunit; Short=p49 RT; Contains: RecName: Full=Reverse transcriptase/ribonuclease H, p62 subunit; Short=p62 RT; Contains: RecName: Full=Integrase; Short=IN [Human T-cell lymphotrophic virus type 1 (strain ATK)] Sequence ID: P03362.3 Length: 1462 Range 1: 589 to 916 Score:124 bits(310), Expect:3e-28, Method:Compositional matrix adjust., Identities:99/342(29%), Positives:161/342(47%), Gaps:29/342(8%) Query 20 KGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDF 79 + P I+Q+PL E+L+ L+ +V + + G + P N P+F +KK +G WR + D Sbjct 589 RPPQISQFPLNPERLQALQHLVRKALEAGHI--EPYTGPGNNPVFPVKKANGTWRFIHDL 646 Query 80 RELNKQTEDLAEAQLGLPHPGGLQRK-KHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPN 138 R N T DL+ + G P L H+ +D+ DA+F IPL + ++ Y FT+ Sbjct 647 RATNSLTIDLSSSSPGPPDLSSLPTTLAHLQTIDLRDAFFQIPLPKQFQPYFAFTVPQQC 706 Query 139 NLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLE 198 N GP RY WKVLPQG+K SP +++ + IL+ + P YMDDI + S Sbjct 707 NYGPGTRYAWKVLPQGFKNSPTLFEMQLAHILQPIRQAFPQCTILQYMDDILLASP--SH 764 Query 199 EHRGIVNE--LASYIAQYGFMLPEDKRQEGY-PAKWLGFELHPEKWKFQKHTLPEITEGP 255 E +++E +AS I+ +G + E+K Q+ K+LG + P + + P Sbjct 765 EDLLLLSEATMASLIS-HGLPVSENKTQQTPGTIKFLGQIISPNHLTYD-----AVPTVP 818 Query 256 I----TLNKLQKLVGDLVWRQSLIGKSIPNILKLMEGDR-ALQ------SERYIESIHVR 304 I L +LQ L+G++ W + K P + + + ALQ + Y+ V+ Sbjct 819 IRSRWALPELQALLGEIQW----VSKGTPTLRQPLHSLYCALQRHTDPRDQIYLNPSQVQ 874 Query 305 EWEACRQKLKEMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEK 346 RQ L + + + + G + +VFQ K Sbjct 875 SLVQLRQALSQNCRSRLVQTLPLLGAIMLTLTGTTTVVFQSK 916 >RecName: Full=Endogenous retrovirus group K member 9 Pol protein; AltName: Full=HERV-K(C6) Gag-Pol protein; AltName: Full=HERV-K109 Gag-Pol protein; AltName: Full=HERV-K_6q14.1 provirus ancestral Gag-Pol polyprotein; Includes: RecName: Full=Protease; AltName: Full=PR; AltName: Full=Retropepsin; Includes: RecName: Full=Reverse transcriptase/ribonuclease H; AltName: Full=p66 RT [Homo sapiens] Sequence ID: P63128.3 Length: 1117 Range 1: 959 to 1104 Score:109 bits(272), Expect:1e-23, Method:Compositional matrix adjust., Identities:57/148(39%), Positives:86/148(58%), Gaps:2/148(1%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 + QWPL ++KLE L + + ++G + + W N+P+F I+KKSGKWRML D R +N Sbjct 959 VNQWPLPKQKLEALHLLANEQLEKGHIEPSFSPW--NSPVFVIQKKSGKWRMLTDLRAVN 1016 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPC 143 + + Q GLP P + + + I+D+ D +FTIPL E + FT+ + NN P Sbjct 1017 AVIQPMGPLQPGLPSPAMIPKDWPLIIIDLKDCFFTIPLAEQDCEKFAFTIPAINNKEPA 1076 Query 144 VRYYWKVLPQGWKLSPAVYQFTMQKILR 171 R+ WKVLPQG SP + Q + + L+ Sbjct 1077 TRFQWKVLPQGMLNSPTICQTFVGRALQ 1104 >RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Matrix protein p10; Short=MA; Contains: RecName: Full=p20; Contains: RecName: Full=Capsid protein p25; Short=CA; Contains: RecName: Full=Nucleocapsid protein p14; Short=NC-pol; Contains: RecName: Full=Protease p15; Short=PR; Contains: RecName: Full=Reverse transcriptase/ribonuclease H p90; Short=RT; Contains: RecName: Full=Integrase p46; Short=IN [Walleye dermal sarcoma virus] Sequence ID: O92815.2 Length: 1752 Range 1: 756 to 953 Score:95.1 bits(235), Expect:5e-19, Method:Compositional matrix adjust., Identities:64/209(31%), Positives:107/209(51%), Gaps:18/209(8%) Query 8 IPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIK 67 +P +++K+ P I Q+PL ++K EGL+ ++ LE +G + + H CNTPIF IK Sbjct 756 VPPITIKIKDNASLPSIRQYPLPKDKTEGLRPLISSLENQGILIKC--HSPCNTPIFPIK 813 Query 68 KKS-GKWRMLIDFRELNKQTEDLAEAQLGLPHP--GGLQRKKH-VTILDIGDAYFTIPLY 123 K ++RM+ D R +N L A + P L H T++D+ +A+F++P++ Sbjct 814 KAGRDEYRMIHDLRAINNIVAPLT-AVVASPTTVLSNLAPSLHWFTVIDLSNAFFSVPIH 872 Query 124 EPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFG 183 + + FT +Y W VLPQG+ SP ++ + + L I+ + Sbjct 873 KDSQYLFAFTFEG-------HQYTWTVLPQGFIHSPTLFSQALYQSLHK-IKFKISSEIC 924 Query 184 IYMDDIYIGS---DLGLEEHRGIVNELAS 209 IYMDD+ I S D L++ ++ LAS Sbjct 925 IYMDDVLIASKDRDTNLKDTAVMLQHLAS 953 >RecName: Full=Pro-Pol polyprotein; AltName: Full=Pr125Pol; Contains: RecName: Full=Protease/Reverse transcriptase/ribonuclease H; AltName: Full=p87Pro-RT-RNaseH; Contains: RecName: Full=Protease/Reverse transcriptase; AltName: Full=p65Pro-RT; Contains: RecName: Full=Ribonuclease H; Short=RNase H; Contains: RecName: Full=Integrase; Short=IN; AltName: Full=p42In [Human spumaretrovirus] Sequence ID: P14350.2 Length: 1143 Range 1: 167 to 412 Score:89.7 bits(221), Expect:2e-17, Method:Compositional matrix adjust., Identities:79/270(29%), Positives:137/270(50%), Gaps:32/270(11%) Query 26 QWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELNKQ 85 Q+P+ + ++ ++D L K+G + P + T NTP++ + K G+WRM++D+RE+NK Sbjct 167 QYPINPKAKPSIQIVIDDLLKQGVL--TPQNSTMNTPVYPVPKPDGRWRMVLDYREVNK- 223 Query 86 TEDLAEAQLGLPHPGG----LQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLG 141 T L AQ H G + R+K+ T LD+ + ++ P+ T FT Sbjct 224 TIPLTAAQNQ--HSAGILATIVRQKYKTTLDLANGFWAHPITPESYWLTAFTWQGK---- 277 Query 142 PCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHR 201 +Y W LPQG+ SPA++ + +L+ E P +Q +Y+DDIY+ D +EH Sbjct 278 ---QYCWTRLPQGFLNSPALFTADVVDLLK----EIPNVQ--VYVDDIYLSHD-DPKEHV 327 Query 202 GIVNELASYIAQYGFMLPEDKRQEGY-PAKWLGFELHPEKWKFQ---KHTLPEITEGPIT 257 + ++ + Q G+++ K + G ++LGF + E K L IT P Sbjct 328 QQLEKVFQILLQAGYVVSLKKSEIGQKTVEFLGFNITKEGRGLTDTFKTKLLNITP-PKD 386 Query 258 LNKLQKLVGDLVWRQSLIGKSIPNILKLME 287 L +LQ ++G L + ++ IPN +L++ Sbjct 387 LKQLQSILGLLNFARNF----IPNFAELVQ 412 >RecName: Full=Pro-Pol polyprotein; AltName: Full=Pr125Pol; Contains: RecName: Full=Protease/Reverse transcriptase/ribonuclease H; AltName: Full=p87Pro-RT-RNaseH; Contains: RecName: Full=Protease/Reverse transcriptase; AltName: Full=p65Pro-RT; Contains: RecName: Full=Ribonuclease H; Short=RNase H; Contains: RecName: Full=Integrase; Short=IN; AltName: Full=p42In [Pan troglodytes foamy virus] Sequence ID: Q87040.1 Length: 1146 Range 1: 167 to 412 Score:89.0 bits(219), Expect:4e-17, Method:Compositional matrix adjust., Identities:78/270(29%), Positives:135/270(50%), Gaps:32/270(11%) Query 26 QWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELNKQ 85 Q+P+ + ++ ++D L K+G + P + T NTP++ + K G+WRM++D+RE+NK Sbjct 167 QYPINPKAKPSIQIVIDDLLKQGVL--TPQNSTMNTPVYPVPKPDGRWRMVLDYREVNK- 223 Query 86 TEDLAEAQLGLPHPGG----LQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLG 141 T L AQ H G + R+K+ T LD+ + ++ P+ T FT Sbjct 224 TIPLTAAQNQ--HSAGILATIVRQKYKTTLDLANGFWAHPITPDSYWLTAFTWQGK---- 277 Query 142 PCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHR 201 +Y W LPQG+ SPA++ +L+ E P +Q +Y+DDIY+ D EH Sbjct 278 ---QYCWTRLPQGFLNSPALFTADAVDLLK----EVPNVQ--VYVDDIYLSHD-NPHEHI 327 Query 202 GIVNELASYIAQYGFMLPEDKRQEGY-PAKWLGFELHPEKWKFQ---KHTLPEITEGPIT 257 + ++ + Q G+++ K + G ++LGF + E K L +T P Sbjct 328 QQLEKVFQILLQAGYVVSLKKSEIGQRTVEFLGFNITKEGRGLTDTFKTKLLNVTP-PKD 386 Query 258 LNKLQKLVGDLVWRQSLIGKSIPNILKLME 287 L +LQ ++G L + ++ IPN +L++ Sbjct 387 LKQLQSILGLLNFARNF----IPNFAELVQ 412 >RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Protease; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Reticuloendotheliosis virus] Sequence ID: P03360.2 Length: 1152 Range 1: 139 to 377 Score:84.3 bits(207), Expect:1e-15, Method:Compositional matrix adjust., Identities:65/246(26%), Positives:113/246(45%), Gaps:14/246(5%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKK-SGKWRMLIDFREL 82 + Q+P+T E L+E + + G + P H NTP+ ++K + ++RM+ D RE+ Sbjct 139 VRQYPITLEAKRSLRETIRKFRAAGIL--RPVHSPWNTPLLPVRKSGTSEYRMVQDLREV 196 Query 83 NKQTEDLAEAQLGLPHPGGLQR-----KKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSP 137 NK+ E + +P+P L + ++LD+ DA+F IPL P Q + Sbjct 197 NKRVETIHPT---VPNPYTLLSLLPPDRIWYSVLDLKDAFFCIPL-APESQLIFAFEWAD 252 Query 138 NNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGL 197 G + W LPQG+K SP ++ + + L+G+ +HP + Y+DD+ I +D Sbjct 253 AEEGESGQLTWTRLPQGFKNSPTLFDEALNRDLQGFRLDHPSVSLLQYVDDLLIAADTQ- 311 Query 198 EEHRGIVNELASYIAQYGFMLPEDKRQEGY-PAKWLGFELHPEKWKFQKHTLPEITEGPI 256 +L +A+ G+ + K Q +LGF++H I + P+ Sbjct 312 AACLSATRDLLMTLAELGYRVSGKKAQLCQEEVTYLGFKIHKGSRSLSNSRTQAILQIPV 371 Query 257 TLNKLQ 262 K Q Sbjct 372 PKTKRQ 377 >RecName: Full=Pro-Pol polyprotein; AltName: Full=Pr125Pol; Contains: RecName: Full=Protease/Reverse transcriptase/ribonuclease H; AltName: Full=p87Pro-RT-RNaseH; Contains: RecName: Full=Protease/Reverse transcriptase; AltName: Full=p65Pro-RT; Contains: RecName: Full=Ribonuclease H; Short=RNase H; Contains: RecName: Full=Integrase; Short=IN; AltName: Full=p42In [Macaque simian foamy virus] Sequence ID: P23074.3 Length: 1149 Range 1: 167 to 404 Score:82.4 bits(202), Expect:5e-15, Method:Compositional matrix adjust., Identities:76/258(29%), Positives:128/258(49%), Gaps:28/258(10%) Query 26 QWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELNKQ 85 Q+P+ + ++ ++D L K+G + + + T NTP++ + K GKWRM++D+RE+NK Sbjct 167 QYPINPKAKPSIQIVIDDLLKQGVLIQQ--NSTMNTPVYPVPKPDGKWRMVLDYREVNK- 223 Query 86 TEDLAEAQLGLPHPGG----LQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLG 141 T L AQ H G + R K+ T LD+ + ++ P+ T FT Sbjct 224 TIPLIAAQNQ--HSAGILSSIYRGKYKTTLDLTNGFWAHPITPESYWLTAFTWQGK---- 277 Query 142 PCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHR 201 +Y W LPQG+ SPA++ + +L+ E P +Q Y+DDIYI D +EH Sbjct 278 ---QYCWTRLPQGFLNSPALFTADVVDLLK----EIPNVQ--AYVDDIYISHD-DPQEHL 327 Query 202 GIVNELASYIAQYGFMLPEDKRQEGY-PAKWLGFELHPEKWKFQ---KHTLPEITEGPIT 257 + ++ S + G+++ K + ++LGF + E K L IT P Sbjct 328 EQLEKIFSILLNAGYVVSLKKSEIAQREVEFLGFNITKEGRGLTDTFKQKLLNITP-PKD 386 Query 258 LNKLQKLVGDLVWRQSLI 275 L +LQ ++G L + ++ I Sbjct 387 LKQLQSILGLLNFARNFI 404 >RecName: Full=Pro-Pol polyprotein; AltName: Full=Pr125Pol; Contains: RecName: Full=Protease/Reverse transcriptase/ribonuclease H; AltName: Full=p87Pro-RT-RNaseH; Contains: RecName: Full=Protease/Reverse transcriptase; AltName: Full=p65Pro-RT; Contains: RecName: Full=Ribonuclease H; Short=RNase H; Contains: RecName: Full=Integrase; Short=IN; AltName: Full=p42In [Simian foamy virus (TYPE 3 / STRAIN LK3)] Sequence ID: P27401.2 Length: 1143 Range 1: 167 to 412 Score:82.0 bits(201), Expect:5e-15, Method:Compositional matrix adjust., Identities:77/270(29%), Positives:134/270(49%), Gaps:32/270(11%) Query 26 QWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELNKQ 85 Q+P+ + ++ +++ L K+G + + + NTP++ + K GKWRM++D+RE+NK Sbjct 167 QYPINPKAKASIQTVINDLLKQGVLIQQ--NSIMNTPVYPVPKPDGKWRMVLDYREVNK- 223 Query 86 TEDLAEAQLGLPHPGGLQ----RKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLG 141 T L AQ H G+ R K+ T LD+ + ++ + T FT L Sbjct 224 TIPLIAAQNQ--HSAGILSSIFRGKYKTTLDLSNGFWAHSITPESYWLTAFTWLGQ---- 277 Query 142 PCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHR 201 +Y W LPQG+ SPA++ + +L+ E P +Q +Y+DDIYI D EH Sbjct 278 ---QYCWTRLPQGFLNSPALFTADVVDLLK----EVPNVQ--VYVDDIYISHD-DPREHL 327 Query 202 GIVNELASYIAQYGFMLPEDKRQEG-YPAKWLGFELHPEKWKFQ---KHTLPEITEGPIT 257 + ++ S + G+++ K + + ++LGF + E K L IT P Sbjct 328 EQLEKVFSLLLNAGYVVSLKKSEIAQHEVEFLGFNITKEGRGLTETFKQKLLNITP-PRD 386 Query 258 LNKLQKLVGDLVWRQSLIGKSIPNILKLME 287 L +LQ ++G L + ++ IPN +L++ Sbjct 387 LKQLQSILGLLNFARNF----IPNFSELVK 412 >RecName: Full=Pro-Pol polyprotein; AltName: Full=Pr125Pol; Contains: RecName: Full=Protease/Reverse transcriptase/ribonuclease H; AltName: Full=p87Pro-RT-RNaseH; Contains: RecName: Full=Protease/Reverse transcriptase; AltName: Full=p65Pro-RT; Contains: RecName: Full=Ribonuclease H; Short=RNase H; Contains: RecName: Full=Integrase; Short=IN; AltName: Full=p42In [Feline foamy virus] Sequence ID: O93209.1 Length: 1156 Range 1: 194 to 401 Score:80.5 bits(197), Expect:2e-14, Method:Compositional matrix adjust., Identities:62/223(28%), Positives:111/223(49%), Gaps:20/223(8%) Query 58 TCNTPIFCIKKKSGKWRMLIDFRELNKQTEDLA-EAQLGLPHPGGLQRKKHVTILDIGDA 116 T NTP++ + K +G+WRM++D+R +NK T +A + Q G L + ++ T +D+ + Sbjct 194 TMNTPVYPVPKPNGRWRMVLDYRAVNKVTPLIAVQNQHSYGILGSLFKGRYKTTIDLSNG 253 Query 117 YFTIPLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEE 176 ++ P+ T FT +Y W VLPQG+ SP ++ + +L+G Sbjct 254 FWAHPIVPEDYWITAFTWQGK-------QYCWTVLPQGFLNSPGLFTGDVVDLLQG---- 302 Query 177 HPMIQFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQ-EGYPAKWLGFE 235 + +Y+DD+YI D +EH ++ L + + + G+++ K +LGF+ Sbjct 303 --IPNVEVYVDDVYISHD-SEKEHLEYLDILFNRLKEAGYIISLKKSNIANSIVDFLGFQ 359 Query 236 LHPEKWKFQ---KHTLPEITEGPITLNKLQKLVGDLVWRQSLI 275 + E K L IT P TL +LQ ++G L + ++ I Sbjct 360 ITNEGRGLTDTFKEKLENIT-APTTLKQLQSILGLLNFARNFI 401 >RecName: Full=Intracisternal A-particle Pol-related polyprotein; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Mouse intracisternal A-particle MIAIL3] Sequence ID: P12894.1 Length: 814 Range 1: 66 to 484 Score:78.2 bits(191), Expect:8e-14, Method:Compositional matrix adjust., Identities:103/430(24%), Positives:172/430(40%), Gaps:43/430(10%) Query 142 PCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIG-SDLGLEEH 200 P RY WKVLPQG SP + Q +QK L E+ P + +YMDDI + DL + + Sbjct 66 PDKRYQWKVLPQGMSNSPTMCQLYVQKALLPVREQFPSLILLLYMDDILLCHKDLTMLQK 125 Query 201 RGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPEITEGPI-TLN 259 L ++Q+G + +K Q ++LG + P+K QK EI + TLN Sbjct 126 AYPF--LLKTLSQWGLQIATEKVQISDTGQFLGSVVSPDKIVPQKV---EIRRDHLHTLN 180 Query 260 KLQKLVGDLVWRQSLIGKSIPN-----ILKLMEGDRALQSERYIESIHVREWEACRQKLK 314 QKL+GD+ W + + IP+ + ++EGD + S R + + + + L+ Sbjct 181 DFQKLLGDINWLRPFL--KIPSAELRPLFSILEGDPHISSPRTLTLAANQALQKVEKALQ 238 Query 315 EMEGNYYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWVNVVHSIKNLSqaqqiikaaqkL 374 + E+ + + + V + G LW++ S + A L Sbjct 239 NAQLQRI-EDSQPFSLCVFKTAQLPTAVLWQNGPLLWIHPNVSPAKIIDWYPDAIAQLAL 297 Query 375 --TQEVIIRTGKIPWILLPGREEDWILELQMGNINWM-----------------PSFWSC 415 + I G+ P++L+ + L + +W P Sbjct 298 KGLKAAITHFGQSPYLLIVPYTAAQVQTLAAASNDWAVLVTSFSGKIDNHYPKHPILQFA 357 Query 416 YKGSVRWKKRNVIAELVPGPTYYTDGGKKNGRGSLGYIASTGEKFRIHEEGTNQQLELRA 475 SV + + V L G YTDG K G Y+A+ + + E + Q +E Sbjct 358 QNQSVVFPQITVRNPLKNGIVVYTDGSKT---GIGAYVANGKVVSKQYNENSPQVVECLV 414 Query 476 IEEACKQGPEKMNIVTDSRYAYEFMLRNWDEEVIR------NPIQARIMELVHNKEKIGV 529 + E K + +NIV+DS Y + VI+ N Q + L+ + + + Sbjct 415 VLEVLKTFLKPLNIVSDSYYVVNAVNLLEVAGVIKPSSRVANIFQQIQLVLLSRRSPVYI 474 Query 530 HWVPGHKGIP 539 V H G+P Sbjct 475 THVRAHSGLP 484 >RecName: Full=Gag-Pol polyprotein; Short=Pr180gag-pol; Contains: RecName: Full=Matrix protein p15; Contains: RecName: Full=RNA-binding phosphoprotein p12; AltName: Full=pp12; Contains: RecName: Full=Capsid protein p30; Contains: RecName: Full=Nucleocapsid protein p10-Pol; Short=NC-pol; Contains: RecName: Full=Protease; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Cas-Br-E murine leukemia virus] Sequence ID: P08361.2 Length: 1733 Range 1: 718 to 936 Score:76.6 bits(187), Expect:3e-13, Method:Composition-based stats., Identities:63/226(28%), Positives:109/226(48%), Gaps:15/226(6%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKK-SGKWRMLIDFREL 82 I Q+P++QE G+K + RL +G + W NTP+ +KK + +R + D RE+ Sbjct 718 IKQYPMSQEARLGIKPHIQRLLDQGILVPCQSPW--NTPLLPVKKPGTNDYRPVQDLREV 775 Query 83 NKQTEDLAEAQLGLPHPGGL-----QRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSP 137 NK+ ED+ +P+P L + T+LD+ DA+F + L+ + F P Sbjct 776 NKRVEDIHPT---VPNPYNLLSGLPPSHQWYTVLDLKDAFFCLRLHPTSQPLFAFEWRDP 832 Query 138 NNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGL 197 +G + W LPQG+K SP ++ + + L G+ +HP + Y+DD+ + + L Sbjct 833 E-MGISGQLTWTRLPQGFKNSPTLFDEALHRDLAGFRIQHPDLILLQYVDDLLLAATSEL 891 Query 198 EEHRGIVNELASYIAQYGFMLPEDKRQE-GYPAKWLGFELHP-EKW 241 + +G L + G+ K Q K+LG+ L ++W Sbjct 892 DCQQG-TRALLQTLGDLGYRASAKKAQICQKQVKYLGYLLKEGQRW 936 >RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Matrix protein p15; Contains: RecName: Full=RNA-binding phosphoprotein p12; AltName: Full=pp12; Contains: RecName: Full=Capsid protein p30; Contains: RecName: Full=Nucleocapsid protein p10-Pol; Short=NC-pol; Contains: RecName: Full=Protease; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Friend murine leukemia virus (ISOLATE 57)] Sequence ID: P26810.2 Length: 1739 Range 1: 710 to 939 Score:75.1 bits(183), Expect:9e-13, Method:Composition-based stats., Identities:64/237(27%), Positives:111/237(46%), Gaps:15/237(6%) Query 13 VRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKK-SG 71 + LK I Q+P++QE G+K + RL +G + W NTP+ +KK + Sbjct 710 ISLKATSTPVSIKQYPMSQEARLGIKPHIQRLLDQGILVPCQSPW--NTPLLPVKKPGTN 767 Query 72 KWRMLIDFRELNKQTEDLAEAQLGLPHPGGL-----QRKKHVTILDIGDAYFTIPLYEPY 126 +R + D RE+NK+ ED+ +P+P L + T+LD+ DA+F + L+ Sbjct 768 DYRPVQDLREVNKRVEDIHPT---VPNPYNLLSGLPPSHQWYTVLDLKDAFFCLRLHPTS 824 Query 127 RQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYM 186 + F P +G + W LPQG+K SP ++ + + L + +HP + Y+ Sbjct 825 QSLFAFEWKDPE-MGISGQLTWTRLPQGFKNSPTLFDEALHRDLADFRIQHPDLILLQYV 883 Query 187 DDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQE-GYPAKWLGFELHP-EKW 241 DD+ + + L+ +G L + G+ K Q K+LG+ L ++W Sbjct 884 DDLLLAATSELDCQQG-TRALLQTLGDLGYRASAKKAQICQKQVKYLGYLLKEGQRW 939 >RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Matrix protein p15; Contains: RecName: Full=RNA-binding phosphoprotein p12; AltName: Full=pp12; Contains: RecName: Full=Capsid protein p30; Contains: RecName: Full=Nucleocapsid protein p10-Pol; Short=NC-pol; Contains: RecName: Full=Protease; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Friend murine leukemia virus (ISOLATE FB29)] Sequence ID: P26809.2 Length: 1738 Range 1: 720 to 938 Score:74.7 bits(182), Expect:1e-12, Method:Composition-based stats., Identities:62/226(27%), Positives:108/226(47%), Gaps:15/226(6%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKK-SGKWRMLIDFREL 82 I Q+P++QE G+K + RL +G + W NTP+ +KK + +R + D RE+ Sbjct 720 IKQYPMSQEARLGIKPHIQRLLDQGILVPCQSPW--NTPLLPVKKPGTNDYRPVQDLREV 777 Query 83 NKQTEDLAEAQLGLPHPGGL-----QRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSP 137 NK+ ED+ +P+P L + T+LD+ DA+F + L+ + F P Sbjct 778 NKRVEDIHPT---VPNPYNLLSGLPPSHQWYTVLDLKDAFFCLRLHPTSQSLFAFEWRDP 834 Query 138 NNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGL 197 +G + W LPQG+K SP ++ + + L + +HP + Y+DD+ + + L Sbjct 835 E-MGISGQLTWTRLPQGFKNSPTLFDEALHRDLADFRIQHPDLILLQYVDDLLLAATSEL 893 Query 198 EEHRGIVNELASYIAQYGFMLPEDKRQE-GYPAKWLGFELHP-EKW 241 + +G L + G+ K Q K+LG+ L ++W Sbjct 894 DCQQG-TRALLQTLGDLGYRASAKKAQICQKQVKYLGYLLKEGQRW 938 >RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Matrix protein p15; Short=MA; Contains: RecName: Full=RNA-binding phosphoprotein p12; AltName: Full=pp12; Contains: RecName: Full=Capsid protein p30; Short=CA; Contains: RecName: Full=Nucleocapsid protein p10-Pol; Short=NC-pol; Contains: RecName: Full=Protease; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [AKR (endogenous) murine leukemia virus] Sequence ID: P03356.3 Length: 1734 Range 1: 719 to 897 Score:74.7 bits(182), Expect:1e-12, Method:Composition-based stats., Identities:54/185(29%), Positives:93/185(50%), Gaps:12/185(6%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKK-SGKWRMLIDFREL 82 I Q+P++QE G+K + RL +G + W NTP+ +KK + +R + D RE+ Sbjct 719 IKQYPMSQEAKLGIKPHIQRLLDQGILVPCQSPW--NTPLLPVKKPGTNDYRPVQDLREV 776 Query 83 NKQTEDLAEAQLGLPHPGGL-----QRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSP 137 NK+ ED+ +P+P L + T+LD+ DA+F + L+ + F P Sbjct 777 NKRVEDIHPT---VPNPYNLLSGLPPSHRWYTVLDLKDAFFCLRLHPTSQPLFAFEWRDP 833 Query 138 NNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGL 197 +G + W LPQG+K SP ++ + + L + +HP + Y+DDI + + L Sbjct 834 G-MGISGQLTWTRLPQGFKNSPTLFDEALHRDLADFRIQHPDLILLQYVDDILLAATSEL 892 Query 198 EEHRG 202 + +G Sbjct 893 DCQQG 897 >RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Matrix protein p15; Short=MA; Contains: RecName: Full=RNA-binding phosphoprotein p12; AltName: Full=pp12; Contains: RecName: Full=Capsid protein p30; Short=CA; Contains: RecName: Full=Nucleocapsid protein p10-Pol; Short=NC-pol; Contains: RecName: Full=Protease; Short=PR; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; AltName: Full=p80; Contains: RecName: Full=Integrase; Short=IN; AltName: Full=p46 [Woolly monkey sarcoma virus] Sequence ID: P03359.2 Length: 1687 Range 1: 683 to 899 Score:74.3 bits(181), Expect:2e-12, Method:Compositional matrix adjust., Identities:55/224(25%), Positives:105/224(46%), Gaps:13/224(5%) Query 7 KIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCI 66 ++P V L+ G + Q+P+++E EG++ + R G + W NTP+ + Sbjct 683 QVPPVVVELRSGASPVAVRQYPMSKEAREGIRPHIQRFLDLGVLVPCQSPW--NTPLLPV 740 Query 67 KKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRK---KHV--TILDIGDAYFTI 120 KK + +R + D RE+NK+ +D+ +P+P L H ++LD+ DA+F + Sbjct 741 KKPGTNDYRPVQDLREINKRVQDIHPT---VPNPYNLLSSLPPSHTWYSVLDLKDAFFCL 797 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 L+ + F P G + W LPQG+K SP ++ + + L + +P + Sbjct 798 KLHPNSQPLFAFEWRDPEK-GNTGQLTWTRLPQGFKNSPTLFDEALHRDLAPFRALNPQV 856 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQ 224 Y+DD+ + + + G +L +++ G+ + K Q Sbjct 857 VLLQYVDDLLVAAPTYRDCKEG-TQKLLQELSKLGYRVSAKKAQ 899 >RecName: Full=Gag-Pol polyprotein; Short=Pr180gag-pol; Contains: RecName: Full=Matrix protein p15; Short=MA; Contains: RecName: Full=RNA-binding phosphoprotein p12; AltName: Full=pp12; Contains: RecName: Full=Capsid protein p30; Short=CA; Contains: RecName: Full=Nucleocapsid protein p10-Pol; Short=NC-pol; Contains: RecName: Full=Protease; Short=PR; AltName: Full=p14; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; AltName: Full=p80; Contains: RecName: Full=Integrase; Short=IN; AltName: Full=p46 [Moloney murine leukemia virus isolate Shinnick] Sequence ID: P03355.5 Length: 1738 Range 1: 720 to 938 Score:74.3 bits(181), Expect:2e-12, Method:Composition-based stats., Identities:62/226(27%), Positives:108/226(47%), Gaps:15/226(6%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKK-SGKWRMLIDFREL 82 I Q+P++QE G+K + RL +G + W NTP+ +KK + +R + D RE+ Sbjct 720 IKQYPMSQEARLGIKPHIQRLLDQGILVPCQSPW--NTPLLPVKKPGTNDYRPVQDLREV 777 Query 83 NKQTEDLAEAQLGLPHPGGL-----QRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSP 137 NK+ ED+ +P+P L + T+LD+ DA+F + L+ + F P Sbjct 778 NKRVEDIHPT---VPNPYNLLSGLPPSHQWYTVLDLKDAFFCLRLHPTSQPLFAFEWRDP 834 Query 138 NNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGL 197 +G + W LPQG+K SP ++ + + L + +HP + Y+DD+ + + L Sbjct 835 E-MGISGQLTWTRLPQGFKNSPTLFDEALHRDLADFRIQHPDLILLQYVDDLLLAATSEL 893 Query 198 EEHRGIVNELASYIAQYGFMLPEDKRQE-GYPAKWLGFELHP-EKW 241 + +G L + G+ K Q K+LG+ L ++W Sbjct 894 DCQQG-TRALLQTLGNLGYRASAKKAQICQKQVKYLGYLLKEGQRW 938 >RecName: Full=Gag-Pol polyprotein; Short=Pr180gag-pol; Contains: RecName: Full=Matrix protein p15; Short=MA; Contains: RecName: Full=RNA-binding phosphoprotein p12; AltName: Full=pp12; Contains: RecName: Full=Capsid protein p30; Short=CA; Contains: RecName: Full=Nucleocapsid protein p10; Short=NC-pol; Contains: RecName: Full=Protease p14; Short=PR; Contains: RecName: Full=Reverse transcriptase/ribonuclease H p80; Short=RT; Contains: RecName: Full=Integrase p46; Short=IN [Xenotropic MuLV-related virus VP35] Sequence ID: Q2F7J3.1 Length: 1733 Range 1: 718 to 936 Score:73.6 bits(179), Expect:3e-12, Method:Composition-based stats., Identities:62/226(27%), Positives:107/226(47%), Gaps:15/226(6%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKK-SGKWRMLIDFREL 82 I Q+P++QE G+K + RL +G + W NTP+ +KK + +R + D RE+ Sbjct 718 IKQYPMSQEARLGIKPHIQRLLDQGILVPCQSPW--NTPLLPVKKPGTNDYRPVQDLREV 775 Query 83 NKQTEDLAEAQLGLPHPGGL-----QRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSP 137 NK+ ED+ +P+P L + T+LD+ DA+F + L+ + F P Sbjct 776 NKRVEDIHPT---VPNPYNLLSGLPPSHQWYTVLDLKDAFFCLRLHPTSQPLFAFEWRDP 832 Query 138 NNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGL 197 +G + W LPQG+K SP ++ + + L + +HP + Y+DD+ + + Sbjct 833 E-MGISGQLTWTRLPQGFKNSPTLFDEALHRDLADFRIQHPDLILLQYVDDLLLAATSEQ 891 Query 198 EEHRGIVNELASYIAQYGFMLPEDKRQE-GYPAKWLGFELHP-EKW 241 + RG L + G+ K Q K+LG+ L ++W Sbjct 892 DCQRG-TRALLQTLGNLGYRASAKKAQICQKQVKYLGYLLKEGQRW 936 >RecName: Full=Gag-Pol polyprotein; Short=Pr180gag-pol; Contains: RecName: Full=Matrix protein p15; Short=MA; Contains: RecName: Full=RNA-binding phosphoprotein p12; AltName: Full=pp12; Contains: RecName: Full=Capsid protein p30; Short=CA; Contains: RecName: Full=Nucleocapsid protein p10; Short=NC-pol; Contains: RecName: Full=Protease p14; Short=PR; Contains: RecName: Full=Reverse transcriptase/ribonuclease H p80; Short=RT; Contains: RecName: Full=Integrase p46; Short=IN [Xenotropic MuLV-related virus VP62] Sequence ID: A1Z651.1 Length: 1733 Range 1: 718 to 936 Score:73.6 bits(179), Expect:3e-12, Method:Composition-based stats., Identities:62/226(27%), Positives:107/226(47%), Gaps:15/226(6%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKK-SGKWRMLIDFREL 82 I Q+P++QE G+K + RL +G + W NTP+ +KK + +R + D RE+ Sbjct 718 IKQYPMSQEARLGIKPHIQRLLDQGILVPCQSPW--NTPLLPVKKPGTNDYRPVQDLREV 775 Query 83 NKQTEDLAEAQLGLPHPGGL-----QRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSP 137 NK+ ED+ +P+P L + T+LD+ DA+F + L+ + F P Sbjct 776 NKRVEDIHPT---VPNPYNLLSGLPPSHQWYTVLDLKDAFFCLRLHPTSQPLFAFEWRDP 832 Query 138 NNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGL 197 +G + W LPQG+K SP ++ + + L + +HP + Y+DD+ + + Sbjct 833 E-MGISGQLTWTRLPQGFKNSPTLFDEALHRDLADFRIQHPDLILLQYVDDLLLAATSEQ 891 Query 198 EEHRGIVNELASYIAQYGFMLPEDKRQE-GYPAKWLGFELHP-EKW 241 + RG L + G+ K Q K+LG+ L ++W Sbjct 892 DCQRG-TRALLQTLGNLGYRASAKKAQICQKQVKYLGYLLKEGQRW 936 >RecName: Full=Gag-Pol polyprotein; Short=Pr180gag-pol; Contains: RecName: Full=Matrix protein p15; Short=MA; Contains: RecName: Full=RNA-binding phosphoprotein p12; AltName: Full=pp12; Contains: RecName: Full=Capsid protein p30; Short=CA; Contains: RecName: Full=Nucleocapsid protein p10; Short=NC-pol; Contains: RecName: Full=Protease p14; Short=PR; Contains: RecName: Full=Reverse transcriptase/ribonuclease H p80; Short=RT; Contains: RecName: Full=Integrase p46; Short=IN [Xenotropic MuLV-related virus VP42] Sequence ID: Q2F7J0.1 Length: 1733 Range 1: 718 to 936 Score:73.6 bits(179), Expect:3e-12, Method:Composition-based stats., Identities:62/226(27%), Positives:107/226(47%), Gaps:15/226(6%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKK-SGKWRMLIDFREL 82 I Q+P++QE G+K + RL +G + W NTP+ +KK + +R + D RE+ Sbjct 718 IKQYPMSQEARLGIKPHIQRLLDQGILVPCQSPW--NTPLLPVKKPGTNDYRPVQDLREV 775 Query 83 NKQTEDLAEAQLGLPHPGGL-----QRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSP 137 NK+ ED+ +P+P L + T+LD+ DA+F + L+ + F P Sbjct 776 NKRVEDIHPT---VPNPYNLLSGLPPSHQWYTVLDLKDAFFCLRLHPTSQPLFAFEWRDP 832 Query 138 NNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGL 197 +G + W LPQG+K SP ++ + + L + +HP + Y+DD+ + + Sbjct 833 E-MGISGQLTWTRLPQGFKNSPTLFDEALHRDLADFRIQHPDLILLQYVDDLLLAATSEQ 891 Query 198 EEHRGIVNELASYIAQYGFMLPEDKRQE-GYPAKWLGFELHP-EKW 241 + RG L + G+ K Q K+LG+ L ++W Sbjct 892 DCQRG-TRALLQTLGNLGYRASAKKAQICQKQVKYLGYLLKEGQRW 936 >RecName: Full=Pol polyprotein; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Feline endogenous virus ECE1] Sequence ID: P31792.1 Length: 1046 Range 1: 25 to 205 Score:73.2 bits(178), Expect:3e-12, Method:Compositional matrix adjust., Identities:51/187(27%), Positives:91/187(48%), Gaps:12/187(6%) Query 13 VRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGK 72 + LK I Q+P+++E G++ + R + G + W NTP+ +KK + Sbjct 25 IDLKPTAMPVSIRQYPMSKEAHMGIQPHITRFLELGVLRPCRSPW--NTPLLPVKKPGTR 82 Query 73 -WRMLIDFRELNKQTEDLAEAQLGLPHPGGLQR-----KKHVTILDIGDAYFTIPLYEPY 126 +R + D RE+NK+T D+ +P+P L + T+LD+ DA+F +PL Sbjct 83 DYRPVQDLREVNKRTMDIHPT---VPNPYNLLSTLSPDRTWYTVLDLKDAFFCLPLAPQS 139 Query 127 RQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYM 186 ++ F P G + W LPQG+K SP ++ + + L + +HP + Y+ Sbjct 140 QELFAFEWRDPER-GISGQLTWTRLPQGFKNSPTLFDEALHRDLTDFRTQHPEVTLLQYV 198 Query 187 DDIYIGS 193 DD+ + + Sbjct 199 DDLLLAA 205 >RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Matrix protein p15; Contains: RecName: Full=RNA-binding phosphoprotein p12; AltName: Full=pp12; Contains: RecName: Full=Capsid protein p30; Contains: RecName: Full=Nucleocapsid protein p10-Pol; Short=NC-pol; Contains: RecName: Full=Protease; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Friend murine leukemia virus (ISOLATE PVC-211)] Sequence ID: P26808.2 Length: 1738 Range 1: 720 to 938 Score:73.2 bits(178), Expect:3e-12, Method:Composition-based stats., Identities:61/226(27%), Positives:108/226(47%), Gaps:15/226(6%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKK-SGKWRMLIDFREL 82 I Q+P+++E G+K + RL +G + W NTP+ +KK + +R + D RE+ Sbjct 720 IKQYPMSREARLGIKPHIQRLLDQGILVPCQSPW--NTPLLPVKKPGTNDYRPVQDLREV 777 Query 83 NKQTEDLAEAQLGLPHPGGL-----QRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSP 137 NK+ ED+ +P+P L + T+LD+ DA+F + L+ + F P Sbjct 778 NKRVEDIHPT---VPNPYNLLSGLPPSHQWYTVLDLKDAFFCLRLHPTSQSLFAFEWRDP 834 Query 138 NNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGL 197 +G + W LPQG+K SP ++ + + L + +HP + Y+DD+ + + L Sbjct 835 E-MGISGQLTWTRLPQGFKNSPTLFDEALHRDLADFRIQHPDLILLQYVDDLLLAATSEL 893 Query 198 EEHRGIVNELASYIAQYGFMLPEDKRQE-GYPAKWLGFELHP-EKW 241 + +G L + G+ K Q K+LG+ L ++W Sbjct 894 DCQQG-TRALLQTLGDLGYRASAKKAQICQKQVKYLGYLLKEGQRW 938 >RecName: Full=Gag-pol polyprotein; Contains: RecName: Full=Matrix protein p15; Contains: RecName: Full=RNA-binding phosphoprotein p12; AltName: Full=pp12; Contains: RecName: Full=Capsid protein p30; Contains: RecName: Full=Nucleocapsid protein p10-Pol; Short=NC-pol; Contains: RecName: Full=Protease; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Murine leukemia virus (strain BM5 ECO)] Sequence ID: Q7SVK7.2 Length: 1734 Range 1: 719 to 937 Score:73.2 bits(178), Expect:4e-12, Method:Composition-based stats., Identities:62/226(27%), Positives:107/226(47%), Gaps:15/226(6%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKK-SGKWRMLIDFREL 82 I Q+P++ E G+K + RL +G + W NTP+ +KK + +R + D RE+ Sbjct 719 IQQYPMSHEARLGIKPHIQRLLDQGILVPCQSPW--NTPLLPVKKPGTNDYRPVQDLREV 776 Query 83 NKQTEDLAEAQLGLPHPGGL-----QRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSP 137 NK+ ED+ +P+P L + T+LD+ DA+F + L+ + F P Sbjct 777 NKRVEDIHPT---VPNPYNLLSGLPPSHQWYTVLDLKDAFFCLRLHPTSQPLFAFEWRDP 833 Query 138 NNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGL 197 +G + W LPQG+K SP ++ + + L + +HP + Y+DDI + + L Sbjct 834 G-MGISGQLTWTRLPQGFKNSPTLFDEALHRDLADFRIQHPDLILLQYVDDILLAATSEL 892 Query 198 EEHRGIVNELASYIAQYGFMLPEDKRQE-GYPAKWLGFELHP-EKW 241 + +G L + G+ K Q K+LG+ L ++W Sbjct 893 DCQQG-TRALLQTLGDLGYRASAKKAQICQKQVKYLGYLLREGQRW 937 >RecName: Full=Gag-Pol polyprotein; AltName: Full=Pr125Pol; Contains: RecName: Full=Matrix protein p15; Short=MA; Contains: RecName: Full=RNA-binding phosphoprotein p12; AltName: Full=pp12; Contains: RecName: Full=Capsid protein p30; Short=CA; Contains: RecName: Full=Nucleocapsid protein p10-Pol; Short=NC-pol; Contains: RecName: Full=Protease/Reverse transcriptase/ribonuclease H; AltName: Full=p87; Contains: RecName: Full=Integrase; Short=IN; AltName: Full=p42 [Koala retrovirus] Sequence ID: Q9TTC1.2 Length: 1687 Range 1: 683 to 899 Score:72.4 bits(176), Expect:6e-12, Method:Compositional matrix adjust., Identities:55/224(25%), Positives:103/224(45%), Gaps:13/224(5%) Query 7 KIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCI 66 ++P V LK + Q+P+++E EG++ + R G + W NTP+ + Sbjct 683 QVPPVVVELKSDASPVAVRQYPMSKEAREGIRPHIQRFLDLGILVPCQSPW--NTPLLPV 740 Query 67 KKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRK---KHV--TILDIGDAYFTI 120 KK + +R + D RE+NK+ +D+ +P+P L H ++LD+ DA+F + Sbjct 741 KKPGTNDYRPVQDLREVNKRVQDIHPT---VPNPYNLLSSLPPSHTWYSVLDLKDAFFCL 797 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 L+ + F P G + W LPQG+K SP ++ + + L + +P + Sbjct 798 KLHPNSQPLFAFEWRDPEK-GNTGQLTWTRLPQGFKNSPTLFDEALHRDLASFRALNPQV 856 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQ 224 Y+DD+ + + + G L +++ G+ + K Q Sbjct 857 VMLQYVDDLLVAAPTYRDCKEG-TRRLLQELSKLGYRVSAKKAQ 899 >RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Matrix protein p15; Short=MA; Contains: RecName: Full=RNA-binding phosphoprotein p12; AltName: Full=pp12; Contains: RecName: Full=Capsid protein p30; Short=CA; Contains: RecName: Full=Nucleocapsid protein p10-Pol; Short=NC-pol; Contains: RecName: Full=Protease; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Gibbon ape leukemia virus] Sequence ID: P21414.2 Length: 1686 Range 1: 682 to 898 Score:71.6 bits(174), Expect:1e-11, Method:Compositional matrix adjust., Identities:53/224(24%), Positives:105/224(46%), Gaps:13/224(5%) Query 7 KIPSTRVRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCI 66 ++P V L+ G + Q+P+++E EG++ + + G + W NTP+ + Sbjct 682 QVPPVVVELRSGASPVAVRQYPMSKEAREGIRPHIQKFLDLGVLVPCRSPW--NTPLLPV 739 Query 67 KKK-SGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKH-----VTILDIGDAYFTI 120 KK + +R + D RE+NK+ +D+ +P+P L ++LD+ DA+F + Sbjct 740 KKPGTNDYRPVQDLREINKRVQDIHPT---VPNPYNLLSSLPPSYTWYSVLDLKDAFFCL 796 Query 121 PLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMI 180 L+ + F P G + W LPQG+K SP ++ + + L + +P + Sbjct 797 RLHPNSQPLFAFEWKDPEK-GNTGQLTWTRLPQGFKNSPTLFDEALHRDLAPFRALNPQV 855 Query 181 QFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQ 224 Y+DD+ + + E+ + +L +++ G+ + K Q Sbjct 856 VLLQYVDDLLVAAPT-YEDCKKGTQKLLQELSKLGYRVSAKKAQ 898 >RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Matrix protein p15; Contains: RecName: Full=RNA-binding phosphoprotein p12; AltName: Full=pp12; Contains: RecName: Full=Capsid protein p30; Contains: RecName: Full=Nucleocapsid protein p10-Pol; Short=NC-pol; Contains: RecName: Full=Protease; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Feline leukemia virus] Sequence ID: P10273.2 Length: 1712 Range 1: 676 to 856 Score:70.1 bits(170), Expect:3e-11, Method:Compositional matrix adjust., Identities:51/187(27%), Positives:90/187(48%), Gaps:12/187(6%) Query 13 VRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKK-SG 71 ++LK I Q+P+ E +G+K + R+ +G + W NTP+ +KK + Sbjct 676 IQLKATATPISIRQYPMPHEAYQGIKPHIRRMLDQGILKPCQSPW--NTPLLPVKKPGTE 733 Query 72 KWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRK---KH--VTILDIGDAYFTIPLYEPY 126 +R + D RE+NK+ ED+ +P+P L H T+LD+ DA+F + L+ Sbjct 734 DYRPVQDLREVNKRVEDIHPT---VPNPYNLLSTLPPSHPWYTVLDLKDAFFCLRLHSES 790 Query 127 RQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYM 186 + F P +G + W LPQG+K SP ++ + L + +P + Y+ Sbjct 791 QLLFAFEWRDPE-IGLSGQLTWTRLPQGFKNSPTLFDEALHSDLADFRVRYPALVLLQYV 849 Query 187 DDIYIGS 193 DD+ + + Sbjct 850 DDLLLAA 856 >RecName: Full=Transposon Ty3-G Gag-Pol polyprotein; AltName: Full=Gag3-Pol3; AltName: Full=Transposon Ty3-1 TYA-TYB polyprotein; Contains: RecName: Full=Capsid protein; Short=CA; AltName: Full=p24; Contains: RecName: Full=Spacer peptide p3; Contains: RecName: Full=Nucleocapsid protein p11; Short=NC; Contains: RecName: Full=Ty3 protease; Short=PR; AltName: Full=p16; Contains: RecName: Full=Spacer peptide J; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Short=RT-RH; AltName: Full=p55; Contains: RecName: Full=Integrase p61; Short=IN; Contains: RecName: Full=Integrase p58; Short=IN [Saccharomyces cerevisiae S288C] Sequence ID: Q99315.3 Length: 1547 Range 1: 588 to 846 Score:69.7 bits(169), Expect:5e-11, Method:Compositional matrix adjust., Identities:64/283(23%), Positives:125/283(44%), Gaps:32/283(11%) Query 13 VRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGK 72 + +K G + P + + +T++ + + +IV +L + P C++P+ + KK G Sbjct 588 IEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFI--VPSKSPCSSPVVLVPKKDGT 645 Query 73 WRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRK----KHVTILDIGDAYFTIPLYEPYRQ 128 +R+ +D+R LNK T LP L + + T LD+ Y IP+ R Sbjct 646 FRLCVDYRTLNKAT---ISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIPMEPKDRY 702 Query 129 YTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQF-GIYMD 187 T F + P +Y + V+P G +P+ + M R ++F +Y+D Sbjct 703 KTAF-------VTPSGKYEYTVMPFGLVNAPSTFARYMADTFRD-------LRFVNVYLD 748 Query 188 DIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDK-RQEGYPAKWLGFELHPEKWKFQKH 246 DI I S+ EEH ++ + + ++ + K + ++LG+ + +K +H Sbjct 749 DILIFSE-SPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQH 807 Query 247 TLPEITEGPI--TLNKLQKLVGDLVWRQSLIGKSIPNILKLME 287 I + P T+ + Q+ +G + + + IPN K+ + Sbjct 808 KCAAIRDFPTPKTVKQAQRFLGMINYYRRF----IPNCSKIAQ 846 >RecName: Full=Putative enzymatic polyprotein; Includes: RecName: Full=Protease; Short=PR; Includes: RecName: Full=Reverse transcriptase; Short=RT; Includes: RecName: Full=Ribonuclease H [Cassava vein mosaic virus] Sequence ID: Q89703.1 Length: 652 Range 1: 244 to 448 Score:68.9 bits(167), Expect:5e-11, Method:Compositional matrix adjust., Identities:62/223(28%), Positives:109/223(48%), Gaps:32/223(14%) Query 60 NTPIFCIKKKS----GKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRK----KHVTIL 111 ++P F + K S GK RM+ID+++LNK+ + + + +P+ L + ++ + Sbjct 244 SSPAFIVNKHSEQKRGKTRMVIDYKDLNKKAKVV---KYPIPNKDTLIHRSIQARYYSKF 300 Query 112 DIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILR 171 D ++ I L E ++YT FT+ P Y WKVLP G+ SP+++Q M +I R Sbjct 301 DCKSGFYHIKLEEDSKKYTAFTV-------PQGYYQWKVLPFGYHNSPSIFQQFMDRIFR 353 Query 172 GWIEEHPMIQFGI-YMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAK 230 P F I Y+DDI + S +EEH+ + + G ++ + K+ E K Sbjct 354 ------PYYDFIIVYIDDILVFSK-TIEEHKIHIAKFRDITLANGLIISK-KKTELCKEK 405 Query 231 --WLGFELHPEKWKFQKHTLPEITEGPITL---NKLQKLVGDL 268 +LG ++ + Q H + +I E + +LQ ++G L Sbjct 406 IDFLGVQIEQGGIELQPHIINKILEKHTKIKNKTELQSILGLL 448 >RecName: Full=Transposon Ty3-I Gag-Pol polyprotein; AltName: Full=Gag3-Pol3; AltName: Full=Transposon Ty3-2 TYA-TYB polyprotein; Contains: RecName: Full=Capsid protein; Short=CA; AltName: Full=p24; Contains: RecName: Full=Spacer peptide p3; Contains: RecName: Full=Nucleocapsid protein p11; Short=NC; Contains: RecName: Full=Ty3 protease; Short=PR; AltName: Full=p16; Contains: RecName: Full=Spacer peptide J; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Short=RT-RH; AltName: Full=p55; Contains: RecName: Full=Integrase p52; Short=IN; Contains: RecName: Full=Integrase p49; Short=IN [Saccharomyces cerevisiae S288C] Sequence ID: Q7LHG5.2 Length: 1498 Range 1: 614 to 872 Score:69.3 bits(168), Expect:6e-11, Method:Compositional matrix adjust., Identities:64/283(23%), Positives:125/283(44%), Gaps:32/283(11%) Query 13 VRLKEGCKGPHIAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGK 72 + +K G + P + + +T++ + + +IV +L + P C++P+ + KK G Sbjct 614 IEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFI--VPSKSPCSSPVVLVPKKDGT 671 Query 73 WRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRK----KHVTILDIGDAYFTIPLYEPYRQ 128 +R+ +D+R LNK T LP L + + T LD+ Y IP+ R Sbjct 672 FRLCVDYRTLNKAT---ISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIPMEPKDRY 728 Query 129 YTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQF-GIYMD 187 T F + P +Y + V+P G +P+ + M R ++F +Y+D Sbjct 729 KTAF-------VTPSGKYEYTVMPFGLVNAPSTFARYMADTFRD-------LRFVNVYLD 774 Query 188 DIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDK-RQEGYPAKWLGFELHPEKWKFQKH 246 DI I S+ EEH ++ + + ++ + K + ++LG+ + +K +H Sbjct 775 DILIFSE-SPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQH 833 Query 247 TLPEITEGPI--TLNKLQKLVGDLVWRQSLIGKSIPNILKLME 287 I + P T+ + Q+ +G + + + IPN K+ + Sbjct 834 KCAAIRDFPTPKTVKQAQRFLGMINYYRRF----IPNCSKIAQ 872 >RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Matrix protein p15; Short=MA; Contains: RecName: Full=RNA-binding phosphoprotein p12; AltName: Full=pp12; Contains: RecName: Full=Capsid protein p30; Short=CA; Contains: RecName: Full=Nucleocapsid protein p10-Pol; Short=NC-pol; Contains: RecName: Full=Protease; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Radiation murine leukemia virus] Sequence ID: P11227.2 Length: 1734 Range 1: 719 to 937 Score:69.3 bits(168), Expect:6e-11, Method:Composition-based stats., Identities:61/226(27%), Positives:107/226(47%), Gaps:15/226(6%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKK-SGKWRMLIDFREL 82 I Q+P++QE G+K + RL +G + W NTP+ +KK + +R + RE+ Sbjct 719 IKQYPMSQEAKLGIKPHIQRLLDQGILVPCQSPW--NTPLLPVKKPGTNDYRPVQGLREV 776 Query 83 NKQTEDLAEAQLGLPHPGGL-----QRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSP 137 NK+ ED+ +P+P L + T+LD+ DA+F + L+ P Q + Sbjct 777 NKRVEDIHPT---VPNPYNLLSGLPTSHRWYTVLDLKDAFFCLRLH-PTSQPLFASEWRD 832 Query 138 NNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGL 197 +G + W LPQG+K SP ++ + + L + +HP + Y+DD+ + + L Sbjct 833 PGMGISGQLTWTRLPQGFKNSPTLFDEALHRGLADFRIQHPDLILLQYVDDLLLAATSEL 892 Query 198 EEHRGIVNELASYIAQYGFMLPEDKRQE-GYPAKWLGFELHP-EKW 241 + +G L + G+ K Q K+LG+ L ++W Sbjct 893 DCQQG-TRALLKTLGNLGYRASAKKAQICQKQVKYLGYLLREGQRW 937 >RecName: Full=Gag-Pol polyprotein; Contains: RecName: Full=Matrix protein p15; Short=MA; Contains: RecName: Full=RNA-binding phosphoprotein p12; AltName: Full=pp12; Contains: RecName: Full=Capsid protein p30; Short=CA; Contains: RecName: Full=Nucleocapsid protein p10-Pol; Short=NC-pol; Contains: RecName: Full=Protease; Contains: RecName: Full=Reverse transcriptase/ribonuclease H; Short=RT; Contains: RecName: Full=Integrase; Short=IN [Baboon endogenous virus strain M7] Sequence ID: P10272.2 Length: 1727 Range 1: 717 to 886 Score:68.6 bits(166), Expect:1e-10, Method:Composition-based stats., Identities:48/176(27%), Positives:87/176(49%), Gaps:12/176(6%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGK-WRMLIDFREL 82 I Q+P++ E G+++ + + + G + W NTP+ +KK + +R + D RE+ Sbjct 717 IKQYPMSLEAHMGIRQHIIKFLELGVLRPCRSPW--NTPLLPVKKPGTQDYRPVQDLREI 774 Query 83 NKQTEDLAEAQLGLPHPGGLQRK-----KHVTILDIGDAYFTIPLYEPYRQYTCFTMLSP 137 NK+T D+ +P+P L T+LD+ DA+F +PL ++ F P Sbjct 775 NKRTVDIHPT---VPNPYNLLSTLKPDYSWYTVLDLKDAFFCLPLAPQSQELFAFEWKDP 831 Query 138 NNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGS 193 G + W LPQG+K SP ++ + + L + +HP + Y+DD+ + + Sbjct 832 ER-GISGQLTWTRLPQGFKNSPTLFDEALHRDLTDFRTQHPEVTLLQYVDDLLLAA 886 >RecName: Full=Enzymatic polyprotein; Includes: RecName: Full=Aspartic protease; Includes: RecName: Full=Endonuclease; Includes: RecName: Full=Reverse transcriptase [Cauliflower mosaic virus (STRAIN STRASBOURG)] Sequence ID: P03554.1 Length: 679 Range 1: 256 to 496 Score:63.9 bits(154), Expect:2e-09, Method:Compositional matrix adjust., Identities:70/260(27%), Positives:119/260(45%), Gaps:29/260(11%) Query 28 PLTQEKLEG-LKEIVD-RLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELNKQ 85 P+ +E+ + +KE++D ++ K K P + N +K+ GK RM+++++ +NK Sbjct 256 PMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA---EKRRGKKRMVVNYKAMNKA 312 Query 86 TEDLAEAQLGLPHPGGL----QRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLG 141 T A LP+ L + KK + D ++ + L + R T FT Sbjct 313 TVGDA---YNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTC------- 362 Query 142 PCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHR 201 P Y W V+P G K +P+++Q M + R + + +Y+DDI + S+ E+H Sbjct 363 PQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVFRK-----FCCVYVDDILVFSN-NEEDHL 416 Query 202 GIVNELASYIAQYGFMLPEDKRQEGYPA-KWLGFELHPEKWKFQKHTLPEITEGPITL-- 258 V + Q+G +L + K Q +LG E+ K Q H L I + P TL Sbjct 417 LHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLED 476 Query 259 -NKLQKLVGDLVWRQSLIGK 277 +LQ+ +G L + I K Sbjct 477 KKQLQRFLGILTYASDYIPK 496 >RecName: Full=Enzymatic polyprotein; Includes: RecName: Full=Aspartic protease; Includes: RecName: Full=Endonuclease; Includes: RecName: Full=Reverse transcriptase [Cauliflower mosaic virus (STRAIN CM-1841)] Sequence ID: P03555.1 Length: 679 Range 1: 256 to 496 Score:63.9 bits(154), Expect:2e-09, Method:Compositional matrix adjust., Identities:70/260(27%), Positives:121/260(46%), Gaps:29/260(11%) Query 28 PLTQEKLEG-LKEIVD-RLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELNKQ 85 P+ +E+ + +KE++D ++ K K P + N +K+ GK RM+++++ +NK Sbjct 256 PMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA---EKRRGKKRMVVNYKAMNKA 312 Query 86 TEDLAEAQLGLPHPGGL----QRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLG 141 T + +A LP+ L + KK + D ++ + L + R T FT Sbjct 313 T--IGDA-YNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTC------- 362 Query 142 PCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHR 201 P Y W V+P G K +P+++Q M + R + + +Y+DDI + S+ E+H Sbjct 363 PQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVFRK-----FCCVYVDDILVFSN-NEEDHL 416 Query 202 GIVNELASYIAQYGFMLPEDKRQEGYPA-KWLGFELHPEKWKFQKHTLPEITEGPITL-- 258 V + Q+G +L + K Q +LG E+ K Q H L I + P TL Sbjct 417 LHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLED 476 Query 259 -NKLQKLVGDLVWRQSLIGK 277 +LQ+ +G L + I K Sbjct 477 KKQLQRFLGILTYASDYIPK 496 >RecName: Full=Enzymatic polyprotein; Includes: RecName: Full=Aspartic protease; Includes: RecName: Full=Endonuclease; Includes: RecName: Full=Reverse transcriptase [Cauliflower mosaic virus (STRAIN BBC)] Sequence ID: Q02964.1 Length: 679 Range 1: 256 to 496 Score:63.9 bits(154), Expect:2e-09, Method:Compositional matrix adjust., Identities:70/260(27%), Positives:121/260(46%), Gaps:29/260(11%) Query 28 PLTQEKLEG-LKEIVD-RLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELNKQ 85 P+ +E+ + +KE++D ++ K K P + N +K+ GK RM+++++ +NK Sbjct 256 PMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA---EKRRGKKRMVVNYKAMNKA 312 Query 86 TEDLAEAQLGLPHPGGL----QRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLG 141 T + +A LP+ L + KK + D ++ + L + R T FT Sbjct 313 T--IGDA-YNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTC------- 362 Query 142 PCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHR 201 P Y W V+P G K +P+++Q M + R + + +Y+DDI + S+ E+H Sbjct 363 PQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVFRK-----FCCVYVDDILVFSN-NEEDHL 416 Query 202 GIVNELASYIAQYGFMLPEDKRQEGYPA-KWLGFELHPEKWKFQKHTLPEITEGPITL-- 258 V + Q+G +L + K Q +LG E+ K Q H L I + P TL Sbjct 417 LHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLED 476 Query 259 -NKLQKLVGDLVWRQSLIGK 277 +LQ+ +G L + I K Sbjct 477 KKQLQRFLGILTYASDYIPK 496 >RecName: Full=Enzymatic polyprotein; Includes: RecName: Full=Aspartic protease; Includes: RecName: Full=Endonuclease; Includes: RecName: Full=Reverse transcriptase [Figwort mosaic virus (STRAIN DXS)] Sequence ID: P09523.1 Length: 666 Range 1: 277 to 489 Score:62.8 bits(151), Expect:5e-09, Method:Compositional matrix adjust., Identities:59/227(26%), Positives:108/227(47%), Gaps:24/227(10%) Query 61 TPIFCIK----KKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKHV-TILDIGD 115 +P F ++ ++ GK RM+++++ +N+ T + + L R K + + D Sbjct 277 SPAFLVENEAERRRGKKRMVVNYKAINQATIGDSHNLPNMQELLTLLRGKSIFSSFDCKS 336 Query 116 AYFTIPLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIE 175 ++ + L E ++ T FT P + WKV+P G K +P+++Q MQ L G + Sbjct 337 GFWQVVLDEESQKLTAFTC-------PQGHFQWKVVPFGLKQAPSIFQRHMQTALNG-AD 388 Query 176 EHPMIQFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPAK--WLG 233 + M +Y+DDI + S+ L +H V + + +YG +L + K+ + K +LG Sbjct 389 KFCM----VYVDDIIVFSNSEL-DHYNHVYAVLKIVEKYGIILSK-KKANLFKEKINFLG 442 Query 234 FELHPEKWKFQKHTLPEITEGPITL---NKLQKLVGDLVWRQSLIGK 277 E+ Q H L I + P L LQ+ +G L + ++ I K Sbjct 443 LEIDKGTHCPQNHILENIHKFPDRLEDKKHLQRFLGVLTYAETYIPK 489 >RecName: Full=Polyprotein P3; Includes: RecName: Full=Putative movement protein; Short=MP; Includes: RecName: Full=Capsid protein; AltName: Full=Coat protein; Short=CP; Includes: RecName: Full=Protease; Short=PR; Includes: RecName: Full=Reverse transcriptase; Short=RT; Includes: RecName: Full=Ribonuclease H [Commelina yellow mottle virus] Sequence ID: P19199.2 Length: 1886 Range 1: 1457 to 1632 Score:63.2 bits(152), Expect:5e-09, Method:Compositional matrix adjust., Identities:54/192(28%), Positives:90/192(46%), Gaps:21/192(10%) Query 67 KKKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGL----QRKKHVTILDIGDAYFTIPL 122 K+K GK RM+ +++ LN+ TE Q LP + R K + D+ ++ + + Sbjct 1457 KEKKGKERMVFNYKLLNENTES---DQYSLPGINTIISKVGRSKIYSKFDLKSGFWQVAM 1513 Query 123 YEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQF 182 E +T F L+ N L Y W V+P G K +PA++Q M + +G E+ Sbjct 1514 EEESVPWTAF--LAGNKL-----YEWLVMPFGLKNAPAIFQRKMDNVFKG-TEKF----I 1561 Query 183 GIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYPA-KWLGFELHPEKW 241 +Y+DDI + S+ E+H + + + G +L K + G P +LG L K Sbjct 1562 AVYIDDILVFSETA-EQHSQHLYTMLQLCKENGLILSPTKMKIGTPEIDFLGASLGCTKI 1620 Query 242 KFQKHTLPEITE 253 K Q H + +I + Sbjct 1621 KLQPHIISKICD 1632 >RecName: Full=Genome polyprotein; Includes: RecName: Full=Aspartic protease; Short=PR; Includes: RecName: Full=Reverse transcriptase; Short=RT [Petunia vein clearing virus isolate Hohn] Sequence ID: Q6XKE6.1 Length: 2180 Range 1: 1402 to 1646 Score:62.4 bits(150), Expect:1e-08, Method:Compositional matrix adjust., Identities:60/264(23%), Positives:122/264(46%), Gaps:31/264(11%) Query 36 GLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKS----GKWRMLIDFRELNKQTEDLAE 91 LKE D L++ + + W C F + K+S GK R++I+++ LN +D Sbjct 1402 ALKE-CDELQQFDLIEPSDSQWACEA--FYVNKRSEQVRGKLRLVINYQPLNHFLQD--- 1455 Query 92 AQLGLPHP----GGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPCVRYY 147 + +P+ L + K + D+ ++ + ++ R T F + P + Sbjct 1456 DKFPIPNKLTLFSHLSKAKLFSKFDLKSGFWQLGIHPNERPKTGFCI-------PDRHFQ 1508 Query 148 WKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGIVNEL 207 WKV+P G K +P+++Q M KI + + +Y+DDI + S+ LE+H ++N+ Sbjct 1509 WKVMPFGLKTAPSLFQKAMIKIFQPI-----LFSALVYIDDILLFSE-TLEDHIKLLNQF 1562 Query 208 ASYIAQYGFMLPEDKR-QEGYPAKWLGFELHPEKWKFQKHTLPEITEGP---ITLNKLQK 263 S + ++G ML K ++LG + + H E+ + P +++ ++Q+ Sbjct 1563 ISLVKKFGVMLSAKKMILAQNKIQFLGMDFADGTFSPAGHISLELQKFPDTNLSVKQIQQ 1622 Query 264 LVGDLVWRQSLIGKSIPNILKLME 287 +G + + + I + +I L + Sbjct 1623 FLGIVNYIRDFIPEVTEHISPLSD 1646 >RecName: Full=Enzymatic polyprotein; Includes: RecName: Full=Aspartic protease; Includes: RecName: Full=Endonuclease; Includes: RecName: Full=Reverse transcriptase [Cauliflower mosaic virus (STRAIN D/H)] Sequence ID: P03556.1 Length: 674 Range 1: 251 to 491 Score:61.6 bits(148), Expect:1e-08, Method:Compositional matrix adjust., Identities:69/257(27%), Positives:116/257(45%), Gaps:23/257(8%) Query 28 PLTQEKLEG-LKEIVD-RLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELNKQ 85 P+ +E+ + +KE++D ++ K K P + N +K+ GK RM+++++ +NK Sbjct 251 PMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA---EKRRGKKRMVVNYKAMNKA 307 Query 86 TEDLAEAQLGLPHPGGLQR-KKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPCV 144 T A L R KK + D ++ + L + R T FT P Sbjct 308 TVGDAYNPPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTC-------PQG 360 Query 145 RYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGIV 204 Y W V+P G K +P+++Q M + R + + +Y+DDI + S+ E+H V Sbjct 361 HYEWNVVPFGLKQAPSIFQRHMDEAFRVFRKF-----CCVYVDDILVFSN-NEEDHLLHV 414 Query 205 NELASYIAQYGFMLPEDKRQEGYPA-KWLGFELHPEKWKFQKHTLPEITEGPITL---NK 260 + Q+G +L + K Q +LG E+ K Q H L I + P TL + Sbjct 415 AMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLEDKKQ 474 Query 261 LQKLVGDLVWRQSLIGK 277 LQ+ +G L + I K Sbjct 475 LQRFLGILTYASDYIPK 491 >RecName: Full=Genome polyprotein; Includes: RecName: Full=Aspartic protease; Short=PR; Includes: RecName: Full=Reverse transcriptase; Short=RT [Petunia vein clearing virus isolate Shepherd] Sequence ID: Q91DM0.1 Length: 2179 Range 1: 1393 to 1645 Score:60.8 bits(146), Expect:2e-08, Method:Compositional matrix adjust., Identities:59/271(22%), Positives:124/271(45%), Gaps:30/271(11%) Query 29 LTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKS----GKWRMLIDFRELNK 84 + E L+ + D L++ + + W C F + K+S GK R++I+++ LN Sbjct 1393 MNPEHLQLAIKECDELQQFDLIEPSDSQWACEA--FYVNKRSEQVRGKLRLVINYQPLNH 1450 Query 85 QTEDLAEAQLGLPHP----GGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNL 140 +D + +P+ L + K + D+ ++ + ++ R T F + Sbjct 1451 FLQD---DKFPIPNKLTLFSHLSKAKLFSKFDLKSGFWQLGIHPNERPKTGFCI------ 1501 Query 141 GPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEH 200 P + WKV+P G K +P+++Q M KI + + +Y+DDI + S+ LE+H Sbjct 1502 -PDRHFQWKVMPFGLKTAPSLFQKAMIKIFQPI-----LFSALVYIDDILLFSE-TLEDH 1554 Query 201 RGIVNELASYIAQYGFMLPEDKR-QEGYPAKWLGFELHPEKWKFQKHTLPEITEGP---I 256 ++N+ S + ++G ML K ++LG + + H E+ + P + Sbjct 1555 IKLLNQFISLVKKFGVMLSAKKMILAQNKIQFLGMDFADGTFSPAGHISLELQKFPDTNL 1614 Query 257 TLNKLQKLVGDLVWRQSLIGKSIPNILKLME 287 ++ ++Q+ +G + + + I + +I L + Sbjct 1615 SVKQIQQFLGIVNYIRDFIPEVTEHISPLSD 1645 >RecName: Full=Enzymatic polyprotein; Includes: RecName: Full=Aspartic protease; Includes: RecName: Full=Endonuclease; Includes: RecName: Full=Reverse transcriptase [Cauliflower mosaic virus (STRAIN NY8153)] Sequence ID: Q00962.1 Length: 680 Range 1: 257 to 501 Score:58.9 bits(141), Expect:8e-08, Method:Compositional matrix adjust., Identities:69/268(26%), Positives:120/268(44%), Gaps:33/268(12%) Query 28 PLTQEKLEG-LKEIVD-RLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELNKQ 85 P+ +E+ + +KE++D ++ K K P + N + G RM+++++ +NK Sbjct 257 PMDREEFDKQIKELLDLKVIKPSKSPHMAPAFLVNNEA---ENGRGNKRMVVNYKAMNKA 313 Query 86 TEDLAEAQLGLPHPGGL----QRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLG 141 T A LP+ L + KK + D ++ + L + R T FT Sbjct 314 TVGDA---YNLPNKDELLTLIRGKKIFSSFDCKSGFWQVLLDQESRPLTAFTC------- 363 Query 142 PCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHR 201 P Y W V+P G K +P+++Q M + R + + +Y+DDI + S+ E+H Sbjct 364 PQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVFRKF-----CCVYVDDIVVFSN-NEEDHL 417 Query 202 GIVNELASYIAQYGFMLPEDKRQEGYPA-KWLGFELHPEKWKFQKHTLPEITEGPITL-- 258 V + Q+G +L + K Q +LG E+ K Q H L I + P TL Sbjct 418 LHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGHILEHINKFPDTLED 477 Query 259 -NKLQKLVGDLVWRQSLIGKSIPNILKL 285 +LQ+ +G L + IPN+ ++ Sbjct 478 KKQLQRFLGILTYASDY----IPNLAQM 501 >RecName: Full=Retrovirus-related Pol polyprotein from transposon 297; Includes: RecName: Full=Protease; Includes: RecName: Full=Reverse transcriptase; Includes: RecName: Full=Endonuclease [Drosophila melanogaster] Sequence ID: P20825.1 Length: 1059 Range 1: 211 to 434 Score:58.9 bits(141), Expect:9e-08, Method:Compositional matrix adjust., Identities:61/244(25%), Positives:111/244(45%), Gaps:33/244(13%) Query 26 QWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKK-----SGKWRMLIDFR 80 Q+PL Q ++ V + +G + + + N+P + + KK + K+R++ID+R Sbjct 211 QYPLAQTHEIEVENQVQEMLNQGLIRESNSPY--NSPTWVVPKKPDASGANKYRVVIDYR 268 Query 81 ELNKQTEDLAEAQLGLPHP----GGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLS 136 +LN+ T + +P+ G L + ++ T +D+ + I + E T F+ S Sbjct 269 KLNEIT---IPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKS 325 Query 137 PNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLG 196 + Y + +P G + +PA +Q M ILR + +H + +Y+DDI I S Sbjct 326 GH-------YEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCL----VYLDDIIIFST-S 373 Query 197 LEEHRGIVNELASYIAQYGFMLPEDK----RQEGYPAKWLGFELHPEKWKFQKHTLPEIT 252 L EH + + + +A L DK ++E A +LG + P+ K + I Sbjct 374 LTEHLNSIQLVFTKLADANLKLQLDKCEFLKKE---ANFLGHIVTPDGIKPNPIKVKAIV 430 Query 253 EGPI 256 PI Sbjct 431 SYPI 434 >RecName: Full=Enzymatic polyprotein; Includes: RecName: Full=Aspartic protease; Includes: RecName: Full=Endonuclease; Includes: RecName: Full=Reverse transcriptase [Soybean chlorotic mottle virus] Sequence ID: P15629.2 Length: 692 Range 1: 194 to 398 Score:58.5 bits(140), Expect:1e-07, Method:Compositional matrix adjust., Identities:60/226(27%), Positives:105/226(46%), Gaps:35/226(15%) Query 13 VRLKEGCKGPHIA-QWPLTQEKLEGLKEIVDRLEKEGKVGRA-PPHWTCNTPIFCIKK-- 68 +RLK+ + ++ + P T ++ KE + L K+G + + PH + P F ++ Sbjct 194 IRLKDPLQEINVTNRIPYTIRDVQEFKEECEDLLKKGLIRESQSPH---SAPAFYVENHN 250 Query 69 --KSGKWRMLIDFRELNKQTEDLAEAQLG----LPHPGGLQRKKHVTI----LDIGDAYF 118 K GK RM+I+++++N EA +G LP + K ++ LD Y+ Sbjct 251 EIKRGKRRMVINYKKMN-------EATIGDSYKLPRKDFILEKIKGSLWFSSLDAKSGYY 303 Query 119 TIPLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHP 178 + L+E + T F+ P Y W VL G K +P++YQ M + L+G EH Sbjct 304 QLRLHENTKPLTAFSC------PPQKHYEWNVLSFGLKQAPSIYQRFMDQSLKGL--EHI 355 Query 179 MIQFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQ 224 + Y+DDI I + E+H V + I + G ++ + K + Sbjct 356 CLA---YIDDILIFTKGSKEQHVNDVRIVLQRIKEKGIIISKKKSK 398 >RecName: Full=Retrovirus-related Pol polyprotein from transposon 17.6; Includes: RecName: Full=Protease; Includes: RecName: Full=Reverse transcriptase; Includes: RecName: Full=Endonuclease [Drosophila melanogaster] Sequence ID: P04323.1 Length: 1058 Range 1: 244 to 435 Score:57.8 bits(138), Expect:2e-07, Method:Compositional matrix adjust., Identities:60/214(28%), Positives:96/214(44%), Gaps:39/214(18%) Query 60 NTPIFCIKKK---SGK--WRMLIDFRELNKQTEDLAEAQLGLPHP--------GGLQRKK 106 N+PI+ + KK SGK +R++ID+R+LN E +G HP G L R Sbjct 244 NSPIWVVPKKQDASGKQKFRIVIDYRKLN-------EITVGDRHPIPNMDEILGKLGRCN 296 Query 107 HVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTM 166 + T +D+ + I + T F+ + Y + +P G K +PA +Q M Sbjct 297 YFTTIDLAKGFHQIEMDPESVSKTAFSTKHGH-------YEYLRMPFGLKNAPATFQRCM 349 Query 167 QKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDK---- 222 ILR + +H + +Y+DDI + S L+EH + + +A+ L DK Sbjct 350 NDILRPLLNKHCL----VYLDDIIVFS-TSLDEHLQSLGLVFEKLAKANLKLQLDKCEFL 404 Query 223 RQEGYPAKWLGFELHPEKWKFQKHTLPEITEGPI 256 +QE +LG L P+ K + I + PI Sbjct 405 KQE---TTFLGHVLTPDGIKPNPEKIEAIQKYPI 435 >RecName: Full=Retrovirus-related Pol polyprotein from transposon 412; Includes: RecName: Full=Protease; Includes: RecName: Full=Reverse transcriptase; Includes: RecName: Full=Endonuclease [Drosophila melanogaster] Sequence ID: P10394.1 Length: 1237 Range 1: 324 to 514 Score:55.1 bits(131), Expect:1e-06, Method:Compositional matrix adjust., Identities:58/226(26%), Positives:97/226(42%), Gaps:45/226(19%) Query 31 QEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSG------KWRMLIDFRELNK 84 ++E ++ V +L K+ V + + N+P+ + KKS KWR++ID+R++NK Sbjct 324 HSQVEEIQAQVQKLIKDKIVEPSVSQY--NSPLLLVPKKSSPNSDKKKWRLVIDYRQINK 381 Query 85 QTEDLAEAQLGLPHPGG----LQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNL 140 + L + LP L R K+ + LD+ + I L E R T F+ + + Sbjct 382 K---LLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGS-- 436 Query 141 GPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEH 200 Y + LP G K++P +Q M G IE Q +YMDD+ + +G E Sbjct 437 -----YRFTRLPFGLKIAPNSFQRMMTIAFSG-IEPS---QAFLYMDDLIV---IGCSE- 483 Query 201 RGIVNELASYIAQYGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKH 246 + ++ L + + +LHPEK F H Sbjct 484 KHMLKNLTEVFGK---------------CREYNLKLHPEKCSFFMH 514 >RecName: Full=Enzymatic polyprotein; Includes: RecName: Full=Aspartic protease; Includes: RecName: Full=Endonuclease; Includes: RecName: Full=Reverse transcriptase [Carnation etched ring virus] Sequence ID: P05400.1 Length: 659 Range 1: 259 to 480 Score:54.3 bits(129), Expect:2e-06, Method:Compositional matrix adjust., Identities:58/236(25%), Positives:101/236(42%), Gaps:26/236(11%) Query 54 PPHWTCNTPIFCIK----KKSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGL----QRK 105 P T +P F ++ ++ GK RM+++++ +NK T+ A LP+ L + K Sbjct 259 PSKSTHMSPAFLVENEAERRRGKKRMVVNYKAMNKATKGDAH---NLPNKDELLTLVRGK 315 Query 106 KHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFT 165 K + D + + L + + T FT P Y W V+P G K +P+++ T Sbjct 316 KIYSSFDCKSGLWQVLLDKESQLLTAFTC-------PQGHYQWNVVPFGLKQAPSIFPKT 368 Query 166 MQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDKRQE 225 ++ + Y+DDI + S+ G +EH V + + G +L + K Q Sbjct 369 YANSHSNQYSKYCCV----YVDDILVFSNTGRKEHYIHVLNILRRCEKLGIILSKKKAQL 424 Query 226 -GYPAKWLGFELHPEKWKFQKHTLPEITEGPITL---NKLQKLVGDLVWRQSLIGK 277 +LG E+ Q H L I + P + +LQ+ +G L + I K Sbjct 425 FKEKINFLGLEIDQGTHCPQNHILEHIHKFPDRIEDKKQLQRFLGILTYASDYIPK 480 >RecName: Full=Pol polyprotein [Simian immunodeficiency virus (AGM266 ISOLATE)] Sequence ID: P12500.1 Length: 100 Range 1: 30 to 100 Score:47.4 bits(111), Expect:1e-05, Method:Composition-based stats., Identities:32/71(45%), Positives:41/71(57%), Gaps:4/71(5%) Query 430 ELVPGP-TYYTDGG-KKNGR-GSLGYIASTG-EKFRIHEEGTNQQLELRAIEEACKQGPE 485 E +PG YY DG +N R G GYI G ++ E TNQQ EL AI+ A + Sbjct 30 EPIPGEDVYYVDGACNRNSREGKAGYITQQGKQRVEKLENTTNQQAELTAIKMALEDSGP 89 Query 486 KMNIVTDSRYA 496 ++NIVTDS+YA Sbjct 90 RVNIVTDSQYA 100 >RecName: Full=Ribonuclease H; Short=RNase H [Cupriavidus necator H16] Sequence ID: Q0K8W6.1 Length: 145 Range 1: 5 to 133 Score:48.1 bits(113), Expect:2e-05, Method:Compositional matrix adjust., Identities:39/129(30%), Positives:61/129(47%), Gaps:19/129(14%) Query 436 TYYTDGGKKN--GRGSLGYI----ASTGEKFRIHEEGTNQQLELRAIEEACK--QGPEKM 487 T Y+DG K GRG G + AS E F TN ++E+ A+ EA + + P + Sbjct 5 TIYSDGACKGNPGRGGWGAVLVAGASEKEMFGGEPNTTNNRMEMTAVIEALRALKRPCVV 64 Query 488 NIVTDSRYAYE--------FMLRNW---DEEVIRNPIQARIMELVHNKEKIGVHWVPGHK 536 + TDS+Y + + R W D++ ++N + ++ + +I HWV GH Sbjct 65 RVYTDSQYVQKGISEWLPGWKARGWKTADKKPVKNADLWQELDTLAQPHQISWHWVRGHN 124 Query 537 GIPQNEEID 545 G P NE D Sbjct 125 GHPGNERAD 133 >RecName: Full=Transposon Tf2-1 polyprotein; AltName: Full=Retrotransposable element Tf2 155 kDa protein [Schizosaccharomyces pombe 972h-] Sequence ID: P0CT34.1 Length: 1333 >RecName: Full=Transposon Tf2-2 polyprotein; AltName: Full=Retrotransposable element Tf2 155 kDa protein [Schizosaccharomyces pombe 972h-] Sequence ID: P0CT35.1 Length: 1333 >RecName: Full=Transposon Tf2-4 polyprotein; AltName: Full=Retrotransposable element Tf2 155 kDa protein [Schizosaccharomyces pombe 972h-] Sequence ID: P0CT37.1 Length: 1333 >RecName: Full=Transposon Tf2-9 polyprotein; AltName: Full=Retrotransposable element Tf2 155 kDa protein [Schizosaccharomyces pombe 972h-] Sequence ID: P0CT40.1 Length: 1333 Range 1: 415 to 659 Score:51.2 bits(121), Expect:2e-05, Method:Compositional matrix adjust., Identities:56/261(21%), Positives:113/261(43%), Gaps:22/261(8%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 I +PL K++ + + +++ K G + + C P+ + KK G RM++D++ LN Sbjct 415 IRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINAC--PVMFVPKKEGTLRMVVDYKPLN 472 Query 84 KQTE-DLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGP 142 K + ++ L +Q T LD+ AY I + + F Sbjct 473 KYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFR--------- 523 Query 143 CVR--YYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEH 200 C R + + V+P G +PA +Q+ + IL E H + YMDDI I S EH Sbjct 524 CPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVV----CYMDDILIHSK-SESEH 578 Query 201 RGIVNELASYIAQYGFMLPEDKRQ-EGYPAKWLGFELHPEKWKFQKHTLPEITE--GPIT 257 V ++ + ++ + K + K++G+ + + + + + ++ + P Sbjct 579 VKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKN 638 Query 258 LNKLQKLVGDLVWRQSLIGKS 278 +L++ +G + + + I K+ Sbjct 639 RKELRQFLGSVNYLRKFIPKT 659 >RecName: Full=Uncharacterized protein K02A2.6 [Caenorhabditis elegans] Sequence ID: Q09575.1 Length: 1268 Range 1: 447 to 679 Score:51.2 bits(121), Expect:2e-05, Method:Compositional matrix adjust., Identities:65/252(26%), Positives:109/252(43%), Gaps:28/252(11%) Query 28 PLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTC-NTPIFCIKKK-SGKWRMLIDFR--ELN 83 P+ LE ++ ++RL++ G + P + PI IKKK +GK R+ DF+ LN Sbjct 447 PVPYGSLEAVETELNRLQEMGVI--VPITYAKWAAPIVVIKKKGTGKIRVCADFKCSGLN 504 Query 84 KQTEDLAEAQLGLPHPGGLQRKKHVTI---LDIGDAYFTIPLYEPYRQYTCFTMLSPNNL 140 +D LP + + T+ +D+ DAY + L E ++ L+ N Sbjct 505 AALKDEFHP---LPTSEDIFSRLKGTVYSQIDLKDAYLQVELDEEAQK------LAVINT 555 Query 141 GPCVRYYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEH 200 + Y + + G K +PA +Q M K++ G + +Y DDI I + +EEH Sbjct 556 HRGIFKYLR-MTFGLKPAPASFQKIMDKMVSG------LTGVAVYWDDIIISAS-SIEEH 607 Query 201 RGIVNELASYIAQYGFMLPEDKRQEGYP-AKWLGF-ELHPEKWKFQKHTLPEITEGPITL 258 I+ EL +YGF + +K +LGF + H + +K + P Sbjct 608 EKILRELFERFKEYGFRVSAEKCAFAQKQVTFLGFVDEHGRRPDSKKTEAIRSMKAPTDQ 667 Query 259 NKLQKLVGDLVW 270 +L +G W Sbjct 668 KQLASFLGAADW 679 >RecName: Full=Transposon Tf2-3 polyprotein; AltName: Full=Retrotransposable element Tf2 155 kDa protein [Schizosaccharomyces pombe 972h-] Sequence ID: P0CT36.1 Length: 1333 >RecName: Full=Transposon Tf2-5 polyprotein; AltName: Full=Retrotransposable element Tf2 155 kDa protein [Schizosaccharomyces pombe 972h-] Sequence ID: P0CT38.1 Length: 1333 >RecName: Full=Transposon Tf2-6 polyprotein; AltName: Full=Retrotransposable element Tf2 155 kDa protein [Schizosaccharomyces pombe 972h-] Sequence ID: P0CT39.1 Length: 1333 >RecName: Full=Transposon Tf2-12 polyprotein; AltName: Full=Retrotransposable element Tf2 155 kDa protein [Schizosaccharomyces pombe 972h-] Sequence ID: P0CT41.1 Length: 1333 Range 1: 415 to 659 Score:51.2 bits(121), Expect:2e-05, Method:Compositional matrix adjust., Identities:56/261(21%), Positives:113/261(43%), Gaps:22/261(8%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 I +PL K++ + + +++ K G + + C P+ + KK G RM++D++ LN Sbjct 415 IRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINAC--PVMFVPKKEGTLRMVVDYKPLN 472 Query 84 KQTE-DLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGP 142 K + ++ L +Q T LD+ AY I + + F Sbjct 473 KYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFR--------- 523 Query 143 CVR--YYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEH 200 C R + + V+P G +PA +Q+ + IL E H + YMDDI I S EH Sbjct 524 CPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESHVV----CYMDDILIHSK-SESEH 578 Query 201 RGIVNELASYIAQYGFMLPEDKRQ-EGYPAKWLGFELHPEKWKFQKHTLPEITE--GPIT 257 V ++ + ++ + K + K++G+ + + + + + ++ + P Sbjct 579 VKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKN 638 Query 258 LNKLQKLVGDLVWRQSLIGKS 278 +L++ +G + + + I K+ Sbjct 639 RKELRQFLGSVNYLRKFIPKT 659 >RecName: Full=Ribonuclease H; Short=RNase H [Cupriavidus pinatubonensis JMP134] Sequence ID: Q46Z81.1 Length: 145 Range 1: 7 to 133 Score:47.8 bits(112), Expect:3e-05, Method:Compositional matrix adjust., Identities:40/127(31%), Positives:61/127(48%), Gaps:19/127(14%) Query 438 YTDGGKKN--GRGSLG--YIASTGEK--FRIHEEGTNQQLELRAIEEACK--QGPEKMNI 489 Y+DG K GRG G +A T EK F TN ++E+ A+ EA + + P + + Sbjct 7 YSDGACKGNPGRGGWGAVLVAGTNEKELFGGEANTTNNRMEMTAVIEALRALKRPCTVQV 66 Query 490 VTDSRYAYE--------FMLRNW---DEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGI 538 TDS+Y + + R W D++ ++N + ++ + KI HWV GH G Sbjct 67 YTDSQYVQKGISEWLPGWKARGWKTADKKPVKNADLWQELDTLVQPHKITWHWVRGHNGH 126 Query 539 PQNEEID 545 P NE D Sbjct 127 PGNERAD 133 >RecName: Full=Retrovirus-related Pol polyprotein from transposon gypsy; Includes: RecName: Full=Reverse transcriptase; Includes: RecName: Full=Endonuclease [Drosophila melanogaster] Sequence ID: P10401.1 Length: 1035 Range 1: 201 to 519 Score:50.8 bits(120), Expect:3e-05, Method:Compositional matrix adjust., Identities:83/336(25%), Positives:144/336(42%), Gaps:41/336(12%) Query 41 VDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKW------RMLIDFRELNKQTEDLAEAQL 94 V +L K+G + P N+P + + KK R++IDFR+LN++T Sbjct 201 VKQLLKDGIIR--PSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLNEKTIPDRYPMP 258 Query 95 GLPHP-GGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQ 153 +P L + K T LD+ Y I L E R+ T F++ N G +Y + LP Sbjct 259 SIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSV----NGG---KYEFCRLPF 311 Query 154 GWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQ 213 G + + +++Q + +LR I + I + +Y+DD+ I S+ + R I L I Sbjct 312 GLRNASSIFQRALDDVLREQIGK---ICY-VYVDDVIIFSENESDHVRHIDTVLKCLIDA 367 Query 214 YGFMLPEDKRQEGYPAKWLGFELHPEKWKFQKHTLPEITE--GPITLNKLQKLVGDLVWR 271 + E R ++LGF + + K + I E P + K++ +G + Sbjct 368 NMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLASYY 427 Query 272 Q------SLIGKSIPNILKLMEGDRALQSERYIESIHVREWEACR---QKLKEMEGN--- 319 + + I + I +ILK G+ S+ + I V E R Q+L+ + + Sbjct 428 RVFIKDFAAIARPITDILK---GENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDV 484 Query 320 ---YYDEEKDIYGQLDWGNKAIEYIVFQEKGKPLWV 352 Y D +K D I ++ QE G+P+ + Sbjct 485 ILKYPDFKKPFDLTTDASASGIGAVLSQE-GRPITM 519 >RecName: Full=Ribonuclease H; Short=RNase H [Cupriavidus metallidurans CH34] Sequence ID: Q1LL89.1 Length: 145 Range 1: 5 to 133 Score:47.4 bits(111), Expect:3e-05, Method:Compositional matrix adjust., Identities:40/129(31%), Positives:59/129(45%), Gaps:19/129(14%) Query 436 TYYTDGGKKN--GRGSLGYIASTG----EKFRIHEEGTNQQLELRAIEEACK--QGPEKM 487 T Y+DG K G G G + G E F TN ++EL A+ EA + + P + Sbjct 5 TIYSDGACKGNPGPGGWGAVLVAGGHEKELFGGESPTTNNRMELMAVIEALRALKRPCIV 64 Query 488 NIVTDSRYA--------YEFMLRNW---DEEVIRNPIQARIMELVHNKEKIGVHWVPGHK 536 NI TDS+Y + + R W D++ ++N + ++ +I HWV GH Sbjct 65 NIYTDSQYVQKGISEWIHGWKARGWKTADKKPVKNADLWQALDEAQKPHQITWHWVRGHN 124 Query 537 GIPQNEEID 545 G P NE D Sbjct 125 GHPGNERAD 133 >RecName: Full=Enzymatic polyprotein; Includes: RecName: Full=Aspartic protease; Includes: RecName: Full=Endonuclease; Includes: RecName: Full=Reverse transcriptase [Cestrum yellow leaf curling virus] Sequence ID: Q7TD08.1 Length: 643 Range 1: 237 to 395 Score:50.4 bits(119), Expect:4e-05, Method:Compositional matrix adjust., Identities:49/176(28%), Positives:76/176(43%), Gaps:25/176(14%) Query 55 PHWTCNTPIFCIKK----KSGKWRMLIDFRELNKQTEDLAEAQLGLPHPGGLQRKKH--- 107 PH + P F ++ K K RM+I+++ LNK T A LP + K Sbjct 237 PH---SAPAFYVENHNEIKRKKRRMVINYKALNKATIGNAHK---LPRIDSILTKVKGSN 290 Query 108 -VTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPCVRYYWKVLPQGWKLSPAVYQFTM 166 + LD Y+ + L+ + T F+ P Y W VLP G K +P +YQ M Sbjct 291 WFSTLDAKSGYWQLRLHPQSKPLTAFSC------PPQKHYQWNVLPFGLKQAPGIYQNFM 344 Query 167 QKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEHRGIVNELASYIAQYGFMLPEDK 222 K L G +E + Y+DDI + ++ EEH + + + G +L + K Sbjct 345 DKNLEG-LENFCL----AYIDDILVFTNSSREEHLSKLLVVLERCKEKGLILSKKK 395 >RecName: Full=Transposon Tf2-7 polyprotein; AltName: Full=Retrotransposable element Tf2 155 kDa protein [Schizosaccharomyces pombe 972h-] Sequence ID: P0CT42.1 Length: 1333 >RecName: Full=Transposon Tf2-8 polyprotein; AltName: Full=Retrotransposable element Tf2 155 kDa protein [Schizosaccharomyces pombe 972h-] Sequence ID: P0CT43.1 Length: 1333 Range 1: 415 to 659 Score:50.4 bits(119), Expect:4e-05, Method:Compositional matrix adjust., Identities:55/261(21%), Positives:117/261(44%), Gaps:22/261(8%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 I +PL K++ + + +++ K G + + C P+ + KK G RM++D++ LN Sbjct 415 IRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINAC--PVMFVPKKEGTLRMVVDYKPLN 472 Query 84 KQTE-DLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGP 142 K + ++ L +Q T LD+ AY I + + F Sbjct 473 KYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFR--------- 523 Query 143 CVR--YYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEH 200 C R + + V+P G ++PA +Q+ + IL G ++E ++ YMD+I I S EH Sbjct 524 CPRGVFEYLVMPYGISIAPAHFQYFINTIL-GEVKESHVV---CYMDNILIHSK-SESEH 578 Query 201 RGIVNELASYIAQYGFMLPEDKRQ-EGYPAKWLGFELHPEKWKFQKHTLPEITE--GPIT 257 V ++ + ++ + K + K++G+ + + + + + ++ + P Sbjct 579 VKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKN 638 Query 258 LNKLQKLVGDLVWRQSLIGKS 278 +L++ +G + + + I K+ Sbjct 639 RKELRQFLGSVNYLRKFIPKT 659 >RecName: Full=Transposon Tf2-11 polyprotein; AltName: Full=Retrotransposable element Tf2 155 kDa protein [Schizosaccharomyces pombe 972h-] Sequence ID: Q9UR07.1 Length: 1333 Range 1: 415 to 659 Score:50.1 bits(118), Expect:5e-05, Method:Compositional matrix adjust., Identities:55/261(21%), Positives:117/261(44%), Gaps:22/261(8%) Query 24 IAQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKSGKWRMLIDFRELN 83 I +PL K++ + + +++ K G + + C P+ + KK G RM++D++ LN Sbjct 415 IRNYPLPPGKMQAMNDEINQGLKSGIIRESKAINAC--PVMFVPKKEGTLRMVVDYKPLN 472 Query 84 KQTE-DLAEAQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGP 142 K + ++ L +Q T LD+ AY I + + F Sbjct 473 KYVKPNIYPLPLIEQLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFR--------- 523 Query 143 CVR--YYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSDLGLEEH 200 C R + + V+P G ++PA +Q+ + IL G ++E ++ YMD+I I S EH Sbjct 524 CPRGVFEYLVMPYGISIAPAHFQYFINTIL-GEVKESHVV---CYMDNILIHSK-SESEH 578 Query 201 RGIVNELASYIAQYGFMLPEDKRQ-EGYPAKWLGFELHPEKWKFQKHTLPEITE--GPIT 257 V ++ + ++ + K + K++G+ + + + + + ++ + P Sbjct 579 VKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGFTPCQENIDKVLQWKQPKN 638 Query 258 LNKLQKLVGDLVWRQSLIGKS 278 +L++ +G + + + I K+ Sbjct 639 RKELRQFLGSVNYLRKFIPKT 659 >RecName: Full=Retrovirus-related Pol polyprotein from transposon opus; Includes: RecName: Full=Protease; Includes: RecName: Full=Reverse transcriptase; Includes: RecName: Full=Endonuclease [Drosophila melanogaster] Sequence ID: Q8I7P9.1 Length: 1003 Range 1: 143 to 289 Score:49.7 bits(117), Expect:6e-05, Method:Compositional matrix adjust., Identities:45/163(28%), Positives:76/163(46%), Gaps:25/163(15%) Query 41 VDRLEKEGKVGRAPPHWTCNTPIFCIKKK-----SGKWRMLIDFRELNK----QTEDLAE 91 +D L ++G + P + N+PI+ + KK ++RM++DF+ LN T + + Sbjct 143 IDELLQDGII--RPSNSPYNSPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYPIPD 200 Query 92 AQLGLPHPGGLQRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLSPNNLGPCVRYYWKVL 151 L L K+ T LD+ + I + E T F+ L+ +Y + L Sbjct 201 INATL---ASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNG-------KYEFLRL 250 Query 152 PQGWKLSPAVYQFTMQKILRGWIEEHPMIQFGIYMDDIYIGSD 194 P G K +PA++Q + ILR EH +Y+DDI + S+ Sbjct 251 PFGLKNAPAIFQRMIDDILR----EHIGKVCYVYIDDIIVFSE 289 >RecName: Full=Polyprotein P3; AltName: Full=P194 protein; Contains: RecName: Full=Putative movement protein; Short=MP; Contains: RecName: Full=Capsid protein; AltName: Full=Coat protein; Short=CP; Contains: RecName: Full=Protease; Short=PR; Contains: RecName: Full=Reverse transcriptase/Ribonuclease H; Short=RT; AltName: Full=p55; Flags: Precursor [Rice tungro bacilliform virus (isolate Philippines)] Sequence ID: P27502.1 Length: 1675 Range 1: 1187 to 1418 Score:48.5 bits(114), Expect:1e-04, Method:Compositional matrix adjust., Identities:57/254(22%), Positives:104/254(40%), Gaps:34/254(13%) Query 25 AQWPLTQEKLEGLKEIVDRLEKEGKVGRAPPHWTCNTPIFCIKKKS----GKWRMLIDFR 80 A P T E ++ + L + +A P T F ++ S K R++ +++ Sbjct 1187 ATIPYTPADKEVFEKQIKELLDNKLIKKADPTCRHRTAAFIVRNHSEEVAQKPRIVYNYK 1246 Query 81 ELNKQTEDLAEAQLGLPHPGGL----QRKKHVTILDIGDAYFTIPLYEPYRQYTCFTMLS 136 LN +++ +PH + Q+ + D+ + + L + ++ +T FT Sbjct 1247 RLN---DNMHTDPFNIPHKISMINLIQKANIFSKFDLKAGFHHMKLKDDFKDWTTFT--- 1300 Query 137 PNNLGPCVR--YYWKVLPQGWKLSPAVYQFTMQKILRGWIEEHPMIQFG-IYMDDIYIGS 193 C Y W V P G +P +Q MQ E ++F +Y+DDI I S Sbjct 1301 ------CSEGLYTWNVCPFGIANAPCAFQRFMQ-------ESFGDLKFALLYIDDILIAS 1347 Query 194 DLGLEEHRGIVNELASYIAQYGFMLPEDKRQEGYP-AKWLGFELHPEKWKFQKHTLPEIT 252 + +EH + + + + G +L + K + ++LG E+ K Q H + +I Sbjct 1348 N-NEKEHIEHLKIFFNRVKEVGCVLSKKKSKMFLKEVEYLGVEIKEGKISLQPHIVDKIK 1406 Query 253 EGPITLNKLQKLVG 266 + NKL L G Sbjct 1407 K--FDKNKLNTLKG 1418 >RecName: Full=Ribonuclease H; Short=RNase H [Rhizobium etli CFN 42] Sequence ID: Q2KBL2.1 Length: 151 Range 1: 7 to 133 Score:45.1 bits(105), Expect:3e-04, Method:Compositional matrix adjust., Identities:36/127(28%), Positives:60/127(47%), Gaps:19/127(14%) Query 438 YTDGGKKN--GRGSLGYIASTGEKFR----IHEEGTNQQLELRAIEEACK--QGPEKMNI 489 +TDG G G G + GE + E TN ++EL A A + + P ++++ Sbjct 7 FTDGACSGNPGPGGWGAVLRYGEVEKELCGGEAETTNNRMELMAAISALQALKSPCEVDL 66 Query 490 VTDSRYAYEFMLR--------NW---DEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGI 538 TDS Y + + + W D++ ++N + +E N+ K+ +HWV GH G Sbjct 67 YTDSAYVKDGISKWIFGWKKNGWKTSDKKPVKNAELWQALEEARNRHKVTLHWVKGHAGH 126 Query 539 PQNEEID 545 P+NE D Sbjct 127 PENERAD 133 >RecName: Full=Ribonuclease H; Short=RNase H [Methylococcus capsulatus str. Bath] Sequence ID: Q60AW8.1 Length: 155 Range 1: 1 to 138 Score:45.1 bits(105), Expect:3e-04, Method:Compositional matrix adjust., Identities:41/142(29%), Positives:66/142(46%), Gaps:27/142(19%) Query 428 IAELVPGPTYYTDGGKKNGRGSLG------YIASTGEKFRIHEEGTNQQLEL----RAIE 477 ++E P YTDG + G G Y + T E + E TN ++EL RA+E Sbjct 1 MSETEPTVYAYTDGACRGNPGPGGWGVLLRYGSKTREIYGGERETTNNRMELMAAIRALE 60 Query 478 EACKQGPEKMNIVTDSRYAYEFML--------RNWDEEVIRNPIQ-----ARIMELVHNK 524 + P K+ IVTDS+Y + + R W + R+P++ R+++ + Sbjct 61 TLSR--PCKVKIVTDSQYVKKGITEWVAQWEKRGW-KTAGRSPVKNIDLWQRLIQ-AEQR 116 Query 525 EKIGVHWVPGHKGIPQNEEIDR 546 ++ W+ GH G P+NE DR Sbjct 117 HQVSWGWIKGHSGHPENEAADR 138 >RecName: Full=Ribonuclease H; Short=RNase H [Ralstonia solanacearum GMI1000] Sequence ID: Q8XZ91.1 Length: 151 Range 1: 5 to 133 Score:44.3 bits(103), Expect:4e-04, Method:Composition-based stats., Identities:38/129(29%), Positives:61/129(47%), Gaps:19/129(14%) Query 436 TYYTDGGKKN--GRGSLGYIASTG----EKFRIHEEGTNQQLELRAIEEACK--QGPEKM 487 T Y+DG K G G G + +G E F TN ++EL A+ EA + + P ++ Sbjct 5 TVYSDGACKGNPGLGGWGTVLVSGGHEKELFGGEAVTTNNRMELMAVIEAFRALKRPCRV 64 Query 488 NIVTDSRYAYE--------FMLRNW---DEEVIRNPIQARIMELVHNKEKIGVHWVPGHK 536 + TDS+Y + + R W D++ ++N R ++ + ++ HWV GH Sbjct 65 KVYTDSQYVQKGISEWLAGWKARGWKTADKKPVKNDDLWRTLDELVVTHEVSWHWVKGHA 124 Query 537 GIPQNEEID 545 G P NE D Sbjct 125 GHPGNERAD 133 >RecName: Full=Ribonuclease H; Short=RNase H [Bradyrhizobium diazoefficiens USDA 110] Sequence ID: Q89UU3.1 Length: 154 Range 1: 4 to 154 Score:44.3 bits(103), Expect:4e-04, Method:Compositional matrix adjust., Identities:44/157(28%), Positives:72/157(45%), Gaps:27/157(17%) Query 432 VPGPTYYTDGGKKN--GRGSLGYIASTGEKFRIHEEG----TNQQLELRAIEEACK--QG 483 +P T YTDG G G G I G+K + G TN Q+EL A A + + Sbjct 4 LPVVTIYTDGACSGNPGPGGWGAILKFGDKEKELNGGERHTTNNQMELMAAISALEALKK 63 Query 484 PEKMNIVTDSRYAYEFML-----------RNWDEEVIRNPIQARIMELVHNKEKIGVHWV 532 P +++ TDS+Y + + R D++ ++N + ++ ++ HWV Sbjct 64 PCTVDLYTDSQYVRQGITGWIHGWKRNGWRTADKKPVKNVELWQRLDAALKAHQVRWHWV 123 Query 533 PGHKGIPQNEEIDRYISEIFLAKEG--RGILQKRAED 567 GH G P+NE D+ LA++G + LQ+R + Sbjct 124 KGHAGHPENERADQ------LARDGIVKARLQQRVAE 154 >RecName: Full=Ribonuclease H; Short=RNase H [Bordetella pertussis Tohama I] Sequence ID: Q7VRX8.1 Length: 155 Range 1: 16 to 155 Score:43.9 bits(102), Expect:6e-04, Method:Compositional matrix adjust., Identities:39/140(28%), Positives:64/140(45%), Gaps:19/140(13%) Query 438 YTDGGKKN--GRGSLGYIASTGEKFRIHEEG----TNQQLELRAIEEACK--QGPEKMNI 489 +TDG K G G G + G+ + G TN ++EL A+ E + + P ++ I Sbjct 16 WTDGACKGNPGPGGWGVLMRAGQHEKTMHGGERQTTNNRMELMAVIEGLRALKRPCRVTI 75 Query 490 VTDSRYAYEFM---LRNW--------DEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGI 538 TDS+Y + M L NW D++ ++N + ++ + ++ WV GH G Sbjct 76 HTDSQYVMKGMTEWLANWKRRGWRTADKKPVKNVELWQALDEQVGRHQVQWRWVRGHAGD 135 Query 539 PQNEEIDRYISEIFLAKEGR 558 P NE D ++ A GR Sbjct 136 PGNERADALANQGMEAARGR 155 >RecName: Full=Ribonuclease H; Short=RNase H [Bordetella bronchiseptica RB50] Sequence ID: Q7WCJ8.1 Length: 155 Range 1: 16 to 155 Score:43.5 bits(101), Expect:7e-04, Method:Compositional matrix adjust., Identities:39/140(28%), Positives:64/140(45%), Gaps:19/140(13%) Query 438 YTDGGKKN--GRGSLGYIASTGEKFRIHEEG----TNQQLELRAIEEACK--QGPEKMNI 489 +TDG K G G G + G+ + G TN ++EL A+ E + + P ++ I Sbjct 16 WTDGACKGNPGPGGWGVLMRAGQHEKTMHGGERQTTNNRMELMAVIEGLRALKRPCRVTI 75 Query 490 VTDSRYAYEFM---LRNW--------DEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGI 538 TDS+Y + M L NW D++ ++N + ++ + ++ WV GH G Sbjct 76 HTDSQYVMKGMTEWLANWKRRGWRTADKKPVKNVELWQALDEQVGRHQVQWRWVRGHAGD 135 Query 539 PQNEEIDRYISEIFLAKEGR 558 P NE D ++ A GR Sbjct 136 PGNERADALANQGVEAARGR 155 >RecName: Full=Ribonuclease H; Short=RNase H [Novosphingobium aromaticivorans DSM 12444] Sequence ID: Q2G9E3.1 Length: 143 Range 1: 7 to 142 Score:42.7 bits(99), Expect:0.001, Method:Composition-based stats., Identities:41/137(30%), Positives:64/137(46%), Gaps:21/137(15%) Query 438 YTDGGKKN--GRGSLGYIASTGEKFR----IHEEGTNQQLELRAI---EEACKQGPEKMN 488 +TDG K G+G G + GE + +E TN ++EL A EA KQ P ++ Sbjct 7 FTDGACKGNPGKGGWGALLRMGEHEKEMAGSEKETTNNRMELMAAIRALEALKQ-PCRVT 65 Query 489 IVTDSRYA--------YEFMLRNW---DEEVIRNPIQARIMELVHNKEKIGVHWVPGHKG 537 + TDS+Y + + + W D + ++N R + K+ WV GH G Sbjct 66 LHTDSKYVLDGITKWIFGWQKKGWKTADNKPVKNEDLWRALVDAVRPHKVEWVWVKGHDG 125 Query 538 IPQNEEIDRYISEIFLA 554 P+NE +D+ S+ LA Sbjct 126 HPENERVDKLASDAALA 142 >RecName: Full=Ribonuclease H; Short=RNase H [Agrobacterium fabrum str. C58] Sequence ID: Q8UHA7.1 Length: 146 Range 1: 7 to 146 Score:42.7 bits(99), Expect:0.001, Method:Compositional matrix adjust., Identities:39/146(27%), Positives:65/146(44%), Gaps:25/146(17%) Query 438 YTDGGKKN--GRGSLGYIASTGEKFRIHEEG----TNQQLELRAIEEACK--QGPEKMNI 489 +TDG G G G + GE + G TN ++EL A A + P ++++ Sbjct 7 FTDGACSGNPGPGGWGAVLRYGETEKELSGGEADTTNNRMELLAAISALNALKSPCEVDL 66 Query 490 VTDSRYA--------YEFMLRNW---DEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGI 538 TDS Y + + + W D + ++N + +E + K+ +HWV GH G Sbjct 67 YTDSAYVKDGITKWIFGWKKKGWKTADNKPVKNVELWQALEAAQERHKVTLHWVKGHAGH 126 Query 539 PQNEEIDRYISEIFLAKEGRGILQKR 564 P+NE D LA++G ++R Sbjct 127 PENERADE------LARKGMEPFKRR 146 >RecName: Full=Ribonuclease H; Short=RNase H [Psychromonas ingrahamii 37] Sequence ID: A1SS86.2 Length: 153 Range 1: 7 to 133 Score:42.7 bits(99), Expect:0.001, Method:Compositional matrix adjust., Identities:34/127(27%), Positives:56/127(44%), Gaps:19/127(14%) Query 438 YTDGG--KKNGRGSLGYIASTGEKFRIHEEG----TNQQLELRAIEEACKQ--GPEKMNI 489 +TDG G G G + E + EG TN ++E+ A +A + P ++ + Sbjct 7 FTDGSCLGNPGPGGYGAVMIYNEHCKELSEGFLLTTNNRMEMLACIKALQSLTEPCEVEL 66 Query 490 VTDSRYAYE---FMLRNWDEEVIRNPIQA--------RIMELVHNKEKIGVHWVPGHKGI 538 TDS+Y + + NW + + +A + ++ K K+ HWV GH G Sbjct 67 TTDSQYVRQGITLWIHNWKKRGWKTAAKAPVKNVDLWKALDAAQEKHKVAWHWVKGHSGH 126 Query 539 PQNEEID 545 P+NE D Sbjct 127 PENERCD 133 >RecName: Full=Ribonuclease H; Short=RNase H [Oleidesulfovibrio alaskensis G20] Sequence ID: Q30X61.1 Length: 154 Range 1: 42 to 134 Score:42.7 bits(99), Expect:0.002, Method:Composition-based stats., Identities:28/93(30%), Positives:44/93(47%), Gaps:14/93(15%) Query 467 TNQQLELRAIEEACK--QGPEKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNK 524 TN ++E+ A+ E + Q P +N+ TDS+Y + + W + RN + + V NK Sbjct 42 TNNRMEILAVIEGLEALQEPCTVNLYTDSQYVRNAVEKKWLDSWQRNGWKTAARKPVKNK 101 Query 525 EK------------IGVHWVPGHKGIPQNEEID 545 + + HWV GH G P+NE D Sbjct 102 DLWLRLLPLLARHTVKFHWVRGHSGHPENELCD 134 >RecName: Full=Ribonuclease H; Short=RNase H [Campylobacter concisus 13826] Sequence ID: A8Z6F7.1 Length: 144 Range 1: 5 to 129 Score:42.4 bits(98), Expect:0.002, Method:Compositional matrix adjust., Identities:38/130(29%), Positives:56/130(43%), Gaps:24/130(18%) Query 436 TYYTDGG--KKNGRGSLGYI---------ASTGEKFRIHEEGTNQQLELRAIEEACK--Q 482 T ++DG G G YI AS GE + TN Q+EL+A K + Sbjct 5 TLFSDGSCLGNPGAGGWAYILRYNEAQKKASGGEAYT-----TNNQMELKAAIMGLKALK 59 Query 483 GPEKMNIVTDSRYAYEFM---LRNWDEEVIRNPIQARIMEL---VHNKEKIGVHWVPGHK 536 P ++ + TDS Y + L NW + +N + + + K+ WV GH Sbjct 60 EPCEVRLFTDSSYVANSINEWLANWQKRNFKNVKNVELWQEYLEISKPHKVVASWVKGHA 119 Query 537 GIPQNEEIDR 546 G P+NEE D+ Sbjct 120 GHPENEECDQ 129 >RecName: Full=Ribonuclease H; Short=RNase H [Syntrophobacter fumaroxidans MPOB] Sequence ID: A0LGJ7.1 Length: 164 Range 1: 20 to 146 Score:42.4 bits(98), Expect:0.002, Method:Compositional matrix adjust., Identities:39/132(30%), Positives:57/132(43%), Gaps:29/132(21%) Query 438 YTDGGKKNGRGSLGYIASTGEKFRIH----------EEGTNQQLELRAIEEACK--QGPE 485 + DG + G G+ G R H E TN Q+EL A+ +A + + P Sbjct 20 FADGACRGNPGPGGW----GAVLRYHGKEKELSGYAEYTTNNQMELAAVIQALRALKEPC 75 Query 486 KMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKE------------KIGVHWVP 533 ++ I TDSRY + + W + +N + R+ V NKE +I WV Sbjct 76 RVTITTDSRYLRDG-ISLWIHKWKQNGWKTRVKTDVRNKELWIALDEACLPHEIDWQWVK 134 Query 534 GHKGIPQNEEID 545 GH G P+NE D Sbjct 135 GHSGHPENERCD 146 >RecName: Full=Ribonuclease H; Short=RNase H [Trichodesmium erythraeum IMS101] Sequence ID: Q115G0.1 Length: 157 Range 1: 9 to 139 Score:42.0 bits(97), Expect:0.003, Method:Compositional matrix adjust., Identities:36/132(27%), Positives:59/132(44%), Gaps:22/132(16%) Query 436 TYYTDGGKKNGRGSLGY-IASTGEKFRIHEEG-----TNQQLELRAIEEACKQG--PEKM 487 T YTDG G GY I EK R G TN ++EL A+ +Q P + Sbjct 9 TIYTDGACSGNPGPGGYGIIILSEKKRQELSGGYKLTTNNRMELMAVIVGLEQLEIPSIV 68 Query 488 NIVTDSRYAYEFMLRNW-------------DEEVIRNPIQARIMELVHNKEKIGVHWVPG 534 N+ TDS+Y + + + W ++ + + ++++L +K ++ WV G Sbjct 69 NLYTDSKYIVDAVTKGWAKRWRANSWKRNKKDKAMNPDLWGKLLDLC-SKHQVEFSWVRG 127 Query 535 HKGIPQNEEIDR 546 H G +NE D+ Sbjct 128 HSGNIENERCDK 139 >RecName: Full=Ribonuclease H; Short=RNase H [Caulobacter vibrioides NA1000] Sequence ID: B8H4W7.1 Length: 149 >RecName: Full=Ribonuclease HI; Short=RNase HI [Caulobacter vibrioides CB15] Sequence ID: Q9A341.1 Length: 149 Range 1: 1 to 139 Score:41.6 bits(96), Expect:0.003, Method:Compositional matrix adjust., Identities:39/139(28%), Positives:61/139(43%), Gaps:19/139(13%) Query 431 LVPGPTYYTDGGKKN--GRGSLGYIASTGEKFRIHEEG----TNQQLELRAIEEACK--Q 482 + P T YTDG K G G G I G+K + G TN ++EL A +A + Sbjct 1 MTPKVTIYTDGACKGNPGPGGWGAILFYGDKKKEICGGEPGTTNNRMELMAAIQALELLN 60 Query 483 GPEKMNIVTDSRYAYEFM---LRNW--------DEEVIRNPIQARIMELVHNKEKIGVHW 531 P + + TDS+Y + + +R W D+ ++N + ++ + + W Sbjct 61 RPCVVELHTDSQYVMKGIQEWIRGWKARGWKTADKSPVKNDDLWKRLDAARARHDVDWRW 120 Query 532 VPGHKGIPQNEEIDRYISE 550 V GH G P NE D +E Sbjct 121 VKGHAGHPLNERADALANE 139 >RecName: Full=Ribonuclease H; Short=RNase H [Zymomonas mobilis subsp. mobilis ZM4 = ATCC 31821] Sequence ID: O69014.1 Length: 156 Range 1: 15 to 149 Score:41.6 bits(96), Expect:0.004, Method:Composition-based stats., Identities:37/137(27%), Positives:64/137(46%), Gaps:23/137(16%) Query 439 TDGGKKNGRGSLGYIASTGEKFRIHEEG--------TNQQLELRAIEEA--CKQGPEKMN 488 TDG K G G+ A +++ HE+ TN ++EL+A+ EA C + P ++ Sbjct 15 TDGACKGNPGFGGWGALL--RYQGHEKAISGSENPTTNNRMELQAVIEALSCLKKPCQIE 72 Query 489 IVTDSRYAYEFMLR---NWDE--------EVIRNPIQARIMELVHNKEKIGVHWVPGHKG 537 + TDS+Y + + R W + + ++N + + + + I WV GH G Sbjct 73 LSTDSKYVMDGLTRWIHGWQKNGWLTAAKKPVKNADLWKQLLALTRQHDIAWKWVKGHAG 132 Query 538 IPQNEEIDRYISEIFLA 554 P NE D+ S+ +A Sbjct 133 HPDNERADQLASDAAIA 149 >RecName: Full=Ribonuclease H; Short=RNase H [Lachnoclostridium phytofermentans ISDg] Sequence ID: A9KLJ9.1 Length: 158 Range 1: 4 to 139 Score:41.6 bits(96), Expect:0.004, Method:Composition-based stats., Identities:42/138(30%), Positives:59/138(42%), Gaps:30/138(21%) Query 436 TYYTDG---GKKNGRGSLG----YIASTG-EKFRIHEEG----TNQQLELRA----IEEA 479 T YTDG G +G G G YI STG E R + G TN ++EL A +E Sbjct 4 TIYTDGAARGNPDGPGGYGTILSYIDSTGVEHIREYSGGYKKTTNNRMELMAAIVGLEAL 63 Query 480 CKQGPEKMNIVTDSRYAYEFMLRNW------------DEEVIRNPIQARIMELVHNKEKI 527 K P + + +DS+Y + +W E ++N + + N+ + Sbjct 64 TK--PCVVTLYSDSQYVVKAFNEHWLDGWIKKGWKRGKNEPVKNVDLWKRLLAAKNQHDV 121 Query 528 GVHWVPGHKGIPQNEEID 545 WV GH G PQNE D Sbjct 122 TFCWVKGHDGHPQNERCD 139 >RecName: Full=Ribonuclease HI; Short=RNase HI [Synechocystis sp. PCC 6803 substr. Kazusa] Sequence ID: Q55801.1 Length: 160 Range 1: 9 to 140 Score:41.6 bits(96), Expect:0.004, Method:Composition-based stats., Identities:36/132(27%), Positives:54/132(40%), Gaps:21/132(15%) Query 436 TYYTDGG--KKNGRGSLGYIASTGEKFRI-----HEEGTNQQLELRAIEEACK--QGPEK 486 T YTDG G G G + G+ R ++ TN ++E+ A Q P + Sbjct 9 TLYTDGACSMNPGPGGYGAVILYGDGRREELSAGYKMTTNNRMEIMGAIAALSHLQEPSQ 68 Query 487 MNIVTDSRYAYEFMLRNWDE------------EVIRNPIQARIMELVHNKEKIGVHWVPG 534 + + TDSRY + M + W + E +NP M + K ++ WV Sbjct 69 VLLYTDSRYMVDAMSKGWAKKWKANGWQRNAKEKAKNPDLWETMLTLCEKHQVTFQWVKA 128 Query 535 HKGIPQNEEIDR 546 H G +NE DR Sbjct 129 HAGNKENERCDR 140 >RecName: Full=Ribonuclease H; Short=RNase H [Yersinia enterocolitica subsp. enterocolitica 8081] Sequence ID: A1JKB1.1 Length: 154 Range 1: 8 to 152 Score:41.2 bits(95), Expect:0.005, Method:Compositional matrix adjust., Identities:44/155(28%), Positives:67/155(43%), Gaps:31/155(20%) Query 438 YTDGGKKNGRGSLGYIASTGEKFRIHEEG--------TNQQLELRAIEEACK--QGPEKM 487 +TDG G GY A +++ HE+ TN ++EL A A + P ++ Sbjct 8 FTDGSCLGNPGPGGYGAIL--RYKQHEKTFSAGYFLTTNNRMELMAAIVALEALTSPCEV 65 Query 488 NIVTDSRYAYEFM---LRNW--------DEEVIRNPIQARIMELVHNKEKIGVHWVPGHK 536 + TDS+Y + + + NW D + +RN + ++L I WV GH Sbjct 66 TLSTDSQYVRQGITQWIHNWKKRGWKTTDRKPVRNVDLWQRLDLAIQTHVIQWEWVKGHA 125 Query 537 GIPQNEEIDRYISEIFLAKEGRGILQKRAEDAGYD 571 G P+NE D LA+EG ED GY+ Sbjct 126 GHPENERCDE------LAREGAN--SPTLEDTGYN 152 >RecName: Full=Ribonuclease HI; Short=RNase HI [Helicobacter pylori 26695] Sequence ID: P56120.1 Length: 143 Range 1: 7 to 128 Score:40.8 bits(94), Expect:0.005, Method:Compositional matrix adjust., Identities:38/130(29%), Positives:57/130(43%), Gaps:30/130(23%) Query 438 YTDGGKKNGRGSLGYIA-----------STGEKFRIHEEGTNQQLELRAIEEACK--QGP 484 + DG G GY A S GE+F TN ++ELRA+ EA K + P Sbjct 7 FCDGSSLGNPGPGGYAAILRYKDKEKTISGGEEFT-----TNNRMELRALNEALKILKRP 61 Query 485 EKMNIVTDSRY---AYEFMLRNWDEEVIRNPIQARIMEL------VHNKEKIGVHWVPGH 535 ++ + +DS+Y A L NW + +N + + ++L V I W+ GH Sbjct 62 CRITLYSDSQYVCQAINVWLANWQK---KNFSKVKNVDLWKEFLEVSKGHSIVAVWIKGH 118 Query 536 KGIPQNEEID 545 G +NE D Sbjct 119 NGHAENERCD 128 >RecName: Full=Ribonuclease H; Short=RNase H [Campylobacter curvus 525.92] Sequence ID: A7H185.1 Length: 149 Range 1: 5 to 138 Score:40.8 bits(94), Expect:0.006, Method:Composition-based stats., Identities:39/134(29%), Positives:55/134(41%), Gaps:19/134(14%) Query 436 TYYTDGGKKN--GRGSLGYIASTGEKFRIHEEG----TNQQLELRAIEEACK--QGPEKM 487 T ++DG N G G YI + G TN Q+EL A+ E K + P ++ Sbjct 5 TLFSDGSCLNNPGAGGWAYILEFNGAVKKDSGGAAMTTNNQMELTAVIEGLKALKEPCEV 64 Query 488 NIVTDSRY---AYEFMLRNW--------DEEVIRNPIQARIMELVHNKEKIGVHWVPGHK 536 + TDS Y A L W D++ ++N + V K+ W+ H Sbjct 65 RLFTDSSYVANAVNSWLDGWVKKNFIGSDKKPVKNIELWQEYLRVSRPHKVTASWIKAHN 124 Query 537 GIPQNEEIDRYISE 550 G PQNEE D E Sbjct 125 GHPQNEECDTMARE 138 >RecName: Full=Ribonuclease H; Short=RNase H [Rhizobium leguminosarum bv. viciae 3841] Sequence ID: Q1MKH6.1 Length: 151 Range 1: 42 to 133 Score:40.8 bits(94), Expect:0.006, Method:Composition-based stats., Identities:26/92(28%), Positives:46/92(50%), Gaps:13/92(14%) Query 467 TNQQLELRAIEEACK--QGPEKMNIVTDSRYA--------YEFMLRNW---DEEVIRNPI 513 TN ++EL A A + P ++++ TDS Y + + W D++ ++N Sbjct 42 TNNRMELLAAISALSALKSPCEVDLYTDSAYVKDGISKWIFGWKKNGWKTADKKPVKNAE 101 Query 514 QARIMELVHNKEKIGVHWVPGHKGIPQNEEID 545 + +E ++ K+ +HWV GH G P+NE D Sbjct 102 LWQALEAARDRHKVTLHWVKGHAGHPENERAD 133 >RecName: Full=Ribonuclease H; Short=RNase H [Helicobacter acinonychis str. Sheeba] Sequence ID: Q17XJ7.1 Length: 143 Range 1: 7 to 128 Score:40.4 bits(93), Expect:0.007, Method:Compositional matrix adjust., Identities:39/130(30%), Positives:55/130(42%), Gaps:30/130(23%) Query 438 YTDGGKKNGRGSLGYIA-----------STGEKFRIHEEGTNQQLELRAIEEACK--QGP 484 + DG G GY A S GEKF TN ++ELRA+ EA K + P Sbjct 7 FCDGSSLGNPGPGGYAAILRYKDKEKIISGGEKFT-----TNNRMELRALNEALKILKRP 61 Query 485 EKMNIVTDSRY---AYEFMLRNWDEEVIRNPIQARIMEL------VHNKEKIGVHWVPGH 535 + + +DS+Y A L W + +N + + ++L V I W+ GH Sbjct 62 CHITLYSDSQYVCQAINVWLIGWQK---KNFAKVKNVDLWKEFLEVSKGHSIVAIWIKGH 118 Query 536 KGIPQNEEID 545 G QNE D Sbjct 119 NGHAQNERCD 128 >RecName: Full=Ribonuclease H; Short=RNase H [Sulfurovum sp. NBC37-1] Sequence ID: A6QCI9.1 Length: 147 Range 1: 30 to 133 Score:40.4 bits(93), Expect:0.008, Method:Compositional matrix adjust., Identities:36/110(33%), Positives:51/110(46%), Gaps:20/110(18%) Query 450 LGYIASTGEKFRIHEEGTNQQLELRAIEEACK--QGPEKMNIVTDSRYAYEFM---LRNW 504 L Y S E F EE TN ++ELR + E K + P + +V+DS Y + + L +W Sbjct 30 LEYKGSRKEYFGGEEETTNNRMELRGVIEGLKLLKEPCDVEVVSDSSYVVKAINEWLESW 89 Query 505 ---DEEVIRNP------IQARIMELVHNKEKIGVHWVPGHKGIPQNEEID 545 D + ++N I+A VH WV GH G P+NE D Sbjct 90 IRRDFKKVKNVDLWKAYIEAAAPHHVHGT------WVRGHDGHPENERCD 133 >RecName: Full=Ribonuclease H; Short=RNase H [Erythrobacter litoralis HTCC2594] Sequence ID: Q2ND39.1 Length: 144 Range 1: 7 to 138 Score:40.4 bits(93), Expect:0.008, Method:Compositional matrix adjust., Identities:37/132(28%), Positives:57/132(43%), Gaps:19/132(14%) Query 438 YTDGGKKN--GRGSLGYIASTGEKFRIHEEG----TNQQLELRAIEEACKQ--GPEKMNI 489 +TDG K G G G + G+ + G TN ++ELRA E P ++ + Sbjct 7 FTDGACKGNPGPGGWGVLLRMGKHEKELSGGEPETTNNRMELRAAIEGLNALIEPCEVEL 66 Query 490 VTDSRYAYEFMLR-----------NWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGI 538 TDS+Y + + + N ++ +RN + + K+ HWV GH G Sbjct 67 YTDSKYVVDGITKWVHGWKKRGWVNASKKPVRNDDLWHDLIEAELRHKVTWHWVKGHNGH 126 Query 539 PQNEEIDRYISE 550 +NE DR SE Sbjct 127 AENERADRLASE 138 >RecName: Full=Ribonuclease H; Short=RNase H [Magnetospirillum magneticum AMB-1] Sequence ID: Q2W9A9.1 Length: 154 Range 1: 1 to 153 Score:40.4 bits(93), Expect:0.008, Method:Composition-based stats., Identities:44/161(27%), Positives:70/161(43%), Gaps:32/161(19%) Query 428 IAELVPGPT---YYTDGGKKNGRGSLGYIASTGEKFRIHEE--------GTNQQLELRAI 476 ++E P P YTDG G G+ A +F+ E+ TN ++E+ A+ Sbjct 1 MSETAPKPETVEIYTDGACSGNPGPGGWGAIL--RFKGIEKELKGGESPTTNNRMEMMAV 58 Query 477 EEACKQGPEK--MNIVTDSRYAYEFM---LRNW--------DEEVIRNPIQARIMELVHN 523 A +++ TDS Y + M LR W D++ ++N + ++ Sbjct 59 LVALNTLTRSCAVDVYTDSEYVKKGMTEWLRGWKARGWKTADKKPVKNDDLWKALDEAAA 118 Query 524 KEKIGVHWVPGHKGIPQNEEIDRYISEIFLAKEGRGILQKR 564 + K+ HWV GH G P+NE D LA+EG L+ R Sbjct 119 RHKVSWHWVKGHAGHPENERADA------LAREGIADLRAR 153 >RecName: Full=Ribonuclease H; Short=RNase H [Thiobacillus denitrificans ATCC 25259] Sequence ID: Q3SIB2.1 Length: 148 Range 1: 9 to 146 Score:40.4 bits(93), Expect:0.009, Method:Compositional matrix adjust., Identities:39/146(27%), Positives:68/146(46%), Gaps:27/146(18%) Query 438 YTDGGKKNGRGSLGY---IASTGEKFRIH---EEGTNQQLELRAIEEACK--QGPEKMNI 489 Y+DG K G+ G+ + + G + I TN ++E+ A+ A + + P + + Sbjct 9 YSDGACKGNPGAGGWGALLVAGGHRKEISGGEPNTTNNRMEMTAVIRALELLKRPSTVEV 68 Query 490 VTDSRYAYE--------FMLRNW---DEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGI 538 TDS+Y + + RNW D + ++N + ++ + + +I WV GH G Sbjct 69 HTDSQYVQKGVSEWLPGWKRRNWRTADGKPVKNQDLWQQLDALSQQHRIVWKWVRGHAGH 128 Query 539 PQNEEIDRYISEIFLAKEGRGILQKR 564 P+NE D LA + G+LQ R Sbjct 129 PENERAD------VLANQ--GVLQAR 146 >RecName: Full=Ribonuclease H; Short=RNase H [Anaeromyxobacter sp. Fw109-5] Sequence ID: A7HB50.1 Length: 175 Range 1: 55 to 146 Score:40.8 bits(94), Expect:0.009, Method:Composition-based stats., Identities:29/92(32%), Positives:49/92(53%), Gaps:13/92(14%) Query 467 TNQQLELRAIEEACKQGPE--KMNIVTDSRYAYEFM---LRNWDEEVIRN----PIQAR- 516 TN ++ELRA+ EA PE +++V+DSRY + + + W ++ R P+ R Sbjct 55 TNNRMELRAVLEALDGLPEGEAVDVVSDSRYVVDALSKWIHGWRKKGWRTAAGEPVLNRD 114 Query 517 IMELVHNKEK---IGVHWVPGHKGIPQNEEID 545 ++E + + + + WV GH G P NE +D Sbjct 115 LIEALDARGRELRVRYSWVRGHDGHPVNEVVD 146 >RecName: Full=Ribonuclease H; Short=RNase H [Pseudomonas stutzeri A1501] Sequence ID: A4VLR0.1 Length: 151 Range 1: 9 to 148 Score:40.0 bits(92), Expect:0.012, Method:Compositional matrix adjust., Identities:40/148(27%), Positives:66/148(44%), Gaps:27/148(18%) Query 438 YTDGGKKNGRGSLG------YIASTGEKFRIHEEGTNQQLELRAIEEACKQ--GPEKMNI 489 YTDG K G G Y E + + TN ++EL A A + P K+ + Sbjct 9 YTDGACKGNPGPGGWGALLIYKGVKRELWGGEPDTTNNRMELMAAIRALAELKRPCKVRL 68 Query 490 VTDSRYAYEFM---LRNW--------DEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGI 538 VTDS+Y + + + NW ++ ++N + ++ N+ ++ WV GH G Sbjct 69 VTDSQYVMQGINDWMPNWKKRGWKTASKQPVKNADLWQQLDEQVNRHEVSWQWVRGHTGH 128 Query 539 PQNEEIDRYISEIFLAKEGRGILQKRAE 566 P NE+ D LA RG++Q + + Sbjct 129 PGNEQAD------LLAN--RGVVQAKRQ 148 >RecName: Full=Ribonuclease H; Short=RNase H [Helicobacter pylori HPAG1] Sequence ID: Q1CTK9.1 Length: 143 Range 1: 7 to 128 Score:39.7 bits(91), Expect:0.013, Method:Compositional matrix adjust., Identities:38/130(29%), Positives:57/130(43%), Gaps:30/130(23%) Query 438 YTDGGKKNGRGSLGYIA-----------STGEKFRIHEEGTNQQLELRAIEEACK--QGP 484 + DG G GY A S GE+F TN ++ELRA+ EA K + P Sbjct 7 FCDGSSLGNPGPGGYAAILRYKDKEKTISGGEEFT-----TNNRMELRALNEALKILKRP 61 Query 485 EKMNIVTDSRY---AYEFMLRNWDEEVIRNPIQARIMEL------VHNKEKIGVHWVPGH 535 ++ + +DS+Y A L NW + +N + + ++L V I W+ GH Sbjct 62 CRITLYSDSQYVCQAINVWLVNWQK---KNFSKVKNVDLWKEFLKVSKGHLIMAVWIKGH 118 Query 536 KGIPQNEEID 545 G +NE D Sbjct 119 NGHAENERCD 128 >RecName: Full=Ribonuclease H; Short=RNase H [Desulfovibrio vulgaris DP4] Sequence ID: A1VFS4.1 Length: 156 >RecName: Full=Ribonuclease H; Short=RNase H [Desulfovibrio vulgaris str. Hildenborough] Sequence ID: Q72E89.1 Length: 156 Range 1: 44 to 136 Score:40.0 bits(92), Expect:0.013, Method:Composition-based stats., Identities:27/94(29%), Positives:48/94(51%), Gaps:16/94(17%) Query 467 TNQQLELRAIEEACK--QGPEKMNIVTDSRYAYEFMLRNW------------DEEVIRN- 511 TN ++E+ A+ EA + + P K+ + TDS+Y + + W D++ ++N Sbjct 44 TNNRMEILAVLEALEALRDPCKVTLFTDSQYVRNAVEKKWLAGWQRNGWKTADKKPVKNR 103 Query 512 PIQARIMELVHNKEKIGVHWVPGHKGIPQNEEID 545 + R++ L+ K + WV GH G P+NE D Sbjct 104 DLWERLVPLL-AKHSVSFRWVRGHSGHPENERCD 136 >RecName: Full=Ribonuclease H; Short=RNase H [Thermus thermophilus HB8] Sequence ID: P29253.2 Length: 166 Range 1: 12 to 140 Score:40.0 bits(92), Expect:0.013, Method:Composition-based stats., Identities:35/131(27%), Positives:57/131(43%), Gaps:24/131(18%) Query 438 YTDGGKKNGRGSLGYIASTGEKFRIHEE--------GTNQQLELRAIEEACK--QGPEKM 487 +TDG G G+ A +F HE+ TN ++EL+A E K + P ++ Sbjct 12 FTDGACLGNPGPGGWAALL--RFHAHEKLLSGGEACTTNNRMELKAAIEGLKALKEPCEV 69 Query 488 NIVTDSRYAYEFMLRNWDE------------EVIRNPIQARIMELVHNKEKIGVHWVPGH 535 ++ TDS Y + W E + ++N + L ++ H+V GH Sbjct 70 DLYTDSHYLKKAFTEGWLEGWRKRGWRTAEGKPVKNRDLWEALLLAMAPHRVRFHFVKGH 129 Query 536 KGIPQNEEIDR 546 G P+NE +DR Sbjct 130 TGHPENERVDR 140 >RecName: Full=Ribonuclease H; Short=RNase H [Pseudoalteromonas atlantica T6c] Sequence ID: Q15TA7.1 Length: 153 Range 1: 7 to 146 Score:40.0 bits(92), Expect:0.014, Method:Compositional matrix adjust., Identities:42/144(29%), Positives:64/144(44%), Gaps:24/144(16%) Query 438 YTDGGKKNGRGSLGYIA------STGEKFRIHEEGTNQQLELRAIEEACKQGPE--KMNI 489 YTDG G GY A + E + TN ++EL A EA E K+++ Sbjct 7 YTDGSCLGNPGPGGYGAVLLFNQHSKELSQGFVHTTNNRMELLATIEALASLTETCKVDL 66 Query 490 VTDSRYAYEFM---LRNW--------DEEVIRN-PIQARIMELVHNKEKIGVHWVPGHKG 537 TDS+Y + ++NW D++ ++N + R+ E V + + HWV GH G Sbjct 67 TTDSQYVKNGINQWIKNWRKNGWRTSDKKPVKNVDLWKRLDEQV-GRHDVKWHWVKGHSG 125 Query 538 IPQNEEIDRYISEIFLAKEGRGIL 561 P NE D + A G+ +L Sbjct 126 HPMNERCDVLARD---AASGKSLL 146 >RecName: Full=Ribonuclease H; Short=RNase H [Thermus thermophilus HB27] Sequence ID: Q72IE1.1 Length: 166 Range 1: 12 to 140 Score:40.0 bits(92), Expect:0.014, Method:Composition-based stats., Identities:35/131(27%), Positives:57/131(43%), Gaps:24/131(18%) Query 438 YTDGGKKNGRGSLGYIASTGEKFRIHEE--------GTNQQLELRAIEEACK--QGPEKM 487 +TDG G G+ A +F HE+ TN ++EL+A E K + P ++ Sbjct 12 FTDGACLGNPGPGGWAALL--RFNAHEKLLSGGEACTTNNRMELKAAIEGLKALKEPCEV 69 Query 488 NIVTDSRYAYEFMLRNWDE------------EVIRNPIQARIMELVHNKEKIGVHWVPGH 535 ++ TDS Y + W E + ++N + L ++ H+V GH Sbjct 70 DLYTDSHYLKKAFTEGWLEGWRKRGWRTAEGKPVKNRDLWEALLLAMAPHRVRFHFVKGH 129 Query 536 KGIPQNEEIDR 546 G P+NE +DR Sbjct 130 TGHPENERVDR 140 >RecName: Full=Ribonuclease H; Short=RNase H [Hahella chejuensis KCTC 2396] Sequence ID: Q2SJ45.1 Length: 148 Range 1: 7 to 133 Score:39.7 bits(91), Expect:0.016, Method:Compositional matrix adjust., Identities:44/128(34%), Positives:63/128(49%), Gaps:21/128(16%) Query 438 YTDGG-KKN-GRGSLGYIASTG----EKFRIHEEGTNQQLELRAIEEAC---KQGPEKMN 488 YTDG KKN G G G I G E + + TN ++EL A EA KQG K+ Sbjct 7 YTDGACKKNPGPGGWGAILIYGKNEKEIYGGELDTTNNRMELMAAIEALRALKQGC-KVE 65 Query 489 IVTDSRYAYEFM---LRNWDEEVIR----NPIQA----RIMELVHNKEKIGVHWVPGHKG 537 + TDS+Y + + ++NW ++ R +P++ + ++ NK I WV GH G Sbjct 66 LYTDSQYVRKGITEWMQNWIKKGWRTSGGDPVKNVDLWQALDKERNKHDISWRWVKGHSG 125 Query 538 IPQNEEID 545 P NE D Sbjct 126 HPLNERAD 133 >RecName: Full=Ribonuclease H; Short=RNase H [Yersinia pestis Pestoides F] Sequence ID: A4TL54.1 Length: 154 >RecName: Full=Ribonuclease H; Short=RNase H [Yersinia pseudotuberculosis IP 31758] Sequence ID: A7FFK7.1 Length: 154 >RecName: Full=Ribonuclease H; Short=RNase H [Yersinia pestis Angola] Sequence ID: A9R0G0.1 Length: 154 >RecName: Full=Ribonuclease H; Short=RNase H [Yersinia pseudotuberculosis YPIII] Sequence ID: B1JR46.1 Length: 154 >RecName: Full=Ribonuclease H; Short=RNase H [Yersinia pseudotuberculosis PB1/+] Sequence ID: B2KAC9.1 Length: 154 >RecName: Full=Ribonuclease H; Short=RNase H [Yersinia pestis Antiqua] Sequence ID: Q1CAJ5.1 Length: 154 >RecName: Full=Ribonuclease H; Short=RNase H [Yersinia pestis Nepal516] Sequence ID: Q1CFI6.1 Length: 154 >RecName: Full=Ribonuclease H; Short=RNase H [Yersinia pseudotuberculosis IP 32953] Sequence ID: Q667M7.1 Length: 154 >RecName: Full=Ribonuclease HI; Short=RNase HI [Yersinia pestis] Sequence ID: Q8ZH30.1 Length: 154 Range 1: 8 to 152 Score:39.7 bits(91), Expect:0.016, Method:Compositional matrix adjust., Identities:42/155(27%), Positives:67/155(43%), Gaps:31/155(20%) Query 438 YTDGGKKNGRGSLGYIASTGEKFRIHEEG--------TNQQLELRAIEEACK--QGPEKM 487 +TDG G GY A +++ HE+ TN ++EL A A + P ++ Sbjct 8 FTDGSCLGNPGPGGYGAIL--RYKQHEKTFSAGYYLTTNNRMELMAAIVALEALTSPCEV 65 Query 488 NIVTDSRYAYEFM---LRNW--------DEEVIRNPIQARIMELVHNKEKIGVHWVPGHK 536 + TDS+Y + + + NW D + +RN + ++L I WV GH Sbjct 66 TLSTDSQYVRQGITQWIHNWKKRGWKTADRKPVRNVDLWQRLDLAIQSHTIQWEWVKGHA 125 Query 537 GIPQNEEIDRYISEIFLAKEGRGILQKRAEDAGYD 571 G P+NE D LA++G +D GY+ Sbjct 126 GHPENERCDE------LARQGAN--SPTLDDTGYN 152 >RecName: Full=Ribonuclease H1; Short=RNase H1 [Rattus norvegicus] Sequence ID: Q5BK46.1 Length: 285 Range 1: 142 to 285 Score:40.8 bits(94), Expect:0.020, Method:Compositional matrix adjust., Identities:42/145(29%), Positives:60/145(41%), Gaps:26/145(17%) Query 438 YTDG-----GKKNGRGSLGYIASTGEKF----RIHEEGTNQQLEL----RAIEEACKQGP 484 YTDG G+K R +G G R+ TNQ+ E+ +AI +A Q Sbjct 142 YTDGCCSSNGRKRARAGIGVYWGPGHPLNVGIRLPGRQTNQRAEIHAACKAITQAKAQNI 201 Query 485 EKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEK----------IGVHW--V 532 K+ + TDS + + NW + +N + + V NKE + + W + Sbjct 202 SKLVLYTDSMFTIN-GITNWVQGWKKNGWRTSTGKDVINKEDFMELDELTQGMDIQWMHI 260 Query 533 PGHKGIPQNEEIDRYISEIFLAKEG 557 PGH G NEE DR E EG Sbjct 261 PGHSGFVGNEEADRLAREGAKQSEG 285 >RecName: Full=Ribonuclease H; Short=RNase H [Legionella pneumophila str. Lens] Sequence ID: Q5WWW5.1 Length: 143 Range 1: 6 to 132 Score:39.3 bits(90), Expect:0.020, Method:Compositional matrix adjust., Identities:36/128(28%), Positives:62/128(48%), Gaps:21/128(16%) Query 438 YTDGGKKNGRGSLGY---IASTGEKFRIHE---EGTNQQLELRAI---EEACKQGPEKMN 488 YTDG K G G+ + G + +H + TN ++EL A EA K+ P +++ Sbjct 6 YTDGACKGNPGPGGWGVLLRYNGREKTLHGGEPQTTNNRMELMAAIKGLEALKR-PCEVD 64 Query 489 IVTDSRYAYEFM-----------LRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKG 537 + TDS+Y + M RN +E+++N + ++ + + I HWV GH G Sbjct 65 LYTDSQYLQQGMKEWIKTWKRNGWRNSKKELVKNAELWKSLDNLASIHNINWHWVKGHSG 124 Query 538 IPQNEEID 545 +N+ +D Sbjct 125 HLENDLVD 132 >RecName: Full=Ribonuclease H; Short=RNase H [Syntrophotalea carbinolica DSM 2380] Sequence ID: Q3A827.1 Length: 152 Range 1: 21 to 137 Score:39.3 bits(90), Expect:0.021, Method:Composition-based stats., Identities:36/119(30%), Positives:55/119(46%), Gaps:21/119(17%) Query 446 GRGSLGYIASTGEKFR----IHEEGTNQQLELR---AIEEACKQGPEKMNIVTDSRYAYE 498 G G G + GE+ R E TN ++EL A EA + P ++ + TDS+Y + Sbjct 21 GPGGFGTLLRCGERVRELSGFDPETTNNRMELLGAIAGLEALTR-PCRVRLTTDSQYVCK 79 Query 499 FM---LRNWD---------EEVIRNPIQARIMELVHNKEKIGVHWVPGHKGIPQNEEID 545 M + W E+V + R++ LV +K ++ HWV GH G +NE D Sbjct 80 GMTEWIHGWQKKGWKNSKKEDVANRDLWERLLVLV-SKHEVSWHWVRGHAGHAENERCD 137 >RecName: Full=Ribonuclease H1; Short=RNase H1; AltName: Full=Ribonuclease H type II [Homo sapiens] Sequence ID: O60930.2 Length: 286 Range 1: 143 to 286 Score:40.4 bits(93), Expect:0.024, Method:Compositional matrix adjust., Identities:47/155(30%), Positives:68/155(43%), Gaps:36/155(23%) Query 438 YTDG-----GKKNGRGSLGYIASTGEKF----RIHEEGTNQQLEL----RAIEEACKQGP 484 YTDG G++ R +G G R+ TNQ+ E+ +AIE+A Q Sbjct 143 YTDGCCSSNGRRRPRAGIGVYWGPGHPLNVGIRLPGRQTNQRAEIHAACKAIEQAKTQNI 202 Query 485 EKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEK----------IGVHW--V 532 K+ + TDS + + NW + +N + + V NKE + + W V Sbjct 203 NKLVLYTDSMFTIN-GITNWVQGWKKNGWKTSAGKEVINKEDFVALERLTQGMDIQWMHV 261 Query 533 PGHKGIPQNEEIDRYISEIFLAKEGRGILQKRAED 567 PGH G NEE DR LA+EG K++ED Sbjct 262 PGHSGFIGNEEADR------LAREG----AKQSED 286 >RecName: Full=Ribonuclease H; Short=RNase H [Chelativorans sp. BNC1] Sequence ID: Q11KC5.1 Length: 162 Range 1: 7 to 146 Score:39.3 bits(90), Expect:0.026, Method:Compositional matrix adjust., Identities:37/146(25%), Positives:63/146(43%), Gaps:25/146(17%) Query 438 YTDGGKKNGRGSLG------YIASTGEKFRIHEEGTNQQLELRAIEEACK--QGPEKMNI 489 +TDG G G Y + E + + TN ++EL A EA + + P ++++ Sbjct 7 FTDGACSGNPGPGGWGAILRYNGTEKELYGGEADTTNNRMELTAAIEALEALKEPCEVDL 66 Query 490 VTDSRYAYEFML-----------RNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKGI 538 TDS Y + + R D + ++N + ++ + K+ HWV GH G Sbjct 67 HTDSNYLRDGISGWIEGWKRNGWRTADRKPVKNAELWQALDEARRRHKVHWHWVRGHAGH 126 Query 539 PQNEEIDRYISEIFLAKEGRGILQKR 564 P+NE D LA+ G +K+ Sbjct 127 PENERAD------ALARAGMAPFKKK 146 >RecName: Full=Ribonuclease H; Short=RNase H [Legionella pneumophila str. Corby] Sequence ID: A5IBM5.1 Length: 143 >RecName: Full=Ribonuclease H; Short=RNase H [Legionella pneumophila str. Paris] Sequence ID: Q5X5I2.1 Length: 143 >RecName: Full=Ribonuclease H; Short=RNase H [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Sequence ID: Q5ZVQ7.1 Length: 143 Range 1: 6 to 132 Score:38.9 bits(89), Expect:0.026, Method:Compositional matrix adjust., Identities:36/128(28%), Positives:62/128(48%), Gaps:21/128(16%) Query 438 YTDGGKKNGRGSLGY---IASTGEKFRIH---EEGTNQQLELRAI---EEACKQGPEKMN 488 YTDG K G G+ + G + +H + TN ++EL A EA K+ P +++ Sbjct 6 YTDGACKGNPGPGGWGVLLRYNGREKTLHGGEAQTTNNRMELMAAIKGLEALKR-PCEVD 64 Query 489 IVTDSRYAYEFM-----------LRNWDEEVIRNPIQARIMELVHNKEKIGVHWVPGHKG 537 + TDS+Y + M RN +E+++N + ++ + + I HWV GH G Sbjct 65 LYTDSQYLQQGMKEWIKTWKRNGWRNSKKELVKNAELWKSLDNLASIHNIHWHWVKGHSG 124 Query 538 IPQNEEID 545 +N+ +D Sbjct 125 HLENDLVD 132 >RecName: Full=Ribonuclease H; Short=RNase H [Candidatus Ruthia magnifica str. Cm (Calyptogena magnifica)] Sequence ID: A1AW38.1 Length: 146 Range 1: 7 to 134 Score:38.9 bits(89), Expect:0.027, Method:Composition-based stats., Identities:33/128(26%), Positives:61/128(47%), Gaps:20/128(15%) Query 438 YTDGGKKN--GRGSLGYIASTGEKFR----IHEEGTNQQLELRAI---EEACKQGPEKMN 488 YTDGG + G G G G+ + + ++ TN Q+EL A E K ++ Sbjct 7 YTDGGCRGNPGIGGWGVWLKYGDYDKKLQGVQQDTTNNQMELTATIKALEVIKSNDIAID 66 Query 489 IVTDSRYAYEFM---LRNW--------DEEVIRNPIQARIMELVHNKEKIGVHWVPGHKG 537 + TDS+Y + ++NW +++ ++N + +++++N+ + HWV GH G Sbjct 67 LFTDSKYVITGISEWIKNWKAKGWKTANKKPVKNIDLWQRLDVLNNQHNVTWHWVKGHSG 126 Query 538 IPQNEEID 545 N+ D Sbjct 127 DKGNDMAD 134 >RecName: Full=Ribonuclease H; Short=RNase H [Neisseria gonorrhoeae FA 1090] Sequence ID: Q5F7K9.1 Length: 145 Range 1: 4 to 134 Score:38.9 bits(89), Expect:0.028, Method:Composition-based stats., Identities:40/132(30%), Positives:55/132(41%), Gaps:22/132(16%) Query 435 PTY-YTDGGKKNGRGSLG------YIASTGEKFRIHEEGTNQQLELRAIEEACKQGPEKM 487 P Y YTDG K G+ G Y + E F + TN ++EL A+ E K + Sbjct 4 PVYLYTDGACKGNPGAGGWGVLMRYGSREKELFGGEAQTTNNRMELTAVIEGLKSLKRRC 63 Query 488 NIV--TDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEK-------IGVH-----WVP 533 ++ TDS+Y M NW RN + + V N + +G H WV Sbjct 64 TVIICTDSQYVKNGM-ENWIHGWKRNGWKTAAKQPVKNDDLWQELDALVGQHQVSWTWVK 122 Query 534 GHKGIPQNEEID 545 GH G +NE D Sbjct 123 GHAGHAENERAD 134 >RecName: Full=Ribonuclease H; Short=RNase H [Neisseria meningitidis FAM18] Sequence ID: A1KV38.1 Length: 145 Range 1: 4 to 134 Score:38.9 bits(89), Expect:0.029, Method:Composition-based stats., Identities:40/132(30%), Positives:55/132(41%), Gaps:22/132(16%) Query 435 PTY-YTDGGKKNGRGSLG------YIASTGEKFRIHEEGTNQQLELRAIEEACKQGPEKM 487 P Y YTDG K G+ G Y + E F + TN ++EL A+ E K + Sbjct 4 PVYLYTDGACKGNPGAGGWGVLMRYGSHEKELFGGEAQTTNNRMELTAVIEGLKSLKRRC 63 Query 488 NIV--TDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEK-------IGVH-----WVP 533 ++ TDS+Y M NW RN + + V N + +G H WV Sbjct 64 TVIICTDSQYVKNGM-ENWIHGWKRNGWKTAAKQPVKNDDLWKELDTLVGQHQVSWTWVK 122 Query 534 GHKGIPQNEEID 545 GH G +NE D Sbjct 123 GHAGHAENERAD 134 >RecName: Full=Ribonuclease H1; Short=RNase H1 [Mus musculus] Sequence ID: O70338.1 Length: 285 Range 1: 142 to 285 Score:40.0 bits(92), Expect:0.031, Method:Compositional matrix adjust., Identities:46/155(30%), Positives:67/155(43%), Gaps:36/155(23%) Query 438 YTDG-----GKKNGRGSLGYIASTGEKF----RIHEEGTNQQLEL----RAIEEACKQGP 484 YTDG G+K R +G G R+ TNQ+ E+ +AI +A Q Sbjct 142 YTDGCCSSNGRKRARAGIGVYWGPGHPLNVGIRLPGRQTNQRAEIHAACKAIMQAKAQNI 201 Query 485 EKMNIVTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKEK----------IGVHW--V 532 K+ + TDS + + NW + +N + + V NKE + + W + Sbjct 202 SKLVLYTDSMFTINGIT-NWVQGWKKNGWRTSTGKDVINKEDFMELDELTQGMDIQWMHI 260 Query 533 PGHKGIPQNEEIDRYISEIFLAKEGRGILQKRAED 567 PGH G NEE DR LA+EG K++ED Sbjct 261 PGHSGFVGNEEADR------LAREG----AKQSED 285 >RecName: Full=Ribonuclease H; Short=RNase H [Bradyrhizobium sp. ORS 278] Sequence ID: A4Z216.1 Length: 154 Range 1: 4 to 142 Score:38.5 bits(88), Expect:0.039, Method:Compositional matrix adjust., Identities:39/145(27%), Positives:67/145(46%), Gaps:25/145(17%) Query 432 VPGPTYYTDGGKKN--GRGSLGYIASTGEKFRIHEEG----TNQQLELRAIEEACK--QG 483 +P + +TDG G G G I G+K + + G TN ++EL A A + + Sbjct 4 LPVVSIFTDGACSGNPGPGGWGAILRFGDKEKELKGGEPHTTNNRMELMAAISALEALKK 63 Query 484 PEKMNIVTDSRYAYEFM---LRNW--------DEEVIRNPIQARIMELVHNKEKIGVHWV 532 ++ + TDS+Y + + + W D++ ++N + ++ KI HWV Sbjct 64 SCQVELYTDSQYVRQGITGWIHGWKRNGWKTADKKPVKNAELWQRLDAALKPHKINWHWV 123 Query 533 PGHKGIPQNEEIDRYISEIFLAKEG 557 GH G P+NE D+ LA++G Sbjct 124 KGHAGHPENERADQ------LARDG 142 >RecName: Full=Ribonuclease H; Short=RNase H [Brucella anthropi ATCC 49188] Sequence ID: A6WWG8.1 Length: 154 Range 1: 7 to 139 Score:38.5 bits(88), Expect:0.040, Method:Compositional matrix adjust., Identities:41/140(29%), Positives:58/140(41%), Gaps:27/140(19%) Query 438 YTDGGKKNGRGSLGYIASTGEKFRIHE------EGTNQQLELRAIEEACK--QGPEKMNI 489 YTDG G G+ A + E + TN ++EL A A + P ++++ Sbjct 7 YTDGACSGNPGPGGWGAILRWNDNVKELKGGEADTTNNRMELMAAISALSALKEPCEVDL 66 Query 490 VTDSRYAYEFMLRNWDEEVIRNPIQARIMELVHNKE------------KIGVHWVPGHKG 537 TDS Y + + W E RN + + V N E K+ HWV GH G Sbjct 67 YTDSVYVRDG-ISGWIEGWKRNGWKTAAKKPVKNAELWQALDEARKPHKVNWHWVKGHAG 125 Query 538 IPQNEEIDRYISEIFLAKEG 557 P+NE D LA+EG Sbjct 126 HPENERADE------LAREG 139 >RecName: Full=Ribonuclease H; Short=RNase H [Acidovorax sp. JS42] Sequence ID: A1W6Q8.1 Length: 148 Range 1: 7 to 135 Score:38.5 bits(88), Expect:0.041, Method:Compositional matrix adjust., Identities:39/129(30%), Positives:59/129(45%), Gaps:21/129(16%) Query 438 YTDGGKKN--GRGSLGYIASTG----EKFRIHEEGTNQQLELRAIEEACK--QGPEKMNI 489 YTDG K G G G + +G E F TN ++EL A+ +A + P ++ + Sbjct 7 YTDGACKGNPGPGGWGAVLRSGTLEKELFGGELGTTNNRMELMAVIQALGALKRPCQVAL 66 Query 490 VTDSRYAYEFM---LRNWDEEVIRN----PIQ-----ARIMELVHNK-EKIGVHWVPGHK 536 DS+Y + + + W ++ R P++ R+ EL H +I HWV GH Sbjct 67 YLDSQYVRQGITEWIHGWKKKGWRTAAGQPVKNVELWQRLDELAHQAGHRIEWHWVRGHA 126 Query 537 GIPQNEEID 545 G P NE D Sbjct 127 GDPGNERAD 135