TBLASTN 2.2.28+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Database: X5.fasta 1,868 sequences; 23,962,143 total letters Query= HSP71_YEAST P10591 Heat shock protein SSA1 (Heat shock protein YG100) Length=642 Score E Sequences producing significant alignments: (Bits) Value scaffold-199 920 0.0 scaffold-96 744 0.0 scaffold-423 737 0.0 unplaced-999 540 8e-171 unplaced-980 461 9e-142 scaffold-157 285 1e-81 scaffold-693 281 2e-80 unplaced-804 264 2e-74 scaffold-499 262 6e-74 unplaced-959 231 1e-63 scaffold-469 150 5e-43 scaffold-418 150 5e-43 unplaced-113 122 1e-32 scaffold-138 78.6 3e-17 scaffold-61 78.6 3e-17 unplaced-721 43.9 2e-05 > scaffold-199 Length=1112851 Score = 920 bits (2377), Expect = 0.0, Method: Compositional matrix adjust. Identities = 481/609 (79%), Positives = 550/609 (90%), Gaps = 3/609 (0%) Frame = -2 Query 2 SKAVGIDLGTTYSCVAHFANDRVDIIANDQGNRTTPSFVAFTDTERLIGDAAKNQAAMNP 61 SKA+GIDLGTTYSCV + N++V+IIAND GNRTTPS+VAFTD+ERL+GDAAKNQ +NP Sbjct 1109256 SKAIGIDLGTTYSCVGVW*NEKVEIIAND*GNRTTPSYVAFTDSERLLGDAAKNQVGLNP 1109077 Query 62 SNTVFDAKRLIGRNFNDPEVQADMKHFPFKLIDVDGKPQIQVEFKGETKNFTPEQISSMV 121 NTVFDAKRLIGR F D EVQ+DMKH+PFK+ID GKP I VE+ GETK FTPE++S+MV Sbjct 1109076 YNTVFDAKRLIGRKFADAEVQSDMKHWPFKVIDKAGKPFI*VEYLGETKTFTPEEVSAMV 1108897 Query 122 LGKMKETAESYLGAKVNDAVVTVPAYFNDSQRQATKDAGTIAGLNVLRIINEPTAAAIAY 181 L KMKETAE++LGAKV +AVVTVPAYFNDSQRQATKDAG+IAGLNV+RIINEPTAAAIAY Sbjct 1108896 LTKMKETAEAFLGAKVTNAVVTVPAYFNDSQRQATKDAGSIAGLNVMRIINEPTAAAIAY 1108717 Query 182 GLDKKGK-EEHVLIFDLGGGTFDVSLLSIEDGIFEVKATAGDTHLGGEDFDNRLVNHFIQ 240 GLDKK K E++VLIFDLGGGTFDVSLL+IE+GIFEVKATAGDTHLGGEDFDNRLV HF Q Sbjct 1108716 GLDKKTKGEKNVLIFDLGGGTFDVSLLTIEEGIFEVKATAGDTHLGGEDFDNRLVTHFAQ 1108537 Query 241 EFKRKNKKDLSTNQRALRRLRTACERAKRTLSSSAQTSVEIDSLFEGIDFYTSITRARFE 300 EFKRK+KKDLS N R+LRRLRTACERAKRTLSS+ Q S+EIDSLFEG+DFYTSITRARFE Sbjct 1108536 EFKRKHKKDLSGNARSLRRLRTACERAKRTLSSATQASIEIDSLFEGVDFYTSITRARFE 1108357 Query 301 ELCADLFRSTLDPVEKVLRDAKLDKSQVDEIVLVGGSTRIPKVQKLVTDYFNGKEPNRSI 360 ELC DLFR TLDPVEKVLRD+K+DKSQVDEIVLVGGSTRIPKVQKLV+D+FNGKEPN++I Sbjct 1108356 ELCGDLFRGTLDPVEKVLRDSKIDKSQVDEIVLVGGSTRIPKVQKLVSDFFNGKEPNKTI 1108177 Query 361 NPDEavaygaavqaaILTGDESSKTQDlllldvaplslGIETAGGVMTKLIPRNSTIPTK 420 NPDEAVAYGAAVQA+IL+G+ S KT DLLLLDVAPLSLGIETAGGV T LI RN+TIPTK Sbjct 1108176 NPDEAVAYGAAVQASILSGETSEKT*DLLLLDVAPLSLGIETAGGVFTALIKRNTTIPTK 1107997 Query 421 KSEIFSTYADNQPGVLIQVFEGERAKTKDNNLLGKFELSGIPPAPRGVPQIEVTFDVDSN 480 KSEIFSTYADNQPGVLIQVFEGERA+T DN+ LGKFEL+GIPPAPRGVPQIEVTFD+D+N Sbjct 1107996 KSEIFSTYADNQPGVLIQVFEGERARTADNHQLGKFELTGIPPAPRGVPQIEVTFDIDAN 1107817 Query 481 GILNVSAVEKGTGKSNKITITNDKGRLSKEDIEKMVaeaekfkeedekeSQRIASKNQLE 540 GILNVSA +K TG+SNKITITNDKGRLS+EDIE+MV+EAEK+K++DE+ + RI +KN LE Sbjct 1107816 GILNVSASDKTTGRSNKITITNDKGRLSQEDIERMVSEAEKYKKQDEEATARIHAKNGLE 1107637 Query 541 SIAYSLKNTISE--AGDKLEQADKDTVTKKAEETISWLDSNTTASKEEFDDKLKELQDIA 598 S AY+L+NT+++ K+++ADK+T+ K ETISWLD N ASKEEF+ K KEL+ A Sbjct 1107636 SYAYNLRNTLNDDNLKGKIDEADKETLEKAITETISWLD*NLEASKEEFESKQKELEGTA 1107457 Query 599 NPIMSKLYQ 607 NPIM+KLYQ Sbjct 1107456 NPIMTKLYQ 1107430 > scaffold-96 Length=364303 Score = 744 bits (1921), Expect = 0.0, Method: Compositional matrix adjust. Identities = 401/607 (66%), Positives = 487/607 (80%), Gaps = 5/607 (1%) Frame = +3 Query 3 KAVGIDLGTTYSCVAHFANDRVDIIANDQGNRTTPSFVAFTD-TERLIGDAAKNQAAMNP 61 K +GIDLGTTYSCVA F +V+IIAND G+R TPS+VAFTD ERLIG+AAKNQA NP Sbjct 89928 KVIGIDLGTTYSCVAVFQAGKVEIIANDLGSRITPSWVAFTDDGERLIGEAAKNQAPQNP 90107 Query 62 SNTVFDAKRLIGRNFNDPEVQADMKHFPFKLIDVDGKPQIQVEFKGETKNFTPEQISSMV 121 NTVFDAKRLIGR +ND EVQ + K PF + D DG+P IQV KGE K FTPE+IS+MV Sbjct 90108 KNTVFDAKRLIGRKYNDKEVQMEKKSLPFDVTDRDGRPVIQVSVKGEKKTFTPEEISAMV 90287 Query 122 LGKMKETAESYLGAKVNDAVVTVPAYFNDSQRQATKDAGTIAGLNVLRIINEPTAAAIAY 181 LGKMK+ AE YLG +V AVVTVPAYFND+QR ATKDAGTIAGLNVLRIINEPTAAAIAY Sbjct 90288 LGKMKQIAEDYLGKEVTHAVVTVPAYFNDAQR*ATKDAGTIAGLNVLRIINEPTAAAIAY 90467 Query 182 GLDKKGKEEHVLIFDLGGGTFDVSLLSIEDGIFEVKATAGDTHLGGEDFDNRLVNHFIQE 241 GLDKKG E +L++DLGGGTFDVSLLSI+DG+FEV ATAGDTHLGGEDFDNRL ++ ++ Sbjct 90468 GLDKKGGERTILVYDLGGGTFDVSLLSIDDGVFEVLATAGDTHLGGEDFDNRLRDYLVK* 90647 Query 242 FKRKNKKDLSTNQRALRRLRTACERAKRTLSSSAQTSVEIDSLFEGIDFYTSITRARFEE 301 FK+KN D S++ +A+ +L+ E+AKR LS+ + VEI++LF+G D ++TRA+FEE Sbjct 90648 FKKKNGSDPSSDLKAMGKLKRESEKAKRALSA*STVRVEIENLFDGKDLSETLTRAKFEE 90827 Query 302 LCADLFRSTLDPVEKVLRDAKLDKSQVDEIVLVGGSTRIPKVQKLVTDYFNGKEPNRSIN 361 L D F+ TL PVE+VL+DA L KS + +IVLVGGSTRIPKVQ L+ D+F+GKE N+ +N Sbjct 90828 LNNDYFKKTLKPVERVLKDAGLKKSDIQDIVLVGGSTRIPKVQSLLKDFFDGKELNKGVN 91007 Query 362 PDEavaygaavqaaILTGDESSKTQDlllldvaplslGIETAGGVMTKLIPRNSTIPTKK 421 PDEAVAYGAAVQ IL+ +E + D+LLLDV PL+LGIET GGVMTKLI R + IPTKK Sbjct 91008 PDEAVAYGAAVQGGILSNEE--EV*DVLLLDVNPLTLGIETTGGVMTKLINRGTVIPTKK 91181 Query 422 SEIFSTYADNQPGVLIQVFEGERAKTKDNNLLGKFELSGIPPAPRGVPQIEVTFDVDSNG 481 S+IFST ADN P VLIQ+FEGERA TKDNNLLGKFEL+GIPPAPRGVP IEVTF++D NG Sbjct 91182 SQIFSTAADN*PAVLIQIFEGERAMTKDNNLLGKFELTGIPPAPRGVP*IEVTFELDQNG 91361 Query 482 ILNVSAVEKGTGKSNKITITNDKGRLSKEDIEKMVaeaekfkeedekeSQRIASKNQLES 541 IL VSAV+KG+GKS ITI NDKGRLS+EDIE+M+ EAE++ EED +RI +KN LE+ Sbjct 91362 ILKVSAVDKGSGKSESITIKNDKGRLSEEDIERMLKEAEEYAEEDRIFKERIEAKNGLET 91541 Query 542 IAYSLKNTIS-EAGDKLEQADKDTVTKKAEETISWLDSNT-TASKEEFDDKLKELQDIAN 599 YS+KN +S E G KLE DKD ++ +E I W++SN TA+KE++++ E++ + N Sbjct 91542 YLYSVKNQLSDEVGKKLEDEDKDAISSIVKEKIEWMESNAETATKEDYEEVKAEVEAVVN 91721 Query 600 PIMSKLY 606 PIMSKLY Sbjct 91722 PIMSKLY 91742 > scaffold-423 Length=1446679 Score = 737 bits (1903), Expect = 0.0, Method: Compositional matrix adjust. Identities = 399/607 (66%), Positives = 485/607 (80%), Gaps = 5/607 (1%) Frame = -3 Query 3 KAVGIDLGTTYSCVAHFANDRVDIIANDQGNRTTPSFVAFTD-TERLIGDAAKNQAAMNP 61 K +GIDLGTTYSCVA F +V+IIAND G+R TPS+VAFTD ERLIG+AAKNQA NP Sbjct 1313216 KVIGIDLGTTYSCVAVFQAGKVEIIANDLGSRITPSWVAFTDDGERLIGEAAKNQAPQNP 1313037 Query 62 SNTVFDAKRLIGRNFNDPEVQADMKHFPFKLIDVDGKPQIQVEFKGETKNFTPEQISSMV 121 NTVFDAKRLIGR +ND EVQ + K PF + D +G+P IQV KGE K FTPE+IS+MV Sbjct 1313036 KNTVFDAKRLIGRKYNDKEVQMEKKSLPFDVTDREGRPVIQVSVKGEKKTFTPEEISAMV 1312857 Query 122 LGKMKETAESYLGAKVNDAVVTVPAYFNDSQRQATKDAGTIAGLNVLRIINEPTAAAIAY 181 L KMK+ AE YLG +V AVVTVPAYFND+QR ATKDAGTIAGLNVLRIINEPTAAAIAY Sbjct 1312856 LVKMKQIAEDYLGKEVTHAVVTVPAYFNDAQR*ATKDAGTIAGLNVLRIINEPTAAAIAY 1312677 Query 182 GLDKKGKEEHVLIFDLGGGTFDVSLLSIEDGIFEVKATAGDTHLGGEDFDNRLVNHFIQE 241 GLDKKG E +L++DLGGGTFDVSLLSI+DG+FEV ATAGDTHLGGEDFDNRL ++ ++ Sbjct 1312676 GLDKKGGERTILVYDLGGGTFDVSLLSIDDGVFEVLATAGDTHLGGEDFDNRLRDYLVK* 1312497 Query 242 FKRKNKKDLSTNQRALRRLRTACERAKRTLSSSAQTSVEIDSLFEGIDFYTSITRARFEE 301 FK+KN D S++ +A+ +L+ E+AKR LS+ + VEI++LF+G D ++TRA+FEE Sbjct 1312496 FKKKNGSDPSSDLKAMGKLKRESEKAKRALSA*STVRVEIENLFDGKDLSETLTRAKFEE 1312317 Query 302 LCADLFRSTLDPVEKVLRDAKLDKSQVDEIVLVGGSTRIPKVQKLVTDYFNGKEPNRSIN 361 L D F+ TL PVE+VL+DA L KS + +IVLVGGSTRIPKVQ L+ D+F+GKE N+ +N Sbjct 1312316 LNNDYFKKTLKPVERVLKDAGLKKSDIQDIVLVGGSTRIPKVQSLLKDFFDGKELNKGVN 1312137 Query 362 PDEavaygaavqaaILTGDESSKTQDlllldvaplslGIETAGGVMTKLIPRNSTIPTKK 421 PDEAVAYGAAVQ IL+ +E + D+LLLDV PL+LGIET GGVMTKLI R + IPTKK Sbjct 1312136 PDEAVAYGAAVQGGILSNEE--EV*DVLLLDVNPLTLGIETTGGVMTKLINRGTVIPTKK 1311963 Query 422 SEIFSTYADNQPGVLIQVFEGERAKTKDNNLLGKFELSGIPPAPRGVPQIEVTFDVDSNG 481 S IFST ADN P VLIQ+FEGERA TKDNNLLGKFEL+GIPPAPRGVP IEVTF++D NG Sbjct 1311962 S*IFSTAADN*PAVLIQIFEGERAMTKDNNLLGKFELTGIPPAPRGVP*IEVTFELDQNG 1311783 Query 482 ILNVSAVEKGTGKSNKITITNDKGRLSKEDIEKMVaeaekfkeedekeSQRIASKNQLES 541 IL VSAV+KG+GKS ITI NDKGRLS+EDIE+M+ EAE++ EED +RI +KN LE+ Sbjct 1311782 ILKVSAVDKGSGKSESITIKNDKGRLSEEDIERMLKEAEEYAEEDRIFKERIEAKNGLET 1311603 Query 542 IAYSLKNTIS-EAGDKLEQADKDTVTKKAEETISWLDSNT-TASKEEFDDKLKELQDIAN 599 YS+KN +S E G KLE DKD ++ +E I W++SN TA+KE++++ E++ + N Sbjct 1311602 YLYSVKNQLSDEVGKKLEDEDKDAISSIVKEKIEWMESNAETATKEDYEEVKAEVEAVVN 1311423 Query 600 PIMSKLY 606 PIMSKLY Sbjct 1311422 PIMSKLY 1311402 Score = 232 bits (592), Expect = 8e-64, Method: Compositional matrix adjust. Identities = 146/391 (37%), Positives = 223/391 (57%), Gaps = 17/391 (4%) Frame = +1 Query 4 AVGIDLGTTYSCVAHFANDRVDIIANDQGNRTTPSFVAFTDTERLIGDAAKNQAAMNPSN 63 VG D+G S +A N VD+I N+ NR TPS V+F +R IG++AK+ N N Sbjct 781726 VVGFDIGNMNSIIAVARNRGVDVICNEVSNRATPSLVSFGPKQRHIGESAKSMELGNFKN 781905 Query 64 TVFDAKRLIGRNFNDPEVQ-ADMKHFPFKLIDVDGKPQI--QVEFKGETKNFTPEQISSM 120 TV KRLIGR F++ +V + K+ +L+ ++G +I +V+F E + F+ Q+ +M Sbjct 781906 TVGSLKRLIGRKFSEKDVTDIEAKYLNCELVPINGGNEIGAKVQF*NEERVFSYTQLMAM 782085 Query 121 VLGKMKETAESYLGAKVNDAVVTVPAYFNDSQRQATKDAGTIAGLNVLRIINEPTAAAIA 180 + K+K E L + V D V++VP YF D+QR+A DA IAG+ LR+I + TA+A+ Sbjct 782086 YINKLK*ITEQELKSVVTDCVISVPLYFTDAQRRALIDAADIAGVKALRLIPDVTASALQ 782265 Query 181 YGLDKK---------GKE----EHVLIFDLGGGTFDVSLLSIEDGIFEVKATAGDTHLGG 227 +G+ K GK ++V D+G VS++ G VK A D HLGG Sbjct 782266 WGITKTDLPDPEAADGKSGPAIKYVAFVDIG*SDTSVSIVGY*KGKMMVKGVAYDRHLGG 782445 Query 228 EDFDNRLVNHFIQEFKRKNKKDLSTNQRALRRLRTACERAKRTLSSSAQTSVEIDSLFEG 287 +FD LV++F+++ + K K D+ +N +AL RLR+ CE+ K+ LS++ Q + ++ L Sbjct 782446 RNFDEVLVDYFVKDIQTKYKMDVRSNPKALFRLRSTCEKTKKILSANLQAPLSVECLMND 782625 Query 288 IDFYTSITRARFEELCADLFRSTLDPVEKVLRDAKLDKSQVDEIVLVGGSTRIPKVQKLV 347 D + I R +FE L L L P+ + L A + K +D + +VGGSTRIP V+ V Sbjct 782626 KDVSSMIERPQFEGLIQPLIERVLVPM*RALDIAGVSKEDIDVVEIVGGSTRIPAVKTAV 782805 Query 348 TDYFNGKEPNRSINPDEavaygaavqaaILT 378 + +F GKE + ++N DE VA G AIL+ Sbjct 782806 S*FF-GKELSTTLN*DECVAKGCTFMCAILS 782895 > unplaced-999 Length=6853 Score = 540 bits (1391), Expect = 8e-171, Method: Compositional matrix adjust. Identities = 258/315 (82%), Positives = 288/315 (91%), Gaps = 1/315 (0%) Frame = -2 Query 2 SKAVGIDLGTTYSCVAHFANDRVDIIANDQGNRTTPSFVAFTDTERLIGDAAKNQAAMNP 61 SKA+GIDLGTTYSCV + N++V+IIANDQGNRTTPS+VAFTD+ERL+GDAAKNQ +NP Sbjct 945 SKAIGIDLGTTYSCVGVW*NEKVEIIANDQGNRTTPSYVAFTDSERLLGDAAKNQVGLNP 766 Query 62 SNTVFDAKRLIGRNFNDPEVQADMKHFPFKLIDVDGKPQIQVEFKGETKNFTPEQISSMV 121 NTVFDAKRLIGR F D EV +DMKH+PFK+ID GKP IQVE+ GETK FTPE++S+MV Sbjct 765 YNTVFDAKRLIGRKFADAEV*SDMKHWPFKVIDKAGKPFIQVEYLGETKTFTPEEVSAMV 586 Query 122 LGKMKETAESYLGAKVNDAVVTVPAYFNDSQRQATKDAGTIAGLNVLRIINEPTAAAIAY 181 L KMKETAE++LGAKV +AVVTVPAYFNDSQRQATKDAG+IAGLNV+RIINEPTAAAIAY Sbjct 585 LTKMKETAEAFLGAKVTNAVVTVPAYFNDSQRQATKDAGSIAGLNVMRIINEPTAAAIAY 406 Query 182 GLDKKGK-EEHVLIFDLGGGTFDVSLLSIEDGIFEVKATAGDTHLGGEDFDNRLVNHFIQ 240 GLDKK K E++VLIFDLGGGTFDVSLL+IE+GIFEVKATAGDTHLGGEDFDNRLV HF Q Sbjct 405 GLDKKTKGEKNVLIFDLGGGTFDVSLLTIEEGIFEVKATAGDTHLGGEDFDNRLVTHFAQ 226 Query 241 EFKRKNKKDLSTNQRALRRLRTACERAKRTLSSSAQTSVEIDSLFEGIDFYTSITRARFE 300 EFKRK+KKDLS N R+LRRLRTACERAKRTLSS+ Q S+EIDSLFEG+DFYTSITRARFE Sbjct 225 EFKRKHKKDLSGNARSLRRLRTACERAKRTLSSATQASIEIDSLFEGVDFYTSITRARFE 46 Query 301 ELCADLFRSTLDPVE 315 ELC DLFR TLDPVE Sbjct 45 ELCGDLFRGTLDPVE 1 > unplaced-980 Length=19190 Score = 461 bits (1185), Expect = 9e-142, Method: Compositional matrix adjust. Identities = 259/334 (78%), Positives = 299/334 (90%), Gaps = 2/334 (1%) Frame = +1 Query 276 QTSVEIDSLFEGIDFYTSITRARFEELCADLFRSTLDPVEKVLRDAKLDKSQVDEIVLVG 335 Q S+EIDSLFEG+DFYTSITRARFEELC DLFR TLDPVEKVLRD+K+DKSQVDEIVLVG Sbjct 1 QASIEIDSLFEGVDFYTSITRARFEELCGDLFRGTLDPVEKVLRDSKIDKSQVDEIVLVG 180 Query 336 GSTRIPKVQKLVTDYFNGKEPNRSINPDEavaygaavqaaILTGDESSKTQDlllldvap 395 GSTRIPKVQKLV+D+FNGKEPN++INPDEAVAYGAAVQA+IL+G+ S KT DLLLLDVAP Sbjct 181 GSTRIPKVQKLVSDFFNGKEPNKTINPDEAVAYGAAVQASILSGETSEKT*DLLLLDVAP 360 Query 396 lslGIETAGGVMTKLIPRNSTIPTKKSEIFSTYADNQPGVLIQVFEGERAKTKDNNLLGK 455 LSLGIETAGGV T LI RN+TIPTKKSEIFSTYADNQPGVLIQVFEGERA+T DN+ LGK Sbjct 361 LSLGIETAGGVFTALIKRNTTIPTKKSEIFSTYADNQPGVLIQVFEGERARTADNHQLGK 540 Query 456 FELSGIPPAPRGVPQIEVTFDVDSNGILNVSAVEKGTGKSNKITITNDKGRLSKEDIEKM 515 FEL+GIPPAPRGVPQIEVTFD+D+NGILNVSA +K TG+SNKITITNDKGRLS+EDIE+M Sbjct 541 FELTGIPPAPRGVPQIEVTFDIDANGILNVSASDKTTGRSNKITITNDKGRLSQEDIERM 720 Query 516 VaeaekfkeedekeSQRIASKNQLESIAYSLKNTISEAG--DKLEQADKDTVTKKAEETI 573 V+EAEK+K++DE+ + RI +KN LES AY+L+NT+++ K++ ADK+T+ K ETI Sbjct 721 VSEAEKYKKQDEEATARIHAKNGLESYAYNLRNTLNDDNLKGKIDGADKETLEKAITETI 900 Query 574 SWLDSNTTASKEEFDDKLKELQDIANPIMSKLYQ 607 SWLD N ASKEEF+ K KEL+ ANPIM+KLYQ Sbjct 901 SWLD*NLEASKEEFESKQKELEGTANPIMTKLYQ 1002 > scaffold-157 Length=706690 Score = 285 bits (730), Expect = 1e-81, Method: Compositional matrix adjust. Identities = 188/402 (47%), Positives = 259/402 (64%), Gaps = 18/402 (4%) Frame = +2 Query 216 VKATAGDTHLGGEDFDNRLVNHFIQEFKRKNKKDLSTNQRALRRLRTACERAKRTLSSSA 275 VKAT GDT LGGEDFDNRLV + + EFK++ D+S + AL+R+R + E+AK LSS Sbjct 165338 VKATNGDT*LGGEDFDNRLVQYIVNEFKKEQGVDVSKDMMALQRIRESAEKAKIELSSVP 165517 Query 276 QTSVEIDSLFEGI----DFYTSITRARFEELCADLFRSTLDPVEKVLRDAKLDKSQVDEI 331 QT + + + +TR++FE LCADL T++P +K L DA + S +DE+ Sbjct 165518 QTEINLPYITADATGPKHINLRLTRSKFESLCADLMNRTIEPCKKALSDANVKPSDLDEV 165697 Query 332 VLVGGSTRIPKVQKLVTDYFNGKEPNRSINPDEavaygaavqaaILTGDESSKTQDllll 391 +LVGG TR+P+VQ +V D F +EP++++NPDEAVA GAA+Q +L+G+ +S +LLL Sbjct 165698 ILVGGMTRVPRVQDIVKDLFK-REPSKAVNPDEAVAVGAAIQGGVLSGEVNS----VLLL 165862 Query 392 dvaplslGIETAGGVMTKLIPRNSTIPTKKSEIFSTYADNQPGVLIQVFEGERAKTKDNN 451 DV PLSLGIET GGV T+LI RN+TIPTKKS++FST AD Q V I+VF+GER KDN Sbjct 165863 DVTPLSLGIETLGGVFTRLIHRNTTIPTKKSQVFSTAADGQTQVEIRVFQGERELCKDNK 166042 Query 452 LLGKFELSGIPPAPRGVPQIEVTFDVDSNGILNVSAVEKGTGKSNKITITNDKGRLSKED 511 LLG F L+GIPP PRGVPQIEVTFD+D++GI+NVSA +K T ITIT G L + Sbjct 166043 LLGNF*LTGIPPLPRGVPQIEVTFDIDADGIVNVSAKDKATNVD*SITITASSG-LKDTE 166219 Query 512 IEKMVaeaekfkeedekeSQRIASKNQLESIAYSLKNTISEAGDKLEQADKDTVTKKAEE 571 IEKM+ EAE+F D+ I + N + + Y + ++E K+ + + + +K +E Sbjct 166220 IEKMIQEAEQFSSADKDRKDVIEATNMADGLIYETEKNLNEHKSKIGEDEMSKIQEKIQE 166399 Query 572 TISWL------DSNTTASKEEFDDKLKELQDIANPIMSKLYQ 607 S L D N A E+ K +EL+D + +Y+ Sbjct 166400 LRSALTQAQEGDGNVKA--EDLRTKTQELKDAGMKMFEIVYK 166519 Score = 233 bits (593), Expect = 5e-64, Method: Compositional matrix adjust. Identities = 117/215 (54%), Positives = 153/215 (71%), Gaps = 5/215 (2%) Frame = +3 Query 5 VGIDLGTTYSCVAHFANDRVDIIANDQGNRTTPSFVAFT-DTERLIGDAAKNQAAMNPSN 63 +GIDLGTT SC A +I N +G RTTPS+VAF+ D E L+G AK QA +NP N Sbjct 164418 IGIDLGTTNSCNALMDGKASRVIENAEGTRTTPSYVAFSKDGELLVGVPAKRQAVINP*N 164597 Query 64 TVFDAKRLIGRNFNDPEVQADMKHFPFKLI-DVDGKPQIQVEFKGETKNFTPEQISSMVL 122 T F KRLIGR F+DPE+ K P+K++ +G ++ K ++P QI VL Sbjct 164598 TFFATKRLIGRKFDDPEI*VMKKTVPYKIVKHSNGDAWVE---DTNGKRYSPSQIGGYVL 164768 Query 123 GKMKETAESYLGAKVNDAVVTVPAYFNDSQRQATKDAGTIAGLNVLRIINEPTAAAIAYG 182 GK+K AES+LG V +AVVTVPAYFND+QRQATKDAG IAGL+VLR+INEPTAAA+AYG Sbjct 164769 GKLKSDAESFLGTPVKNAVVTVPAYFNDAQRQATKDAGKIAGLDVLRVINEPTAAALAYG 164948 Query 183 LDKKGKEEHVLIFDLGGGTFDVSLLSIEDGIFEVK 217 +DK ++ + ++DLGGGTFD+S+L ++ G+FEV+ Sbjct 164949 MDKADMDKTIAVYDLGGGTFDISILELQSGVFEVR 165053 Score = 133 bits (334), Expect = 5e-32, Method: Compositional matrix adjust. Identities = 90/310 (29%), Positives = 154/310 (50%), Gaps = 27/310 (9%) Frame = -1 Query 61 PSNTVFDAKRLIGRNFNDPEVQADMKHFPFKLIDVDGKPQIQVEFKGETKN------FTP 114 P K L+GR +ND VQ + +L++ + + KG + FT Sbjct 219904 PKEAYGKLKNLVGRLYNDSLVQ*FQEGCDNELVEDSLRHTCAFKHKGGAADGQSDTEFTV 219725 Query 115 EQISSMVLGKMKETAESYLGAKVNDAVVTVPAYFNDSQRQATKDAGTIAGLNVLRIINEP 174 E++ +M L ++ E + V DAV+TVP ++ +R A DA +AG+ VL ++N+ Sbjct 219724 EELIAMQLAYTRDQVEQFAEEVVRDAVLTVPPFWTQHER*ALIDAAELAGIRVLSLMNDE 219545 Query 175 TAAAIAYGLDKKGKEE--HVLIFDLGGGTFDVSLL---SIEDGI------------FEVK 217 TA A+ Y ++ ++ + + +D G G+ SL+ +I++ I +V Sbjct 219544 TAVALNYASTRQFDDKP*YHIFYDFGAGSTVASLIEFRTIQEKISPKSRMTKNVTSIQVL 219365 Query 218 ATAGDTHLGGEDFDNRLVNHFIQEFKR----KNKKDLSTNQRALRRLRTACERAKRTLSS 273 A D LGG +FD RL +++FK K K D++ + RA+ +L R K LS+ Sbjct 219364 AMGYDRTLGGHEFDTRL**LLVRKFKEGVGAKLKDDITKDHRAMSKLLREANRVKHILSA 219185 Query 274 SAQTSVEIDSLFEGIDFYTSITRARFEELCADLFRSTLDPVEKVLRDAKLDKSQVDEIVL 333 + ++ +F IDF T++ RA FE C DL + +P+++V++ + L + VL Sbjct 219184 NTDAQASVEGVFHDIDFKTTVRRAEFENSCQDLLKRIGNPIDQVIQSSNLTLKDI*SFVL 219005 Query 334 VGGSTRIPKV 343 VGG R+P V Sbjct 219004 VGGG*RVPAV 218975 > scaffold-693 Length=1268102 Score = 281 bits (720), Expect = 2e-80, Method: Compositional matrix adjust. Identities = 188/402 (47%), Positives = 257/402 (64%), Gaps = 18/402 (4%) Frame = +1 Query 216 VKATAGDTHLGGEDFDNRLVNHFIQEFKRKNKKDLSTNQRALRRLRTACERAKRTLSSSA 275 VKAT GDT LGGEDFDNRLV + + EFK++ D+S + AL+R+R + E+AK LSS Sbjct 1114528 VKATNGDT*LGGEDFDNRLVQYIVNEFKKEQGVDVSKDMMALQRIRESAEKAKIELSSVP 1114707 Query 276 QTSVEIDSLFEGI----DFYTSITRARFEELCADLFRSTLDPVEKVLRDAKLDKSQVDEI 331 QT + + + +TR++FE LCADL T++P +K L DA + S +DE+ Sbjct 1114708 QTEINLPYITADATGPKHINLRLTRSKFESLCADLMNRTIEPCKKALSDANVKPSDLDEV 1114887 Query 332 VLVGGSTRIPKVQKLVTDYFNGKEPNRSINPDEavaygaavqaaILTGDESSKTQDllll 391 +LVGG TR+P+VQ +V D F +EP++++NPDEAVA GAA+Q +L+G+ +S +LLL Sbjct 1114888 ILVGGMTRVPRVQDIVKDLFK-REPSKAVNPDEAVAVGAAIQGGVLSGEVNS----VLLL 1115052 Query 392 dvaplslGIETAGGVMTKLIPRNSTIPTKKSEIFSTYADNQPGVLIQVFEGERAKTKDNN 451 DV PLSLGIET GGV T+LI RN+TIPTKKS +FST AD Q V I+VF GER KDN Sbjct 1115053 DVTPLSLGIETLGGVFTRLIHRNTTIPTKKS*VFSTAADGQTQVEIRVF*GERELCKDNK 1115232 Query 452 LLGKFELSGIPPAPRGVPQIEVTFDVDSNGILNVSAVEKGTGKSNKITITNDKGRLSKED 511 LLG F L+GIPP PRGVPQIEVTFD+D++GI+NVSA +K T ITIT G L + Sbjct 1115233 LLGNF*LTGIPPLPRGVPQIEVTFDIDADGIVNVSAKDKATNVDQSITITASSG-LKDTE 1115409 Query 512 IEKMVaeaekfkeedekeSQRIASKNQLESIAYSLKNTISEAGDKLEQADKDTVTKKAEE 571 IEKM+ EAE+F D+ I + N + + Y + ++E K+ + + + +K +E Sbjct 1115410 IEKMIQEAEQFSSADKDRKDVIEATNMADGLIYETEKNLNEHKSKIGEDEMSKI*EKIQE 1115589 Query 572 TISWL------DSNTTASKEEFDDKLKELQDIANPIMSKLYQ 607 S L D N A E+ K +EL+D + +Y+ Sbjct 1115590 LRSALTQAQEGDGNVKA--EDLRTKTQELKDAGMKMFEIVYK 1115709 Score = 232 bits (591), Expect = 1e-63, Method: Compositional matrix adjust. Identities = 117/215 (54%), Positives = 153/215 (71%), Gaps = 5/215 (2%) Frame = +2 Query 5 VGIDLGTTYSCVAHFANDRVDIIANDQGNRTTPSFVAFT-DTERLIGDAAKNQAAMNPSN 63 +GIDLGTT SC A +I N +G RTTPS+VAF+ D E L+G AK QA +NP N Sbjct 1113608 IGIDLGTTNSCNALMDGKASRVIENAEGTRTTPSYVAFSKDGELLVGVPAKRQAVINP*N 1113787 Query 64 TVFDAKRLIGRNFNDPEVQADMKHFPFKLI-DVDGKPQIQVEFKGETKNFTPEQISSMVL 122 T F KRLIGR F+DPE+ K P+K++ +G ++ K ++P QI VL Sbjct 1113788 TFFATKRLIGRKFDDPEI*VMKKTVPYKIVKHSNGDAWVE---DTNGKRYSPSQIGGYVL 1113958 Query 123 GKMKETAESYLGAKVNDAVVTVPAYFNDSQRQATKDAGTIAGLNVLRIINEPTAAAIAYG 182 GK+K AES+LG V +AVVTVPAYFND+QRQATKDAG IAGL+VLR+INEPTAAA+AYG Sbjct 1113959 GKLKSDAESFLGTPVKNAVVTVPAYFNDAQRQATKDAGKIAGLDVLRVINEPTAAALAYG 1114138 Query 183 LDKKGKEEHVLIFDLGGGTFDVSLLSIEDGIFEVK 217 +DK ++ + ++DLGGGTFD+S+L ++ G+FEV+ Sbjct 1114139 MDKADMDKTIAVYDLGGGTFDISILELQSGVFEVR 1114243 Score = 125 bits (313), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 92/331 (28%), Positives = 161/331 (49%), Gaps = 27/331 (8%) Frame = -1 Query 61 PSNTVFDAKRLIGRNFNDPEVQADMKHFPFKLIDVDGKPQIQVEFKGETKN------FTP 114 P K L+GR +ND VQ + +L++ + + K + F+ Sbjct 1168679 PKEAYGKLKNLVGRLYNDSLVQ*FREGCDNELVEDSLRHTCAFKHKSGASDG*SE*EFSV 1168500 Query 115 EQISSMVLGKMKETAESYLGAKVNDAVVTVPAYFNDSQRQATKDAGTIAGLNVLRIINEP 174 E++ +M L ++ E + V DAV+TVP ++ +R A DA +AG+ VL ++N+ Sbjct 1168499 EELIAMQLAYTRDQVEQFAEEIVRDAVLTVPPFWTQHER*ALIDAAELAGIRVLSLMNDE 1168320 Query 175 TAAAIAYGLDKKGKEE--HVLIFDLGGGTFDVSLL---SIEDGIF------------EVK 217 TA A+ Y ++ ++ + + +D G G+ SL+ SI++ I +V Sbjct 1168319 TAVALNYASTRQFDDKP*YHIFYDFGAGSTVASLIEFRSIQEKILPKSRMTKNVTSIQVL 1168140 Query 218 ATAGDTHLGGEDFDNRLVNHFIQEFKR----KNKKDLSTNQRALRRLRTACERAKRTLSS 273 A D LGG +FD RL +++FK K K D++ + RA+ +L R K LS+ Sbjct 1168139 AMGYDRTLGGHEFDARL*QLLVRKFKEGVGAKLKDDITKDHRAMSKLLREANRVKHILSA 1167960 Query 274 SAQTSVEIDSLFEGIDFYTSITRARFEELCADLFRSTLDPVEKVLRDAKLDKSQVDEIVL 333 + ++ +F IDF T++ RA FE C DL + +P+++V++ + L + VL Sbjct 1167959 NTDAQASVEGVFHDIDFKTTVRRAEFENSCQDLLKRIGNPIDQVIQSSNLTLKDI*SFVL 1167780 Query 334 VGGSTRIPKVQKLVTDYFNGKEPNRSINPDE 364 VGG R+P V + + + + +N DE Sbjct 1167779 VGGG*RVPAV*QGL*RLVGADKIAKHVNADE 1167687 > unplaced-804 Length=18753 Score = 264 bits (674), Expect = 2e-74, Method: Compositional matrix adjust. Identities = 138/193 (72%), Positives = 167/193 (87%), Gaps = 2/193 (1%) Frame = -1 Query 417 IPTKKSEIFSTYADNQPGVLIQVFEGERAKTKDNNLLGKFELSGIPPAPRGVPQIEVTFD 476 IPTKKSEIFSTYADNQPGVLIQVFEGERA+T DN+ LGKFEL+GIPPAPRGVPQIEVTFD Sbjct 17964 IPTKKSEIFSTYADNQPGVLIQVFEGERARTADNHQLGKFELTGIPPAPRGVPQIEVTFD 17785 Query 477 VDSNGILNVSAVEKGTGKSNKITITNDKGRLSKEDIEKMVaeaekfkeedekeSQRIASK 536 +D+NGILNVSA +K TG+SNKITITNDKGRLS+EDIE+MV+EAEK+K++DE+ + R +K Sbjct 17784 IDANGILNVSASDKTTGRSNKITITNDKGRLSQEDIERMVSEAEKYKKQDEEATARFHAK 17605 Query 537 NQLESIAYSLKNTISEAG--DKLEQADKDTVTKKAEETISWLDSNTTASKEEFDDKLKEL 594 N LES AY+L+NT+++ K+++ADK+T+ K ETISWLD N ASKEEF+ K KEL Sbjct 17604 NGLESYAYNLRNTLNDDNLKGKIDEADKETLEKAITETISWLDQNLEASKEEFESKQKEL 17425 Query 595 QDIANPIMSKLYQ 607 + ANPIM+KLYQ Sbjct 17424 EGTANPIMTKLYQ 17386 > scaffold-499 Length=32750 Score = 262 bits (670), Expect = 6e-74, Method: Compositional matrix adjust. Identities = 138/193 (72%), Positives = 168/193 (87%), Gaps = 2/193 (1%) Frame = +1 Query 417 IPTKKSEIFSTYADNQPGVLIQVFEGERAKTKDNNLLGKFELSGIPPAPRGVPQIEVTFD 476 IPTKKSEIFST+ADNQPGVLIQVFEGERA+T DN+ LGKFEL+GIPPAPRGVPQIEVTFD Sbjct 3580 IPTKKSEIFSTHADNQPGVLIQVFEGERARTADNHQLGKFELTGIPPAPRGVPQIEVTFD 3759 Query 477 VDSNGILNVSAVEKGTGKSNKITITNDKGRLSKEDIEKMVaeaekfkeedekeSQRIASK 536 +D+NGILNVSA +K TG+SNKITITNDKGRLS+EDIE+MV+EAEK+K++DE+ + RI +K Sbjct 3760 IDANGILNVSASDKTTGRSNKITITNDKGRLSQEDIERMVSEAEKYKKQDEEATARIHAK 3939 Query 537 NQLESIAYSLKNTISEAG--DKLEQADKDTVTKKAEETISWLDSNTTASKEEFDDKLKEL 594 N LES AY+L+NT+++ K+++ADK+T+ K ETISWLD N ASKEEF+ K KEL Sbjct 3940 NGLESYAYNLRNTLNDDNLKGKIDEADKETLEKAITETISWLD*NLEASKEEFESKQKEL 4119 Query 595 QDIANPIMSKLYQ 607 + ANPIM+KLYQ Sbjct 4120 EGTANPIMTKLYQ 4158 > unplaced-959 Length=36552 Score = 231 bits (589), Expect = 1e-63, Method: Compositional matrix adjust. Identities = 146/391 (37%), Positives = 223/391 (57%), Gaps = 17/391 (4%) Frame = +1 Query 4 AVGIDLGTTYSCVAHFANDRVDIIANDQGNRTTPSFVAFTDTERLIGDAAKNQAAMNPSN 63 VG D+G S +A N VD+I N+ NR TPS V+F +R IG++AK+ N N Sbjct 9193 VVGFDIGNMNSIIAVARNRGVDVICNEVSNRATPSLVSFGPKQRHIGESAKSMELGNFKN 9372 Query 64 TVFDAKRLIGRNFNDPEVQ-ADMKHFPFKLIDVDGKPQI--QVEFKGETKNFTPEQISSM 120 TV KRLIGR F++ +V + K+ +L+ ++G +I +V+F E + F+ Q+ +M Sbjct 9373 TVGSLKRLIGRKFSEKDVTDIEAKYLNCELVPINGGNEIGAKVQF*NEERVFSYTQLMAM 9552 Query 121 VLGKMKETAESYLGAKVNDAVVTVPAYFNDSQRQATKDAGTIAGLNVLRIINEPTAAAIA 180 + K+K E L + V D V++VP YF D+QR+A DA IAG+ LR+I + TA+A+ Sbjct 9553 YINKLK*ITE*ELKSVVTDCVISVPLYFTDAQRRALIDAADIAGVKALRLIPDVTASALQ 9732 Query 181 YGLDKK---------GKE----EHVLIFDLGGGTFDVSLLSIEDGIFEVKATAGDTHLGG 227 +G+ K GK ++V D+G VS++ + G VK A D HLGG Sbjct 9733 WGITKTDLPDPEAADGKSGPPIKYVAFVDIG*SDTSVSIVGYQKGKMMVKGVAYDRHLGG 9912 Query 228 EDFDNRLVNHFIQEFKRKNKKDLSTNQRALRRLRTACERAKRTLSSSAQTSVEIDSLFEG 287 +FD LV++F+++ K K D+ +N +AL RLR+ CE+ K+ LS++ Q + ++ L Sbjct 9913 RNFDEVLVDYFVKDI*TKYKMDVRSNPKALFRLRSTCEKTKKILSANLQAPLSVECLMND 10092 Query 288 IDFYTSITRARFEELCADLFRSTLDPVEKVLRDAKLDKSQVDEIVLVGGSTRIPKVQKLV 347 D + I R +FE L L L P+ + L A + K +D + +VGGSTRIP V+ V Sbjct 10093 KDVSSMIERPQFEGLIQPLIERVLVPM*RALDIAGVSKEDIDVVEIVGGSTRIPAVKTAV 10272 Query 348 TDYFNGKEPNRSINPDEavaygaavqaaILT 378 + +F GKE + ++N DE VA G AIL+ Sbjct 10273 S*FF-GKELSTTLN*DECVAKGCTFMCAILS 10362 > scaffold-469 Length=313 Score = 150 bits (380), Expect = 5e-43, Method: Compositional matrix adjust. Identities = 80/104 (77%), Positives = 96/104 (92%), Gaps = 0/104 (0%) Frame = +2 Query 437 IQVFEGERAKTKDNNLLGKFELSGIPPAPRGVPQIEVTFDVDSNGILNVSAVEKGTGKSN 496 IQVFEGERA+T DN+ LGKFEL+GIPPAPRGVPQIEVTFD+D+NGILNVSA +K TG+SN Sbjct 2 IQVFEGERARTADNHQLGKFELTGIPPAPRGVPQIEVTFDIDANGILNVSASDKTTGRSN 181 Query 497 KITITNDKGRLSKEDIEKMVaeaekfkeedekeSQRIASKNQLE 540 KITITNDKGRLS+EDIE+MV+EAEK+K++DE+ + RI +KN LE Sbjct 182 KITITNDKGRLSQEDIERMVSEAEKYKKQDEEATARIHAKNGLE 313 > scaffold-418 Length=313 Score = 150 bits (380), Expect = 5e-43, Method: Compositional matrix adjust. Identities = 80/104 (77%), Positives = 96/104 (92%), Gaps = 0/104 (0%) Frame = -2 Query 437 IQVFEGERAKTKDNNLLGKFELSGIPPAPRGVPQIEVTFDVDSNGILNVSAVEKGTGKSN 496 IQVFEGERA+T DN+ LGKFEL+GIPPAPRGVPQIEVTFD+D+NGILNVSA +K TG+SN Sbjct 312 IQVFEGERARTADNHQLGKFELTGIPPAPRGVPQIEVTFDIDANGILNVSASDKTTGRSN 133 Query 497 KITITNDKGRLSKEDIEKMVaeaekfkeedekeSQRIASKNQLE 540 KITITNDKGRLS+EDIE+MV+EAEK+K++DE+ + RI +KN LE Sbjct 132 KITITNDKGRLSQEDIERMVSEAEKYKKQDEEATARIHAKNGLE 1 > unplaced-113 Length=266 Score = 122 bits (305), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 60/87 (69%), Positives = 68/87 (78%), Gaps = 1/87 (1%) Frame = +1 Query 9 LGTTYSCVAHFANDRVDIIANDQGNRTTPSFVAFTD-TERLIGDAAKNQAAMNPSNTVFD 67 LGTTYSCVA F +V+IIAND G+R TPS+VAFTD ERLIG+AAKNQA NP NTVFD Sbjct 1 LGTTYSCVAVFQAGKVEIIANDLGSRITPSWVAFTDDGERLIGEAAKNQAPQNPKNTVFD 180 Query 68 AKRLIGRNFNDPEVQADMKHFPFKLID 94 AKRLIGR +ND EVQ + K PF + D Sbjct 181 AKRLIGRKYNDKEVQMEKKSLPFDVTD 261 > scaffold-138 Length=253 Score = 78.6 bits (192), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 41/71 (58%), Positives = 54/71 (76%), Gaps = 2/71 (3%) Frame = -2 Query 539 LESIAYSLKNTISEAG--DKLEQADKDTVTKKAEETISWLDSNTTASKEEFDDKLKELQD 596 LES AY+L+NT+++ K+++ADK+T+ K ETISWLD N ASKEEF+ K KEL+ Sbjct 249 LESYAYNLRNTLNDDNLKGKIDEADKETLEKAITETISWLD*NLEASKEEFESKQKELEG 70 Query 597 IANPIMSKLYQ 607 ANPIM+KLYQ Sbjct 69 TANPIMTKLYQ 37 > scaffold-61 Length=253 Score = 78.6 bits (192), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 41/71 (58%), Positives = 54/71 (76%), Gaps = 2/71 (3%) Frame = +2 Query 539 LESIAYSLKNTISEAG--DKLEQADKDTVTKKAEETISWLDSNTTASKEEFDDKLKELQD 596 LES AY+L+NT+++ K+++ADK+T+ K ETISWLD N ASKEEF+ K KEL+ Sbjct 5 LESYAYNLRNTLNDDNLKGKIDEADKETLEKAITETISWLD*NLEASKEEFESKQKELEG 184 Query 597 IANPIMSKLYQ 607 ANPIM+KLYQ Sbjct 185 TANPIMTKLYQ 217 > unplaced-721 Length=275 Score = 43.9 bits (102), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 20/29 (69%), Positives = 23/29 (79%), Gaps = 0/29 (0%) Frame = -1 Query 579 NTTASKEEFDDKLKELQDIANPIMSKLYQ 607 N ASKEEF+ K KEL+ ANPIM+KLYQ Sbjct 272 NLEASKEEFESKQKELEGTANPIMTKLYQ 186 Lambda K H a alpha 0.313 0.131 0.358 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 4192692642 Database: X5.fasta Posted date: Oct 27, 2018 7:16 PM Number of letters in database: 23,962,143 Number of sequences in database: 1,868 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 13 Window for multiple hits: 40