На главную

Выравнивание последовательностей

Практикум 11: Построение парных выравниваний. Поиск по сходству

Задание 1. Поиск гомологов белка в базе данных NCBI

В задании требовалось собрать выборку белков, гомологичных Metallo-beta-lactamase type 2 [Klebsiella pneumoniae].

Для нахождения белков в базе данных Refseq использовалась программа protein BLAST на сайте NCBI. Мы ограничили поиск таксоном Gammaproteobacteria (taxid:1236), чтобы уменьшить количество находок. Настройки поиска можно посмотреть здесь.

Cравнение белков
Параметр/Находка Лучшая Средняя Худшая
Sequence ID WP_063860854.1 WP_032492622.1 WP_025733014.1
Название белка subclass B1 metallo-beta-lactamase NDM-11 subclass B1 metallo-beta-lactamase IMP-34 adenosylcobinamide kinase/adenosylcobinamide phosphate guanyltransferase
Организм Escherichia coli Klebsiella oxytoca Carnimonas nigrificans
Bit Score 554 133 35.8
% идентичных остатков 99% 34% 34%
% сходных остатков 100% 52% 50%
E-value 0.0 2e-35 4.8
Выравнивание из BLAST WP_063860854.1_with_our.html WP_032492622.1_with_our.html WP_025733014.1_with_our.html

Всего было обнаружено 365 находок, из них 179 оказались гомологичны нашему белку по всей длине (query cover > 80%).

Если руководствоваться критерием (query cover > 70%) & (E-value < 1e-3), обнаруживается 233 гомолога.

Задание 2. Множественное выравнивание выборки

Нами было отобрано 25 не слишком сходных последовательностей (см. таблицу). Эти последовательности были скачаны в формате fasta, к ним добавлена последовательность белка Metallo-beta-lactamase type 2 (см. файл). С помощью программы MUSCLE было построено множественное выравнивание этих последовательностей (см. файл).

 


 
gi|505278934|ref|WP_015466036.1|/1-258gi|1028085750|ref|WP_063844283.1|/1-254gi|1028111000|ref|WP_063865212.1|/1-254gi|1033023311|ref|WP_064339490.1|/1-253gi|1033019354|ref|WP_064335534.1|/1-253gi|953970376|ref|WP_058057118.1|/1-253gi|780229776|ref|WP_045529203.1|/1-253gi|516398145|ref|WP_017787543.1|/1-253gi|643468725|ref|WP_025213553.1|/1-311gi|966389408|ref|WP_058436172.1|/1-313sp|C7C422|BLAN1_KLEPN/1-270gi|1028105829|ref|WP_063860854.1|/1-270gi|1028105830|ref|WP_063860855.1|/1-270gi|505401651|ref|WP_015588753.1|/1-250gi|1011488691|ref|WP_062383369.1|/1-232gi|1044690745|ref|WP_065419570.1|/1-246gi|1028105463|ref|WP_063860584.1|/1-246gi|695269805|ref|WP_032491874.1|/1-246gi|1028105449|ref|WP_063860580.1|/1-245gi|1028110913|ref|WP_063865166.1|/1-266gi|765480818|ref|WP_044727179.1|/1-345gi|695270403|ref|WP_032492472.1|/1-266gi|1028110964|ref|WP_063865177.1|/1-266gi|1028110972|ref|WP_063865184.1|/1-266gi|917434988|ref|WP_052041700.1|/1-256gi|496209978|ref|WP_008927523.1|/1-254blocksConservationQualityConsensus
102030405060708090100110120130140150160170180190200210220230240250260270280290300310320330340350360370380390400410420430440450460470480490----------------------------------------------------------------------------------------------------------------------MLQ-VITIKAFNDNYIWL---------------------IKDSQSQRCILVD----PGDAQPVLEILEQQ-NLSVDAILVTHAHQDHIGGISELLAHFNKEIPIYSKDKLFSSS-------EPVEEGKTLCF------------FEQR--LSLKVMFVPG----------------HTLDHVAYYNDSS--LFCGDTLFSAG---CGRVMEGTHQQMFTSLARISQLKDTTKVYCAHEYTQQNLIFALHLEP-KNR------------------------------------------------ALRAHMQKVAKLRQQGLASVPTTLALEKTLNPFLRCSDTGLKNALQNKLCDEIHDALHCFTKLRQYKDVFIC-------------------------------------------------------------------------------MMKGWIKYGLAGALVLV---------ASFWGG-----SVHAA-AISLTQVSGP-VYV----------VEDNYYVKENSMVYFGAKGVTVVGATWTPDTARELHKLIKRVSNKPVLEVINTNYHTDRAGGNAYWKSIGAKVVSTRQTRDLMKSDWAEIVAFTRKGLPEYPDLPLVLPNVVHDGDFTLQ--EGKVRAFYLG--------------PAHTPDGIFVYFPDQQVLYGNCILKET----LGNLSFADVKAYPQTLERLRA-----------MKLPIKIVVGGHDSPLHGP------------------------------------------------ELIDHYEELIKASPHS---------------------------------------------------------------------------------------------------------------------------------------MMKGWMKCGLAGAVVLM---------ASFWGG-----SVRAA-GISLKQVSGP-VYV----------VEDNYYVKENSMVYFGAKGVTAVGATWTPDTARELHKLIKRVSSKPVLEVINTNYHTDRAGGNAYWKSIGAKVVATRQTRDLMKSDWAEIVAFTRKGLPEYPDLPLVLPNVVHDGDFKLQ--DGKVRAFYAG--------------PAHTPDGIFVYFPDEQVLYGNCILKEK----LGNLSFANVKEYPQTIERLKA-----------MKLPIKTVIGGHDSPLHGP------------------------------------------------ELIDHYEALIKAAAHS----------------------------------------------------------------------------------------------------------------------------------------MKGWMKCGLAGAVVLM---------ASFWGG-----SVRAA-GISLKQVSGP-VYV----------VEDNYYVKENSMVYFGAKGVTVVGATWTPDTARELHKLIKRVSSKPVLEVINTNYHTDRAGGNAYWKSIGAKVVATRQTRDLMKSDWAEIVTFTRKGLPEYPDLPLVLPNVVHDGDFTLQ--EGKVRAFYAG--------------PAHTPDGIFVYFPDEQVLYGNCILKEK----LGNLSFANVKEYPQTIERLKA-----------MKLPIKTVIGGHDSPLHGP------------------------------------------------ELIDHYEELIKAAPQS----------------------------------------------------------------------------------------------------------------------------------------MKGWMKCTLAGAVVLM---------ASFWGG-----SVRAA-GIELKQVSGP-VYV----------VEDNYYVKENSMVYFGAKGVTVVGATWTPDTARELHKLIKRVSSKPVLEVINTNYHTDRAGGNAYWKSIGAKVVATRQTRDLMKSDWAEIVAFTRKGLPEYPDLPLVLPNVVHDGDFTLQ--EGKVRAFYAG--------------PAHTPDGIFVYFPDEQVLYGNCILKEK----LGNLSFANVKAYPQTIERLKA-----------MKLPIKTVIGGHDSPLHGP------------------------------------------------ELIDHYEELIKTAPQS----------------------------------------------------------------------------------------------------------------------------------------MKGWMKCGLAGAVVLM---------ASFWGG-----SVRAA-GIELKQVSGP-VYV----------VEDNYYVKENSMVYFGAKGVTVVGATWTPDTARELHKLIKRVSSKPVLEVINTNYHTDRAGGNAYWKSIGAKVVATRQTRDLMKSDWAEIVAFTRKGLPEYPDLPLVLPNVVHDGDFTLQ--EGKVRAFYAG--------------PAHTPDGIFVYFPDEQVLYGNCILKEK----LGNLSFANVKAYPQTIERLKA-----------MKLPIKTVIGGHDSPLHGP------------------------------------------------ELIDHYEELIKAAPQS----------------------------------------------------------------------------------------------------------------------------------------MKGWIKCGLAGAVVLM---------ASFWGG-----SVRAA-GMSLTQVNGP-VYV----------VEDNYYVQENSMVYFGAKGVTVVGATWTPDTARELHKLIKRVSRQPVLEVINTNYHTDRAGGNAYWKSIGAKVVSTRQTRDLMKSDWVEIVAFTRKGLPDYPDLPLVLPNVVHQGDFTLQ--EGKLRAFYAG--------------PAHTPDGIFVYFPDQQVLYGNCILKEK----LGNLSFADVKAYPQTLERLKA-----------MKLPIKTVVGGHDSPLHGP------------------------------------------------ELIDHYEALIKAAPQS----------------------------------------------------------------------------------------------------------------------------------------MKGWIKCGLAGALVLM---------ASFWGG-----SVRAA-GISLTQVSGP-VYV----------VEDNYYVKENSMVYFGAKGVTVVGATWTPDTARELHKLIKRVSRQPVLEVINTNYHTDRAGGNAYWKSIGAKVVSTRQTRDLMKSDWAEIVAFTRKGLPEYPDLPLVLPNVVHEGDFTLQ--EGKLRAFFAG--------------PAHTPDGIFVYFPDQQVLYGNCILKEK----LGNLSFANVKAYPQTIERLKA-----------MKLPIKTVVGGHDSPLHGP------------------------------------------------ELIDHYEALIKAVPQS----------------------------------------------------------------------------------------------------------------------------------------MRWLLLI----FLSLC---------GPAWAN-------TDY-ALKPRQIAEG-TWLLEGSTENFAKANG-GNIVNTAFIVTDAG-VVVIDTGPSKRYGEALRQAIAATTDKPVVQVLLTHHHPDHVLGNQAFSAVPIGALAG--TTDLLRQQ-------GDAMAENM--Y----RLVGDWMRGTEV--VLPTQVLEPGVLKVGNHSLRLLQLAGHTGADLAILDETTGVLFAGDLVFYERA--LTTPNSPGLDVWLNDLDTLQA-------------LPWKQIVPGHGPV-ATDAKPFAQMRDYLGWLDQLMRDGAVRGDDMAEMIRSPIPERFAGISLSRYELIRSVSHLYPRYERAGMKRVDAPAQ------------------------------------------------------------------------------------------------------------------------------MRWILLVCLSLSLSLS---------LPARAE-------LDY-RLEPRQIAED-TWLLEGGTENFSRDNG-GNIVNTGFIVTEAG-VLVIDTGPSKRYGQALREAIARVTGKPVIQVLLTHHHPDHVLGNQAFSDVPIGALAG--TTELLRQQ-------GDAMAENL--Y----RLVGDWMRGTEV--VLPNRTLEPGVLEIGGHRLRLLALAGHTGADLAIFDERTGVLFAGDLVFYQRA--LTTPNSPGLEVWLKDLDRLQA-------------LPWRLVVPGHGPV-ASSAVPFAQMRDYLGWLHQLLSDGAARGDDMAELIRAPIPARFAAISLTRYELIRSVSHLYPRHERERMQRVDTP--------------------------------------------------------------------------------------------------------------------------MELPNIMHPVAKLSTALAAALMLSGCMPGEIRPTIGQQMETGDQRFG-DLVFRQLAPN-VWQ----HTSYLDMPGFGAVASNGLIVRDGGRVLVVDTAWTDDQTAQILNWIKQEINLPVALAVVTHAHQDKMGGMDALHAAGIATYANALSNQLAPQE-------GMVAAQHSLTF-----AANGWVEPATAPNFGPLKVFYPG--------------PGHTSDNITVGIDGTDIAFGGCLIKDSKAKSLGNLGDADTEHYAASARAFGA-----------AFPKASMIVMSHSAP-DSR------------------------------------------------AAITHTARMADKLR------------------------------------------------------------------------------------------------------------------------------------MELPNIMHPVAKLSTALAAALMLSGCMPGEIRPTIGQQMETGDQRFG-DLVFRQLAPN-VWQ----HTSYLDMPGFGAVASNGLIVRDGGRVLVVDTAWTDDQTAQILNWIKQEINLPVALAVVTHAHQDKMGGMDALHAAGIATYANALSNQLAPQE-------GVVAAQHSLTF-----AANGWVEPATAPNFGPLKVFYPG--------------PGHTSDNITVGIDGTDIAFGGCLIKDSKAKSLGNLGDADTEHYAASARAFGA-----------AFPKASMIVMSHSAP-DSR------------------------------------------------AAITHTARMADKLR------------------------------------------------------------------------------------------------------------------------------------MELPNIMHPVAKLSTALAAALMLSGCMPGEIRPTIGQQMETGDQRFG-DLVFRQLAPN-VWQ----HTSYLDMPGFGAVASNGLIVRDGGRVLVVDTAWTDDQTAQILNWIKQEINLPVALAVVTHAHQDKMGGMDALHAAGIATYANALSNQLAPQE-------GLVAAQHSLTF-----AANGWVEPATAPNFGPLKVFYPG--------------PGHTSDNITVGIDGTDIAFGGCLIKDSKAKSLGNLDDADTEHYAASARAFGA-----------AFPKASMIVMSHSAP-DSR------------------------------------------------AAITHTARMADKLR------------------------------------------------------------------------------------------------------------------------------------------MKNVLVFLILLVALPA----------LAQGHK----------PLEVIKIEDG-VYL----HTSFKNIEGYGLVDSNGLVVLDNNQAYIIDTPWSEEDTKLLLSWATDR-GYQVMASISTHSHEDRTAGIKLLNSKSIPTYTSELTKKLLARE-------GKPVPTHY--F--------KDDEFTLG--NGLIELYYPG--------------AGHTEDNIVAWLPKSKILFGGCLVRSHEWEGLGYVGDASISSWADSIKNIVS-----------KKYPIQMVVPGHGKV-GSS------------------------------------------------DILDHTIDLAESASNKLMQPTAEASAD--------------------------------------------------------------------------------------------------------------------------MSHAQVWA-------------------------------SEELP-PLKIQQLTDS-VYL----HISHKVVDGFGLVDSNGLVVLIGSEAYIVDTPWSTQDTETLLQWINAQ-GFTLKSVVSTHFHEDRTAGIEYLNANAIPTYASARTNKILQRQ-------GRPLAANT--F--------NKDKFSLV--KAHIEVFYPG--------------AGHAQDNVVVWLPEQKLLFGGCLIRANAATSLGNTSDAVLSAWSASVEELQS-----------RYADAKLVVPGHGDV-GDV------------------------------------------------SLLEHTRVLATVGQAVSK--------------------------------------------------------------------------------------------------------------------------------------MSKLSVFFIFLFCSIA-----------TAAE-------SLP-DLKIEKLDEG-VYV----HTSFEEVNGWGVVPKHGLVVLVNAEAYLIDTPFTAKDTEKLVTWFVER-GYKIKGSISSHFHSDSTGGIEWLISRSIPTYASELTNELLKKD-------GKVQATNS--F----SGVNYWL----V--KNKIEVFYPG--------------PGHTPDNVVVWLPERKILFGGCFIKPYG---LGNLGDANIEAWPKSAKLLKS-----------KYGKAKLVVPSHSEV-GDA------------------------------------------------SLLKLTLEQAVKGLNESKKPSKPSN-------------------------------------------------------------------------------------------------------------------------------MSKLSVFFIFLFCSIA-----------TAAE-------SLP-DLKIEKLDEG-VYV----HTSFEEVNGWGVVPKHGLVVLVNAEAYLIDTPFTAKDTEKLVTWFVER-GYKIKGSISSHFHSDSTGGIEWLNSRSIPTYASELTNELLKKD-------GKVQATNS--F----SGVNYWL----V--KNKIEVFYPG--------------PGHTPDNVVVWLPERKILFGGCFIKPYG---LGNLSDANIEAWPKSAKLLKS-----------KYGKAKLVVPGHSEV-GDA------------------------------------------------SLLKLTLEQAVKGLNESKKPSKPSN-------------------------------------------------------------------------------------------------------------------------------MKKLFVLCVCFLCSIT-----------AAGA-------ALP-DLKIEKLEEG-VYV----HTSFEEVNGWGVVSKHGLVVLVNTDAYLIDTPFTATDTEKLVNWFVER-GYKIKGTISSHFHSDSTGGIEWLNSQSIPTYASELTNELLKKD-------GKVQAKNS--F----SGVSYWL----V--KNKIEVFYPG--------------PGHTQDNVVVWLPEKKILFGGCFVKPDG---LGNLGDANLEAWPKSAKILMS-----------KYVKAKLVVSSHSEI-GDA------------------------------------------------SLLKRTWEQAVKGLNESKKPSQPSN-------------------------------------------------------------------------------------------------------------------------------MKKLFVLCIFLFCSIT-----------AAGA-------SLP-DLKIEKLEEG-VYV----HTSFEEVNGWGVASKHGLVVLVNTDAYLIDTPFTAKDTEKLVNWFVER-GYKIKGSISSHFHSDSTGGIEWLNSQSIPTYASVLTNELLKKD-------GKVQAKNS--F----SGVSYWL----V--KNKIEVFYPG--------------PGHTQDNVVVWLPKNKILFGGCFVKPYG---LGNLDDANVEAWPHSAEKLIS-----------KYGNAKLVVPSHSDI-GDA------------------------------------------------SLLKLTWEQAVKGLNESKKSNTVH-------------------------------------------------------------------------------------------------------------------------------MFKLLSKLLVYLTASIMAIASPLAFSVDSSGEYPTVSEIPVG-EVRLYQIADG-VWS----HIATQSFDG-AVYPSNGLIVRDGDELLLIDTAWGAKNTAALLAEIEKQIGLPVTRAVSTHFHDDRVGGVDVLRAAGVATYASPSTRRLAEVE-------GNEIPTHS--L----EGLSSSG-DAVR--FGPVELFYPG--------------AAHSTDNLVVYVPSASVLYGGCAIYELSRTSAGNVADADLAEWPTSIERIQQ-----------HYPEAQYVIPGHGLP-GGL------------------------------------------------DLLKHTTNVVKAHTNRSVVE----------------------------------------------------MRSRNWSRTLTERSGGSGAVLVFMACYDCFFVQSMPRASKQQARYAVGRCLMLWSSNDVTQQGSRPKTKLCRTHPHGVLMFKLLSKLLVYLTASIMAIASPLAFSVDSSGEYPTVSEIPVG-EVRLYQIADG-VWS----HIATQSFDG-AVYPSNGLIVRDGDELLLIDTAWGAKNTAALLAEIEKQIGLPVTRAVSTHFHDDRVGGVDVLRAAGVATYASPSTRRLAEVE-------GSEIPTHS--L----EGLSSSG-DAVR--FGPVELFYPG--------------AAHSTDNLVVYVPSASVLYGGCAIYELSRTSAGNVADADLAEWPTSIERIQQ-----------HYPEAQFVIPGHGLP-GGL------------------------------------------------DLLKHTTNVVKAHTNRSVVE-----------------------------------------------------------------------------------------------------------------------------------MLKVISSLLVYMTASVMAVASPLAHSGEPSGEYPTVNEIPVG-EVRLYQIADG-VWS----HIATQSFDG-AVYPSNGLIVRDGDELLLIDTAWGAKNTAALLAEIEKQIGLPVTRAVSTHFHDDRVGGVDVLRAAGVATYASPSTRRLAEAE-------GNEIPTHS--L----EGLSSSG-DAVR--FGPVELFYPG--------------AAHSTDNLVVYVPSAKVLYGGCAVHELSRTSAGNVADADLAEWPTSVERIQK-----------HYPEAEVVIPGHGLP-GGL------------------------------------------------DLLQHTANVVKAHKNRSVAE-----------------------------------------------------------------------------------------------------------------------------------MLKVISSLLVYMTASVMAVASPLAHSGEPSGEYPTVNEIPVG-EVRLYQIADG-VWS----HIATQSFDG-AVYPSNGLIVRDGDELLLIDTAWGAKNTAALLAEIEKQIGLPVTRAVSTHFHDDRVGGVDVLRAAGVATYASPSTRRLAEAE-------GNEIPTHS--L----EGLSSSG-DAVR--FGPVELFYPG--------------AAHSTDNLVVYVPSANVLYGGCAVLELSSTSAGNVADADLAEWPTSVERIQK-----------HYPEAEVVIPGHGLP-GGL------------------------------------------------DLLQHTANVVKAHKNRSVAE-----------------------------------------------------------------------------------------------------------------------------------MLKVISSLLVYMTASVMAVASPLAHSGEPSGEYPTVNEIPVG-EVRLYQIADG-VWS----HIATQSFDG-AVYPSNGLIVRDGDELLLIDTAWGAKNTAALLAEIEKQIGLPVTRAISTHFHDDRVGGVDVLRAAGVATYASPSTRRLAEAE-------GNEIPTHS--L----EGLSSSG-DAVR--FGPVELFYPG--------------AAHSTDNLVVYVPSANVLYGGCAVHELSSTSAGNVADADLAEWPTSVERIQK-----------HYPEAEVVIPGHGLP-GGL------------------------------------------------DLLQHTANVVKAHKNRSVAE------------------------------------------------------------------------------------------------------------------------------------MRVTQLLLLLFFAPLFIAP------ACLAAETQDRNIIELSADLQVYPLTDD-VWV----HRSWADLAG-TRYPANGLLINNDDTLVLIDTAWGNDSTALLLEWVTVKFGKYPDVAILTHAHADRIGGLPALAEHHIPAYVHPMTRELAASQ----------YPEEQ--LPEVIPGLEKHA--SAR--LEFLEAFYPG--------------PAHSPDNIVVWYDSMKLLFGGCAVKSADAGDIHYVEGSSPEGWEAAIGNIQG-----------RYPEATTVIPGHGAV-GGA------------------------------------------------QLLDHTRKLARQGNQP-------------------------------------------------------------------------------------------------------------------------------------MRLFALVITLLIPLAVLAV----------EPVGK-----PIRLSNDLVVQPLTEH-VWL----HRSWTEYNG-QRVSSNGVIIDDGKTLTLIDTAWGEAPTATLLSWIEQHFGKDPDQAILTHSHDDRMGGSRLLAEKHIRAYVHPKTLSIVKAE------PGKYYDPAS-HF----PQALAFHDRRAR--LGDIELYYPG--------------AAHSPDNIVIWYGRDHLLVGGCAVKGRESTNIGYVAGSAPDHWAAAMDNLIQ-----------AFPQAAMVVPGHGQL-GDE------------------------------------------------QLLFHTRALATQALERR-------------------------------------------------------BBBBBBBBBBBBBBBBBBBBBB--------------------------------------------------------------------------------1000001000101000----------00030-------000-1825237431-6+5----------001-00100127531353075795110123273193326222-341622595953*3*457*5227242253365200534+6236-------00022224--7----0010000-0003--231435735*--------------13*725294553333027697649421----735543642318125732834-----------01334359755*316-251------------------------------------------------4752251165232100-------------------------------------------------------- MRSRNWSRTLTERSGGSGAVLVFMACYDCFFVQSMPRASKQQARYAVGRCLMLWSSNDVTQQGSRPKTKLCRTHMELPNMMKGWSKLLLALAASLMA+ASPLA+SASFWGEYPTVNS+RLG+DLKLYQLADGYV+VLEG+HTSFQ+VEGNGVVPSNGLVVRDGKEVLLIDTAWTAD+TA+LLKWIKRVIG+PVLE+ISTHFHTDR+GGNDYLKSAGI+TYAS+LTRDLAKSDWAEIVAFGRVGA+HSPDLPLVLPGV+HWGDFTLRPNFGKVEVFYPGVL++G+H+LRLL+LPAHTPDNIVVYFP++KVLFGGC+VKEK+ATSLGNLSDADLEAWPQSIERLQALKDTTKVYCAHMYPPAK+VVPGHGSPLGGPA+PFAQMRDYLGWL+QL++DGA+RGDDMAE+IR+PIP+RFA+ISL+RYELLDHTEELAKAAPNSS+KES+PSN++KTLNPFLRCSDTGLKNALQNKLCDEIHDALHCFTKLRQYKDVFIC

В выравнивании присутствуют достаточно длинные невыровненные участки (длины 80 на N-конце и длины 50 на C-конце).

Задание 3. Глобальное и локальное парные выравнивания

Были построены четыре парных выравнивания последовательностей WP_063860580.1 и Metallo-beta-lactamase type 2:

Глобальное, полученное из множественного путем удаления лишних последовательностей (см. fasta-файл)

 
 
sp|C7C422|BLAN1_KLEPN/1-270gi|1028105449|ref|WP_063860580.1|/1-245ConservationQualityConsensus
102030405060708090100110120130140150160170180190200210220230240250260270280290300310320330340350360370380390400410420430440--------------------------------------------------------------------------MELPNIMHPVAKLSTALAAALMLSGCMPGEIRPTIGQQMETGDQRFG-DLVFRQLAPN-VWQ----HTSYLDMPGFGAVASNGLIVRDGGRVLVVDTAWTDDQTAQILNWIKQEINLPVALAVVTHAHQDKMGGMDALHAAGIATYANALSNQLAPQE-------GMVAAQHSLTF-----AANGWVEPATAPNFGPLKVFYPG--------------PGHTSDNITVGIDGTDIAFGGCLIKDSKAKSLGNLGDADTEHYAASARAFGA-----------AFPKASMIVMSHSAP-DSR------------------------------------------------AAITHTARMADKLR------------------------------------------------------------------------------------------MKKLFVLCIFLFCSIT-----------AAGA-------SLP-DLKIEKLEEG-VYV----HTSFEEVNGWGVASKHGLVVLVNTDAYLIDTPFTAKDTEKLVNWFVER-GYKIKGSISSHFHSDSTGGIEWLNSQSIPTYASVLTNELLKKD-------GKVQAKNS--F----SGVSYWL----V--KNKIEVFYPG--------------PGHTQDNVVVWLPKNKILFGGCFVKPYG---LGNLDDANVEAWPHSAEKLIS-----------KYGNAKLVVPSHSDI-GDA------------------------------------------------SLLKLTWEQAVKGLNESKKSNTVH--------------------------------------------------------------------------------*94975*767*798+7-----------77*6-------687-**5887*457-*+6----***95988*9*88855**9*557778799**79*567*47+9**8588-7749578969*7*8*57**996*5868*7***98*9*8*7479-------*7*6*75*--*-----+896*9----8--674+7*****--------------****8**97*6+65+6*7****89*665---****5**87*4+74**85878-----------5976*599*7***56-574------------------------------------------------87+65*688*5*75---------- --------------------------------------------------------------------------MELPNIM+++++L+++L+++++LSGCMPGEIRP++G+QMETGDQ+++-DL++++L+++-V++----HTS+++++G+G+++++GL+V+++++++++DT++T+++T++++NW++++I++++++++++H+H+D++GG+++L++++I+TYA++L+N+L++++-------G+V+A++SLTF----S++++W+EPAT+PN+++++VFYPG--------------PGHT+DN++V++++++I+FGGC++K+++AKSLGNL+DA++E++++SA+++++-----------++++A+++V+SHS++-+++------------------------------------------------+++++T+++A+K++NESKKSNTVH

Глобальное, полученное в программе NEEDLE (см. fasta-файл)  


 
sp|C7C422|BLAN1_KLEPN/1-270gi|1028105449|ref|WP_063860580.1|/1-245ConservationQualityConsensus
102030405060708090100110120130140150160170180190200210220230240250260270MELPNIMHPVAKLSTALAAALMLSGCMPGEIRPTIGQQMETGDQRFGDLVFRQLAPNVWQHTSYLDMPGFGAVASNGLIVRDGGRVLVVDTAWTDDQTAQILNWIKQEINLPVALAVVTHAHQDKMGGMDALHAAGIATYANALSNQLAPQEGMVAAQHSLTFAANGWVEPATAPNFGPLKVFYPGPGHTSDNITVGIDGTDIAFGGCLIKDSKAKSLGNLGDADTEHYAASARAFGAAFPKASMIVMSHSAPDSRAAITHTARMADKLR--------------------MKKLFVLCIFLFCSITAAGASLP--------------DLKIEKLEEGVYVHTSFEEVNGWGVASKHGLVVLVNTDAYLIDTPFTAKDTEKLVNWF-VERGYKIKGSISSHFHSDSTGGIEWLNSQSIPTYASVLTNELLKKDGKVQAKNSFS-GVSYWLVK------NKIEVFYPGPGHTQDNVVVWLPKNKILFGGCFV---KPYGLGNLDDANVEAWPHSAEKLISKYGNAKLVVPSHSDIGDASLLKLTWEQAVKGLNESKKSNTVH----------8*6568*977*98*7887*455*--------------**5887*457*+6***95988*9*88855**9*557778799**79*567*47+9**8-6*57749578969*7*8*57**996*5868*7***98*9*8*7479*7*6*75*89-+896*944------74+7*********8**97*6+65+6*7****89---*778****5**87*4+74**858785976*599*7***5657487+65*688*5*75---------- MELPNIMHPV+K++++L+++L++S++++G+++PTIGQQMETGDQRFGDL++++L+++V++HTS+++++G+G+++++GL+V+++++++++DT++T+++T++++NW+K+E+++++++++++H+H+D++GG+++L++++I+TYA++L+N+L++++G+V+A++S++F++++W+++ATAPNF++++VFYPGPGHT+DN++V++++++I+FGGC++KDSK+++LGNL+DA++E++++SA+++++++++A+++V+SHS++++++++++T+++A+K++NESKKSNTVH

Локальное, полученное в программе WATER (см. fasta-файл)

 
 
sp|C7C422|BLAN1_KLEPN/1-204gi|1028105449|ref|WP_063860580.1|/1-193ConservationQualityConsensus
102030405060708090100110120130140150160170180190200DLVFRQLAPNVWQHTSYLDMPGFGAVASNGLIVRDGGRVLVVDTAWTDDQTAQILNWIKQEINLPVALAVVTHAHQDKMGGMDALHAAGIATYANALSNQLAPQEGMVAAQHSLTFAANGWVEPATAPNFGPLKVFYPGPGHTSDNITVGIDGTDIAFGGCLIKDSKAKSLGNLGDADTEHYAASARAFGAAFPKASMIVMSHSDLKIEKLEEGVYVHTSFEEVNGWGVASKHGLVVLVNTDAYLIDTPFTAKDTEKLVNWF-VERGYKIKGSISSHFHSDSTGGIEWLNSQSIPTYASVLTNELLKKDGKVQAKNSFS-GVSYWLVK------NKIEVFYPGPGHTQDNVVVWLPKNKILFGGCFV---KPYGLGNLDDANVEAWPHSAEKLISKYGNAKLVVPSHS**5887*457*+6***95988*9*88855**9*557778799**79*567*47+9**8-6*57749578969*7*8*57**996*5868*7***98*9*8*7479*7*6*75*89-+896*944------74+7*********8**97*6+65+6*7****89---*778****5**87*4+74**858785976*599*7*** DL++++L+++V++HTS+++++G+G+++++GL+V+++++++++DT++T+++T++++NW+K+E+++++++++++H+H+D++GG+++L++++I+TYA++L+N+L++++G+V+A++S++F++++W+++ATAPNF++++VFYPGPGHT+DN++V++++++I+FGGC++KDSK+++LGNL+DA++E++++SA+++++++++A+++V+SHS

Локальное, полученное в результате работы BLAST (см. mfa-файл)

 


 
sp|C7C422|BLAN1_KLEPN/1-204gi|1028105449|ref|WP_063860580.1|/1-193ConservationQualityConsensus
102030405060708090100110120130140150160170180190200DLVFRQLAPNVWQHTSYLDMPGFGAVASNGLIVRDGGRVLVVDTAWTDDQTAQILNWIKQEINLPVALAVVTHAHQDKMGGMDALHAAGIATYANALSNQLAPQEGMVAAQHSLTFAANGWVEPATAPNFGPLKVFYPGPGHTSDNITVGIDGTDIAFGGCLIKDSKAKSLGNLGDADTEHYAASARAFGAAFPKASMIVMSHSDLKIEKLEEGVYVHTSFEEVNGWGVASKHGLVVLVNTDAYLIDTPFTAKDTEKLVNWF-VERGYKIKGSISSHFHSDSTGGIEWLNSQSIPTYASVLTNELLKKDGKVQAKNSFSGVSYWLVK-------NKIEVFYPGPGHTQDNVVVWLPKNKILFGGCFV---KPYGLGNLDDANVEAWPHSAEKLISKYGNAKLVVPSHS**5887*457*+6***95988*9*88855**9*557778799**79*567*47+9**8-6*57749578969*7*8*57**996*5868*7***98*9*8*7479*7*6*75*89788767*7-------74+7*********8**97*6+65+6*7****89---*778****5**87*4+74**858785976*599*7*** DL++++L+++V++HTS+++++G+G+++++GL+V+++++++++DT++T+++T++++NW+K+E+++++++++++H+H+D++GG+++L++++I+TYA++L+N+L++++G+V+A++S++++++++V+PATAPNF++++VFYPGPGHT+DN++V++++++I+FGGC++KDSK+++LGNL+DA++E++++SA+++++++++A+++V+SHS

Проект JalView

Задание 4. Выравнивание различных выравниваний друг относительно друга

На картинке несовпадающие участки выравнивания.

Участки, найденные программами BLAST и WATER, совпадают.

 


 
sp|C7C422|BLAN1_KLEPN/1-204gi|1028105449|ref|WP_063860580.1|/1-193sp|C7C422|BLAN1_KLEPN/1-204gi|1028105449|ref|WP_063860580.1|/1-193sp|C7C422|BLAN1_KLEPN/1-270gi|1028105449|ref|WP_063860580.1|/1-245sp|C7C422|BLAN1_KLEPN/1-270gi|1028105449|ref|WP_063860580.1|/1-245ConservationQualityConsensus
102030405060708090100110120130140150160170180190200210220230240250260270280290300310320330340350360370380390400410420430440--------------------------------------------------------------------------------------------------------------------------DLVFRQLAPN-VWQ----HTSYLDMPGFGAVASNGLIVRDGGRVLVVDTAWTDDQTAQILNWIKQEINLPVALAVVTHAHQDKMGGMDALHAAGIATYANALSNQLAPQE-------GMVAAQHSLTF-----AANGWVEPATAPNFGPLKVFYPG--------------PGHTSDNITVGIDGTDIAFGGCLIKDSKAKSLGNLGDADTEHYAASARAFGA-----------AFPKASMIVMSHS--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------DLKIEKLEEG-VYV----HTSFEEVNGWGVASKHGLVVLVNTDAYLIDTPFTAKDTEKLVNWF-VERGYKIKGSISSHFHSDSTGGIEWLNSQSIPTYASVLTNELLKKD-------GKVQAKNSFSG-----VSYWLVK-------NKIEVFYPG--------------PGHTQDNVVVWLPKNKILFGGCFV---KPYGLGNLDDANVEAWPHSAEKLIS-----------KYGNAKLVVPSHS--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------DLVFRQLAPN-VWQ----HTSYLDMPGFGAVASNGLIVRDGGRVLVVDTAWTDDQTAQILNWIKQEINLPVALAVVTHAHQDKMGGMDALHAAGIATYANALSNQLAPQE-------GMVAAQHSLTF-----AANGWVEPATAPNFGPLKVFYPG--------------PGHTSDNITVGIDGTDIAFGGCLIKDSKAKSLGNLGDADTEHYAASARAFGA-----------AFPKASMIVMSHS--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------DLKIEKLEEG-VYV----HTSFEEVNGWGVASKHGLVVLVNTDAYLIDTPFTAKDTEKLVNWF-VERGYKIKGSISSHFHSDSTGGIEWLNSQSIPTYASVLTNELLKKD-------GKVQAKNSFS------GVSYWLVK------NKIEVFYPG--------------PGHTQDNVVVWLPKNKILFGGCFV---KPYGLGNLDDANVEAWPHSAEKLIS-----------KYGNAKLVVPSHS--------------------------------------------------------------------------------------------------------------------------------------------------------MELPNIMHPVAKLSTALAAALMLSGCMPGEIRPTIGQQMETGDQRFG-DLVFRQLAPN-VWQ----HTSYLDMPGFGAVASNGLIVRDGGRVLVVDTAWTDDQTAQILNWIKQEINLPVALAVVTHAHQDKMGGMDALHAAGIATYANALSNQLAPQE-------GMVAAQHSLTF-----AANGWVEPATAPNFGPLKVFYPG--------------PGHTSDNITVGIDGTDIAFGGCLIKDSKAKSLGNLGDADTEHYAASARAFGA-----------AFPKASMIVMSHSAP-DSR------------------------------------------------AAITHTARMADKLR------------------------------------------------------------------------------------------MKKLFVLCIFLFCSIT-----------AAGA-------SLP-DLKIEKLEEG-VYV----HTSFEEVNGWGVASKHGLVVLVNTDAYLIDTPFTAKDTEKLVNWFVER-GYKIKGSISSHFHSDSTGGIEWLNSQSIPTYASVLTNELLKKD-------GKVQAKNS--F----SGVSYWL----V--KNKIEVFYPG--------------PGHTQDNVVVWLPKNKILFGGCFVKPYG---LGNLDDANVEAWPHSAEKLIS-----------KYGNAKLVVPSHSDI-GDA------------------------------------------------SLLKLTWEQAVKGLNESKKSNTVH--------------------------------------------------------------------------MELPNIMHPVAKLSTALAAALMLSGCMPGEIRPTIGQQMETGDQRFG-DLVFRQLAPN-VWQ----HTSYLDMPGFGAVASNGLIVRDGGRVLVVDTAWTDDQTAQILNWIKQEINLPVALAVVTHAHQDKMGGMDALHAAGIATYANALSNQLAPQE-------GMVAAQHSLTF-----AANGWVEPATAPNFGPLKVFYPG--------------PGHTSDNITVGIDGTDIAFGGCLIKDSKAKSLGNLGDADTEHYAASARAFGA-----------AFPKASMIVMSHSAP-DSR------------------------------------------------AAITHTARMADKLR----------------------------------------------------------------------------------------------MKKLFVLCIFLFCSITAAGASLP---------------DLKIEKLEEG-VYV----HTSFEEVNGWGVASKHGLVVLVNTDAYLIDTPFTAKDTEKLVNWF-VERGYKIKGSISSHFHSDSTGGIEWLNSQSIPTYASVLTNELLKKD-------GKVQAKNSFS------GVSYWLVK------NKIEVFYPG--------------PGHTQDNVVVWLPKNKILFGGCFV---KPYGLGNLDDANVEAWPHSAEKLIS-----------KYGNAKLVVPSHSDI-GDA------------------------------------------------SLLKLTWEQAVKGLNESKKSNTVH--------------------------------------------------------------------------------------------------------------------------**5887*457-*+6----***95988*9*88855**9*557778799**79*567*47+9**8-4807749578969*7*8*57**996*5868*7***98*9*8*7479-------*7*6*75*12------8666790-------74+7*****--------------****8**97*6+65+6*7****89---5122****5**87*4+74**85878-----------5976*599*7***------------------------------------------------------------------------------ --------------------------------------------------------------------------MELPNIMHPVAKLSTALAAALMLSGCMPGEIRPTIGQQMETGDQRFG-DL++++L+++-V++----HTS+++++G+G+++++GL+V+++++++++DT++T+++T++++NW+KQEI++++++++++H+H+D++GG+++L++++I+TYA++L+N+L++++-------G+V+A++SLTF----SAANGWVEPATAPNF++++VFYPG--------------PGHT+DN++V++++++I+FGGC++KDSKAKSLGNL+DA++E++++SA+++++-----------++++A+++V+SHS++-+++------------------------------------------------+++++T+++A+K++NESKKSNTVH

Проект JalView

Задание 5. Парные выравнивания поледовательностей негомологичных белков

Глобальное, полученное в программе NEEDLE (см. fasta-файл)

Локальное, полученное в программе WATER (см. fasta-файл)

Выравнивание друг отностительно друга

 


 
sp|C7C422|BLAN1_KLEPN/1-2704R1L:A|PDBID|CHAIN|SEQUENCE/1-436sp|C7C422|BLAN1_KLEPN/1-974R1L:A|PDBID|CHAIN|SEQUENCE/1-90ConservationQualityConsensus
102030405060708090100110120130140150160170180190200210220230240250260270280290300310320330340350360370380390400410420430440450460470480490500510520530540550560570580------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MELPNIMHPVAKLSTALAAALMLSGCMPGEI---RPTIGQQMETGDQRF---GDLVFRQLAPNVWQHTSY----LDMPG--FGAVASNGLIVRD---GGRVLVVDTAWTDDQT----------AQILNWIKQEINLPVALAVVTHAHQDKMGGMDALHAAGIATYANALSNQLAPQEGMVAAQHSLTFAANGWVEPATAPNFGPLKVFYPGPGHTSDNITVGIDGTDIAFGGCLIKDSKAKSLGNLGDADTEHYAASARAFGAAFPKASMIVMSHSAPDSRAAITHTARMADKLRGMSTQYWEEEIEIMSREKLQELQLQRLKKTINIAANSPYYKEVFSKNGITGDSIQSLDDIRKIPFTTKSDMRANYPFGLVAGDMKRDGVRIHSSSGTTGNPTVIVHSQHDLDSWANLVARCLYMVGIRKTDVFQNSSGYGMFTGGLGFQYGAERLGCLTVPAAAGNSKRQIKFISDFKTTALHAIPSYAIRLAEVFQEEGIDPRETTLKTLVIGAEPHTDEQRRKIERMLNVKAYNSFGMTEMNGPGVAFECQEQNGMHFWEDCYLVEIIDPETGEPVPEGEIGELVLTTLDREMMPLIRYRTRDLTRILPGKCPCGRT---HLRIDRIKGRSDDMFIIKGVNIFPMQVEKI------------LVQ--FPELGSNYLITLETVNNQDEMIVEVELSDLSTDNYIELEKIRRDIIRQLKDEILVTPKVKLV------KKGSLP--QSEGKAVRVKDLRDNK--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MHPVAKLSTALAAALMLSGCMPGEI---RPTIGQQMETGDQRF---GDLVFRQLAPNVWQHTSY----LDMPG--FGAVASNGLIVRD---GGRVLVVDTAWTDDQT----------AQILN-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------LHAIPSYAIRLAEVFQEEGIDPRETTLKTLVIGAEPHTDEQR-----RKIERML--NVKAYNSFGMTEMNGPGVAFECQEQNGM---------------HFWEDCYL----------VEIID-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------9679557464474387345848486---564*5454275976-----3495587--69------------776--*34648*69---------------3376*466----------37*+6----------------------------------------------------------------------------------------------------------------------------------------------------------------------- GMSTQYWEEEIEIMSREKLQELQLQRLKKTINIAANSPYYKEVFSKNGITGDSIQSLDDIRKIPFTTKSDMRANYPFGLVAGDMKRDGVRIHSSSGTTGNPTVIVHSQHDLDSWANLVARCLYMVGIRKTDVFQNSSGYGMFTGGLGFQYGAERLGCLTVPAAAGNSKRQIKFISDFKTTALHAIPSYAIRLAEVFQEEGIDPRETTLKTLVIGAEPHTDEQRRKIERMLNVKAYNSFGMTEMNGPGVAFECQEQNGMHFWEDCYLVEIIDPETGEPVPEGEIGELVL++L+++MHPVAK+ST+LAAALMLSGCMPGE+TLKR+TIGQQMETGDQRFIIKGDL+FR+LAPNVWQHTSYGMTELDMPGVAFGAVASNGLIVRDTVNGGRVLVVDTAWTDDQTDNYIELEKIRAQI+N++K+EI++++++++VTHAHQDK+G+++AL+++G+A+++++L++++APQEGMVAAQHSLTFAANGWVEPATAPNFGPLKVFYPGPGHTSDNITVGIDGTDIAFGGCLIKDSKAKSLGNLGDADTEHYAASARAFGAAFPKASMIVMSHSAPDSRAAITHTARMADKLR


© Екатерина Посицельская, 2016