import sys
import modeller
import _modeller
import modeller.automodel
import requests
env=modeller.environ()
env.io.hetatm=True
! wget http://www.pdb.org/pdb/files/1lmp.pdb
--2020-10-13 03:09:25-- http://www.pdb.org/pdb/files/1lmp.pdb Resolving www.pdb.org (www.pdb.org)... 128.6.244.52 Connecting to www.pdb.org (www.pdb.org)|128.6.244.52|:80... connected. HTTP request sent, awaiting response... 301 Moved Permanently Location: https://www.rcsb.org/pdb/files/1lmp.pdb [following] --2020-10-13 03:09:26-- https://www.rcsb.org/pdb/files/1lmp.pdb Resolving www.rcsb.org (www.rcsb.org)... 128.6.244.65 Connecting to www.rcsb.org (www.rcsb.org)|128.6.244.65|:443... connected. ERROR: The certificate of `www.rcsb.org' is not trusted. ERROR: The certificate of `www.rcsb.org' hasn't got a known issuer.
url = 'http://www.pdb.org/pdb/files/1lmp.pdb'
r = requests.get(url)
with open('1lmp.pdb', 'wb') as f:
f.write(r.content)
! wget http://www.uniprot.org/uniprot/LYS_BUFAN.fasta
--2020-10-13 03:08:30-- http://www.uniprot.org/uniprot/LYS_BUFAN.fasta Resolving www.uniprot.org (www.uniprot.org)... 128.175.245.202, 193.62.193.81 Connecting to www.uniprot.org (www.uniprot.org)|128.175.245.202|:80... connected. HTTP request sent, awaiting response... 301 Moved Permanently Location: https://www.uniprot.org/uniprot/LYS_BUFAN.fasta [following] --2020-10-13 03:08:31-- https://www.uniprot.org/uniprot/LYS_BUFAN.fasta Connecting to www.uniprot.org (www.uniprot.org)|128.175.245.202|:443... connected. HTTP request sent, awaiting response... 302 Location: /uniprot/P85045.fasta [following] --2020-10-13 03:08:32-- https://www.uniprot.org/uniprot/P85045.fasta Reusing existing connection to www.uniprot.org:443. HTTP request sent, awaiting response... 200 Length: 240 [text/plain] Saving to: `LYS_BUFAN.fasta' 100%[======================================>] 240 --.-K/s in 0s 2020-10-13 03:08:33 (131 MB/s) - `LYS_BUFAN.fasta' saved [240/240]
alignm=modeller.alignment(env)
alignm.append(file='LYS_BUFAN.fasta', align_codes='all',alignment_format='FASTA')
# создадим модель
mdl = modeller.model(env, file='1lmp.pdb')
#, model_segment=('FIRST:'+'A', 'LAST:'+'A'))
# и добавим в выравнивание
alignm.append_model(mdl, atom_files='1lmp.pdb', align_codes='1lmp')
# а также присвоим им идентификаторы
alignm[0].code = 'seq'
alignm[1].code = 'mod'
read_pd_459W> Residue type NDG not recognized. 'automodel' model building will treat this residue as a rigid body. To use real parameters, add the residue type to ${LIB}/restyp.lib, its topology to ${LIB}/top_*.lib, and suitable forcefield parameters to ${LIB}/par.lib. read_pd_459W> Residue type NAG not recognized. 'automodel' model building will treat this residue as a rigid body. To use real parameters, add the residue type to ${LIB}/restyp.lib, its topology to ${LIB}/top_*.lib, and suitable forcefield parameters to ${LIB}/par.lib.
alignm.salign()
alignm.write(file='all_in_one.ali', alignment_format='PIR')
SALIGN_____> adding the next group to the alignment; iteration 1
## Выбираем объект для моделирования
s = alignm[0]
pdb = alignm[1]
print(s.code, pdb.code)
## Создаем объект automodel
a = modeller.automodel.automodel(env, alnfile='all_in_one.ali', knowns= pdb.code , sequence = s.code )
a.name='mod'+s.code
a.starting_model = 1
a.ending_model = 2
a.make()
seq mod fndatmi_285W> Only 129 residues out of 132 contain atoms of type CA (This is usually caused by non-standard residues, such as ligands, or by PDB files with missing atoms.) fndatmi_285W> Only 129 residues out of 132 contain atoms of type CA (This is usually caused by non-standard residues, such as ligands, or by PDB files with missing atoms.) check_ali___> Checking the sequence-structure alignment. Implied intrachain target CA(i)-CA(i+1) distances longer than 8.0 angstroms: ALN_POS TMPL RID1 RID2 NAM1 NAM2 DIST ---------------------------------------------- END OF TABLE read_to_681_> topology.submodel read from topology file: 3 fndatmi_285W> Only 129 residues out of 132 contain atoms of type CA (This is usually caused by non-standard residues, such as ligands, or by PDB files with missing atoms.) patch_s_522_> Number of disulfides patched in MODEL: 4 mdtrsr__446W> A potential that relies on one protein is used, yet you have at least one known structure available. MDT, not library, potential is used. iup2crm_280W> No topology library in memory or assigning a BLK residue. Default CHARMM atom type assigned: C1 --> CT2 This message is written only for the first such atom. 0 atoms in HETATM/BLK residues constrained to protein atoms within 2.30 angstroms and protein CA atoms within 10.00 angstroms 0 atoms in residues without defined topology constrained to be rigid bodies condens_443_> Restraints marked for deletion were removed. Total number of restraints before, now: 13044 12008 iupac_m_397W> Atoms were not swapped because of the uncertainty of how to handle the H atom. iupac_m_397W> Atoms were not swapped because of the uncertainty of how to handle the H atom. >> ENERGY; Differences between the model's features and restraints: Number of all residues in MODEL : 146 Number of all, selected real atoms : 1166 1166 Number of all, selected pseudo atoms : 0 0 Number of all static, selected restraints : 12008 12008 COVALENT_CYS : F NONBONDED_SEL_ATOMS : 1 Number of non-bonded pairs (excluding 1-2,1-3,1-4): 2336 Dynamic pairs routine : 2, NATM x NATM cell sorting Atomic shift for contacts update (UPDATE_DYNAMIC) : 0.390 LENNARD_JONES_SWITCH : 6.500 7.500 COULOMB_JONES_SWITCH : 6.500 7.500 RESIDUE_SPAN_RANGE : 0 99999 NLOGN_USE : 15 CONTACT_SHELL : 4.000 DYNAMIC_PAIRS,_SPHERE,_COULOMB,_LENNARD,_MODELLER : T T F F F SPHERE_STDV : 0.050 RADII_FACTOR : 0.820 Current energy : 878.3400 Summary of the restraint violations: NUM ... number of restraints. NUMVI ... number of restraints with RVIOL > VIOL_REPORT_CUT[i]. RVIOL ... relative difference from the best value. NUMVP ... number of restraints with -Ln(pdf) > VIOL_REPORT_CUT2[i]. RMS_1 ... RMS(feature, minimally_violated_basis_restraint, NUMB). RMS_2 ... RMS(feature, best_value, NUMB). MOL.PDF ... scaled contribution to -Ln(Molecular pdf). # RESTRAINT_GROUP NUM NUMVI NUMVP RMS_1 RMS_2 MOL.PDF S_i ------------------------------------------------------------------------------------------------------ 1 Bond length potential : 1197 0 0 0.006 0.006 12.519 1.000 2 Bond angle potential : 1618 0 4 2.231 2.231 158.64 1.000 3 Stereochemical cosine torsion poten: 814 0 27 47.331 47.331 280.14 1.000 4 Stereochemical improper torsion pot: 502 0 0 1.349 1.349 20.337 1.000 5 Soft-sphere overlap restraints : 2336 0 0 0.002 0.002 1.0259 1.000 6 Lennard-Jones 6-12 potential : 0 0 0 0.000 0.000 0.0000 1.000 7 Coulomb point-point electrostatic p: 0 0 0 0.000 0.000 0.0000 1.000 8 H-bonding potential : 0 0 0 0.000 0.000 0.0000 1.000 9 Distance restraints 1 (CA-CA) : 2405 0 0 0.135 0.135 41.774 1.000 10 Distance restraints 2 (N-O) : 2564 0 2 0.198 0.198 83.769 1.000 11 Mainchain Phi dihedral restraints : 0 0 0 0.000 0.000 0.0000 1.000 12 Mainchain Psi dihedral restraints : 0 0 0 0.000 0.000 0.0000 1.000 13 Mainchain Omega dihedral restraints: 145 0 2 4.519 4.519 34.913 1.000 14 Sidechain Chi_1 dihedral restraints: 126 0 0 64.604 64.604 23.943 1.000 15 Sidechain Chi_2 dihedral restraints: 93 0 0 69.103 69.103 31.002 1.000 16 Sidechain Chi_3 dihedral restraints: 37 0 0 74.948 74.948 24.160 1.000 17 Sidechain Chi_4 dihedral restraints: 21 0 0 81.740 81.740 14.579 1.000 18 Disulfide distance restraints : 4 0 0 0.008 0.008 0.39093E-01 1.000 19 Disulfide angle restraints : 8 0 0 1.768 1.768 0.55200 1.000 20 Disulfide dihedral angle restraints: 4 0 0 32.809 32.809 3.6785 1.000 21 Lower bound distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 22 Upper bound distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 23 Distance restraints 3 (SDCH-MNCH) : 1582 0 0 0.409 0.409 44.158 1.000 24 Sidechain Chi_5 dihedral restraints: 0 0 0 0.000 0.000 0.0000 1.000 25 Phi/Psi pair of dihedral restraints: 144 18 14 23.130 61.806 48.665 1.000 26 Distance restraints 4 (SDCH-SDCH) : 744 0 0 0.665 0.665 54.444 1.000 27 Distance restraints 5 (X-Y) : 0 0 0 0.000 0.000 0.0000 1.000 28 NMR distance restraints 6 (X-Y) : 0 0 0 0.000 0.000 0.0000 1.000 29 NMR distance restraints 7 (X-Y) : 0 0 0 0.000 0.000 0.0000 1.000 30 Minimal distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 31 Non-bonded restraints : 0 0 0 0.000 0.000 0.0000 1.000 32 Atomic accessibility restraints : 0 0 0 0.000 0.000 0.0000 1.000 33 Atomic density restraints : 0 0 0 0.000 0.000 0.0000 1.000 34 Absolute position restraints : 0 0 0 0.000 0.000 0.0000 1.000 35 Dihedral angle difference restraint: 0 0 0 0.000 0.000 0.0000 1.000 36 GBSA implicit solvent potential : 0 0 0 0.000 0.000 0.0000 1.000 37 EM density fitting potential : 0 0 0 0.000 0.000 0.0000 1.000 38 SAXS restraints : 0 0 0 0.000 0.000 0.0000 1.000 39 Symmetry restraints : 0 0 0 0.000 0.000 0.0000 1.000 # Heavy relative violation of each residue is written to: seq.V99990001 # The profile is NOT normalized by the number of restraints. # The profiles are smoothed over a window of residues: 1 # The sum of all numbers in the file: 15111.6699 List of the violated restraints: A restraint is violated when the relative difference from the best value (RVIOL) is larger than CUTOFF. ICSR ... index of a restraint in the current set. RESNO ... residue numbers of the first two atoms. ATM ... IUPAC atom names of the first two atoms. FEAT ... the value of the feature in the model. restr ... the mean of the basis restraint with the smallest difference from the model (local minimum). viol ... difference from the local minimum. rviol ... relative difference from the local minimum. RESTR ... the best value (global minimum). VIOL ... difference from the best value. RVIOL ... relative difference from the best value. ------------------------------------------------------------------------------------------------- Feature 25 : Phi/Psi pair of dihedral restraints List of the RVIOL violations larger than : 6.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 4148 1S 2G C N 5 7 -92.72 -80.20 57.03 1.65 82.20 -136.72 17.38 1 2G 2G N CA 7 8 -130.26 174.10 8.50 2 4149 2G 3K C N 9 11 -100.98 -118.00 17.28 0.61 -62.90 -179.05 21.71 2 3K 3K N CA 11 12 136.10 139.10 -40.80 3 4150 3K 4Y C N 18 20 -131.69 -124.30 19.16 0.77 -63.50 177.17 31.50 3 4Y 4Y N CA 20 21 153.08 135.40 -43.40 4 4152 5I 6S C N 38 40 -70.73 -72.40 4.08 0.30 -64.10 169.01 12.65 4 6S 6S N CA 40 41 156.12 152.40 -35.00 5 4153 6S 7W C N 44 46 -135.70 -124.90 33.32 1.34 -63.00 158.53 24.71 5 7W 7W N CA 46 47 174.92 143.40 -44.20 6 4154 7W 8E C N 58 60 -68.29 -69.30 3.90 0.33 -63.60 173.50 23.39 6 8E 8E N CA 60 61 146.27 142.50 -40.30 7 4155 8E 9D C N 67 69 -68.78 -70.90 14.41 0.61 -63.30 176.13 21.41 7 9D 9D N CA 69 70 136.05 150.30 -40.00 8 4156 9D 10S C N 75 77 -128.20 -136.60 19.98 0.80 -64.10 179.88 11.11 8 10S 10S N CA 77 78 133.06 151.20 -35.00 9 4158 11C 12S C N 87 89 -79.98 -72.40 11.27 0.89 -64.10 179.77 12.17 9 12S 12S N CA 89 90 144.06 152.40 -35.00 10 4159 12S 13Y C N 93 95 -127.81 -124.30 8.86 0.35 -63.50 -175.37 32.52 10 13Y 13Y N CA 95 96 143.53 135.40 -43.40 11 4160 13Y 14L C N 105 107 -87.72 -108.50 22.37 0.99 -63.50 167.20 21.94 11 14L 14L N CA 107 108 124.24 132.50 -41.20 12 4161 14L 15Q C N 113 115 -154.04 -121.10 108.16 4.50 -63.80 118.62 21.48 12 15Q 15Q N CA 115 116 -117.28 139.70 -40.30 13 4162 15Q 16L C N 122 124 -59.78 -70.70 11.07 1.06 -63.50 175.39 24.14 13 16L 16L N CA 124 125 143.45 141.60 -41.20 14 4163 16L 17Q C N 130 132 -113.81 -121.10 20.34 0.82 -63.80 168.60 22.50 14 17Q 17Q N CA 132 133 120.71 139.70 -40.30 15 4198 51E 52S C N 414 416 -138.18 -64.10 82.00 8.62 -64.10 82.00 8.62 15 52S 52S N CA 416 417 0.16 -35.00 -35.00 16 4210 63P 64P C N 504 506 -60.67 -58.70 24.80 2.21 -64.50 157.62 11.97 16 64P 64P N CA 506 507 -55.23 -30.50 147.20 17 4288 141T 142Q C N 1128 1130 -1.28 -63.80 98.26 12.65 -73.00 125.66 9.56 17 142Q 142Q N CA 1130 1131 -116.11 -40.30 140.70 18 4289 142Q 143G C N 1137 1139 -55.77 -62.40 23.83 3.91 82.20 140.56 10.27 18 143G 143G N CA 1139 1140 -18.31 -41.20 8.50 report______> Distribution of short non-bonded contacts: DISTANCE1: 0.00 2.10 2.20 2.30 2.40 2.50 2.60 2.70 2.80 2.90 3.00 3.10 3.20 3.30 3.40 DISTANCE2: 2.10 2.20 2.30 2.40 2.50 2.60 2.70 2.80 2.90 3.00 3.10 3.20 3.30 3.40 3.50 FREQUENCY: 0 0 0 0 0 5 7 38 69 128 117 116 154 192 172 << end of ENERGY. iupac_m_397W> Atoms were not swapped because of the uncertainty of how to handle the H atom. >> ENERGY; Differences between the model's features and restraints: Number of all residues in MODEL : 146 Number of all, selected real atoms : 1166 1166 Number of all, selected pseudo atoms : 0 0 Number of all static, selected restraints : 12008 12008 COVALENT_CYS : F NONBONDED_SEL_ATOMS : 1 Number of non-bonded pairs (excluding 1-2,1-3,1-4): 2401 Dynamic pairs routine : 2, NATM x NATM cell sorting Atomic shift for contacts update (UPDATE_DYNAMIC) : 0.390 LENNARD_JONES_SWITCH : 6.500 7.500 COULOMB_JONES_SWITCH : 6.500 7.500 RESIDUE_SPAN_RANGE : 0 99999 NLOGN_USE : 15 CONTACT_SHELL : 4.000 DYNAMIC_PAIRS,_SPHERE,_COULOMB,_LENNARD,_MODELLER : T T F F F SPHERE_STDV : 0.050 RADII_FACTOR : 0.820 Current energy : 769.9892 Summary of the restraint violations: NUM ... number of restraints. NUMVI ... number of restraints with RVIOL > VIOL_REPORT_CUT[i]. RVIOL ... relative difference from the best value. NUMVP ... number of restraints with -Ln(pdf) > VIOL_REPORT_CUT2[i]. RMS_1 ... RMS(feature, minimally_violated_basis_restraint, NUMB). RMS_2 ... RMS(feature, best_value, NUMB). MOL.PDF ... scaled contribution to -Ln(Molecular pdf). # RESTRAINT_GROUP NUM NUMVI NUMVP RMS_1 RMS_2 MOL.PDF S_i ------------------------------------------------------------------------------------------------------ 1 Bond length potential : 1197 0 0 0.006 0.006 11.459 1.000 2 Bond angle potential : 1618 0 5 2.121 2.121 142.79 1.000 3 Stereochemical cosine torsion poten: 814 0 33 47.656 47.656 279.95 1.000 4 Stereochemical improper torsion pot: 502 0 0 1.258 1.258 18.276 1.000 5 Soft-sphere overlap restraints : 2401 0 0 0.002 0.002 0.72514 1.000 6 Lennard-Jones 6-12 potential : 0 0 0 0.000 0.000 0.0000 1.000 7 Coulomb point-point electrostatic p: 0 0 0 0.000 0.000 0.0000 1.000 8 H-bonding potential : 0 0 0 0.000 0.000 0.0000 1.000 9 Distance restraints 1 (CA-CA) : 2405 0 0 0.128 0.128 34.322 1.000 10 Distance restraints 2 (N-O) : 2564 0 0 0.165 0.165 70.749 1.000 11 Mainchain Phi dihedral restraints : 0 0 0 0.000 0.000 0.0000 1.000 12 Mainchain Psi dihedral restraints : 0 0 0 0.000 0.000 0.0000 1.000 13 Mainchain Omega dihedral restraints: 145 0 1 4.123 4.123 29.068 1.000 14 Sidechain Chi_1 dihedral restraints: 126 0 0 64.505 64.505 17.517 1.000 15 Sidechain Chi_2 dihedral restraints: 93 0 0 73.445 73.445 33.073 1.000 16 Sidechain Chi_3 dihedral restraints: 37 0 0 62.115 62.115 22.693 1.000 17 Sidechain Chi_4 dihedral restraints: 21 0 0 90.665 90.665 15.219 1.000 18 Disulfide distance restraints : 4 0 0 0.008 0.008 0.45074E-01 1.000 19 Disulfide angle restraints : 8 0 0 1.903 1.903 0.63973 1.000 20 Disulfide dihedral angle restraints: 4 0 0 27.977 27.977 3.0637 1.000 21 Lower bound distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 22 Upper bound distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 23 Distance restraints 3 (SDCH-MNCH) : 1582 0 0 0.375 0.375 26.630 1.000 24 Sidechain Chi_5 dihedral restraints: 0 0 0 0.000 0.000 0.0000 1.000 25 Phi/Psi pair of dihedral restraints: 144 15 13 21.498 58.307 32.415 1.000 26 Distance restraints 4 (SDCH-SDCH) : 744 0 0 0.544 0.544 31.357 1.000 27 Distance restraints 5 (X-Y) : 0 0 0 0.000 0.000 0.0000 1.000 28 NMR distance restraints 6 (X-Y) : 0 0 0 0.000 0.000 0.0000 1.000 29 NMR distance restraints 7 (X-Y) : 0 0 0 0.000 0.000 0.0000 1.000 30 Minimal distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 31 Non-bonded restraints : 0 0 0 0.000 0.000 0.0000 1.000 32 Atomic accessibility restraints : 0 0 0 0.000 0.000 0.0000 1.000 33 Atomic density restraints : 0 0 0 0.000 0.000 0.0000 1.000 34 Absolute position restraints : 0 0 0 0.000 0.000 0.0000 1.000 35 Dihedral angle difference restraint: 0 0 0 0.000 0.000 0.0000 1.000 36 GBSA implicit solvent potential : 0 0 0 0.000 0.000 0.0000 1.000 37 EM density fitting potential : 0 0 0 0.000 0.000 0.0000 1.000 38 SAXS restraints : 0 0 0 0.000 0.000 0.0000 1.000 39 Symmetry restraints : 0 0 0 0.000 0.000 0.0000 1.000 # Heavy relative violation of each residue is written to: seq.V99990002 # The profile is NOT normalized by the number of restraints. # The profiles are smoothed over a window of residues: 1 # The sum of all numbers in the file: 14472.9033 List of the violated restraints: A restraint is violated when the relative difference from the best value (RVIOL) is larger than CUTOFF. ICSR ... index of a restraint in the current set. RESNO ... residue numbers of the first two atoms. ATM ... IUPAC atom names of the first two atoms. FEAT ... the value of the feature in the model. restr ... the mean of the basis restraint with the smallest difference from the model (local minimum). viol ... difference from the local minimum. rviol ... relative difference from the local minimum. RESTR ... the best value (global minimum). VIOL ... difference from the best value. RVIOL ... relative difference from the best value. ------------------------------------------------------------------------------------------------- Feature 25 : Phi/Psi pair of dihedral restraints List of the RVIOL violations larger than : 6.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 4148 1S 2G C N 5 7 -156.16 -167.20 26.98 0.85 82.20 -173.42 14.08 1 2G 2G N CA 7 8 149.98 174.60 8.50 2 4149 2G 3K C N 9 11 -110.33 -118.00 10.45 0.52 -62.90 179.39 26.10 2 3K 3K N CA 11 12 146.20 139.10 -40.80 3 4150 3K 4Y C N 18 20 -68.90 -98.40 30.68 1.36 -63.50 179.86 28.82 3 4Y 4Y N CA 20 21 136.82 128.40 -43.40 4 4152 5I 6S C N 38 40 -139.11 -136.60 3.20 0.10 -64.10 -172.52 18.92 4 6S 6S N CA 40 41 153.17 151.20 -35.00 5 4153 6S 7W C N 44 46 -134.97 -124.90 22.80 0.85 -63.00 168.13 25.86 5 7W 7W N CA 46 47 163.86 143.40 -44.20 6 4154 7W 8E C N 58 60 -111.05 -117.80 7.09 0.31 -63.60 -174.57 22.46 6 8E 8E N CA 60 61 138.95 136.80 -40.30 7 4155 8E 9D C N 67 69 -113.51 -96.50 28.00 1.18 -63.30 -176.56 20.14 7 9D 9D N CA 69 70 136.44 114.20 -40.00 8 4158 11C 12S C N 87 89 -170.21 -136.60 36.35 1.15 -64.10 -168.06 21.57 8 12S 12S N CA 89 90 165.06 151.20 -35.00 9 4159 12S 13Y C N 93 95 -110.43 -98.40 16.86 1.29 -63.50 166.74 24.25 9 13Y 13Y N CA 95 96 116.59 128.40 -43.40 10 4160 13Y 14L C N 105 107 -103.93 -108.50 25.78 1.43 -63.50 165.93 25.52 10 14L 14L N CA 107 108 157.87 132.50 -41.20 11 4161 14L 15Q C N 113 115 -109.63 -121.10 12.11 0.54 -63.80 -178.02 29.21 11 15Q 15Q N CA 115 116 143.59 139.70 -40.30 12 4162 15Q 16L C N 122 124 -97.47 -70.70 62.69 3.94 -63.50 125.21 19.47 12 16L 16L N CA 124 125 -161.71 141.60 -41.20 13 4163 16L 17Q C N 130 132 -78.18 -73.00 46.77 3.16 -63.80 135.29 19.16 13 17Q 17Q N CA 132 133 94.22 140.70 -40.30 14 4198 51E 52S C N 414 416 -138.44 -64.10 82.56 8.64 -64.10 82.56 8.64 14 52S 52S N CA 416 417 0.92 -35.00 -35.00 15 4210 63P 64P C N 504 506 -73.72 -58.70 15.24 1.80 -64.50 175.37 13.58 15 64P 64P N CA 506 507 -27.93 -30.50 147.20 report______> Distribution of short non-bonded contacts: DISTANCE1: 0.00 2.10 2.20 2.30 2.40 2.50 2.60 2.70 2.80 2.90 3.00 3.10 3.20 3.30 3.40 DISTANCE2: 2.10 2.20 2.30 2.40 2.50 2.60 2.70 2.80 2.90 3.00 3.10 3.20 3.30 3.40 3.50 FREQUENCY: 0 0 0 0 0 4 12 48 67 110 114 125 169 186 182 << end of ENERGY. >> Summary of successfully produced models: Filename molpdf ---------------------------------------- seq.B99990001.pdb 878.34003 seq.B99990002.pdb 769.98920
import nglview
import ipywidgets
w1 = nglview.show_structure_file('seq.B99990001.pdb')
w1
w2 = nglview.show_structure_file('seq.B99990002.pdb')
w2
Полученные модели и исходный референс были совмещены для сравнения в Pymol
from IPython.display import Image
Image(filename='overlay1.png')
Зеленым обозначена структура 1lmp, голубым - модели без ограничений, розовым с дополнительными ограничением.
В целом, структуры очень похожи. Видно, что есть отличия по N-концевому участку
Добавим лиганд к последовательности LYS_BUFAN из 1lmp и посроим модель
seq_liganded = ''
for res in alignm[0].residues:
seq_liganded += res.code
for lig in alignm[1].residues[-3:]:
seq_liganded += lig.code
alignm.append_sequence(seq_liganded)
alignm[2].code = 'lys_lig'
del alignm[0]
#добавим в выравнивание
alignm.salign()
alignm.write(file='all_in_one_lig.ali', alignment_format='PIR')
SALIGN_____> adding the next group to the alignment; iteration 1
## Выбираем объект для моделирования
s = alignm[1]
pdb = alignm[0]
print(s.code, pdb.code)
## Создаем объект automodel
a = modeller.automodel.automodel(env, alnfile='all_in_one_lig.ali', knowns=pdb.code, sequence=s.code)
a.name = 'mod' + s.code
a.starting_model = 1
a.ending_model = 2
a.make()
lys_lig mod fndatmi_285W> Only 129 residues out of 132 contain atoms of type CA (This is usually caused by non-standard residues, such as ligands, or by PDB files with missing atoms.) fndatmi_285W> Only 129 residues out of 132 contain atoms of type CA (This is usually caused by non-standard residues, such as ligands, or by PDB files with missing atoms.) check_ali___> Checking the sequence-structure alignment. Implied intrachain target CA(i)-CA(i+1) distances longer than 8.0 angstroms: ALN_POS TMPL RID1 RID2 NAM1 NAM2 DIST ---------------------------------------------- END OF TABLE read_to_681_> topology.submodel read from topology file: 3 getf_______W> RTF restraint not found in the atoms list: residue type, indices: 10 146 atom names : C +N atom indices : 1164 0 getf_______W> RTF restraint not found in the atoms list: residue type, indices: 10 146 atom names : C CA +N O atom indices : 1164 1159 0 1165 fndatmi_285W> Only 129 residues out of 132 contain atoms of type CA (This is usually caused by non-standard residues, such as ligands, or by PDB files with missing atoms.) patch_s_522_> Number of disulfides patched in MODEL: 4 mdtrsr__446W> A potential that relies on one protein is used, yet you have at least one known structure available. MDT, not library, potential is used. iup2crm_280W> No topology library in memory or assigning a BLK residue. Default CHARMM atom type assigned: C1 --> CT2 This message is written only for the first such atom. 43 atoms in HETATM/BLK residues constrained to protein atoms within 2.30 angstroms and protein CA atoms within 10.00 angstroms 43 atoms in residues without defined topology constrained to be rigid bodies condens_443_> Restraints marked for deletion were removed. Total number of restraints before, now: 14445 13409 iupac_m_397W> Atoms were not swapped because of the uncertainty of how to handle the H atom. >> ENERGY; Differences between the model's features and restraints: Number of all residues in MODEL : 149 Number of all, selected real atoms : 1209 1209 Number of all, selected pseudo atoms : 0 0 Number of all static, selected restraints : 13409 13409 COVALENT_CYS : F NONBONDED_SEL_ATOMS : 1 Number of non-bonded pairs (excluding 1-2,1-3,1-4): 2678 Dynamic pairs routine : 2, NATM x NATM cell sorting Atomic shift for contacts update (UPDATE_DYNAMIC) : 0.390 LENNARD_JONES_SWITCH : 6.500 7.500 COULOMB_JONES_SWITCH : 6.500 7.500 RESIDUE_SPAN_RANGE : 0 99999 NLOGN_USE : 15 CONTACT_SHELL : 4.000 DYNAMIC_PAIRS,_SPHERE,_COULOMB,_LENNARD,_MODELLER : T T F F F SPHERE_STDV : 0.050 RADII_FACTOR : 0.820 Current energy : 1018.6566 Summary of the restraint violations: NUM ... number of restraints. NUMVI ... number of restraints with RVIOL > VIOL_REPORT_CUT[i]. RVIOL ... relative difference from the best value. NUMVP ... number of restraints with -Ln(pdf) > VIOL_REPORT_CUT2[i]. RMS_1 ... RMS(feature, minimally_violated_basis_restraint, NUMB). RMS_2 ... RMS(feature, best_value, NUMB). MOL.PDF ... scaled contribution to -Ln(Molecular pdf). # RESTRAINT_GROUP NUM NUMVI NUMVP RMS_1 RMS_2 MOL.PDF S_i ------------------------------------------------------------------------------------------------------ 1 Bond length potential : 1197 0 0 0.007 0.007 14.806 1.000 2 Bond angle potential : 1618 0 6 2.278 2.278 165.48 1.000 3 Stereochemical cosine torsion poten: 814 0 29 47.023 47.023 280.47 1.000 4 Stereochemical improper torsion pot: 502 0 0 1.357 1.357 20.886 1.000 5 Soft-sphere overlap restraints : 2678 1 2 0.008 0.008 17.900 1.000 6 Lennard-Jones 6-12 potential : 0 0 0 0.000 0.000 0.0000 1.000 7 Coulomb point-point electrostatic p: 0 0 0 0.000 0.000 0.0000 1.000 8 H-bonding potential : 0 0 0 0.000 0.000 0.0000 1.000 9 Distance restraints 1 (CA-CA) : 2405 0 0 0.119 0.119 37.112 1.000 10 Distance restraints 2 (N-O) : 2564 1 11 0.233 0.233 129.30 1.000 11 Mainchain Phi dihedral restraints : 0 0 0 0.000 0.000 0.0000 1.000 12 Mainchain Psi dihedral restraints : 0 0 0 0.000 0.000 0.0000 1.000 13 Mainchain Omega dihedral restraints: 145 0 3 4.762 4.762 38.772 1.000 14 Sidechain Chi_1 dihedral restraints: 126 0 0 72.352 72.352 27.646 1.000 15 Sidechain Chi_2 dihedral restraints: 93 0 2 81.355 81.355 44.959 1.000 16 Sidechain Chi_3 dihedral restraints: 37 0 0 76.353 76.353 26.050 1.000 17 Sidechain Chi_4 dihedral restraints: 21 0 0 86.997 86.997 16.447 1.000 18 Disulfide distance restraints : 4 0 0 0.007 0.007 0.35324E-01 1.000 19 Disulfide angle restraints : 8 0 0 2.841 2.841 1.4259 1.000 20 Disulfide dihedral angle restraints: 4 0 0 28.794 28.794 2.7176 1.000 21 Lower bound distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 22 Upper bound distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 23 Distance restraints 3 (SDCH-MNCH) : 1582 0 0 0.404 0.404 44.086 1.000 24 Sidechain Chi_5 dihedral restraints: 0 0 0 0.000 0.000 0.0000 1.000 25 Phi/Psi pair of dihedral restraints: 144 22 14 23.505 66.455 64.995 1.000 26 Distance restraints 4 (SDCH-SDCH) : 744 0 1 0.778 0.778 71.077 1.000 27 Distance restraints 5 (X-Y) : 1401 0 0 0.033 0.033 14.496 1.000 28 NMR distance restraints 6 (X-Y) : 0 0 0 0.000 0.000 0.0000 1.000 29 NMR distance restraints 7 (X-Y) : 0 0 0 0.000 0.000 0.0000 1.000 30 Minimal distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 31 Non-bonded restraints : 0 0 0 0.000 0.000 0.0000 1.000 32 Atomic accessibility restraints : 0 0 0 0.000 0.000 0.0000 1.000 33 Atomic density restraints : 0 0 0 0.000 0.000 0.0000 1.000 34 Absolute position restraints : 0 0 0 0.000 0.000 0.0000 1.000 35 Dihedral angle difference restraint: 0 0 0 0.000 0.000 0.0000 1.000 36 GBSA implicit solvent potential : 0 0 0 0.000 0.000 0.0000 1.000 37 EM density fitting potential : 0 0 0 0.000 0.000 0.0000 1.000 38 SAXS restraints : 0 0 0 0.000 0.000 0.0000 1.000 39 Symmetry restraints : 0 0 0 0.000 0.000 0.0000 1.000 # Heavy relative violation of each residue is written to: lys_lig.V99990001 # The profile is NOT normalized by the number of restraints. # The profiles are smoothed over a window of residues: 1 # The sum of all numbers in the file: 16793.7031 List of the violated restraints: A restraint is violated when the relative difference from the best value (RVIOL) is larger than CUTOFF. ICSR ... index of a restraint in the current set. RESNO ... residue numbers of the first two atoms. ATM ... IUPAC atom names of the first two atoms. FEAT ... the value of the feature in the model. restr ... the mean of the basis restraint with the smallest difference from the model (local minimum). viol ... difference from the local minimum. rviol ... relative difference from the local minimum. RESTR ... the best value (global minimum). VIOL ... difference from the best value. RVIOL ... relative difference from the best value. ------------------------------------------------------------------------------------------------- Feature 10 : Distance restraints 2 (N-O) List of the RVIOL violations larger than : 4.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 8128 67S 87P N O 530 712 8.73 6.49 2.23 4.92 6.49 2.23 4.92 ------------------------------------------------------------------------------------------------- Feature 25 : Phi/Psi pair of dihedral restraints List of the RVIOL violations larger than : 6.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 4148 1S 2G C N 5 7 127.20 78.70 48.99 2.29 82.20 -175.94 11.12 1 2G 2G N CA 7 8 -173.02 -166.10 8.50 2 4149 2G 3K C N 9 11 -65.92 -70.20 8.67 0.53 -62.90 173.68 22.39 2 3K 3K N CA 11 12 132.86 140.40 -40.80 3 4150 3K 4Y C N 18 20 -126.28 -124.30 3.38 0.14 -63.50 -170.82 33.18 3 4Y 4Y N CA 20 21 138.14 135.40 -43.40 4 4152 5I 6S C N 38 40 -130.11 -136.60 10.88 0.38 -64.10 -170.66 11.68 4 6S 6S N CA 40 41 142.46 151.20 -35.00 5 4153 6S 7W C N 44 46 -76.43 -71.30 19.33 1.32 -63.00 158.74 20.69 5 7W 7W N CA 46 47 157.63 139.00 -44.20 6 4154 7W 8E C N 58 60 -59.01 -69.30 13.44 0.75 -63.60 174.21 23.48 6 8E 8E N CA 60 61 133.85 142.50 -40.30 7 4155 8E 9D C N 67 69 -93.87 -96.50 7.18 0.30 -63.30 150.66 16.97 7 9D 9D N CA 69 70 107.52 114.20 -40.00 8 4156 9D 10S C N 75 77 -84.09 -72.40 19.29 0.88 -64.10 158.51 12.78 8 10S 10S N CA 77 78 167.75 152.40 -35.00 9 4157 10S 11C C N 81 83 -63.43 -63.00 8.27 1.04 -117.90 178.08 7.68 9 11C 11C N CA 83 84 -49.36 -41.10 141.10 10 4158 11C 12S C N 87 89 -75.76 -72.40 7.75 0.36 -64.10 166.02 12.76 10 12S 12S N CA 89 90 159.38 152.40 -35.00 11 4159 12S 13Y C N 93 95 -77.40 -98.40 22.02 1.01 -63.50 178.95 27.69 11 13Y 13Y N CA 95 96 135.01 128.40 -43.40 12 4160 13Y 14L C N 105 107 -62.10 -70.70 27.90 1.82 -63.50 156.27 21.78 12 14L 14L N CA 107 108 115.06 141.60 -41.20 13 4161 14L 15Q C N 113 115 -141.76 -121.10 21.00 0.87 -63.80 -167.28 25.08 13 15Q 15Q N CA 115 116 135.95 139.70 -40.30 14 4162 15Q 16L C N 122 124 -136.67 -108.50 29.70 1.53 -63.50 179.85 22.11 14 16L 16L N CA 124 125 123.09 132.50 -41.20 15 4163 16L 17Q C N 130 132 -133.47 -121.10 15.35 0.76 -63.80 -175.43 24.15 15 17Q 17Q N CA 132 133 130.61 139.70 -40.30 16 4198 51E 52S C N 414 416 -136.99 -64.10 81.18 8.47 -64.10 81.18 8.47 16 52S 52S N CA 416 417 0.74 -35.00 -35.00 17 4209 62N 63P C N 497 499 -64.45 -64.50 56.36 4.21 -58.70 126.08 10.89 17 63P 63P N CA 499 500 -156.44 147.20 -30.50 18 4210 63P 64P C N 504 506 -57.25 -58.70 24.36 2.13 -64.50 153.56 11.08 18 64P 64P N CA 506 507 -6.19 -30.50 147.20 19 4233 86T 87P C N 704 706 -46.01 -58.70 63.98 4.34 -64.50 121.01 10.03 19 87P 87P N CA 706 707 -93.21 -30.50 147.20 20 4234 87P 88R C N 711 713 -97.75 -125.20 60.46 2.21 57.30 162.53 16.24 20 88R 88R N CA 713 714 86.74 140.60 38.00 21 4288 141T 142Q C N 1128 1130 17.83 -63.80 120.51 15.62 -73.00 128.11 9.94 21 142Q 142Q N CA 1130 1131 -128.96 -40.30 140.70 22 4289 142Q 143G C N 1137 1139 -61.13 -62.40 30.61 4.63 82.20 144.60 10.33 22 143G 143G N CA 1139 1140 -10.62 -41.20 8.50 report______> Distribution of short non-bonded contacts: DISTANCE1: 0.00 2.10 2.20 2.30 2.40 2.50 2.60 2.70 2.80 2.90 3.00 3.10 3.20 3.30 3.40 DISTANCE2: 2.10 2.20 2.30 2.40 2.50 2.60 2.70 2.80 2.90 3.00 3.10 3.20 3.30 3.40 3.50 FREQUENCY: 0 0 0 0 0 7 15 54 95 124 128 156 188 183 230 << end of ENERGY. >> ENERGY; Differences between the model's features and restraints: Number of all residues in MODEL : 149 Number of all, selected real atoms : 1209 1209 Number of all, selected pseudo atoms : 0 0 Number of all static, selected restraints : 13409 13409 COVALENT_CYS : F NONBONDED_SEL_ATOMS : 1 Number of non-bonded pairs (excluding 1-2,1-3,1-4): 2603 Dynamic pairs routine : 2, NATM x NATM cell sorting Atomic shift for contacts update (UPDATE_DYNAMIC) : 0.390 LENNARD_JONES_SWITCH : 6.500 7.500 COULOMB_JONES_SWITCH : 6.500 7.500 RESIDUE_SPAN_RANGE : 0 99999 NLOGN_USE : 15 CONTACT_SHELL : 4.000 DYNAMIC_PAIRS,_SPHERE,_COULOMB,_LENNARD,_MODELLER : T T F F F SPHERE_STDV : 0.050 RADII_FACTOR : 0.820 Current energy : 886.5428 Summary of the restraint violations: NUM ... number of restraints. NUMVI ... number of restraints with RVIOL > VIOL_REPORT_CUT[i]. RVIOL ... relative difference from the best value. NUMVP ... number of restraints with -Ln(pdf) > VIOL_REPORT_CUT2[i]. RMS_1 ... RMS(feature, minimally_violated_basis_restraint, NUMB). RMS_2 ... RMS(feature, best_value, NUMB). MOL.PDF ... scaled contribution to -Ln(Molecular pdf). # RESTRAINT_GROUP NUM NUMVI NUMVP RMS_1 RMS_2 MOL.PDF S_i ------------------------------------------------------------------------------------------------------ 1 Bond length potential : 1197 0 0 0.006 0.006 12.115 1.000 2 Bond angle potential : 1618 0 5 2.179 2.179 151.61 1.000 3 Stereochemical cosine torsion poten: 814 0 26 46.358 46.358 275.25 1.000 4 Stereochemical improper torsion pot: 502 0 0 1.336 1.336 20.365 1.000 5 Soft-sphere overlap restraints : 2603 1 2 0.008 0.008 17.710 1.000 6 Lennard-Jones 6-12 potential : 0 0 0 0.000 0.000 0.0000 1.000 7 Coulomb point-point electrostatic p: 0 0 0 0.000 0.000 0.0000 1.000 8 H-bonding potential : 0 0 0 0.000 0.000 0.0000 1.000 9 Distance restraints 1 (CA-CA) : 2405 0 0 0.103 0.103 29.134 1.000 10 Distance restraints 2 (N-O) : 2564 1 7 0.210 0.210 107.41 1.000 11 Mainchain Phi dihedral restraints : 0 0 0 0.000 0.000 0.0000 1.000 12 Mainchain Psi dihedral restraints : 0 0 0 0.000 0.000 0.0000 1.000 13 Mainchain Omega dihedral restraints: 145 0 1 4.239 4.239 30.723 1.000 14 Sidechain Chi_1 dihedral restraints: 126 0 2 72.880 72.880 26.808 1.000 15 Sidechain Chi_2 dihedral restraints: 93 0 0 75.174 75.174 32.369 1.000 16 Sidechain Chi_3 dihedral restraints: 37 0 0 69.467 69.467 25.136 1.000 17 Sidechain Chi_4 dihedral restraints: 21 0 0 84.720 84.720 13.165 1.000 18 Disulfide distance restraints : 4 0 0 0.008 0.008 0.40860E-01 1.000 19 Disulfide angle restraints : 8 0 0 1.815 1.815 0.58187 1.000 20 Disulfide dihedral angle restraints: 4 0 0 29.338 29.338 3.2852 1.000 21 Lower bound distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 22 Upper bound distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 23 Distance restraints 3 (SDCH-MNCH) : 1582 0 0 0.357 0.357 30.671 1.000 24 Sidechain Chi_5 dihedral restraints: 0 0 0 0.000 0.000 0.0000 1.000 25 Phi/Psi pair of dihedral restraints: 144 18 15 21.458 61.540 45.602 1.000 26 Distance restraints 4 (SDCH-SDCH) : 744 0 1 0.686 0.686 52.607 1.000 27 Distance restraints 5 (X-Y) : 1401 0 0 0.030 0.030 11.966 1.000 28 NMR distance restraints 6 (X-Y) : 0 0 0 0.000 0.000 0.0000 1.000 29 NMR distance restraints 7 (X-Y) : 0 0 0 0.000 0.000 0.0000 1.000 30 Minimal distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 31 Non-bonded restraints : 0 0 0 0.000 0.000 0.0000 1.000 32 Atomic accessibility restraints : 0 0 0 0.000 0.000 0.0000 1.000 33 Atomic density restraints : 0 0 0 0.000 0.000 0.0000 1.000 34 Absolute position restraints : 0 0 0 0.000 0.000 0.0000 1.000 35 Dihedral angle difference restraint: 0 0 0 0.000 0.000 0.0000 1.000 36 GBSA implicit solvent potential : 0 0 0 0.000 0.000 0.0000 1.000 37 EM density fitting potential : 0 0 0 0.000 0.000 0.0000 1.000 38 SAXS restraints : 0 0 0 0.000 0.000 0.0000 1.000 39 Symmetry restraints : 0 0 0 0.000 0.000 0.0000 1.000 # Heavy relative violation of each residue is written to: lys_lig.V99990002 # The profile is NOT normalized by the number of restraints. # The profiles are smoothed over a window of residues: 1 # The sum of all numbers in the file: 15709.1064 List of the violated restraints: A restraint is violated when the relative difference from the best value (RVIOL) is larger than CUTOFF. ICSR ... index of a restraint in the current set. RESNO ... residue numbers of the first two atoms. ATM ... IUPAC atom names of the first two atoms. FEAT ... the value of the feature in the model. restr ... the mean of the basis restraint with the smallest difference from the model (local minimum). viol ... difference from the local minimum. rviol ... relative difference from the local minimum. RESTR ... the best value (global minimum). VIOL ... difference from the best value. RVIOL ... relative difference from the best value. ------------------------------------------------------------------------------------------------- Feature 10 : Distance restraints 2 (N-O) List of the RVIOL violations larger than : 4.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 8128 67S 87P N O 530 712 8.63 6.49 2.14 4.71 6.49 2.14 4.71 ------------------------------------------------------------------------------------------------- Feature 25 : Phi/Psi pair of dihedral restraints List of the RVIOL violations larger than : 6.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 4148 1S 2G C N 5 7 86.45 78.70 8.87 0.30 82.20 178.98 8.47 1 2G 2G N CA 7 8 -170.43 -166.10 8.50 2 4149 2G 3K C N 9 11 -71.43 -70.20 3.39 0.22 -62.90 175.84 23.31 2 3K 3K N CA 11 12 143.56 140.40 -40.80 3 4150 3K 4Y C N 18 20 -125.86 -124.30 6.66 0.29 -63.50 -174.48 32.58 3 4Y 4Y N CA 20 21 141.87 135.40 -43.40 4 4152 5I 6S C N 38 40 -71.74 -72.40 7.27 0.40 -64.10 180.00 13.51 4 6S 6S N CA 40 41 145.16 152.40 -35.00 5 4153 6S 7W C N 44 46 -137.82 -124.90 13.27 0.51 -63.00 -174.83 28.18 5 7W 7W N CA 46 47 146.42 143.40 -44.20 6 4154 7W 8E C N 58 60 -76.48 -69.30 8.48 0.77 -63.60 178.75 23.15 6 8E 8E N CA 60 61 137.99 142.50 -40.30 7 4155 8E 9D C N 67 69 -81.06 -96.50 15.77 0.65 -63.30 152.02 17.76 7 9D 9D N CA 69 70 110.98 114.20 -40.00 8 4156 9D 10S C N 75 77 -157.32 -136.60 33.77 1.17 -64.10 174.18 19.35 8 10S 10S N CA 77 78 177.86 151.20 -35.00 9 4159 12S 13Y C N 93 95 -117.15 -124.30 10.39 0.45 -63.50 179.47 25.98 9 13Y 13Y N CA 95 96 127.87 135.40 -43.40 10 4160 13Y 14L C N 105 107 -64.82 -70.70 14.68 0.93 -63.50 169.35 23.44 10 14L 14L N CA 107 108 128.15 141.60 -41.20 11 4161 14L 15Q C N 113 115 -129.38 -121.10 18.63 0.72 -63.80 175.99 29.34 11 15Q 15Q N CA 115 116 156.39 139.70 -40.30 12 4162 15Q 16L C N 122 124 -108.81 -108.50 11.20 0.60 -63.50 -179.14 27.89 12 16L 16L N CA 124 125 143.70 132.50 -41.20 13 4163 16L 17Q C N 130 132 -91.74 -73.00 29.89 2.30 -63.80 160.17 22.18 13 17Q 17Q N CA 132 133 117.41 140.70 -40.30 14 4198 51E 52S C N 414 416 -135.59 -64.10 75.62 8.49 -64.10 75.62 8.49 14 52S 52S N CA 416 417 -10.35 -35.00 -35.00 15 4209 62N 63P C N 497 499 -66.85 -64.50 58.86 4.27 -58.70 123.76 10.89 15 63P 63P N CA 499 500 -153.99 147.20 -30.50 16 4210 63P 64P C N 504 506 -59.05 -58.70 27.48 2.24 -64.50 150.32 10.93 16 64P 64P N CA 506 507 -3.02 -30.50 147.20 17 4233 86T 87P C N 704 706 -46.38 -58.70 65.21 4.47 -64.50 119.65 9.91 17 87P 87P N CA 706 707 -94.53 -30.50 147.20 18 4234 87P 88R C N 711 713 -99.04 -125.20 58.20 2.14 57.30 164.33 16.33 18 88R 88R N CA 713 714 88.61 140.60 38.00 report______> Distribution of short non-bonded contacts: DISTANCE1: 0.00 2.10 2.20 2.30 2.40 2.50 2.60 2.70 2.80 2.90 3.00 3.10 3.20 3.30 3.40 DISTANCE2: 2.10 2.20 2.30 2.40 2.50 2.60 2.70 2.80 2.90 3.00 3.10 3.20 3.30 3.40 3.50 FREQUENCY: 0 0 0 0 0 10 11 60 79 140 133 143 186 186 197 << end of ENERGY. >> Summary of successfully produced models: Filename molpdf ---------------------------------------- lys_lig.B99990001.pdb 1018.65662 lys_lig.B99990002.pdb 886.54285
import nglview
import ipywidgets
w3 = nglview.show_structure_file('lys_lig.B99990001.pdb')
w3.clear_representations()
w3.add_cartoon('protein', color_scheme='residueindex')
w3.add_ball_and_stick('ligand', opacity=0.5)
w3
w0 = nglview.show_structure_file('1lmp.pdb')
w0.clear_representations()
w0.add_cartoon('protein', color_scheme='residueindex')
w0.add_ball_and_stick('ligand', opacity=0.5)
w0
У получившихся моделей метостоположение лиганда совпадает с координатами в рефенсном лизоциме.
Структурно модель почти полностью совпадает с 1lmp, хотя на уровне последовательностей различий много.
from IPython.display import Image
Image(filename='overlay2.png')
Зеленым обозначена референсная структура, голубым - модель выданной последовательности.
Переместим лиганд (добавим случайный шум к его местоположению)
#удаляем файл с автоматическими ограничениями
! rm lys_lig.rsr
env.libs.topology.clear()
class mymodel(modeller.automodel.automodel):
def special_restraints(self, aln):
rsr = self.restraints
at = self.atoms
for x,y in [('CG2:92','O6:147'),('N:78','OD2:118')]:
rsr.add(modeller.forms.gaussian(group=modeller.physical.xy_distance,
feature=modeller.features.distance(
at[x],at[y]),mean=1.0, stdev=0.1))
from modeller import *
from modeller.automodel import *
a = mymodel(env, alnfile='all_in_one_lig.ali', knowns=pdb.code, sequence=s.code)
a.name = 'mod' + s.code
a.starting_model = 1
a.ending_model = 2
a.make()
fndatmi_285W> Only 129 residues out of 132 contain atoms of type CA (This is usually caused by non-standard residues, such as ligands, or by PDB files with missing atoms.) fndatmi_285W> Only 129 residues out of 132 contain atoms of type CA (This is usually caused by non-standard residues, such as ligands, or by PDB files with missing atoms.) check_ali___> Checking the sequence-structure alignment. Implied intrachain target CA(i)-CA(i+1) distances longer than 8.0 angstroms: ALN_POS TMPL RID1 RID2 NAM1 NAM2 DIST ---------------------------------------------- END OF TABLE read_to_681_> topology.submodel read from topology file: 3 getf_______W> RTF restraint not found in the atoms list: residue type, indices: 10 146 atom names : C +N atom indices : 1164 0 getf_______W> RTF restraint not found in the atoms list: residue type, indices: 10 146 atom names : C CA +N O atom indices : 1164 1159 0 1165 fndatmi_285W> Only 129 residues out of 132 contain atoms of type CA (This is usually caused by non-standard residues, such as ligands, or by PDB files with missing atoms.) patch_s_522_> Number of disulfides patched in MODEL: 4 mdtrsr__446W> A potential that relies on one protein is used, yet you have at least one known structure available. MDT, not library, potential is used. iup2crm_280W> No topology library in memory or assigning a BLK residue. Default CHARMM atom type assigned: C1 --> CT2 This message is written only for the first such atom. 43 atoms in HETATM/BLK residues constrained to protein atoms within 2.30 angstroms and protein CA atoms within 10.00 angstroms 43 atoms in residues without defined topology constrained to be rigid bodies condens_443_> Restraints marked for deletion were removed. Total number of restraints before, now: 14447 13411 iupac_m_397W> Atoms were not swapped because of the uncertainty of how to handle the H atom. >> ENERGY; Differences between the model's features and restraints: Number of all residues in MODEL : 149 Number of all, selected real atoms : 1209 1209 Number of all, selected pseudo atoms : 0 0 Number of all static, selected restraints : 13411 13411 COVALENT_CYS : F NONBONDED_SEL_ATOMS : 1 Number of non-bonded pairs (excluding 1-2,1-3,1-4): 2672 Dynamic pairs routine : 2, NATM x NATM cell sorting Atomic shift for contacts update (UPDATE_DYNAMIC) : 0.390 LENNARD_JONES_SWITCH : 6.500 7.500 COULOMB_JONES_SWITCH : 6.500 7.500 RESIDUE_SPAN_RANGE : 0 99999 NLOGN_USE : 15 CONTACT_SHELL : 4.000 DYNAMIC_PAIRS,_SPHERE,_COULOMB,_LENNARD,_MODELLER : T T F F F SPHERE_STDV : 0.050 RADII_FACTOR : 0.820 Current energy : 4289.7935 Summary of the restraint violations: NUM ... number of restraints. NUMVI ... number of restraints with RVIOL > VIOL_REPORT_CUT[i]. RVIOL ... relative difference from the best value. NUMVP ... number of restraints with -Ln(pdf) > VIOL_REPORT_CUT2[i]. RMS_1 ... RMS(feature, minimally_violated_basis_restraint, NUMB). RMS_2 ... RMS(feature, best_value, NUMB). MOL.PDF ... scaled contribution to -Ln(Molecular pdf). # RESTRAINT_GROUP NUM NUMVI NUMVP RMS_1 RMS_2 MOL.PDF S_i ------------------------------------------------------------------------------------------------------ 1 Bond length potential : 1197 7 15 0.040 0.040 448.44 1.000 2 Bond angle potential : 1618 18 36 3.986 3.986 468.92 1.000 3 Stereochemical cosine torsion poten: 814 0 32 48.048 48.048 288.89 1.000 4 Stereochemical improper torsion pot: 502 2 4 2.398 2.398 57.506 1.000 5 Soft-sphere overlap restraints : 2672 2 7 0.012 0.012 47.433 1.000 6 Lennard-Jones 6-12 potential : 0 0 0 0.000 0.000 0.0000 1.000 7 Coulomb point-point electrostatic p: 0 0 0 0.000 0.000 0.0000 1.000 8 H-bonding potential : 0 0 0 0.000 0.000 0.0000 1.000 9 Distance restraints 1 (CA-CA) : 2405 9 27 0.311 0.311 291.98 1.000 10 Distance restraints 2 (N-O) : 2564 3 25 0.290 0.290 270.60 1.000 11 Mainchain Phi dihedral restraints : 0 0 0 0.000 0.000 0.0000 1.000 12 Mainchain Psi dihedral restraints : 0 0 0 0.000 0.000 0.0000 1.000 13 Mainchain Omega dihedral restraints: 145 2 4 5.688 5.688 55.328 1.000 14 Sidechain Chi_1 dihedral restraints: 126 0 2 68.992 68.992 36.374 1.000 15 Sidechain Chi_2 dihedral restraints: 93 0 0 82.848 82.848 33.529 1.000 16 Sidechain Chi_3 dihedral restraints: 37 0 0 78.329 78.329 19.530 1.000 17 Sidechain Chi_4 dihedral restraints: 21 0 0 80.211 80.211 11.504 1.000 18 Disulfide distance restraints : 4 0 0 0.008 0.008 0.42737E-01 1.000 19 Disulfide angle restraints : 8 0 0 2.241 2.241 0.88715 1.000 20 Disulfide dihedral angle restraints: 4 0 0 33.249 33.249 3.9746 1.000 21 Lower bound distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 22 Upper bound distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 23 Distance restraints 3 (SDCH-MNCH) : 1582 0 0 0.531 0.531 73.326 1.000 24 Sidechain Chi_5 dihedral restraints: 0 0 0 0.000 0.000 0.0000 1.000 25 Phi/Psi pair of dihedral restraints: 144 16 19 22.899 57.780 78.729 1.000 26 Distance restraints 4 (SDCH-SDCH) : 744 0 1 0.761 0.761 65.089 1.000 27 Distance restraints 5 (X-Y) : 1403 8 38 0.268 0.268 2037.7 1.000 28 NMR distance restraints 6 (X-Y) : 0 0 0 0.000 0.000 0.0000 1.000 29 NMR distance restraints 7 (X-Y) : 0 0 0 0.000 0.000 0.0000 1.000 30 Minimal distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 31 Non-bonded restraints : 0 0 0 0.000 0.000 0.0000 1.000 32 Atomic accessibility restraints : 0 0 0 0.000 0.000 0.0000 1.000 33 Atomic density restraints : 0 0 0 0.000 0.000 0.0000 1.000 34 Absolute position restraints : 0 0 0 0.000 0.000 0.0000 1.000 35 Dihedral angle difference restraint: 0 0 0 0.000 0.000 0.0000 1.000 36 GBSA implicit solvent potential : 0 0 0 0.000 0.000 0.0000 1.000 37 EM density fitting potential : 0 0 0 0.000 0.000 0.0000 1.000 38 SAXS restraints : 0 0 0 0.000 0.000 0.0000 1.000 39 Symmetry restraints : 0 0 0 0.000 0.000 0.0000 1.000 # Heavy relative violation of each residue is written to: lys_lig.V99990001 # The profile is NOT normalized by the number of restraints. # The profiles are smoothed over a window of residues: 1 # The sum of all numbers in the file: 20074.9531 List of the violated restraints: A restraint is violated when the relative difference from the best value (RVIOL) is larger than CUTOFF. ICSR ... index of a restraint in the current set. RESNO ... residue numbers of the first two atoms. ATM ... IUPAC atom names of the first two atoms. FEAT ... the value of the feature in the model. restr ... the mean of the basis restraint with the smallest difference from the model (local minimum). viol ... difference from the local minimum. rviol ... relative difference from the local minimum. RESTR ... the best value (global minimum). VIOL ... difference from the best value. RVIOL ... relative difference from the best value. ------------------------------------------------------------------------------------------------- Feature 1 : Bond length potential List of the RVIOL violations larger than : 4.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 633 77S 78R C N 615 617 1.51 1.35 0.16 5.81 1.35 0.16 5.81 2 641 78R 78R N CA 617 618 1.59 1.43 0.16 5.19 1.43 0.16 5.19 3 770 92T 92T CB CA 749 748 2.17 1.50 0.67 18.46 1.50 0.67 18.46 4 772 92T 92T CG2 CB 751 749 2.22 1.54 0.68 18.74 1.54 0.68 18.74 5 960 118D 118D CB CA 938 937 2.09 1.54 0.55 15.17 1.54 0.55 15.17 6 961 118D 118D CG CB 939 938 2.11 1.52 0.59 15.37 1.52 0.59 15.37 7 963 118D 118D OD2 CG 941 939 1.50 1.26 0.24 10.09 1.26 0.24 10.09 ------------------------------------------------------------------------------------------------- Feature 2 : Bond angle potential List of the RVIOL violations larger than : 4.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 2056 77S 77S CA C 612 615 133.85 116.50 17.35 4.99 116.50 17.35 4.99 2 2098 80W 80W N CA 642 643 129.86 107.00 22.86 6.57 107.00 22.86 6.57 3 2236 91N 91N N CA 739 740 123.25 107.00 16.25 4.67 107.00 16.25 4.67 4 2244 91N 92T C N 745 747 145.62 120.00 25.62 5.82 120.00 25.62 5.82 5 2246 92T 92T N CA 747 748 136.56 113.50 23.06 6.20 113.50 23.06 6.20 6 2249 92T 92T CA CB 748 749 161.23 108.50 52.73 12.37 108.50 52.73 12.37 7 2253 92T 92T C CA 752 748 137.79 108.00 29.78 6.90 108.00 29.78 6.90 8 2257 93C 93C N CA 754 755 128.62 107.00 21.62 6.22 107.00 21.62 6.22 9 2508 118D 118D CA CB 937 938 156.79 108.00 48.78 11.31 108.00 48.78 11.31 10 2512 118D 118D CB CG 938 939 150.64 118.00 32.64 6.63 118.00 32.64 6.63 11 2514 118D 118D C CA 942 937 127.59 108.00 19.59 4.54 108.00 19.59 4.54 ------------------------------------------------------------------------------------------------- Feature 13 : Mainchain Omega dihedral restraints List of the RVIOL violations larger than : 4.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 4368 77S 77S CA C 612 615 -154.29 -180.00 25.71 5.14 -180.00 25.71 5.14 2 4409 118D 118D CA C 937 942 -156.86 -180.00 23.14 4.63 -180.00 23.14 4.63 ------------------------------------------------------------------------------------------------- Feature 25 : Phi/Psi pair of dihedral restraints List of the RVIOL violations larger than : 6.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 4148 1S 2G C N 5 7 -63.73 -62.40 2.04 0.37 82.20 154.67 11.78 1 2G 2G N CA 7 8 -42.75 -41.20 8.50 2 4149 2G 3K C N 9 11 -75.24 -70.20 8.37 0.70 -62.90 174.96 22.07 2 3K 3K N CA 11 12 133.72 140.40 -40.80 3 4150 3K 4Y C N 18 20 -133.28 -124.30 28.78 1.20 -63.50 168.94 30.25 3 4Y 4Y N CA 20 21 162.74 135.40 -43.40 4 4152 5I 6S C N 38 40 -60.71 -72.40 24.04 1.11 -64.10 166.43 12.26 4 6S 6S N CA 40 41 131.39 152.40 -35.00 5 4153 6S 7W C N 44 46 -140.86 -124.90 21.62 0.69 -63.00 175.98 27.24 5 7W 7W N CA 46 47 157.98 143.40 -44.20 6 4154 7W 8E C N 58 60 -73.85 -69.30 8.99 0.81 -63.60 175.35 22.83 6 8E 8E N CA 60 61 134.74 142.50 -40.30 7 4156 9D 10S C N 75 77 -66.92 -72.40 11.00 0.51 -64.10 177.89 12.72 7 10S 10S N CA 77 78 142.86 152.40 -35.00 8 4158 11C 12S C N 87 89 -162.94 -136.60 32.29 0.99 -64.10 -176.06 20.46 8 12S 12S N CA 89 90 169.87 151.20 -35.00 9 4159 12S 13Y C N 93 95 -114.91 -124.30 18.82 0.74 -63.50 170.43 24.65 9 13Y 13Y N CA 95 96 119.10 135.40 -43.40 10 4160 13Y 14L C N 105 107 -135.42 -108.50 42.39 1.95 -63.50 169.56 27.99 10 14L 14L N CA 107 108 165.25 132.50 -41.20 11 4161 14L 15Q C N 113 115 -126.65 -121.10 43.72 1.98 -63.80 150.40 25.40 11 15Q 15Q N CA 115 116 -176.94 139.70 -40.30 12 4162 15Q 16L C N 122 124 -65.46 -70.70 6.37 0.64 -63.50 173.57 24.21 12 16L 16L N CA 124 125 145.24 141.60 -41.20 13 4163 16L 17Q C N 130 132 -102.53 -121.10 29.47 1.00 -63.80 161.83 21.95 13 17Q 17Q N CA 132 133 116.82 139.70 -40.30 14 4198 51E 52S C N 414 416 -137.07 -64.10 81.25 8.48 -64.10 81.25 8.48 14 52S 52S N CA 416 417 0.74 -35.00 -35.00 15 4210 63P 64P C N 504 506 -81.49 -58.70 30.40 2.28 -64.50 158.49 12.74 15 64P 64P N CA 506 507 -10.38 -30.50 147.20 16 4224 77S 78R C N 615 617 -115.26 -63.00 53.42 8.23 -63.00 53.42 8.23 16 78R 78R N CA 617 618 -30.00 -41.10 -41.10 ------------------------------------------------------------------------------------------------- Feature 27 : Distance restraints 5 (X-Y) List of the RVIOL violations larger than : 4.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 13410 92T 147. CG2 O6 751 1178 6.99 1.00 5.99 59.85 1.00 5.99 59.85 2 13411 78R 118D N OD2 617 941 5.55 1.00 4.55 45.48 1.00 4.55 45.48 report______> Distribution of short non-bonded contacts: DISTANCE1: 0.00 2.10 2.20 2.30 2.40 2.50 2.60 2.70 2.80 2.90 3.00 3.10 3.20 3.30 3.40 DISTANCE2: 2.10 2.20 2.30 2.40 2.50 2.60 2.70 2.80 2.90 3.00 3.10 3.20 3.30 3.40 3.50 FREQUENCY: 0 0 0 0 2 13 22 64 80 129 127 157 169 203 193 << end of ENERGY. >> ENERGY; Differences between the model's features and restraints: Number of all residues in MODEL : 149 Number of all, selected real atoms : 1209 1209 Number of all, selected pseudo atoms : 0 0 Number of all static, selected restraints : 13411 13411 COVALENT_CYS : F NONBONDED_SEL_ATOMS : 1 Number of non-bonded pairs (excluding 1-2,1-3,1-4): 2718 Dynamic pairs routine : 2, NATM x NATM cell sorting Atomic shift for contacts update (UPDATE_DYNAMIC) : 0.390 LENNARD_JONES_SWITCH : 6.500 7.500 COULOMB_JONES_SWITCH : 6.500 7.500 RESIDUE_SPAN_RANGE : 0 99999 NLOGN_USE : 15 CONTACT_SHELL : 4.000 DYNAMIC_PAIRS,_SPHERE,_COULOMB,_LENNARD,_MODELLER : T T F F F SPHERE_STDV : 0.050 RADII_FACTOR : 0.820 Current energy : 4438.9775 Summary of the restraint violations: NUM ... number of restraints. NUMVI ... number of restraints with RVIOL > VIOL_REPORT_CUT[i]. RVIOL ... relative difference from the best value. NUMVP ... number of restraints with -Ln(pdf) > VIOL_REPORT_CUT2[i]. RMS_1 ... RMS(feature, minimally_violated_basis_restraint, NUMB). RMS_2 ... RMS(feature, best_value, NUMB). MOL.PDF ... scaled contribution to -Ln(Molecular pdf). # RESTRAINT_GROUP NUM NUMVI NUMVP RMS_1 RMS_2 MOL.PDF S_i ------------------------------------------------------------------------------------------------------ 1 Bond length potential : 1197 8 16 0.040 0.040 458.55 1.000 2 Bond angle potential : 1618 15 38 4.021 4.021 481.41 1.000 3 Stereochemical cosine torsion poten: 814 0 28 47.379 47.379 285.55 1.000 4 Stereochemical improper torsion pot: 502 2 5 2.412 2.412 58.352 1.000 5 Soft-sphere overlap restraints : 2718 2 6 0.012 0.012 46.161 1.000 6 Lennard-Jones 6-12 potential : 0 0 0 0.000 0.000 0.0000 1.000 7 Coulomb point-point electrostatic p: 0 0 0 0.000 0.000 0.0000 1.000 8 H-bonding potential : 0 0 0 0.000 0.000 0.0000 1.000 9 Distance restraints 1 (CA-CA) : 2405 9 26 0.311 0.311 293.11 1.000 10 Distance restraints 2 (N-O) : 2564 4 30 0.328 0.328 318.65 1.000 11 Mainchain Phi dihedral restraints : 0 0 0 0.000 0.000 0.0000 1.000 12 Mainchain Psi dihedral restraints : 0 0 0 0.000 0.000 0.0000 1.000 13 Mainchain Omega dihedral restraints: 145 1 5 5.701 5.701 55.584 1.000 14 Sidechain Chi_1 dihedral restraints: 126 0 2 77.108 77.108 41.028 1.000 15 Sidechain Chi_2 dihedral restraints: 93 0 1 72.643 72.643 32.921 1.000 16 Sidechain Chi_3 dihedral restraints: 37 0 0 75.057 75.057 27.432 1.000 17 Sidechain Chi_4 dihedral restraints: 21 0 0 86.954 86.954 13.256 1.000 18 Disulfide distance restraints : 4 0 0 0.008 0.008 0.49403E-01 1.000 19 Disulfide angle restraints : 8 0 0 2.742 2.742 1.3278 1.000 20 Disulfide dihedral angle restraints: 4 0 0 35.384 35.384 4.2531 1.000 21 Lower bound distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 22 Upper bound distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 23 Distance restraints 3 (SDCH-MNCH) : 1582 0 0 0.497 0.497 74.450 1.000 24 Sidechain Chi_5 dihedral restraints: 0 0 0 0.000 0.000 0.0000 1.000 25 Phi/Psi pair of dihedral restraints: 144 19 21 25.223 63.238 103.25 1.000 26 Distance restraints 4 (SDCH-SDCH) : 744 0 0 0.807 0.807 77.278 1.000 27 Distance restraints 5 (X-Y) : 1403 8 40 0.270 0.270 2066.4 1.000 28 NMR distance restraints 6 (X-Y) : 0 0 0 0.000 0.000 0.0000 1.000 29 NMR distance restraints 7 (X-Y) : 0 0 0 0.000 0.000 0.0000 1.000 30 Minimal distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 31 Non-bonded restraints : 0 0 0 0.000 0.000 0.0000 1.000 32 Atomic accessibility restraints : 0 0 0 0.000 0.000 0.0000 1.000 33 Atomic density restraints : 0 0 0 0.000 0.000 0.0000 1.000 34 Absolute position restraints : 0 0 0 0.000 0.000 0.0000 1.000 35 Dihedral angle difference restraint: 0 0 0 0.000 0.000 0.0000 1.000 36 GBSA implicit solvent potential : 0 0 0 0.000 0.000 0.0000 1.000 37 EM density fitting potential : 0 0 0 0.000 0.000 0.0000 1.000 38 SAXS restraints : 0 0 0 0.000 0.000 0.0000 1.000 39 Symmetry restraints : 0 0 0 0.000 0.000 0.0000 1.000 # Heavy relative violation of each residue is written to: lys_lig.V99990002 # The profile is NOT normalized by the number of restraints. # The profiles are smoothed over a window of residues: 1 # The sum of all numbers in the file: 21082.1699 List of the violated restraints: A restraint is violated when the relative difference from the best value (RVIOL) is larger than CUTOFF. ICSR ... index of a restraint in the current set. RESNO ... residue numbers of the first two atoms. ATM ... IUPAC atom names of the first two atoms. FEAT ... the value of the feature in the model. restr ... the mean of the basis restraint with the smallest difference from the model (local minimum). viol ... difference from the local minimum. rviol ... relative difference from the local minimum. RESTR ... the best value (global minimum). VIOL ... difference from the best value. RVIOL ... relative difference from the best value. ------------------------------------------------------------------------------------------------- Feature 1 : Bond length potential List of the RVIOL violations larger than : 4.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 633 77S 78R C N 615 617 1.51 1.35 0.17 6.02 1.35 0.17 6.02 2 641 78R 78R N CA 617 618 1.59 1.43 0.16 5.18 1.43 0.16 5.18 3 770 92T 92T CB CA 749 748 2.18 1.50 0.68 18.60 1.50 0.68 18.60 4 772 92T 92T CG2 CB 751 749 2.23 1.54 0.69 18.89 1.54 0.69 18.89 5 960 118D 118D CB CA 938 937 2.10 1.54 0.56 15.34 1.54 0.56 15.34 6 961 118D 118D CG CB 939 938 2.12 1.52 0.60 15.62 1.52 0.60 15.62 7 963 118D 118D OD2 CG 941 939 1.50 1.26 0.24 10.20 1.26 0.24 10.20 8 966 118D 118D C CA 942 937 1.65 1.49 0.16 4.53 1.49 0.16 4.53 ------------------------------------------------------------------------------------------------- Feature 2 : Bond angle potential List of the RVIOL violations larger than : 4.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 2056 77S 77S CA C 612 615 134.60 116.50 18.10 5.20 116.50 18.10 5.20 2 2098 80W 80W N CA 642 643 128.17 107.00 21.17 6.09 107.00 21.17 6.09 3 2244 91N 92T C N 745 747 147.53 120.00 27.53 6.26 120.00 27.53 6.26 4 2246 92T 92T N CA 747 748 136.38 113.50 22.88 6.15 113.50 22.88 6.15 5 2249 92T 92T CA CB 748 749 159.89 108.50 51.38 12.05 108.50 51.38 12.05 6 2253 92T 92T C CA 752 748 136.76 108.00 28.75 6.66 108.00 28.75 6.66 7 2257 93C 93C N CA 754 755 128.15 107.00 21.15 6.08 107.00 21.15 6.08 8 2508 118D 118D CA CB 937 938 151.58 108.00 43.57 10.10 108.00 43.57 10.10 9 2512 118D 118D CB CG 938 939 151.24 118.00 33.24 6.75 118.00 33.24 6.75 ------------------------------------------------------------------------------------------------- Feature 13 : Mainchain Omega dihedral restraints List of the RVIOL violations larger than : 4.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 4368 77S 77S CA C 612 615 -154.63 -180.00 25.37 5.07 -180.00 25.37 5.07 ------------------------------------------------------------------------------------------------- Feature 25 : Phi/Psi pair of dihedral restraints List of the RVIOL violations larger than : 6.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 4148 1S 2G C N 5 7 -79.43 -80.20 5.87 0.27 82.20 -124.40 7.14 1 2G 2G N CA 7 8 179.91 174.10 8.50 2 4149 2G 3K C N 9 11 -74.95 -70.20 11.31 0.71 -62.90 168.97 22.62 2 3K 3K N CA 11 12 150.66 140.40 -40.80 3 4150 3K 4Y C N 18 20 -76.13 -98.40 22.29 0.85 -63.50 173.08 26.82 3 4Y 4Y N CA 20 21 129.22 128.40 -43.40 4 4152 5I 6S C N 38 40 -159.47 -136.60 42.45 1.56 -64.10 167.78 19.04 4 6S 6S N CA 40 41 -173.04 151.20 -35.00 5 4153 6S 7W C N 44 46 -57.56 -71.30 27.18 1.68 -63.00 159.84 20.32 5 7W 7W N CA 46 47 115.55 139.00 -44.20 6 4154 7W 8E C N 58 60 -140.05 -117.80 44.14 1.67 -63.60 163.73 26.08 6 8E 8E N CA 60 61 174.92 136.80 -40.30 7 4156 9D 10S C N 75 77 -121.63 -136.60 16.93 0.52 -64.10 -172.65 11.54 7 10S 10S N CA 77 78 143.30 151.20 -35.00 8 4159 12S 13Y C N 93 95 -128.02 -124.30 17.40 1.08 -63.50 174.20 24.77 8 13Y 13Y N CA 95 96 118.41 135.40 -43.40 9 4160 13Y 14L C N 105 107 -147.03 -108.50 44.52 1.96 -63.50 -175.97 30.71 9 14L 14L N CA 107 108 154.81 132.50 -41.20 10 4161 14L 15Q C N 113 115 -129.34 -121.10 11.20 0.36 -63.80 -175.55 30.59 10 15Q 15Q N CA 115 116 147.28 139.70 -40.30 11 4162 15Q 16L C N 122 124 -127.07 -108.50 62.23 3.14 -63.50 141.94 23.64 11 16L 16L N CA 124 125 -168.11 132.50 -41.20 12 4163 16L 17Q C N 130 132 -73.76 -73.00 2.89 0.20 -63.80 178.49 25.73 12 17Q 17Q N CA 132 133 137.91 140.70 -40.30 13 4209 62N 63P C N 497 499 -63.40 -64.50 57.13 4.32 -58.70 125.27 10.73 13 63P 63P N CA 499 500 -155.68 147.20 -30.50 14 4210 63P 64P C N 504 506 -58.66 -58.70 27.87 2.31 -64.50 149.94 10.88 14 64P 64P N CA 506 507 -2.63 -30.50 147.20 15 4224 77S 78R C N 615 617 -112.48 -63.00 50.05 7.89 -63.00 50.05 7.89 15 78R 78R N CA 617 618 -33.51 -41.10 -41.10 16 4279 132C 133K C N 1054 1056 -51.99 -62.90 29.29 3.38 -70.20 152.72 11.48 16 133K 133K N CA 1056 1057 -67.98 -40.80 140.40 17 4280 133K 134G C N 1063 1065 -73.84 -62.40 13.42 2.03 82.20 161.77 12.11 17 134G 134G N CA 1065 1066 -34.17 -41.20 8.50 18 4288 141T 142Q C N 1128 1130 25.53 -63.80 129.91 16.87 -73.00 129.92 10.12 18 142Q 142Q N CA 1130 1131 -134.62 -40.30 140.70 19 4289 142Q 143G C N 1137 1139 -61.42 -62.40 29.51 4.45 82.20 145.03 10.40 19 143G 143G N CA 1139 1140 -11.71 -41.20 8.50 ------------------------------------------------------------------------------------------------- Feature 27 : Distance restraints 5 (X-Y) List of the RVIOL violations larger than : 4.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 13410 92T 147. CG2 O6 751 1178 7.06 1.00 6.06 60.61 1.00 6.06 60.61 2 13411 78R 118D N OD2 617 941 5.53 1.00 4.53 45.31 1.00 4.53 45.31 report______> Distribution of short non-bonded contacts: DISTANCE1: 0.00 2.10 2.20 2.30 2.40 2.50 2.60 2.70 2.80 2.90 3.00 3.10 3.20 3.30 3.40 DISTANCE2: 2.10 2.20 2.30 2.40 2.50 2.60 2.70 2.80 2.90 3.00 3.10 3.20 3.30 3.40 3.50 FREQUENCY: 0 0 0 0 2 16 20 80 88 127 164 151 149 207 208 << end of ENERGY. >> Summary of successfully produced models: Filename molpdf ---------------------------------------- lys_lig.B99990001.pdb 4289.79346 lys_lig.B99990002.pdb 4438.97754
from IPython.display import Image
Image(filename='overlay3.png')
Структуры моделей с (голубой) и без (зеленый) дополнительных ограничений
Построим структуру лизоцима с лигандом где все аминокислоты аланин и сравним скор функцию с "нативной" последовательностью.
alignm = modeller.alignment(env)
mdl = modeller.model(env, file='1lmp.pdb')
alignm.append_model(mdl, atom_files='1lmp.pdb', align_codes='1lmp')
alanin = 'A' * 146 + '...'
alignm.append_sequence(alanin)
alignm[0].code = 'pdb'
alignm[1].code = 'alanin'
alignm.salign()
alignm.write(file='all_in_one_ala.ali', alignment_format='PIR')
SALIGN_____> adding the next group to the alignment; iteration 1
## Выбираем объект для моделирования
s = alignm[1]
pdb = alignm[0]
print(s.code, pdb.code)
## Создаем объект automodel
a = modeller.automodel.automodel(env, alnfile='all_in_one_ala.ali', knowns=pdb.code, sequence=s.code)
a.name='mod'+s.code
a.starting_model = 1
a.ending_model = 2
a.make()
alanin pdb automodel__W> Topology and/or parameter libraries already in memory. These will be used instead of the automodel defaults. If this is not what you want, clear them before creating the automodel object with env.libs.topology.clear() and env.libs.parameters.clear() fndatmi_285W> Only 129 residues out of 132 contain atoms of type CA (This is usually caused by non-standard residues, such as ligands, or by PDB files with missing atoms.) fndatmi_285W> Only 129 residues out of 132 contain atoms of type CA (This is usually caused by non-standard residues, such as ligands, or by PDB files with missing atoms.) check_ali___> Checking the sequence-structure alignment. Implied intrachain target CA(i)-CA(i+1) distances longer than 8.0 angstroms: ALN_POS TMPL RID1 RID2 NAM1 NAM2 DIST ---------------------------------------------- END OF TABLE getf_______W> RTF restraint not found in the atoms list: residue type, indices: 1 146 atom names : C +N atom indices : 729 0 getf_______W> RTF restraint not found in the atoms list: residue type, indices: 1 146 atom names : C CA +N O atom indices : 729 727 0 730 fndatmi_285W> Only 129 residues out of 132 contain atoms of type CA (This is usually caused by non-standard residues, such as ligands, or by PDB files with missing atoms.) mdtrsr__446W> A potential that relies on one protein is used, yet you have at least one known structure available. MDT, not library, potential is used. iup2crm_280W> No topology library in memory or assigning a BLK residue. Default CHARMM atom type assigned: C1 --> CT2 This message is written only for the first such atom. 43 atoms in HETATM/BLK residues constrained to protein atoms within 2.30 angstroms and protein CA atoms within 10.00 angstroms 43 atoms in residues without defined topology constrained to be rigid bodies condens_443_> Restraints marked for deletion were removed. Total number of restraints before, now: 10506 9783 >> ENERGY; Differences between the model's features and restraints: Number of all residues in MODEL : 149 Number of all, selected real atoms : 774 774 Number of all, selected pseudo atoms : 0 0 Number of all static, selected restraints : 9783 9783 COVALENT_CYS : F NONBONDED_SEL_ATOMS : 1 Number of non-bonded pairs (excluding 1-2,1-3,1-4): 1491 Dynamic pairs routine : 2, NATM x NATM cell sorting Atomic shift for contacts update (UPDATE_DYNAMIC) : 0.390 LENNARD_JONES_SWITCH : 6.500 7.500 COULOMB_JONES_SWITCH : 6.500 7.500 RESIDUE_SPAN_RANGE : 0 99999 NLOGN_USE : 15 CONTACT_SHELL : 4.000 DYNAMIC_PAIRS,_SPHERE,_COULOMB,_LENNARD,_MODELLER : T T F F F SPHERE_STDV : 0.050 RADII_FACTOR : 0.820 Current energy : 900.8751 Summary of the restraint violations: NUM ... number of restraints. NUMVI ... number of restraints with RVIOL > VIOL_REPORT_CUT[i]. RVIOL ... relative difference from the best value. NUMVP ... number of restraints with -Ln(pdf) > VIOL_REPORT_CUT2[i]. RMS_1 ... RMS(feature, minimally_violated_basis_restraint, NUMB). RMS_2 ... RMS(feature, best_value, NUMB). MOL.PDF ... scaled contribution to -Ln(Molecular pdf). # RESTRAINT_GROUP NUM NUMVI NUMVP RMS_1 RMS_2 MOL.PDF S_i ------------------------------------------------------------------------------------------------------ 1 Bond length potential : 730 0 0 0.007 0.007 9.1025 1.000 2 Bond angle potential : 1021 1 6 2.325 2.325 114.12 1.000 3 Stereochemical cosine torsion poten: 296 0 37 76.908 76.908 287.10 1.000 4 Stereochemical improper torsion pot: 292 0 0 1.032 1.032 9.1105 1.000 5 Soft-sphere overlap restraints : 1491 1 2 0.012 0.012 23.802 1.000 6 Lennard-Jones 6-12 potential : 0 0 0 0.000 0.000 0.0000 1.000 7 Coulomb point-point electrostatic p: 0 0 0 0.000 0.000 0.0000 1.000 8 H-bonding potential : 0 0 0 0.000 0.000 0.0000 1.000 9 Distance restraints 1 (CA-CA) : 2405 0 7 0.333 0.333 102.02 1.000 10 Distance restraints 2 (N-O) : 2564 1 8 0.426 0.426 174.37 1.000 11 Mainchain Phi dihedral restraints : 0 0 0 0.000 0.000 0.0000 1.000 12 Mainchain Psi dihedral restraints : 0 0 0 0.000 0.000 0.0000 1.000 13 Mainchain Omega dihedral restraints: 145 1 7 5.371 5.371 49.334 1.000 14 Sidechain Chi_1 dihedral restraints: 0 0 0 0.000 0.000 0.0000 1.000 15 Sidechain Chi_2 dihedral restraints: 0 0 0 0.000 0.000 0.0000 1.000 16 Sidechain Chi_3 dihedral restraints: 0 0 0 0.000 0.000 0.0000 1.000 17 Sidechain Chi_4 dihedral restraints: 0 0 0 0.000 0.000 0.0000 1.000 18 Disulfide distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 19 Disulfide angle restraints : 0 0 0 0.000 0.000 0.0000 1.000 20 Disulfide dihedral angle restraints: 0 0 0 0.000 0.000 0.0000 1.000 21 Lower bound distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 22 Upper bound distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 23 Distance restraints 3 (SDCH-MNCH) : 722 0 0 0.278 0.278 7.2790 1.000 24 Sidechain Chi_5 dihedral restraints: 0 0 0 0.000 0.000 0.0000 1.000 25 Phi/Psi pair of dihedral restraints: 144 36 23 35.947 85.484 101.70 1.000 26 Distance restraints 4 (SDCH-SDCH) : 63 0 0 0.740 0.740 3.5256 1.000 27 Distance restraints 5 (X-Y) : 1401 0 0 0.040 0.040 19.413 1.000 28 NMR distance restraints 6 (X-Y) : 0 0 0 0.000 0.000 0.0000 1.000 29 NMR distance restraints 7 (X-Y) : 0 0 0 0.000 0.000 0.0000 1.000 30 Minimal distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 31 Non-bonded restraints : 0 0 0 0.000 0.000 0.0000 1.000 32 Atomic accessibility restraints : 0 0 0 0.000 0.000 0.0000 1.000 33 Atomic density restraints : 0 0 0 0.000 0.000 0.0000 1.000 34 Absolute position restraints : 0 0 0 0.000 0.000 0.0000 1.000 35 Dihedral angle difference restraint: 0 0 0 0.000 0.000 0.0000 1.000 36 GBSA implicit solvent potential : 0 0 0 0.000 0.000 0.0000 1.000 37 EM density fitting potential : 0 0 0 0.000 0.000 0.0000 1.000 38 SAXS restraints : 0 0 0 0.000 0.000 0.0000 1.000 39 Symmetry restraints : 0 0 0 0.000 0.000 0.0000 1.000 # Heavy relative violation of each residue is written to: alanin.V99990001 # The profile is NOT normalized by the number of restraints. # The profiles are smoothed over a window of residues: 1 # The sum of all numbers in the file: 15812.7422 List of the violated restraints: A restraint is violated when the relative difference from the best value (RVIOL) is larger than CUTOFF. ICSR ... index of a restraint in the current set. RESNO ... residue numbers of the first two atoms. ATM ... IUPAC atom names of the first two atoms. FEAT ... the value of the feature in the model. restr ... the mean of the basis restraint with the smallest difference from the model (local minimum). viol ... difference from the local minimum. rviol ... relative difference from the local minimum. RESTR ... the best value (global minimum). VIOL ... difference from the best value. RVIOL ... relative difference from the best value. ------------------------------------------------------------------------------------------------- Feature 10 : Distance restraints 2 (N-O) List of the RVIOL violations larger than : 4.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 6626 93A 80A N O 461 400 5.13 2.92 2.20 5.52 2.92 2.20 5.52 ------------------------------------------------------------------------------------------------- Feature 25 : Phi/Psi pair of dihedral restraints List of the RVIOL violations larger than : 6.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 2354 15A 16A C N 74 76 78.92 55.40 27.44 1.99 -62.50 155.62 31.61 1 16A 16A N CA 76 77 24.06 38.20 -40.90 2 2357 18A 19A C N 89 91 139.41 -134.00 151.68 5.59 -62.50 165.10 33.24 2 19A 19A N CA 91 92 -88.48 147.00 -40.90 3 2358 19A 20A C N 94 96 57.92 -68.20 133.83 12.64 -68.20 133.83 12.64 3 20A 20A N CA 96 97 -169.92 145.30 145.30 4 2359 20A 21A C N 99 101 -78.31 -68.20 57.92 5.02 -62.50 130.14 20.61 4 21A 21A N CA 101 102 88.27 145.30 -40.90 5 2360 21A 22A C N 104 106 58.84 55.40 14.86 0.64 -62.50 137.48 27.96 5 22A 22A N CA 106 107 23.74 38.20 -40.90 6 2375 36A 37A C N 179 181 66.50 55.40 13.78 0.89 -62.50 147.21 29.94 6 37A 37A N CA 181 182 30.03 38.20 -40.90 7 2376 37A 38A C N 184 186 62.43 55.40 10.57 0.51 -62.50 143.80 29.24 7 38A 38A N CA 186 187 30.31 38.20 -40.90 8 2387 48A 49A C N 239 241 85.43 55.40 40.83 2.28 -62.50 156.62 31.65 8 49A 49A N CA 241 242 10.54 38.20 -40.90 9 2392 53A 54A C N 264 266 92.67 -134.00 138.40 3.39 -62.50 -154.35 32.30 9 54A 54A N CA 266 267 -175.86 147.00 -40.90 10 2395 56A 57A C N 279 281 60.36 55.40 14.47 1.28 -62.50 153.90 31.16 10 57A 57A N CA 281 282 51.79 38.20 -40.90 11 2401 62A 63A C N 309 311 -75.03 -68.20 12.84 0.82 -62.50 163.41 27.41 11 63A 63A N CA 311 312 156.18 145.30 -40.90 12 2402 63A 64A C N 314 316 -158.73 -134.00 26.03 0.62 -62.50 -169.85 36.29 12 64A 64A N CA 316 317 155.10 147.00 -40.90 13 2403 64A 65A C N 319 321 -83.17 -68.20 15.28 1.19 -62.50 172.02 29.25 13 65A 65A N CA 321 322 148.33 145.30 -40.90 14 2404 65A 66A C N 324 326 -70.62 -68.20 14.62 1.26 -62.50 171.98 27.80 14 66A 66A N CA 326 327 130.89 145.30 -40.90 15 2405 66A 67A C N 329 331 -79.41 -68.20 11.35 1.02 -62.50 176.44 29.77 15 67A 67A N CA 331 332 143.47 145.30 -40.90 16 2406 67A 68A C N 334 336 -74.05 -68.20 7.59 0.75 -62.50 179.00 29.92 16 68A 68A N CA 336 337 140.47 145.30 -40.90 17 2407 68A 69A C N 339 341 -157.97 -134.00 28.50 0.69 -62.50 -176.51 35.15 17 69A 69A N CA 341 342 162.41 147.00 -40.90 18 2408 69A 70A C N 344 346 -108.17 -134.00 32.42 0.84 -62.50 174.39 26.73 18 70A 70A N CA 346 347 127.40 147.00 -40.90 19 2409 70A 71A C N 349 351 159.11 -134.00 70.97 1.66 -62.50 -157.08 40.35 19 71A 71A N CA 351 352 170.70 147.00 -40.90 20 2410 71A 72A C N 354 356 -72.33 -68.20 5.64 0.56 -62.50 177.91 29.65 20 72A 72A N CA 356 357 141.46 145.30 -40.90 21 2411 72A 73A C N 359 361 -81.03 -68.20 13.36 1.00 -62.50 171.09 28.98 21 73A 73A N CA 361 362 149.02 145.30 -40.90 22 2412 73A 74A C N 364 366 -130.41 -134.00 15.36 0.77 -62.50 -174.18 28.03 22 74A 74A N CA 366 367 132.06 147.00 -40.90 23 2413 74A 75A C N 369 371 67.18 -68.20 135.39 11.42 -62.50 -141.89 32.91 23 75A 75A N CA 371 372 143.73 145.30 -40.90 24 2414 75A 76A C N 374 376 -112.70 -134.00 34.73 1.21 -62.50 168.14 25.61 24 76A 76A N CA 376 377 119.57 147.00 -40.90 25 2415 76A 77A C N 379 381 63.12 -68.20 131.48 11.36 -62.50 -150.83 31.59 25 77A 77A N CA 381 382 151.85 145.30 -40.90 26 2416 77A 78A C N 384 386 -159.93 -134.00 27.96 0.64 -62.50 -171.27 36.12 26 78A 78A N CA 386 387 157.47 147.00 -40.90 27 2417 78A 79A C N 389 391 175.73 -134.00 51.42 2.12 -62.50 -145.08 32.33 27 79A 79A N CA 391 392 136.20 147.00 -40.90 28 2422 83A 84A C N 414 416 74.98 55.40 52.19 1.91 -62.50 140.86 28.13 28 84A 84A N CA 416 417 -10.18 38.20 -40.90 29 2426 87A 88A C N 434 436 53.46 55.40 5.40 0.20 -62.50 143.27 29.04 29 88A 88A N CA 436 437 43.24 38.20 -40.90 30 2429 90A 91A C N 449 451 49.95 55.40 21.69 0.91 -62.50 150.54 30.27 30 91A 91A N CA 451 452 59.19 38.20 -40.90 31 2431 92A 93A C N 459 461 -76.25 -62.50 36.79 6.77 -62.50 36.79 6.77 31 93A 93A N CA 461 462 -75.02 -40.90 -40.90 32 2432 93A 94A C N 464 466 -94.62 -62.50 37.18 7.56 -62.50 37.18 7.56 32 94A 94A N CA 466 467 -59.61 -40.90 -40.90 33 2459 120A 121A C N 599 601 81.60 -134.00 158.33 3.60 -62.50 179.59 28.78 33 121A 121A N CA 601 602 -148.08 147.00 -40.90 34 2472 133A 134A C N 664 666 49.13 55.40 12.18 0.46 -62.50 143.10 28.91 34 134A 134A N CA 666 667 48.64 38.20 -40.90 35 2480 141A 142A C N 704 706 -53.77 -62.50 10.56 1.71 -68.20 168.47 14.09 35 142A 142A N CA 706 707 -46.85 -40.90 145.30 36 2483 144A 145A C N 719 721 57.74 55.40 4.84 0.48 -62.50 146.29 29.68 36 145A 145A N CA 721 722 42.43 38.20 -40.90 report______> Distribution of short non-bonded contacts: DISTANCE1: 0.00 2.10 2.20 2.30 2.40 2.50 2.60 2.70 2.80 2.90 3.00 3.10 3.20 3.30 3.40 DISTANCE2: 2.10 2.20 2.30 2.40 2.50 2.60 2.70 2.80 2.90 3.00 3.10 3.20 3.30 3.40 3.50 FREQUENCY: 0 0 0 0 1 8 11 60 57 81 68 104 126 114 108 << end of ENERGY. >> ENERGY; Differences between the model's features and restraints: Number of all residues in MODEL : 149 Number of all, selected real atoms : 774 774 Number of all, selected pseudo atoms : 0 0 Number of all static, selected restraints : 9783 9783 COVALENT_CYS : F NONBONDED_SEL_ATOMS : 1 Number of non-bonded pairs (excluding 1-2,1-3,1-4): 1450 Dynamic pairs routine : 2, NATM x NATM cell sorting Atomic shift for contacts update (UPDATE_DYNAMIC) : 0.390 LENNARD_JONES_SWITCH : 6.500 7.500 COULOMB_JONES_SWITCH : 6.500 7.500 RESIDUE_SPAN_RANGE : 0 99999 NLOGN_USE : 15 CONTACT_SHELL : 4.000 DYNAMIC_PAIRS,_SPHERE,_COULOMB,_LENNARD,_MODELLER : T T F F F SPHERE_STDV : 0.050 RADII_FACTOR : 0.820 Current energy : 826.4486 Summary of the restraint violations: NUM ... number of restraints. NUMVI ... number of restraints with RVIOL > VIOL_REPORT_CUT[i]. RVIOL ... relative difference from the best value. NUMVP ... number of restraints with -Ln(pdf) > VIOL_REPORT_CUT2[i]. RMS_1 ... RMS(feature, minimally_violated_basis_restraint, NUMB). RMS_2 ... RMS(feature, best_value, NUMB). MOL.PDF ... scaled contribution to -Ln(Molecular pdf). # RESTRAINT_GROUP NUM NUMVI NUMVP RMS_1 RMS_2 MOL.PDF S_i ------------------------------------------------------------------------------------------------------ 1 Bond length potential : 730 0 0 0.007 0.007 9.2107 1.000 2 Bond angle potential : 1021 0 5 2.322 2.322 111.17 1.000 3 Stereochemical cosine torsion poten: 296 0 42 79.299 79.299 304.42 1.000 4 Stereochemical improper torsion pot: 292 0 0 1.096 1.096 8.9672 1.000 5 Soft-sphere overlap restraints : 1450 1 2 0.012 0.012 23.407 1.000 6 Lennard-Jones 6-12 potential : 0 0 0 0.000 0.000 0.0000 1.000 7 Coulomb point-point electrostatic p: 0 0 0 0.000 0.000 0.0000 1.000 8 H-bonding potential : 0 0 0 0.000 0.000 0.0000 1.000 9 Distance restraints 1 (CA-CA) : 2405 0 2 0.281 0.281 73.149 1.000 10 Distance restraints 2 (N-O) : 2564 0 7 0.398 0.398 158.51 1.000 11 Mainchain Phi dihedral restraints : 0 0 0 0.000 0.000 0.0000 1.000 12 Mainchain Psi dihedral restraints : 0 0 0 0.000 0.000 0.0000 1.000 13 Mainchain Omega dihedral restraints: 145 1 6 5.472 5.472 51.206 1.000 14 Sidechain Chi_1 dihedral restraints: 0 0 0 0.000 0.000 0.0000 1.000 15 Sidechain Chi_2 dihedral restraints: 0 0 0 0.000 0.000 0.0000 1.000 16 Sidechain Chi_3 dihedral restraints: 0 0 0 0.000 0.000 0.0000 1.000 17 Sidechain Chi_4 dihedral restraints: 0 0 0 0.000 0.000 0.0000 1.000 18 Disulfide distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 19 Disulfide angle restraints : 0 0 0 0.000 0.000 0.0000 1.000 20 Disulfide dihedral angle restraints: 0 0 0 0.000 0.000 0.0000 1.000 21 Lower bound distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 22 Upper bound distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 23 Distance restraints 3 (SDCH-MNCH) : 722 0 0 0.247 0.247 5.2027 1.000 24 Sidechain Chi_5 dihedral restraints: 0 0 0 0.000 0.000 0.0000 1.000 25 Phi/Psi pair of dihedral restraints: 144 37 19 30.984 86.828 58.438 1.000 26 Distance restraints 4 (SDCH-SDCH) : 63 0 0 0.506 0.506 1.4629 1.000 27 Distance restraints 5 (X-Y) : 1401 0 0 0.042 0.042 21.307 1.000 28 NMR distance restraints 6 (X-Y) : 0 0 0 0.000 0.000 0.0000 1.000 29 NMR distance restraints 7 (X-Y) : 0 0 0 0.000 0.000 0.0000 1.000 30 Minimal distance restraints : 0 0 0 0.000 0.000 0.0000 1.000 31 Non-bonded restraints : 0 0 0 0.000 0.000 0.0000 1.000 32 Atomic accessibility restraints : 0 0 0 0.000 0.000 0.0000 1.000 33 Atomic density restraints : 0 0 0 0.000 0.000 0.0000 1.000 34 Absolute position restraints : 0 0 0 0.000 0.000 0.0000 1.000 35 Dihedral angle difference restraint: 0 0 0 0.000 0.000 0.0000 1.000 36 GBSA implicit solvent potential : 0 0 0 0.000 0.000 0.0000 1.000 37 EM density fitting potential : 0 0 0 0.000 0.000 0.0000 1.000 38 SAXS restraints : 0 0 0 0.000 0.000 0.0000 1.000 39 Symmetry restraints : 0 0 0 0.000 0.000 0.0000 1.000 # Heavy relative violation of each residue is written to: alanin.V99990002 # The profile is NOT normalized by the number of restraints. # The profiles are smoothed over a window of residues: 1 # The sum of all numbers in the file: 16143.9502 List of the violated restraints: A restraint is violated when the relative difference from the best value (RVIOL) is larger than CUTOFF. ICSR ... index of a restraint in the current set. RESNO ... residue numbers of the first two atoms. ATM ... IUPAC atom names of the first two atoms. FEAT ... the value of the feature in the model. restr ... the mean of the basis restraint with the smallest difference from the model (local minimum). viol ... difference from the local minimum. rviol ... relative difference from the local minimum. RESTR ... the best value (global minimum). VIOL ... difference from the best value. RVIOL ... relative difference from the best value. ------------------------------------------------------------------------------------------------- Feature 25 : Phi/Psi pair of dihedral restraints List of the RVIOL violations larger than : 6.5000 # ICSR RESNO1/2 ATM1/2 INDATM1/2 FEAT restr viol rviol RESTR VIOL RVIOL 1 2354 15A 16A C N 74 76 78.39 55.40 25.99 2.01 -62.50 156.00 31.70 1 16A 16A N CA 76 77 26.08 38.20 -40.90 2 2357 18A 19A C N 89 91 63.38 55.40 16.88 0.62 -62.50 141.31 28.73 2 19A 19A N CA 91 92 23.32 38.20 -40.90 3 2359 20A 21A C N 99 101 55.22 55.40 3.08 0.20 -62.50 140.14 28.47 3 21A 21A N CA 101 102 35.13 38.20 -40.90 4 2360 21A 22A C N 104 106 63.23 55.40 11.94 0.57 -62.50 143.94 29.27 4 22A 22A N CA 106 107 29.18 38.20 -40.90 5 2361 22A 23A C N 109 111 -69.22 -62.50 6.74 1.31 -134.00 -176.54 11.30 5 23A 23A N CA 111 112 -41.36 -40.90 147.00 6 2375 36A 37A C N 179 181 67.29 55.40 15.60 0.92 -62.50 146.99 29.89 6 37A 37A N CA 181 182 28.11 38.20 -40.90 7 2376 37A 38A C N 184 186 63.13 55.40 13.06 0.55 -62.50 143.12 29.11 7 38A 38A N CA 186 187 27.67 38.20 -40.90 8 2387 48A 49A C N 239 241 84.61 55.40 39.50 2.23 -62.50 156.20 31.58 8 49A 49A N CA 241 242 11.61 38.20 -40.90 9 2392 53A 54A C N 264 266 91.72 -134.00 139.06 3.44 -62.50 -154.40 32.24 9 54A 54A N CA 266 267 -176.87 147.00 -40.90 10 2395 56A 57A C N 279 281 58.86 55.40 14.06 1.13 -62.50 152.72 30.91 10 57A 57A N CA 281 282 51.83 38.20 -40.90 11 2401 62A 63A C N 309 311 -71.31 -68.20 68.26 5.62 -62.50 118.34 18.98 11 63A 63A N CA 311 312 77.11 145.30 -40.90 12 2402 63A 64A C N 314 316 173.07 -134.00 55.38 1.33 -62.50 -160.61 39.17 12 64A 64A N CA 316 317 163.30 147.00 -40.90 13 2403 64A 65A C N 319 321 -43.06 -68.20 32.68 2.09 -62.50 166.47 28.28 13 65A 65A N CA 321 322 124.43 145.30 -40.90 14 2404 65A 66A C N 324 326 163.53 -134.00 64.76 1.59 -62.50 -155.11 40.51 14 66A 66A N CA 326 327 164.08 147.00 -40.90 15 2405 66A 67A C N 329 331 -144.20 -134.00 12.13 0.65 -62.50 -163.54 36.57 15 67A 67A N CA 331 332 140.44 147.00 -40.90 16 2406 67A 68A C N 334 336 -164.14 -134.00 30.81 1.27 -62.50 -154.61 39.09 16 68A 68A N CA 336 337 140.63 147.00 -40.90 17 2407 68A 69A C N 339 341 59.78 -68.20 130.33 11.82 -62.50 -167.10 29.29 17 69A 69A N CA 341 342 169.92 145.30 -40.90 18 2408 69A 70A C N 344 346 -135.61 -134.00 2.20 0.06 -62.50 -174.38 34.34 18 70A 70A N CA 346 347 148.49 147.00 -40.90 19 2409 70A 71A C N 349 351 -152.61 -134.00 19.00 0.50 -62.50 -169.11 36.10 19 71A 71A N CA 351 352 150.83 147.00 -40.90 20 2410 71A 72A C N 354 356 -125.28 -134.00 24.93 1.15 -62.50 176.11 26.60 20 72A 72A N CA 356 357 123.64 147.00 -40.90 21 2411 72A 73A C N 359 361 -172.72 -134.00 43.42 0.99 -62.50 -171.87 36.64 21 73A 73A N CA 361 362 166.64 147.00 -40.90 22 2412 73A 74A C N 364 366 -117.25 -134.00 22.06 0.61 -62.50 -178.04 27.70 22 74A 74A N CA 366 367 132.63 147.00 -40.90 23 2413 74A 75A C N 369 371 161.97 -134.00 70.95 1.61 -62.50 -164.02 39.04 23 75A 75A N CA 371 372 177.54 147.00 -40.90 24 2414 75A 76A C N 374 376 -65.58 -68.20 2.67 0.24 -62.50 173.28 28.55 24 76A 76A N CA 376 377 145.85 145.30 -40.90 25 2415 76A 77A C N 379 381 -68.12 -68.20 2.04 0.17 -62.50 171.85 28.44 25 77A 77A N CA 381 382 147.34 145.30 -40.90 26 2416 77A 78A C N 384 386 -124.70 -134.00 34.18 1.68 -62.50 167.02 25.18 26 78A 78A N CA 386 387 114.11 147.00 -40.90 27 2417 78A 79A C N 389 391 -21.27 -68.20 84.36 8.17 -62.50 111.59 16.83 27 79A 79A N CA 391 392 -144.59 145.30 -40.90 28 2422 83A 84A C N 414 416 80.69 55.40 54.63 1.99 -62.50 146.44 29.21 28 84A 84A N CA 416 417 -10.22 38.20 -40.90 29 2426 87A 88A C N 434 436 44.95 55.40 37.70 1.52 -62.50 157.62 31.34 29 88A 88A N CA 436 437 74.42 38.20 -40.90 30 2429 90A 91A C N 449 451 62.25 55.40 12.34 1.28 -62.50 153.45 31.11 30 91A 91A N CA 451 452 48.46 38.20 -40.90 31 2432 93A 94A C N 464 466 64.98 55.40 9.83 1.14 -62.50 151.19 30.72 31 94A 94A N CA 466 467 40.39 38.20 -40.90 32 2459 120A 121A C N 599 601 81.13 -134.00 159.59 3.62 -62.50 178.01 28.59 32 121A 121A N CA 601 602 -146.05 147.00 -40.90 33 2471 132A 133A C N 659 661 -55.61 -62.50 14.04 2.10 -68.20 162.06 13.51 33 133A 133A N CA 661 662 -53.13 -40.90 145.30 34 2472 133A 134A C N 664 666 -111.90 -134.00 56.78 2.54 -62.50 144.31 21.83 34 134A 134A N CA 666 667 94.69 147.00 -40.90 35 2473 134A 135A C N 669 671 -167.13 -134.00 36.03 0.82 -68.20 100.19 7.93 35 135A 135A N CA 671 672 161.14 147.00 145.30 36 2481 142A 143A C N 709 711 71.88 55.40 42.07 1.53 -62.50 140.32 28.25 36 143A 143A N CA 711 712 -0.51 38.20 -40.90 37 2483 144A 145A C N 719 721 66.17 55.40 12.65 0.91 -62.50 147.68 30.03 37 145A 145A N CA 721 722 31.58 38.20 -40.90 report______> Distribution of short non-bonded contacts: DISTANCE1: 0.00 2.10 2.20 2.30 2.40 2.50 2.60 2.70 2.80 2.90 3.00 3.10 3.20 3.30 3.40 DISTANCE2: 2.10 2.20 2.30 2.40 2.50 2.60 2.70 2.80 2.90 3.00 3.10 3.20 3.30 3.40 3.50 FREQUENCY: 0 0 0 0 0 7 7 49 68 83 78 81 113 114 94 << end of ENERGY. >> Summary of successfully produced models: Filename molpdf ---------------------------------------- alanin.B99990001.pdb 900.87506 alanin.B99990002.pdb 826.44861
w4 = nglview.show_structure_file('alanin.B99990001.pdb')
w4.clear_representations()
w4.add_cartoon('protein', color_scheme='residueindex')
w4.add_ball_and_stick('ligand', opacity=0.5)
w4
Итоговая структура получилась очень похожей на лизоцим, что, разумеется, невозможно.
from IPython.display import Image
Image(filename='overlay4.png')
Синим обозначена исходная структура 1lmp, розовым - получившаяся модель.
Очевидно, что использовать пакет Modeller подобным образом не стоит. Референсные структуры нужно подбирать с уверенностью в гомологии с изучаемой последовательностью.
Сравним скор-функции
import pandas as pd
data = [['Модель 1', 878.34003, 1018.65662, 4289.79346, 900.87506],['Модель 2', 769.98920, 886.54285, 4438.97754, 826.44861]]
pd.DataFrame(data, columns=["", "Без лиганда", "С лигандом", "С перемещенным лигандом", "Замещенный на аланины"])
Без лиганда | С лигандом | С перемещенным лигандом | Замещенный на аланины | ||
---|---|---|---|---|---|
0 | Модель 1 | 878.34003 | 1018.65662 | 4289.79346 | 900.87506 |
1 | Модель 2 | 769.98920 | 886.54285 | 4438.97754 | 826.44861 |
У моделей со смещенным лигандом очень высокий показатель, поскольку введенные дополнительные ограничения не соответствуют изначальной структуре лизоцима 1lmp. Скор аланиновой последовательности сопоставим с моделями без лиганда, возможно по причине большой разницы первичных последовательностей 1lmp и выданного белка.