Command line Training Set First Motif Summary of Motifs Termination Explanation


Search sequence databases for the best combined matches with these motifs using MAST.
Search sequence databases for all matches with these motifs using FIMO.
Submit these motifs to BLOCKS multiple alignment processor.


MEME - Motif discovery tool

MEME version 4.3.0 (Release date: Sat Sep 26 01:51:56 PDT 2009)

For further information on how to interpret these results or to get a copy of the MEME software please access http://meme.sdsc.edu.

This file may be used as input to the MAST algorithm for searching sequence databases for matches to groups of motifs. MAST is available for interactive use and downloading at http://meme.sdsc.edu.


REFERENCE

If you use this program in your research, please cite:

Timothy L. Bailey and Charles Elkan, "Fitting a mixture model by expectation maximization to discover motifs in biopolymers", Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28-36, AAAI Press, Menlo Park, California, 1994.



TRAINING SET

DATAFILE= memeout/meme.fasta
ALPHABET= ACDEFGHIKLMNPQRSTVWY
Sequence name Weight Length Sequence name Weight Length
NP_001253007.1 1.0000 203 NP_001186687.1 1.0000 146
YP_002049390.1 1.0000 180 XP_002898071.1 1.0000 210
XP_002771748.1 1.0000 227 XP_001742699.1 1.0000 197
NP_011110.1 1.0000 211 XP_002677773.1 1.0000 225
NP_390369.1 1.0000 187 ZP_23953076.1 1.0000 190
YP_006776038.1 1.0000 187 YP_006862846.1 1.0000 173
YP_875063.1 1.0000 173 YP_006544936.1 1.0000 203
ZP_21930915.1 1.0000 190 YP_007249493.1 1.0000 186
YP_002248514.1 1.0000 183 YP_004628550.1 1.0000 193
YP_002250946.1 1.0000 181 ZP_21044980.1 1.0000 199
ZP_09717097.1 1.0000 203 YP_003968700.1 1.0000 191
YP_006640390.1 1.0000 210 YP_002729868.1 1.0000 182
ZP_23724314.1 1.0000 205 ZP_19272575.1 1.0000 176
YP_005097506.1 1.0000 179 ZP_23957523.1 1.0000 197
YP_003504642.1 1.0000 157 YP_001998586.1 1.0000 196
ZP_08554557.1 1.0000 192 YP_004203297.1 1.0000 180
YP_001939367.1 1.0000 197 ZP_07685973.1 1.0000 198
YP_004169895.1 1.0000 180 YP_328474.1 1.0000 178

COMMAND LINE SUMMARY

This information can also be useful in the event you wish to report a
problem with the MEME software.

command: meme memeout/meme.fasta -mod zoops -prior dirichlet -nostatus -protein -oc memeout/ 
model: mod= zoops nmotifs= 1 evt= inf
object function= E-value of product of p-values
width: minw= 8 maxw= 50 minic= 0.00
width: wg= 11 ws= 1 endgaps= yes
nsites: minsites= 2 maxsites= 36 wnsites= 0.8
theta: prob= 1 spmap= pam spfuzz= 120
em: prior= dirichlet b= 0.01 maxiter= 50
distance= 1e-05
data: n= 6865 N= 36
sample: seed= 0 seqfrac= 1
Letter frequencies in dataset:
A 0.066 C 0.013 D 0.062 E 0.071 F 0.042 G 0.068 H 0.021 I 0.072 K 0.065 L 0.105 M 0.023 N 0.023 P 0.057 Q 0.038 R 0.069 S 0.051 T 0.041 V 0.067 W 0.009 Y 0.037
Background letter frequencies (from dataset with add-one prior applied):
A 0.066 C 0.013 D 0.062 E 0.071 F 0.042 G 0.068 H 0.021 I 0.072 K 0.065 L 0.105 M 0.023 N 0.024 P 0.057 Q 0.038 R 0.069 S 0.051 T 0.041 V 0.067 W 0.009 Y 0.037

P N
MOTIF 1 width = 29 sites = 35 llr = 1890 E-value = 2.3e-499

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
85.4 (bits)
Relative Entropy
77.9 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
YP_002250946.1 110 2.58e-29 QPEKSTIVKH FDLIIVPGIAFDKRGYRIGYGKGYYDKFL NKMKETIKIG
ZP_23724314.1 128 3.87e-29 LKEKRVPIDR IDLILVPGVAFDRQGGRIGHGKGYYDKLL AGARTDTLRV
YP_004203297.1 109 7.16e-28 PTTQAVDPQV LDLVVVPGLAFDEEGYRLGHGKGYYDRFL ATVRAEKLGV
YP_006862846.1 101 9.98e-28 EPLPYGPVDR MDLLVVPGIAFDRKGYRLGYGKGYYDKFL AKRKVVFSIG
ZP_21930915.1 118 2.60e-27 RSQKRVPPEE IDLVLLPGLAFDRKGGRIGYGKGYFDRFL DRLNDRAGRI
NP_001253007.1 128 4.78e-27 VREEALSTGG LDLIFMPGLGFDKHGNRLGRGKGYYDTYL KRCLQHQEVK
ZP_23953076.1 115 8.62e-27 NKQNVADVNT IDLVLLPGIAFDKRGNRIGHGAGYYDRLL GKYLHAKRAG
YP_003968700.1 117 1.15e-26 EEKSKAAAEE LDLVVVPGVGFDINGYRVGYGGGFYDKFF DGIDKEVSKV
XP_001742699.1 119 1.32e-26 AETAALAHEG LDVILVPGMAFDKAGRRLGRGKGYYDRYF ARCAQFATAH
ZP_21044980.1 125 2.01e-26 PTAPLISPAL VDLILVPAVACDVQGYRLGYGGGFYDRLL SQPLWRGKPT
NP_001186687.1 71 2.31e-26 VREEALSTGG LDLIFMPGLGFDKHGNRLGRGKGYYDAYL KRCLQHQEVK
NP_390369.1 115 3.03e-26 EKTKEVNPSQ IDLMIVPGVCFDVNGFRVGFGGGYYDRYL SEYEGKTVSL
YP_002729868.1 108 5.14e-26 PEGEAFPVED IDIVVVPAVAYDLRGHRLGYGKGYYDRLL KRIKGLKVGL
YP_003504642.1 84 6.66e-26 EPVECTDFKH IDIAVVPGVAFDRALHRIGYGKGYYDRLL GAVRFGIIAG
ZP_19272575.1 113 8.59e-26 VGAPFTTYEQ IEVVVVPGVSFDANGNRLGRGRGYYDRFL LQVPQAYKLG
YP_006544936.1 127 8.59e-26 GHELPARPED VQVVIIPMVAFDAKGNRLGYGAGYYDRFL CRYPHPIKIG
YP_001939367.1 121 9.74e-26 LGCVEASIQD IDLILIPGVGFDRQGHRLGRGLGFYDRCL SLLSPRACRI
YP_001998586.1 116 9.74e-26 VPLELTDERC FDAVIVPLVGFDRQGGRIGFGKGWYDRFF EELSTRGISP
ZP_09717097.1 127 1.10e-25 AGKIHTPGEK AALMVVPGMAFDKQGNRMGHGKGYYDKFF AKLDGLGVPY
XP_002898071.1 129 2.03e-25 PRDDAVQGDD LELVLLPGVAFDRRGGRVGHGKGYYDSFL RRLTEHYDAI
ZP_07685973.1 117 2.29e-25 RRIIPALPGD WDLTFVPLLGFDRQGYRIGYGKGYYDQLL GAATTYAVGL
YP_002248514.1 112 2.29e-25 KNGLKAFIEE IDVIAVPGIAFDYKCFRIGYGGGYYDRVL ENKKGVAVGL
XP_002771748.1 141 2.90e-25 VNAMDCRPPA LNVILVPGLGFDNHCRRLGRGKGFYDRYL SRFSAKTGSM
ZP_08554557.1 119 3.67e-25 YGCELCDKET IDLIIMPGLAFDAKGGRVGYGGGFYDHFI NSMETSVRKL
YP_007249493.1 115 1.25e-24 GSEIPARGED VDTIILPMLGFDRTGARIGYGAGYYDRFL EKFPSLRKIG
YP_004169895.1 107 1.90e-24 ADAPRVDRAR VDAVLLPALAFDEVGFRLGYGGGFYDRLL AGWAVPTVGV
YP_005097506.1 108 1.11e-23 EPEGTEYTEY IDIYIVPGVAFDLDLYRLGYGGGFFDRYF SVHKKTQLIG
YP_004628550.1 118 2.50e-23 FENPELKPEN FEIIFVPGVAFDLKKGRIGYGGGFYDKIL KKTKAFKIGV
ZP_23957523.1 120 3.23e-23 DIRFLKPVRE LDVICTPLVGFDSVGHRLGMGGGYYDRTL SRWFSTGEGA
YP_875063.1 103 4.92e-23 PRNDCEECID PDVIIVPAVGTARDGSRLGYGRGYYDRFL AGLDIPSIVP
YP_006640390.1 124 8.01e-23 PRRGTEALGL ADLVICPALAVDRRGMRLGRGAGWYDRAL EYKRPDAPAL
YP_328474.1 113 1.38e-22 IEGEEIEAQQ IAAALIPAIVFDQNKFRLGYGGGYYDRFL SKYPYIWTIG
XP_002677773.1 126 3.34e-22 NLLDSLNGDS KLLVIVPGLAFDYSNRRLGRGKGHYDTFF EKLDDLQGRI
YP_002049390.1 107 3.59e-22 IYQSRLHPNQ VGLLLAPALAFDNRGIRLGYGGGYYDRLR SDPLWRGIIA
YP_006776038.1 115 4.43e-22 PKDDCPVNNN LDVILVPTVAISPTGVRLGYGHGFYDKFL AKNKTATISL

Motif 1 block diagrams


Name
Lowest
p-value
Motifs
YP_002250946.1 2.58e-29

1
ZP_23724314.1 3.87e-29

1
YP_004203297.1 7.16e-28

1
YP_006862846.1 9.98e-28

1
ZP_21930915.1 2.60e-27

1
NP_001253007.1 4.78e-27

1
ZP_23953076.1 8.62e-27

1
YP_003968700.1 1.15e-26

1
XP_001742699.1 1.32e-26

1
ZP_21044980.1 2.01e-26

1
NP_001186687.1 2.31e-26

1
NP_390369.1 3.03e-26

1
YP_002729868.1 5.14e-26

1
YP_003504642.1 6.66e-26

1
ZP_19272575.1 8.59e-26

1
YP_006544936.1 8.59e-26

1
YP_001939367.1 9.74e-26

1
YP_001998586.1 9.74e-26

1
ZP_09717097.1 1.10e-25

1
XP_002898071.1 2.03e-25

1
ZP_07685973.1 2.29e-25

1
YP_002248514.1 2.29e-25

1
XP_002771748.1 2.90e-25

1
ZP_08554557.1 3.67e-25

1
YP_007249493.1 1.25e-24

1
YP_004169895.1 1.90e-24

1
YP_005097506.1 1.11e-23

1
YP_004628550.1 2.50e-23

1
ZP_23957523.1 3.23e-23

1
YP_875063.1 4.92e-23

1
YP_006640390.1 8.01e-23

1
YP_328474.1 1.38e-22

1
XP_002677773.1 3.34e-22

1
YP_002049390.1 3.59e-22

1
YP_006776038.1 4.43e-22

1
SCALE
| | | | | | | | | |
1 25 50 75 100 125 150 175 200 225

Motif 1 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 1 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 1 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Motif 1 regular-expression

[IL]D[LV][IV][LIV]VP[GA][VL][AG]FDRxGYR[LI]G[YR]G[KG]G[YF]YD[RK][FL]L

Time 6.52 secs.

P N
SUMMARY OF MOTIFS


Combined block diagrams: non-overlapping sites with p-value < 0.0001


Name
Combined
p-value

Motifs
NP_001253007.1 8.37e-25

1
NP_001186687.1 2.72e-24

1
YP_002049390.1 5.46e-20

1
XP_002898071.1 3.70e-23

1
XP_002771748.1 5.78e-23

1
XP_001742699.1 2.24e-24

1
NP_011110.1 3.00e-05

1
XP_002677773.1 6.59e-20

1
NP_390369.1 4.81e-24

1
ZP_23953076.1 1.40e-24

1
YP_006776038.1 7.04e-20

1
YP_006862846.1 1.45e-25

1
YP_875063.1 7.13e-21

1
YP_006544936.1 1.50e-23

1
ZP_21930915.1 4.21e-25

1
YP_007249493.1 1.97e-22

1
YP_002248514.1 3.55e-23

1
YP_004628550.1 4.12e-21

1
YP_002250946.1 3.95e-27

1
ZP_21044980.1 3.44e-24

1
ZP_09717097.1 1.93e-23

1
YP_003968700.1 1.87e-24

1
YP_006640390.1 1.46e-20

1
YP_002729868.1 7.92e-24

1
ZP_23724314.1 6.84e-27

1
ZP_19272575.1 1.27e-23

1
YP_005097506.1 1.68e-21

1
ZP_23957523.1 5.46e-21

1
YP_003504642.1 8.59e-24

1
YP_001998586.1 1.64e-23

1
ZP_08554557.1 6.01e-23

1
YP_004203297.1 1.09e-25

1
YP_001939367.1 1.65e-23

1
ZP_07685973.1 3.90e-23

1
YP_004169895.1 2.89e-22

1
YP_328474.1 2.07e-20

1
SCALE
| | | | | | | | | |
1 25 50 75 100 125 150 175 200 225

Motif summary in machine readable format.
Stopped because Stopped because nmotifs = 1 reached..



CPU: kodomo.fbb.msu.ru


EXPLANATION OF MEME RESULTS


The MEME results consist of:

MOTIFS

For each motif that it discovers in the training set, MEME prints the following information: