Command line Training Set First Motif Summary of Motifs Termination Explanation


Search sequence databases for the best combined matches with these motifs using MAST.
Search sequence databases for all matches with these motifs using FIMO.
Submit these motifs to BLOCKS multiple alignment processor.


MEME - Motif discovery tool

MEME version 4.3.0 (Release date: Sat Sep 26 01:51:56 PDT 2009)

For further information on how to interpret these results or to get a copy of the MEME software please access http://meme.sdsc.edu.

This file may be used as input to the MAST algorithm for searching sequence databases for matches to groups of motifs. MAST is available for interactive use and downloading at http://meme.sdsc.edu.


REFERENCE

If you use this program in your research, please cite:

Timothy L. Bailey and Charles Elkan, "Fitting a mixture model by expectation maximization to discover motifs in biopolymers", Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28-36, AAAI Press, Menlo Park, California, 1994.



TRAINING SET

DATAFILE= pasted_sequences
ALPHABET= ACDEFGHIKLMNPQRSTVWY
Sequence name Weight Length Sequence name Weight Length
IDH_BACSU 1.0000 423 IDH_STAAR 1.0000 422
IDH_ECOLI 1.0000 416 IDH_COXBU 1.0000 427
IDH_HELPY 1.0000 425 IDH1_COLMA 1.0000 415
IDH_STRMU 1.0000 393 IDH_ANASP 1.0000 473
IDH_SYNY3 1.0000 475 IDH_AQUAE 1.0000 426

COMMAND LINE SUMMARY

This information can also be useful in the event you wish to report a
problem with the MEME software.

command: meme sequences -sf pasted_sequences -protein -mod zoops -nmotifs 3 -minw 6 -maxw 50 -time 7200 -maxsize 60000 -oc . -nostatus 
model: mod= zoops nmotifs= 3 evt= inf
object function= E-value of product of p-values
width: minw= 6 maxw= 50 minic= 0.00
width: wg= 11 ws= 1 endgaps= yes
nsites: minsites= 2 maxsites= 10 wnsites= 0.8
theta: prob= 1 spmap= pam spfuzz= 120
em: prior= megap b= 21475 maxiter= 50
distance= 1e-05
data: n= 4295 N= 10
sample: seed= 0 seqfrac= 1
Letter frequencies in dataset:
A 0.089 C 0.007 D 0.054 E 0.080 F 0.031 G 0.089 H 0.013 I 0.088 K 0.077 L 0.073 M 0.024 N 0.042 P 0.050 Q 0.031 R 0.035 S 0.043 T 0.053 V 0.071 W 0.016 Y 0.034
Background letter frequencies (from dataset with add-one prior applied):
A 0.089 C 0.008 D 0.054 E 0.080 F 0.031 G 0.089 H 0.013 I 0.088 K 0.077 L 0.073 M 0.024 N 0.042 P 0.050 Q 0.032 R 0.035 S 0.043 T 0.053 V 0.070 W 0.016 Y 0.034

P N
MOTIF 1 width = 13 sites = 2 llr = 73 E-value = 1.2e+001

SEQUENCE LOGO PNG LOGOS 
require CONVERT from ImageMagick; see MEME installation guide
Information Content
47.2 (bits)
Relative Entropy
52.8 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
IDH_SYNY3 173 9.57e-17 GDHGLLKKRP WAPPVFIWGSWFY EASGRKEAVL
IDH_HELPY 223 1.92e-15 GKLAVNKMDG YCSSYMIWLMIFY LFANHTQQVI

Motif 1 block diagrams


Name
Lowest
p-value
Motifs
IDH_SYNY3 9.57e-17

1
IDH_HELPY 1.92e-15

1
SCALE
| | | | | | | | | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300 325 350 375 400 425 450

Motif 1 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 1 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 1 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Motif 1 regular-expression

[WY][AC][PS][PS][VY][FM]IW[GL][MS][IW]FY

Time 10.35 secs.

P N
MOTIF 2 width = 16 sites = 2 llr = 88 E-value = 2.0e+001

SEQUENCE LOGO PNG LOGOS 
require CONVERT from ImageMagick; see MEME installation guide
Information Content
58.2 (bits)
Relative Entropy
63.2 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
IDH_ECOLI 315 2.67e-20 ETPVNPEAKK HNYFPCAPMAQLDYFM EGPAIYVNRE
IDH1_COLMA 246 3.53e-18 GGAGRQTSNE AFYFKTALLRGLDCHS ANSSVADIKK

Motif 2 block diagrams


Name
Lowest
p-value
Motifs
IDH_ECOLI 2.67e-20

2
IDH1_COLMA 3.53e-18

2
SCALE
| | | | | | | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300 325 350 375 400

Motif 2 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 2 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 2 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Motif 2 regular-expression

[AH][FN]YF[KP][CT]A[LP][LM][AR][GQ]LD[CY][FH][MS]

Time 20.95 secs.

P N
MOTIF 3 width = 7 sites = 2 llr = 42 E-value = 5.3e+001

SEQUENCE LOGO PNG LOGOS 
require CONVERT from ImageMagick; see MEME installation guide
Information Content
28.3 (bits)
Relative Entropy
30.3 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
IDH_ANASP 174 3.27e-10 VQGQGKCTAT WLRKGPM KAGIYGVFPI
IDH_ECOLI 0 8.21e-10 WLRTSPM EMEELARIEQ

Motif 3 block diagrams


Name
Lowest
p-value
Motifs
IDH_ANASP 3.27e-10

3
IDH_ECOLI 8.21e-10

3
SCALE
| | | | | | | | | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300 325 350 375 400 425 450

Motif 3 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 3 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 3 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Motif 3 regular-expression

WLR[KT][GS]PM

Time 31.28 secs.

P N
SUMMARY OF MOTIFS


Combined block diagrams: non-overlapping sites with p-value < 0.0001


Name
Combined
p-value

Motifs
IDH_ECOLI 2.74e-21

3
2
IDH_COXBU 1.25e-01

2
IDH_HELPY 2.96e-10

1
IDH1_COLMA 2.54e-13

2
IDH_ANASP 1.10e-05

3
IDH_SYNY3 8.85e-13

1
SCALE
| | | | | | | | | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300 325 350 375 400 425 450

Motif summary in machine readable format.
Stopped because Stopped because nmotifs = 3 reached..



CPU: compute-0-0.local


EXPLANATION OF MEME RESULTS


The MEME results consist of:

MOTIFS

For each motif that it discovers in the training set, MEME prints the following information: