Command line Training Set First Motif Summary of Motifs Termination Explanation


Search sequence databases for the best combined matches with these motifs using MAST.
Search sequence databases for all matches with these motifs using FIMO.
Submit these motifs to BLOCKS multiple alignment processor.


MEME - Motif discovery tool

MEME version 4.3.0 (Release date: Sat Sep 26 01:51:56 PDT 2009)

For further information on how to interpret these results or to get a copy of the MEME software please access http://meme.sdsc.edu.

This file may be used as input to the MAST algorithm for searching sequence databases for matches to groups of motifs. MAST is available for interactive use and downloading at http://meme.sdsc.edu.


REFERENCE

If you use this program in your research, please cite:

Timothy L. Bailey and Charles Elkan, "Fitting a mixture model by expectation maximization to discover motifs in biopolymers", Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28-36, AAAI Press, Menlo Park, California, 1994.



TRAINING SET

DATAFILE= mememe/meme.fasta
ALPHABET= ACDEFGHIKLMNPQRSTVWY
Sequence name Weight Length Sequence name Weight Length
NP_005129.2 1.0000 266 XP_001884427.1 1.0000 215
XP_004243684.1 1.0000 213 XP_003860425.1 1.0000 231
XP_002364979.1 1.0000 184 XP_004359854.1 1.0000 328
XP_004031837.1 1.0000 257 XP_002788261.1 1.0000 285
XP_002903102.1 1.0000 228 XP_002286932.1 1.0000 187
XP_001742585.1 1.0000 274 XP_002673616.1 1.0000 605
XP_004348211.1 1.0000 330 XP_004345816.1 1.0000 311

COMMAND LINE SUMMARY

This information can also be useful in the event you wish to report a
problem with the MEME software.

command: meme mememe/meme.fasta -mod zoops -nmotifs 3 -prior dirichlet -nostatus -protein -oc mememe/ 
model: mod= zoops nmotifs= 3 evt= inf
object function= E-value of product of p-values
width: minw= 8 maxw= 50 minic= 0.00
width: wg= 11 ws= 1 endgaps= yes
nsites: minsites= 2 maxsites= 14 wnsites= 0.8
theta: prob= 1 spmap= pam spfuzz= 120
em: prior= dirichlet b= 0.01 maxiter= 50
distance= 1e-05
data: n= 3914 N= 14
sample: seed= 0 seqfrac= 1
Letter frequencies in dataset:
A 0.066 C 0.015 D 0.064 E 0.062 F 0.051 G 0.068 H 0.021 I 0.054 K 0.068 L 0.082 M 0.019 N 0.035 P 0.054 Q 0.046 R 0.054 S 0.070 T 0.060 V 0.060 W 0.008 Y 0.042
Background letter frequencies (from dataset with add-one prior applied):
A 0.066 C 0.015 D 0.064 E 0.062 F 0.051 G 0.068 H 0.021 I 0.054 K 0.068 L 0.081 M 0.019 N 0.035 P 0.054 Q 0.047 R 0.054 S 0.070 T 0.059 V 0.059 W 0.008 Y 0.042

P N
MOTIF 1 width = 36 sites = 13 llr = 959 E-value = 3.9e-191

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
107.3 (bits)
Relative Entropy
106.5 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
XP_001742585.1 122 5.93e-36 WTLTDMHGEK RHNTDFLGQWHLLYFGFTFCPDVCPEELDKMAEVIN HLDAQSKLPK
XP_001884427.1 58 5.93e-36 FSLTTDTGKP FTDEDLLGKWSLVYFGFTNCPDICPAELDKVTSVLD SIGTYPLASI
XP_004348211.1 141 1.48e-35 FTLVDTNGKR WTEEDLKGRWTLIYFGFTFCPDVCPDELDKMTEIVN TIDNTPDIGP
XP_004359854.1 154 3.56e-35 FSLIDENGKA VSDLDFRGKYMFLYFGFTYCPDACPAELDKMTIVLN NLEKHNLLDS
XP_004345816.1 146 4.11e-35 FTLVDQDGHV VTNHTFRGRYMLVYFGFTFCPDICPAELAKVTKTLK ILEEEEGITP
XP_002903102.1 80 4.11e-35 TLVDCDTRRA VTDASFRGKYSLLYFGFTHCPDICPNELVRIGDVLD TLEAENCPEV
XP_002364979.1 28 8.22e-35 WTLVDMHGRV RGSEEFEGAYQLLYFGFTFCPDICPQELEKMAQVID IIDKEFGEVV
NP_005129.2 113 2.36e-34 FHLLDHRGRA RCKADFRGQWVLMYFGFTHCPDICPDELEKLVQVVR QLEAEPGLPP
XP_002286932.1 34 5.65e-34 WSLVDLDGNL VTNKSFEGKWTLLYFGFARCPDICPSEMVKVGKVMD TLKKEHPELA
XP_002788261.1 119 8.09e-34 TLVDCRNGKP VASEQLRGKYYLIYFGFTFCPDICPQELEKAGKTVD IIEKEFGAGT
XP_002673616.1 451 4.81e-33 FTLVNTKGEV VTDSEFRGKFMFMYFGFTNCPDVCPTEMKKMTKALQ KIEKENPELA
XP_004031837.1 84 2.41e-30 WQLYNTEGKQ FGSDDLKGYYYIIYFGFCKCPDICPNALQKISQSIR KVQDTPEGRL
XP_003860425.1 47 1.24e-29 LRESRTGKYI TSDELFQNKWTLLYFGFSKCSEICPNTLRFISEVMK ACDAEYGSDS

Motif 1 block diagrams


Name
Lowest
p-value
Motifs
XP_001742585.1 5.93e-36

1
XP_001884427.1 5.93e-36

1
XP_004348211.1 1.48e-35

1
XP_004359854.1 3.56e-35

1
XP_004345816.1 4.11e-35

1
XP_002903102.1 4.11e-35

1
XP_002364979.1 8.22e-35

1
NP_005129.2 2.36e-34

1
XP_002286932.1 5.65e-34

1
XP_002788261.1 8.09e-34

1
XP_002673616.1 4.81e-33

1
XP_004031837.1 2.41e-30

1
XP_003860425.1 1.24e-29

1
SCALE
| | | | | | | | | | | | | | | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300 325 350 375 400 425 450 475 500 525 550 575 600

Motif 1 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 1 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 1 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Motif 1 regular-expression

[VR]T[DNS]ED[FL]RGK[WY][MT]L[LI]YFGFTFCPD[IV]CP[AN]EL[DE]K[MIV][TG][KEQ]V[LIV][DN]

Time 2.63 secs.

P N
MOTIF 2 width = 29 sites = 13 llr = 738 E-value = 2.6e-129

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
84.0 (bits)
Relative Entropy
81.9 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
XP_001742585.1 227 9.73e-34 YSRPQVDGDE DYLVDHSIIQYLMDPEGHFVAYYGQNMTA EQMLESVQDH
XP_002788261.1 224 1.28e-33 YNQGIRTDSE DYLVDHSIIHYLMGPNGKFIDFYGKNMTA EEIAGKIGKE
XP_002364979.1 132 1.13e-30 YNEGIKSSDA DYLVDHSIIQYFMGKNGKFKDFFGKNMTV NEIAERIAKH
XP_004348211.1 250 1.41e-28 SAGMPENPAD DYLVDHTIIQYFMNPEGKFATYYGQTTTA QDAAKRLIQS
XP_002673616.1 559 7.90e-28 NAPDYKEGSQ DYLVDHSIFIYLMDPYGHLSEYFAQNTTA DKIYESVSTA
XP_001884427.1 164 4.37e-27 STPPNADPNG DYLVDHSIFVYLMDPHGKFVEAFGQSVGE EVVKTKINEA
XP_004359854.1 257 4.12e-26 VFISKAGKGD SYLVDHTIIEYLIGPDGKFIEFYGSNLNA DQVTEKILER
XP_002903102.1 186 1.08e-25 KADENEDDDD DYLVDHSIVMYLVGPDGEFLDFFTQAARV DDIAAKIKTY
XP_004345816.1 254 5.25e-25 STSQHSEEDE DYLVDHSIFLYLMDKEGSFLSHHGSQYDA HALAQRIATD
XP_002286932.1 141 1.29e-24 MSKADETEDG DYLVDHSIVIYFHDETGDIADCFTQSMRP SDVVDKVVER
NP_005129.2 218 1.53e-24 YNAGPKDEDQ DYIVDHSIAIYLLNPDGLFTDYYGRSRSA EQISDSVRRH
XP_003860425.1 176 4.84e-23 AGVAAPRIDD TYQFDHSSAIYLVGPDGKMKDFFFKEMGL NDTVSRIGVH
XP_004031837.1 207 5.24e-22 GKDKLGQLQY NYTIDHTVISYLMDDEGQYLIHLGPNLNE NQLSRIIIDK

Motif 2 block diagrams


Name
Lowest
p-value
Motifs
XP_001742585.1 9.73e-34

2
XP_002788261.1 1.28e-33

2
XP_002364979.1 1.13e-30

2
XP_004348211.1 1.41e-28

2
XP_002673616.1 7.90e-28

2
XP_001884427.1 4.37e-27

2
XP_004359854.1 4.12e-26

2
XP_002903102.1 1.08e-25

2
XP_004345816.1 5.25e-25

2
XP_002286932.1 1.29e-24

2
NP_005129.2 1.53e-24

2
XP_003860425.1 4.84e-23

2
XP_004031837.1 5.24e-22

2
SCALE
| | | | | | | | | | | | | | | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300 325 350 375 400 425 450 475 500 525 550 575 600

Motif 2 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 2 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 2 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Motif 2 regular-expression

DYLVDH[ST]I[IF][IQ]Y[LF]M[DG]P[DE]GKFL[DE][FY][FY]G[QK][NS]MTA

Time 4.70 secs.

P N
MOTIF 3 width = 28 sites = 12 llr = 687 E-value = 6.9e-121

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
85.5 (bits)
Relative Entropy
82.5 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
XP_002673616.1 522 1.02e-29 SCTAVIEYLQ DYHPRFVGLTGTPDQISRICKKYRVYYN APDYKEGSQD
XP_001742585.1 191 1.02e-29 TLPKIQAYVE QFHPRLLGLTGTHEQIKHICKKFRVYYS RPQVDGDEDY
XP_004345816.1 217 2.19e-27 TVGKIRSYLK DFHPSFVGLTGTPQQVESMARSFRVYSS TSQHSEEDED
XP_002788261.1 188 2.50e-27 TCAQTSLYLS EFDPRTIGLTGTHEQIKDITRKFRVYYN QGIRTDSEDY
XP_002364979.1 96 5.41e-27 TVAQVKSYCE EFHPRLIGFTGTPAQIKDVTRKFRVYYN EGIKSSDADY
XP_004348211.1 213 6.93e-27 KIMGEYLAAN AFHPRIVGLTGTTEEVHQVARAYRVYFS AGMPENPADD
XP_002903102.1 148 8.82e-27 TIAQMQAYKA DFHPKFKMLTGTRDQVADITKAYRVYFS KADENEDDDD
XP_002286932.1 105 2.75e-26 SLKALRDYAK DFHPSYVFLTGAPQQVQAMAKKYRVYMS KADETEDGDY
NP_005129.2 182 1.94e-25 DVEAMARYVQ DFHPRLLGLTGSTKQVAQASHSYRVYYN AGPKDEDQDY
XP_004359854.1 223 5.25e-25 TVEQIKQYIH EFHPKFVGLTGTPEQITKLAKGYRVFIS KAGKGDSYLV
XP_001884427.1 127 8.90e-25 SQSRISRYLQ DFHPSFTGLFGSYDATKAVCKAYRVYFS TPPNADPNGD
XP_003860425.1 124 2.04e-24 KPDVVEQFVC KYDPRVRGLCGTREEVEAAARAWRVYYS SVDETDEERD

Motif 3 block diagrams


Name
Lowest
p-value
Motifs
XP_002673616.1 1.02e-29

3
XP_001742585.1 1.02e-29

3
XP_004345816.1 2.19e-27

3
XP_002788261.1 2.50e-27

3
XP_002364979.1 5.41e-27

3
XP_004348211.1 6.93e-27

3
XP_002903102.1 8.82e-27

3
XP_002286932.1 2.75e-26

3
NP_005129.2 1.94e-25

3
XP_004359854.1 5.25e-25

3
XP_001884427.1 8.90e-25

3
XP_003860425.1 2.04e-24

3
SCALE
| | | | | | | | | | | | | | | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300 325 350 375 400 425 450 475 500 525 550 575 600

Motif 3 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 3 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 3 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Motif 3 regular-expression

[DE]FHP[RS][FL]VGLTGTP[ED]Q[VI]K[AD][IV][ACT][KR][KA][YF]RVY[YF][SN]

Time 6.36 secs.

P N
SUMMARY OF MOTIFS


Combined block diagrams: non-overlapping sites with p-value < 0.0001


Name
Combined
p-value

Motifs
NP_005129.2 1.39e-71

1
3
2
XP_001884427.1 2.43e-75

1
3
2
XP_003860425.1 1.24e-64

1
3
2
XP_002364979.1 3.44e-80

1
3
2
XP_004359854.1 3.22e-73

1
3
2
XP_004031837.1 3.98e-45

1
3
2
XP_002788261.1 8.36e-82

1
3
2
XP_002903102.1 5.04e-75

1
3
2
XP_002286932.1 1.20e-72

1
3
2
XP_001742585.1 1.86e-86

1
3
2
XP_002673616.1 1.27e-76

1
3
2
XP_004348211.1 6.98e-78

1
3
2
XP_004345816.1 1.71e-74

1
3
2
SCALE
| | | | | | | | | | | | | | | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300 325 350 375 400 425 450 475 500 525 550 575 600

Motif summary in machine readable format.
Stopped because Stopped because nmotifs = 3 reached..



CPU: kodomo.fbb.msu.ru


EXPLANATION OF MEME RESULTS


The MEME results consist of:

MOTIFS

For each motif that it discovers in the training set, MEME prints the following information: