Command line Training Set First Motif Summary of Motifs Termination Explanation


Search sequence databases for the best combined matches with these motifs using MAST.
Search sequence databases for all matches with these motifs using FIMO.
Find Genome Ontology terms associated with upstream sequences matching these motifs using GOMO.
Submit these motifs to BLOCKS multiple alignment processor.


MEME - Motif discovery tool

MEME version 4.3.0 (Release date: Sat Sep 26 01:51:56 PDT 2009)

For further information on how to interpret these results or to get a copy of the MEME software please access http://meme.sdsc.edu.

This file may be used as input to the MAST algorithm for searching sequence databases for matches to groups of motifs. MAST is available for interactive use and downloading at http://meme.sdsc.edu.


REFERENCE

If you use this program in your research, please cite:

Timothy L. Bailey and Charles Elkan, "Fitting a mixture model by expectation maximization to discover motifs in biopolymers", Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28-36, AAAI Press, Menlo Park, California, 1994.



TRAINING SET

DATAFILE= species_genes.fasta
ALPHABET= ACGT
Sequence name Weight Length Sequence name Weight Length
1 1.0000 277 2 1.0000 100
3 1.0000 100 4 1.0000 100
5 1.0000 100 6 1.0000 100
7 1.0000 100

COMMAND LINE SUMMARY

This information can also be useful in the event you wish to report a
problem with the MEME software.

command: meme species_genes.fasta -dna -nmotifs 10 -minw 6 -o meme_out_species 
model: mod= zoops nmotifs= 10 evt= inf
object function= E-value of product of p-values
width: minw= 6 maxw= 50 minic= 0.00
width: wg= 11 ws= 1 endgaps= yes
nsites: minsites= 2 maxsites= 7 wnsites= 0.8
theta: prob= 1 spmap= uni spfuzz= 0.5
em: prior= dirichlet b= 0.01 maxiter= 50
distance= 1e-05
data: n= 877 N= 7
strands: +
sample: seed= 0 seqfrac= 1
Letter frequencies in dataset:
A 0.268 C 0.182 G 0.209 T 0.341
Background letter frequencies (from dataset with add-one prior applied):
A 0.268 C 0.183 G 0.209 T 0.341

P N
MOTIF 1 width = 10 sites = 4 llr = 47 E-value = 1.1e+001

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
17.4 (bits)
Relative Entropy
17.1 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
1 48 9.85e-07 GGCTATTCTT CTCAACTAAA CGAAATTTTC
6 88 2.43e-06 TACATTTAGT CTAAACTAAA CA
5 90 3.92e-06 TTTTAGATGT CTAAACGAAA
7 88 5.36e-06 CTAGCGCTTG ATCAACTAAA TT

Motif 1 block diagrams


Name
Lowest
p-value
Motifs
1 9.85e-07

+1
6 2.43e-06

+1
5 3.92e-06

+1
7 5.36e-06

+1
SCALE
| | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275

Motif 1 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 1 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 1 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 1 regular-expression

[CA]T[AC]AAC[TG]AAA

Time 0.39 secs.

P N
MOTIF 2 width = 7 sites = 5 llr = 43 E-value = 3.4e+001

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
11.6 (bits)
Relative Entropy
12.3 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
5 5 2.79e-05 TATGT CAGGTGC TGTGTATAGA
2 77 2.79e-05 ATTAGGAATA CAGGTGC CATTTGTGGC
1 240 7.35e-05 GTGAGGATTC CAGGTTC TGTCTTGGTT
6 33 1.44e-04 GACTTTTCGG CAGTTGC TACAGATAAT
3 89 1.83e-04 GGAAAAGGTT CACGTTC AGTA

Motif 2 block diagrams


Name
Lowest
p-value
Motifs
5 2.79e-05

+2
2 2.79e-05

+2
1 7.35e-05

+2
6 1.44e-04

+2
3 1.83e-04

+2
SCALE
| | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275

Motif 2 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 2 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 2 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 2 regular-expression

CA[GC][GT]T[GT]C

Time 0.71 secs.

P N
MOTIF 3 width = 14 sites = 7 llr = 72 E-value = 9.0e+001

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
15.3 (bits)
Relative Entropy
14.9 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
6 11 7.55e-08 TTATGTCCGT GCTAAAAATGGCGA CTTTTCGGCA
4 85 2.12e-06 CTAATATTCT TCTCAACTTTGCGA A
3 32 3.72e-06 CACATCTTGT GCAACACTTTGTGA TTGTCGTGGT
5 73 9.08e-06 AATTGACCCC GCTCCAATTTTAGA TGTCTAAACG
1 108 9.08e-06 TGTAAGTGTT GCTGAAATTTCTGT AGGTTGTCTA
7 71 2.61e-05 GATCATTGAA GATGAAACTAGCGC TTGATCAACT
2 46 3.14e-05 ATTAGGACTT GTTAAAAATGGTAA GTTACTAATT

Motif 3 block diagrams


Name
Lowest
p-value
Motifs
6 7.55e-08

+3
4 2.12e-06

+3
3 3.72e-06

+3
5 9.08e-06

+3
1 9.08e-06

+3
7 2.61e-05

+3
2 3.14e-05

+3
SCALE
| | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275

Motif 3 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 3 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 3 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 3 regular-expression

GCT[ACG][AC]A[AC][TA]T[TG]G[CT]GA

Time 1.00 secs.

P N
MOTIF 4 width = 6 sites = 5 llr = 39 E-value = 1.5e+002

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
10.6 (bits)
Relative Entropy
11.1 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
7 16 1.71e-04 AAACTTTGGC TGACAC AACTCATGAT
4 33 1.71e-04 TTTATATTTT TGACAC ACGACCTTGT
1 200 1.71e-04 TATGTCATAG TGACAC CCTCTTTCTG
5 66 2.88e-04 TTATGCGAAT TGACCC CGCTCCAATT
2 21 5.07e-04 AAGATAAAGA TGTCAC TGATGTTGTA

Motif 4 block diagrams


Name
Lowest
p-value
Motifs
7 1.71e-04

+4
4 1.71e-04

+4
1 1.71e-04

+4
5 2.88e-04

+4
2 5.07e-04

+4
SCALE
| | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275

Motif 4 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 4 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 4 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 4 regular-expression

TG[AT]C[AC]C

Time 1.22 secs.

P N
MOTIF 5 width = 6 sites = 7 llr = 44 E-value = 7.5e+002

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
9.4 (bits)
Relative Entropy
9.1 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
5 49 3.68e-04 TGAACTTTAC AAAGAT TTTATGCGAA
3 57 3.68e-04 TGTCGTGGTA AAAGAT TACAACAGTA
2 10 3.68e-04 TGTTTCTTTG AAAGAT AAAGATGTCA
6 52 6.19e-04 CAGATAATGG AACGAT GTCCGAAAAA
7 32 9.05e-04 AACTCATGAT GAAGAT GAGGTCGAGA
4 62 1.23e-03 GTTGATGTCA AAACAT TTAAACCCTA
1 21 2.07e-03 TCTTTGAAGG GACCAT GCACGTCTGT

Motif 5 block diagrams


Name
Lowest
p-value
Motifs
5 3.68e-04

+5
3 3.68e-04

+5
2 3.68e-04

+5
6 6.19e-04

+5
7 9.05e-04

+5
4 1.23e-03

+5
1 2.07e-03

+5
SCALE
| | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275

Motif 5 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 5 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 5 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 5 regular-expression

[AG]A[AC][GC]AT

Time 1.40 secs.

P N
MOTIF 6 width = 6 sites = 2 llr = 18 E-value = 7.9e+002

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
12.0 (bits)
Relative Entropy
13.1 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
4 16 1.17e-04 CATTAATGGT GACCTC TTTTATATTT
1 160 1.17e-04 AGTGGCCTTT GACCTC TTTGTTGGTG

Motif 6 block diagrams


Name
Lowest
p-value
Motifs
4 1.17e-04

+6
1 1.17e-04

+6
SCALE
| | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275

Motif 6 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 6 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 6 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 6 regular-expression

GACCTC

Time 1.54 secs.

P N
MOTIF 7 width = 8 sites = 2 llr = 22 E-value = 1.5e+003

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
14.0 (bits)
Relative Entropy
15.7 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
1 1 3.49e-06 C GGCCCGTC CTTCTTTGAA
5 20 1.96e-05 GCTGTGTATA GACCAGTC TATCGAGTTT

Motif 7 block diagrams


Name
Lowest
p-value
Motifs
1 3.49e-06

+7
5 1.96e-05

+7
SCALE
| | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275

Motif 7 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 7 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 7 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 7 regular-expression

G[AG]CC[AC]GTC

Time 1.65 secs.

P N
MOTIF 8 width = 6 sites = 4 llr = 31 E-value = 1.9e+003

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
10.4 (bits)
Relative Entropy
11.2 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
3 0 1.94e-04 GTGGCT GTTGTAGTTG
2 89 1.94e-04 GGTGCCATTT GTGGCT ACTAA
6 70 4.03e-04 CCGAAAAAGA GCGGCT TTTACATTTA
1 151 4.03e-04 TAAGTGCTGA GTGGCC TTTGACCTCT

Motif 8 block diagrams


Name
Lowest
p-value
Motifs
3 1.94e-04

+8
2 1.94e-04

+8
6 4.03e-04

+8
1 4.03e-04

+8
SCALE
| | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275

Motif 8 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 8 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 8 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 8 regular-expression

G[TC]GGC[TC]

Time 1.76 secs.

P N
MOTIF 9 width = 6 sites = 2 llr = 16 E-value = 4.6e+003

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
11.0 (bits)
Relative Entropy
11.5 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
3 21 1.50e-04 AGTTGTTTAT CCACAT CTTGTGCAAC
4 3 3.41e-04 TAA CCTCAT TAATGGTGAC

Motif 9 block diagrams


Name
Lowest
p-value
Motifs
3 1.50e-04

+9
4 3.41e-04

+9
SCALE
| | | | |
1 25 50 75

Motif 9 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 9 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 9 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 9 regular-expression

CC[AT]CAT

Time 1.90 secs.

P N
MOTIF 10 width = 6 sites = 2 llr = 16 E-value = 5.4e+003

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
11.0 (bits)
Relative Entropy
11.7 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
3 65 1.35e-04 TAAAAGATTA CAACAG TACGAAGTGG
1 219 3.06e-04 CTTTCTGTTC CATCAG TTAGAGTGAG

Motif 10 block diagrams


Name
Lowest
p-value
Motifs
3 1.35e-04

+10
1 3.06e-04

+10
SCALE
| | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275

Motif 10 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 10 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 10 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 10 regular-expression

CA[AT]CAG

Time 2.02 secs.

P N
SUMMARY OF MOTIFS


Combined block diagrams: non-overlapping sites with p-value < 0.0001


Name
Combined
p-value

Motifs
1 1.63e-08

+7
+1
+3
+2
2 9.09e-04

+3
+2
3 1.67e-05

+3
4 1.36e-04

+3
5 1.16e-08

+2
+7
+3
+1
6 3.90e-07

+3
+1
7 7.22e-04

+3
+1
SCALE
| | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275

Motif summary in machine readable format.
Stopped because Stopped because nmotifs = 10 reached..



CPU: kodomo


EXPLANATION OF MEME RESULTS


The MEME results consist of:

MOTIFS

For each motif that it discovers in the training set, MEME prints the following information: