Command line Training Set First Motif Summary of Motifs Termination Explanation


Search sequence databases for the best combined matches with these motifs using MAST.
Search sequence databases for all matches with these motifs using FIMO.
Find Genome Ontology terms associated with upstream sequences matching these motifs using GOMO.
Submit these motifs to BLOCKS multiple alignment processor.


MEME - Motif discovery tool

MEME version 4.3.0 (Release date: Sat Sep 26 01:51:56 PDT 2009)

For further information on how to interpret these results or to get a copy of the MEME software please access http://meme.sdsc.edu.

This file may be used as input to the MAST algorithm for searching sequence databases for matches to groups of motifs. MAST is available for interactive use and downloading at http://meme.sdsc.edu.


REFERENCE

If you use this program in your research, please cite:

Timothy L. Bailey and Charles Elkan, "Fitting a mixture model by expectation maximization to discover motifs in biopolymers", Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28-36, AAAI Press, Menlo Park, California, 1994.



TRAINING SET

DATAFILE= upstream.fasta
ALPHABET= ACGT
Sequence name Weight Length Sequence name Weight Length
NC_028833.1_gene_1 1.0000 213 NC_028833.1_gene_2 1.0000 100
NC_028833.1_gene_3 1.0000 100 NC_028833.1_gene_4 1.0000 100
NC_028833.1_gene_5 1.0000 100 NC_028833.1_gene_6 1.0000 100

COMMAND LINE SUMMARY

This information can also be useful in the event you wish to report a
problem with the MEME software.

command: meme upstream.fasta -dna -nmotifs 10 -minw 6 -o meme_out_10 
model: mod= zoops nmotifs= 10 evt= inf
object function= E-value of product of p-values
width: minw= 6 maxw= 50 minic= 0.00
width: wg= 11 ws= 1 endgaps= yes
nsites: minsites= 2 maxsites= 6 wnsites= 0.8
theta: prob= 1 spmap= uni spfuzz= 0.5
em: prior= dirichlet b= 0.01 maxiter= 50
distance= 1e-05
data: n= 713 N= 6
strands: +
sample: seed= 0 seqfrac= 1
Letter frequencies in dataset:
A 0.254 C 0.206 G 0.219 T 0.321
Background letter frequencies (from dataset with add-one prior applied):
A 0.254 C 0.206 G 0.219 T 0.321

P N
MOTIF 1 width = 44 sites = 6 llr = 146 E-value = 3.9e+000

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
35.5 (bits)
Relative Entropy
35.2 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
NC_028833.1_gene_3 31 3.35e-14 CTGCATCGTG TGCCGGCTGTTGTGACTGCAGAGGTAAGAGGCTTCAACGTTACG AAGTGGAGAA
NC_028833.1_gene_2 50 3.04e-13 GTTGTTTTTA CAGTGACTGTTGTTCACTGTCAAAATGCTTGCAACCAAGTTTTG CCGGGT
NC_028833.1_gene_1 141 2.18e-12 GTGATACTCC CCTCTTCTGTTCTGTCAGTCGAGGTGTGTGGATACCAGGTTCCG TCTTGGTTGA
NC_028833.1_gene_6 19 1.23e-11 GATCCAAGAA TGGCGACTTTTCGGCATTAGCTACAGATAACGAGACAATGACCG AAAGAGAGCG
NC_028833.1_gene_5 42 2.52e-11 ATGCTTATGC TATTTACAGGGATTTCATGAGAATAGACCCCCATCCAGTTTTAG ATGTCTAAAC
NC_028833.1_gene_4 6 4.28e-11 TATTAA TGGCGACTTTTTGTACATCTTTGCTAGCGAGCCTGCTATAAGTG TCACATTCAT

Motif 1 block diagrams


Name
Lowest
p-value
Motifs
NC_028833.1_gene_3 3.35e-14

+1
NC_028833.1_gene_2 3.04e-13

+1
NC_028833.1_gene_1 2.18e-12

+1
NC_028833.1_gene_6 1.23e-11

+1
NC_028833.1_gene_5 2.52e-11

+1
NC_028833.1_gene_4 4.28e-11

+1
SCALE
| | | | | | | | |
1 25 50 75 100 125 150 175 200

Motif 1 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 1 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 1 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 1 regular-expression

[TC][GA][GT][CT][GT]ACT[GT]TT[CG][TG][GT][ACT][CA][AT][TG][CG][AT][GC][AT][AG][CG][AT][GA][AG][CG][AT][AG][GC]C[AT][TA]CCA[AG][GT]T[TA][CT][CT]G

Time 0.30 secs.

P N
MOTIF 2 width = 11 sites = 5 llr = 50 E-value = 5.2e+002

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
14.7 (bits)
Relative Entropy
14.6 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
NC_028833.1_gene_4 51 1.46e-06 CTATAAGTGT CACATTCATCA ATGATGTAAA
NC_028833.1_gene_3 89 2.64e-06 GGAGAAGATC CACATTCAGTA
NC_028833.1_gene_5 28 1.74e-05 CAAGCCAGTG CACAATGCTTA TGCTATTTAC
NC_028833.1_gene_1 25 1.90e-05 TGTGTTATTG CACACACATGA TTGTGTTTGT
NC_028833.1_gene_2 27 3.51e-05 CCATGTATAA AACATTCCTTT TGGTTGTTTT

Motif 2 block diagrams


Name
Lowest
p-value
Motifs
NC_028833.1_gene_4 1.46e-06

+2
NC_028833.1_gene_3 2.64e-06

+2
NC_028833.1_gene_5 1.74e-05

+2
NC_028833.1_gene_1 1.90e-05

+2
NC_028833.1_gene_2 3.51e-05

+2
SCALE
| | | | | | | | |
1 25 50 75 100 125 150 175 200

Motif 2 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 2 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 2 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 2 regular-expression

[CA]ACA[TAC][TA][CG][AC][TG][TCG][AT]

Time 0.44 secs.

P N
MOTIF 3 width = 11 sites = 3 llr = 38 E-value = 4.2e+002

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
18.6 (bits)
Relative Entropy
18.5 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
NC_028833.1_gene_5 89 4.99e-07 GTTTTAGATG TCTAAACGAAA
NC_028833.1_gene_6 87 1.49e-06 TTGCATTTAG TCTAAACCGAA AC
NC_028833.1_gene_4 85 2.02e-06 GATCTATTCT TCTCAACTAAA CACT

Motif 3 block diagrams


Name
Lowest
p-value
Motifs
NC_028833.1_gene_5 4.99e-07

+3
NC_028833.1_gene_6 1.49e-06

+3
NC_028833.1_gene_4 2.02e-06

+3
SCALE
| | | | |
1 25 50 75

Motif 3 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 3 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 3 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 3 regular-expression

TCT[AC]AAC[CGT][AG]AA

Time 0.56 secs.

P N
MOTIF 4 width = 15 sites = 2 llr = 38 E-value = 9.4e+002

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
27.0 (bits)
Relative Entropy
27.4 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
NC_028833.1_gene_5 13 1.60e-09 GACAAGCGCT GTCTACAAGCCAGTG CACAATGCTT
NC_028833.1_gene_1 115 4.80e-09 GGTTAGTTCC GTCTACCAGTAAGTG TGTGATACTC

Motif 4 block diagrams


Name
Lowest
p-value
Motifs
NC_028833.1_gene_5 1.60e-09

+4
NC_028833.1_gene_1 4.80e-09

+4
SCALE
| | | | | | | | |
1 25 50 75 100 125 150 175 200

Motif 4 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 4 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 4 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 4 regular-expression

GTCTAC[AC]AG[CT][AC]AGTG

Time 0.66 secs.

P N
MOTIF 5 width = 6 sites = 2 llr = 17 E-value = 9.3e+002

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
11.0 (bits)
Relative Entropy
12.0 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
NC_028833.1_gene_2 4 9.42e-05 CAAC CACCCC ACTTAAGCCA
NC_028833.1_gene_1 95 2.41e-04 TGGGTTCATA CACCTC TTCTGGTTAG

Motif 5 block diagrams


Name
Lowest
p-value
Motifs
NC_028833.1_gene_2 9.42e-05

+5
NC_028833.1_gene_1 2.41e-04

+5
SCALE
| | | | | | | | |
1 25 50 75 100 125 150 175 200

Motif 5 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 5 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 5 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 5 regular-expression

CACC[CT]C

Time 0.74 secs.

P N
MOTIF 6 width = 8 sites = 2 llr = 21 E-value = 1.6e+003

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
15.0 (bits)
Relative Entropy
15.4 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
NC_028833.1_gene_1 203 1.01e-05 GAAACCTATA ACTGCAAC TC
NC_028833.1_gene_3 20 2.28e-05 TAGCTGTATC ACTGCATC GTGTGCCGGC

Motif 6 block diagrams


Name
Lowest
p-value
Motifs
NC_028833.1_gene_1 1.01e-05

+6
NC_028833.1_gene_3 2.28e-05

+6
SCALE
| | | | | | | | |
1 25 50 75 100 125 150 175 200

Motif 6 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 6 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 6 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 6 regular-expression

ACTGCA[AT]C

Time 0.82 secs.

P N
MOTIF 7 width = 8 sites = 2 llr = 21 E-value = 5.6e+003

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
15.0 (bits)
Relative Entropy
15.0 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
NC_028833.1_gene_3 81 1.32e-05 TACGAAGTGG AGAAGATC CACATTCAGT
NC_028833.1_gene_4 71 2.99e-05 AATGATGTAA AGTAGATC TATTCTTCTC

Motif 7 block diagrams


Name
Lowest
p-value
Motifs
NC_028833.1_gene_3 1.32e-05

+7
NC_028833.1_gene_4 2.99e-05

+7
SCALE
| | | | |
1 25 50 75

Motif 7 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 7 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 7 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 7 regular-expression

AG[AT]AGATC

Time 0.89 secs.

P N
MOTIF 8 width = 13 sites = 2 llr = 30 E-value = 8.4e+003

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
22.0 (bits)
Relative Entropy
21.5 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
NC_028833.1_gene_1 186 4.57e-08 CAGGTTCCGT CTTGGTTGAAACC TATAACTGCA
NC_028833.1_gene_6 1 3.24e-07 T CTTTGTTAGATCC AAGAATGGCG

Motif 8 block diagrams


Name
Lowest
p-value
Motifs
NC_028833.1_gene_1 4.57e-08

+8
NC_028833.1_gene_6 3.24e-07

+8
SCALE
| | | | | | | | |
1 25 50 75 100 125 150 175 200

Motif 8 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 8 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 8 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 8 regular-expression

CTT[GT]GTT[AG][AG]A[AT]CC

Time 0.97 secs.

P N
MOTIF 9 width = 9 sites = 2 llr = 22 E-value = 1.6e+004

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
16.0 (bits)
Relative Entropy
15.6 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
NC_028833.1_gene_1 79 4.23e-06 AAGTTGTTAA GTGTAGTGG GTTCATACAC
NC_028833.1_gene_3 4 2.07e-05 GTGG GTGTTGTAG CTGTATCACT

Motif 9 block diagrams


Name
Lowest
p-value
Motifs
NC_028833.1_gene_1 4.23e-06

+9
NC_028833.1_gene_3 2.07e-05

+9
SCALE
| | | | | | | | |
1 25 50 75 100 125 150 175 200

Motif 9 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 9 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 9 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 9 regular-expression

GTGT[AT]GT[AG]G

Time 1.04 secs.

P N
MOTIF 10 width = 6 sites = 2 llr = 15 E-value = 8.6e+003

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
10.0 (bits)
Relative Entropy
11.0 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
NC_028833.1_gene_6 68 2.34e-04 GACCGAAAGA GAGCGG CTTTTGCATT
NC_028833.1_gene_5 6 3.65e-04 GATGAC AAGCGC TGTCTACAAG

Motif 10 block diagrams


Name
Lowest
p-value
Motifs
NC_028833.1_gene_6 2.34e-04

+10
NC_028833.1_gene_5 3.65e-04

+10
SCALE
| | | | |
1 25 50 75

Motif 10 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 10 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 10 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 10 regular-expression

[AG]AGCG[CG]

Time 1.09 secs.

P N
SUMMARY OF MOTIFS


Combined block diagrams: non-overlapping sites with p-value < 0.0001


Name
Combined
p-value

Motifs
NC_028833.1_gene_1 1.09e-19

+2
+9
+4
+1
+8
+6
NC_028833.1_gene_2 9.32e-10

+5
+2
+1
NC_028833.1_gene_3 3.59e-15

+9
+6
+1
+7
+2
NC_028833.1_gene_4 2.49e-11

+1
+2
+7
+3
NC_028833.1_gene_5 2.48e-16

+4
+2
+1
+3
NC_028833.1_gene_6 3.84e-12

+8
+1
+3
SCALE
| | | | | | | | |
1 25 50 75 100 125 150 175 200

Motif summary in machine readable format.
Stopped because Stopped because nmotifs = 10 reached..



CPU: kodomo


EXPLANATION OF MEME RESULTS


The MEME results consist of:

MOTIFS

For each motif that it discovers in the training set, MEME prints the following information: