Command line Training Set First Motif Summary of Motifs Termination Explanation


Search sequence databases for the best combined matches with these motifs using MAST.
Search sequence databases for all matches with these motifs using FIMO.
Find Genome Ontology terms associated with upstream sequences matching these motifs using GOMO.
Submit these motifs to BLOCKS multiple alignment processor.


MEME - Motif discovery tool

MEME version 4.3.0 (Release date: Sat Sep 26 01:51:56 PDT 2009)

For further information on how to interpret these results or to get a copy of the MEME software please access http://meme.sdsc.edu.

This file may be used as input to the MAST algorithm for searching sequence databases for matches to groups of motifs. MAST is available for interactive use and downloading at http://meme.sdsc.edu.


REFERENCE

If you use this program in your research, please cite:

Timothy L. Bailey and Charles Elkan, "Fitting a mixture model by expectation maximization to discover motifs in biopolymers", Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28-36, AAAI Press, Menlo Park, California, 1994.



TRAINING SET

DATAFILE= upstream.fasta
ALPHABET= ACGT
Sequence name Weight Length Sequence name Weight Length
NA39_gp1 1.0000 302 NA39_gp2 1.0000 100
NA39_gp3 1.0000 100 NA39_gp4 1.0000 100
NA39_gp5 1.0000 100 NA39_gp6 1.0000 100
NA39_gp7 1.0000 100 NA39_gp8 1.0000 100
NA39_gp9 1.0000 100

COMMAND LINE SUMMARY

This information can also be useful in the event you wish to report a
problem with the MEME software.

command: meme upstream.fasta -oc result -dna -mod zoops -nmotifs 5 -minw 6 -maxw 50 -minsites 2 
model: mod= zoops nmotifs= 5 evt= inf
object function= E-value of product of p-values
width: minw= 6 maxw= 50 minic= 0.00
width: wg= 11 ws= 1 endgaps= yes
nsites: minsites= 2 maxsites= 9 wnsites= 0.8
theta: prob= 1 spmap= uni spfuzz= 0.5
em: prior= dirichlet b= 0.01 maxiter= 50
distance= 1e-05
data: n= 1102 N= 9
strands: +
sample: seed= 0 seqfrac= 1
Letter frequencies in dataset:
A 0.286 C 0.214 G 0.196 T 0.304
Background letter frequencies (from dataset with add-one prior applied):
A 0.286 C 0.214 G 0.196 T 0.304

P N
MOTIF 1 width = 10 sites = 9 llr = 89 E-value = 1.0e-006

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
14.3 (bits)
Relative Entropy
14.2 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
NA39_gp6 46 1.12e-06 GTAGTGTAAT CTAAACGAAC GTTTAGTTAT
NA39_gp5 63 1.12e-06 TGAAAGGAGT CTAAACGAAC TTACGTTTGG
NA39_gp2 89 1.12e-06 AATTGTTAAT CTAAACGAAC C
NA39_gp1 67 1.12e-06 GTTAGACTCT CTAAACGAAC TTTAAAACTG
NA39_gp9 72 8.26e-06 TAAAATTAAA ATAAACGAAC TATTCTCTCA
NA39_gp4 61 1.41e-05 TTTTGACGAG CCAGACGAAC CTATTAAATA
NA39_gp8 89 4.14e-05 TAACGTGCAA AACAACGAAC T
NA39_gp7 58 1.21e-04 ATCTGAAGAT CAAGAAGATC TGCTAGTTGT
NA39_gp3 32 1.34e-04 AATCGTAGCC CGCAAAGAAG GGCTTTACGA

Motif 1 block diagrams


Name
Lowest
p-value
Motifs
NA39_gp6 1.12e-06

+1
NA39_gp5 1.12e-06

+1
NA39_gp2 1.12e-06

+1
NA39_gp1 1.12e-06

+1
NA39_gp9 8.26e-06

+1
NA39_gp4 1.41e-05

+1
NA39_gp8 4.14e-05

+1
NA39_gp7 1.21e-04

+1
NA39_gp3 1.34e-04

+1
SCALE
| | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300

Motif 1 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 1 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 1 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 1 regular-expression

[CA][TA][AC][AG]A[CA]GAAC

Time 0.62 secs.

P N
MOTIF 2 width = 11 sites = 6 llr = 61 E-value = 1.6e+002

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
15.1 (bits)
Relative Entropy
14.7 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
NA39_gp9 35 4.67e-06 TATTGTTTGT GTTTAATCAGA TTAATTTTAC
NA39_gp8 43 5.64e-06 CTGGCATTTA GTGTCATCAGA ACATGGAATA
NA39_gp1 14 7.29e-06 GCTTCGGCTT GTTGCATAGGA CCGGAAAGGT
NA39_gp6 18 8.61e-06 ACTACAACCT GTGGAATCTGA GCTCCTAGTA
NA39_gp7 73 1.24e-05 AGATCTGCTA GTTGTATCGTA AGAGATCACG
NA39_gp3 61 1.85e-05 AGTGTGTGAT ATTTTATCGGA CTTACGCCGT

Motif 2 block diagrams


Name
Lowest
p-value
Motifs
NA39_gp9 4.67e-06

+2
NA39_gp8 5.64e-06

+2
NA39_gp1 7.29e-06

+2
NA39_gp6 8.61e-06

+2
NA39_gp7 1.24e-05

+2
NA39_gp3 1.85e-05

+2
SCALE
| | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300

Motif 2 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 2 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 2 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 2 regular-expression

GT[TG][GT][ACT]ATC[GA]GA

Time 1.12 secs.

P N
MOTIF 3 width = 13 sites = 4 llr = 51 E-value = 4.8e+002

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
17.6 (bits)
Relative Entropy
18.5 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
NA39_gp6 79 2.00e-07 ACTTTTATTC GTCCCGATAGCTC TTGTAGCC
NA39_gp1 204 2.36e-07 CTACGGTTTC GTCCGTGTCGCTC ACAATTATCC
NA39_gp5 13 1.12e-06 TATTTATGAG GTCCCTTGCGCTA CTATTGATGT
NA39_gp7 31 2.44e-06 AAACCACCGA GTGCCAAATGCTC GCGCATCTGA

Motif 3 block diagrams


Name
Lowest
p-value
Motifs
NA39_gp6 2.00e-07

+3
NA39_gp1 2.36e-07

+3
NA39_gp5 1.12e-06

+3
NA39_gp7 2.44e-06

+3
SCALE
| | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300

Motif 3 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 3 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 3 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 3 regular-expression

GT[CG]C[CG][TAG][AGT][TAG][CAT]GCT[CA]

Time 1.53 secs.

P N
MOTIF 4 width = 6 sites = 6 llr = 43 E-value = 5.3e+002

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
10.3 (bits)
Relative Entropy
10.3 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
NA39_gp5 55 2.57e-04 AGGTCACTTG AAAGGA GTCTAAACGA
NA39_gp2 19 2.57e-04 TCTCGTTGAA AAAGGA AGACTCCTTA
NA39_gp4 16 5.30e-04 CAGTTGCTTT AAAGGT CTATGTGCTT
NA39_gp3 83 5.30e-04 TTACGCCGTT AAAGGT TACGGAACAA
NA39_gp1 29 5.30e-04 ATAGGACCGG AAAGGT ACTATCTACC
NA39_gp8 63 7.06e-04 AACATGGAAT AGAGGA TTTACCTGTA

Motif 4 block diagrams


Name
Lowest
p-value
Motifs
NA39_gp5 2.57e-04

+4
NA39_gp2 2.57e-04

+4
NA39_gp4 5.30e-04

+4
NA39_gp3 5.30e-04

+4
NA39_gp1 5.30e-04

+4
NA39_gp8 7.06e-04

+4
SCALE
| | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300

Motif 4 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 4 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 4 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 4 regular-expression

AAAGG[AT]

Time 1.89 secs.

P N
MOTIF 5 width = 13 sites = 4 llr = 51 E-value = 9.2e+002

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
18.1 (bits)
Relative Entropy
18.5 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
NA39_gp2 0 2.01e-07 GGTTTTGGGTCTC GTTGAAAAAG
NA39_gp4 38 3.68e-07 GCTTGTAAGC GCTGTTGCGACTC TTTTGACGAG
NA39_gp1 174 1.56e-06 TCCTTGTCTA GCTTTTGGTTGGC AGTGAGCCTA
NA39_gp5 82 1.66e-06 CTTACGTTTG GCTTTTGTCACAC GTTAA

Motif 5 block diagrams


Name
Lowest
p-value
Motifs
NA39_gp2 2.01e-07

+5
NA39_gp4 3.68e-07

+5
NA39_gp1 1.56e-06

+5
NA39_gp5 1.66e-06

+5
SCALE
| | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300

Motif 5 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 5 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 5 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 5 regular-expression

G[CG]T[TG]TTG[GCT][GCT][AT][CG][TAG]C

Time 2.20 secs.

P N
SUMMARY OF MOTIFS


Combined block diagrams: non-overlapping sites with p-value < 0.0001


Name
Combined
p-value

Motifs
NA39_gp1 1.84e-10

+2
+1
+5
+3
NA39_gp2 3.00e-07

+5
+1
NA39_gp3 7.85e-04

+2
NA39_gp4 4.18e-06

+5
+1
NA39_gp5 1.27e-09

+3
+1
+5
NA39_gp6 9.82e-09

+2
+1
+3
NA39_gp7 1.25e-05

+3
+2
NA39_gp8 2.38e-04

+2
+1
NA39_gp9 1.93e-04

+2
+1
SCALE
| | | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275 300

Motif summary in machine readable format.
Stopped because Stopped because nmotifs = 5 reached..



CPU: kodomo


EXPLANATION OF MEME RESULTS


The MEME results consist of:

MOTIFS

For each motif that it discovers in the training set, MEME prints the following information: