Command line Training Set First Motif Summary of Motifs Termination Explanation


Search sequence databases for the best combined matches with these motifs using MAST.
Search sequence databases for all matches with these motifs using FIMO.
Find Genome Ontology terms associated with upstream sequences matching these motifs using GOMO.
Submit these motifs to BLOCKS multiple alignment processor.


MEME - Motif discovery tool

MEME version 4.3.0 (Release date: Sat Sep 26 01:51:56 PDT 2009)

For further information on how to interpret these results or to get a copy of the MEME software please access http://meme.sdsc.edu.

This file may be used as input to the MAST algorithm for searching sequence databases for matches to groups of motifs. MAST is available for interactive use and downloading at http://meme.sdsc.edu.


REFERENCE

If you use this program in your research, please cite:

Timothy L. Bailey and Charles Elkan, "Fitting a mixture model by expectation maximization to discover motifs in biopolymers", Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28-36, AAAI Press, Menlo Park, California, 1994.



TRAINING SET

DATAFILE= motiv_ecoli_2/meme.fasta
ALPHABET= ACGT
Sequence name Weight Length Sequence name Weight Length
guaB 1.0000 100 guaA 1.0000 100
purB 1.0000 100 purR 1.0000 100
purL 1.0000 100 purF 1.0000 100
purN 1.0000 100 purA 1.0000 100
purT 1.0000 100 purU 1.0000 100
purK 1.0000 100 purE 1.0000 100
folD 1.0000 100 purD 1.0000 100
purM 1.0000 100 purH 1.0000 100
purC 1.0000 100

COMMAND LINE SUMMARY

This information can also be useful in the event you wish to report a
problem with the MEME software.

command: meme motiv_ecoli_2/meme.fasta -mod zoops -nmotifs 3 -prior dirichlet -revcomp -nostatus -dna -oc motiv_ecoli_2/ 
model: mod= zoops nmotifs= 3 evt= inf
object function= E-value of product of p-values
width: minw= 8 maxw= 50 minic= 0.00
width: wg= 11 ws= 1 endgaps= yes
nsites: minsites= 2 maxsites= 17 wnsites= 0.8
theta: prob= 1 spmap= uni spfuzz= 0.5
em: prior= dirichlet b= 0.01 maxiter= 50
distance= 1e-05
data: n= 1700 N= 17
strands: + -
sample: seed= 0 seqfrac= 1
Letter frequencies in dataset:
A 0.265 C 0.235 G 0.235 T 0.265
Background letter frequencies (from dataset with add-one prior applied):
A 0.265 C 0.235 G 0.235 T 0.265

P N
MOTIF 1 width = 17 sites = 8 llr = 122 E-value = 4.8e-005

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
22.1 (bits)
Relative Entropy
22.0 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME STRAND START P-VALUE SITES
purT + 47 2.55e-10 ATAAAGACAC ACGCAAACGTTTTCGTT TATACTGCGC
purM + 21 4.31e-09 TAAAGCAGTC TCGCAAACGTTTGCTTT CCCTGTTAGA
purL + 8 6.85e-09 TTATTTCC ACGCAAACGGTTTCGTC AGCGCATCAG
purE + 13 9.89e-09 TTTCACAGCC ACGCAACCGTTTTCCTT GCTCTCTTTC
purR + 40 1.19e-08 AGGTGTGTAA AGGCAAACGTTTACCTT GCGATTTTGC
guaB + 31 1.35e-07 AAAGGGGTAG ATGCAATCGGTTACGCT CTGTATAATG
purA + 78 1.88e-07 ATCCATTTTT AAGCAAACGGTGATTTT GAAAA
purN - 62 1.86e-06 CACGCGTTGT TCGGAATCAGAGGCTTT GATGATACCG

Motif 1 block diagrams


Name
Lowest
p-value
Motifs
purT 2.55e-10

+1
purM 4.31e-09

+1
purL 6.85e-09

+1
purE 9.89e-09

+1
purR 1.19e-08

+1
guaB 1.35e-07

+1
purA 1.88e-07

+1
purN 1.86e-06

-1
SCALE
| | | | |
1 25 50 75

Motif 1 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 1 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 1 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 1 regular-expression

[AT]CGCAA[AT]CG[GT]T[TG][ATG]C[GTC]TT

Time 1.54 secs.

P N
MOTIF 2 width = 19 sites = 16 llr = 157 E-value = 1.2e+001

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
14.5 (bits)
Relative Entropy
14.2 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME STRAND START P-VALUE SITES
purL + 25 1.37e-09 CGGTTTCGTC AGCGCATCAGATTCTTTAT AATGACGCCC
purB + 18 9.56e-08 GCGGCGGACG TCTGCAACTGATGTTTTCT CGTAATCGCC
purR + 6 2.12e-06 CGGCGT ACCGCAACACTTTTGTTGT GCGTAAGGTG
purE + 74 4.94e-06 AGCCGAGAGT TGTGCACCACAGGAGTTTT AAGACGC
guaB - 3 4.94e-06 TCTACCCCTT TTTGCAAAAAATGCTTGCT ATC
purC + 53 6.64e-06 GGCACACCAG ACAGCAAAAGATTTTAAAA CGTTAATTCA
purK + 22 6.64e-06 ATGATAAAGA ACTGCACCAGCGTCTGAAT GACTGGCGCA
purU - 26 7.31e-06 ACCATTATTG GCCGCAGCACTTTTTAAAT TTTTTACCTG
folD + 55 9.66e-06 TGACAAAATA GGCGCATCCCCTTCGATCT ACGTAACAGA
purF + 2 1.16e-05 TT TTATCATCAGATGTTTTTT TGATTATCTG
purA + 28 1.50e-05 GTCATTTTTG AGTGCAAAAAGTGCTGTAA CTCTGAAAAA
purM + 48 2.67e-05 CCCTGTTAGA ATTGCGCCGAATTTTATTT TTCTACCGCA
guaA - 62 3.12e-05 TTCCGAGGCA AGTGAAACAGATAATATAA ATCGCCCGAC
purH + 27 4.85e-05 TGCCCCGTTA GGGGCGTTAGCTGAGTTTT TCGCGAAAAA
purT - 20 8.33e-05 GTGTGTCTTT ATTGCTGATGTTGATTTCT CAACCGAAAA
purD - 12 1.36e-04 CTCGTCGGCG GCGGCAATCACTTCGTCAT CACGGATAGA

Motif 2 block diagrams


Name
Lowest
p-value
Motifs
purL 1.37e-09

+2
purB 9.56e-08

+2
purR 2.12e-06

+2
purE 4.94e-06

+2
guaB 4.94e-06

-2
purC 6.64e-06

+2
purK 6.64e-06

+2
purU 7.31e-06

-2
folD 9.66e-06

+2
purF 1.16e-05

+2
purA 1.50e-05

+2
purM 2.67e-05

+2
guaA 3.12e-05

-2
purH 4.85e-05

+2
purT 8.33e-05

-2
purD 1.36e-04

-2
SCALE
| | | | |
1 25 50 75

Motif 2 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 2 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 2 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 2 regular-expression

[AGT][CGT][TC]GCA[AT][CA]A[GAC][AC]T[TG][CTA][TG][TA]T[ACT]T

Time 2.81 secs.

P N
MOTIF 3 width = 10 sites = 3 llr = 39 E-value = 1.1e+004

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
18.2 (bits)
Relative Entropy
18.7 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME STRAND START P-VALUE SITES
purK + 53 6.53e-07 CTGGCGCAAA GCCCAGACCG ACGAAGTGCT
purF + 51 1.96e-06 CAAGTTTCTT GCCCAGAGCG TAAGTGCTCT
guaA - 16 1.96e-06 AGAATCAGGA GCCCAGACGG TAGTTCGGGG

Motif 3 block diagrams


Name
Lowest
p-value
Motifs
purK 6.53e-07

+3
purF 1.96e-06

+3
guaA 1.96e-06

-3
SCALE
| | | | |
1 25 50 75

Motif 3 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 3 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 3 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 3 regular-expression

GCCCAGA[CG][CG]G

Time 3.70 secs.

P N
SUMMARY OF MOTIFS


Combined block diagrams: non-overlapping sites with p-value < 0.0001


Name
Combined
p-value

Motifs
guaB 3.00e-07

-2
+1
guaA 8.32e-05

-3
-2
purB 3.23e-04

+2
purR 7.67e-08

+2
+1
purL 5.16e-11

+1
+2
purF 4.71e-05

+2
+3
purN 6.45e-04

-1
purA 1.04e-05

+2
+1
purT 1.44e-07

-2
+1
purU 1.69e-02

-2
purK 1.61e-05

+2
+3
purE 1.64e-07

+1
+2
folD 2.25e-02

+2
purM 6.16e-07

+1
+2
purH 2.78e-02

+2
purC 9.71e-03

+2
SCALE
| | | | |
1 25 50 75

Motif summary in machine readable format.
Stopped because Stopped because nmotifs = 3 reached..



CPU: kodomo.fbb.msu.ru


EXPLANATION OF MEME RESULTS


The MEME results consist of:

MOTIFS

For each motif that it discovers in the training set, MEME prints the following information: