Command line Training Set First Motif Summary of Motifs Termination Explanation


Search sequence databases for the best combined matches with these motifs using MAST.
Search sequence databases for all matches with these motifs using FIMO.
Find Genome Ontology terms associated with upstream sequences matching these motifs using GOMO.
Submit these motifs to BLOCKS multiple alignment processor.


MEME - Motif discovery tool

MEME version 4.3.0 (Release date: Sat Sep 26 01:51:56 PDT 2009)

For further information on how to interpret these results or to get a copy of the MEME software please access http://meme.sdsc.edu.

This file may be used as input to the MAST algorithm for searching sequence databases for matches to groups of motifs. MAST is available for interactive use and downloading at http://meme.sdsc.edu.


REFERENCE

If you use this program in your research, please cite:

Timothy L. Bailey and Charles Elkan, "Fitting a mixture model by expectation maximization to discover motifs in biopolymers", Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28-36, AAAI Press, Menlo Park, California, 1994.



TRAINING SET

DATAFILE= meme_new/meme.fasta
ALPHABET= ACGT
Sequence name Weight Length Sequence name Weight Length
purT_VC1228 1.0000 99 purA_VC2602 1.0000 99
folD_VC1942 1.0000 99 purK_VC0051 1.0000 99
guaA_VC0768 1.0000 99 purD_VC0275 1.0000 99
purR_VC1721 1.0000 99 purM_VC2226 1.0000 99
purH_VC0276 1.0000 99 purC_VC1190 1.0000 99

COMMAND LINE SUMMARY

This information can also be useful in the event you wish to report a
problem with the MEME software.

command: meme meme_new/meme.fasta -mod zoops -nmotifs 3 -prior dirichlet -revcomp -nostatus -dna -oc meme_new/ 
model: mod= zoops nmotifs= 3 evt= inf
object function= E-value of product of p-values
width: minw= 8 maxw= 50 minic= 0.00
width: wg= 11 ws= 1 endgaps= yes
nsites: minsites= 2 maxsites= 10 wnsites= 0.8
theta: prob= 1 spmap= uni spfuzz= 0.5
em: prior= dirichlet b= 0.01 maxiter= 50
distance= 1e-05
data: n= 990 N= 10
strands: + -
sample: seed= 0 seqfrac= 1
Letter frequencies in dataset:
A 0.285 C 0.215 G 0.215 T 0.285
Background letter frequencies (from dataset with add-one prior applied):
A 0.285 C 0.215 G 0.215 T 0.285

P N
MOTIF 1 width = 9 sites = 6 llr = 67 E-value = 8.2e-001

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
16.0 (bits)
Relative Entropy
16.0 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME STRAND START P-VALUE SITES
purC_VC1190 + 0 4.02e-06 CGTTTGCTT AAATTTATCC
guaA_VC0768 + 0 4.02e-06 CGTTTGCTT AAATGCTTGA
folD_VC1942 - 89 4.02e-06 A CGTTTGCTT TGTTCTGTTT
purT_VC1228 + 70 4.02e-06 CAAAAGAAAA CGTTTGCTT CCCGCTGTAG
purM_VC2226 - 20 1.01e-05 AAAAGGAAAA CGTTTGCGT AAGATTTTTA
purK_VC0051 + 41 2.57e-05 ACACAGATTG AGTCTGCTT AGCGCGGAAA

Motif 1 block diagrams


Name
Lowest
p-value
Motifs
purC_VC1190 4.02e-06

+1
guaA_VC0768 4.02e-06

+1
folD_VC1942 4.02e-06

-1
purT_VC1228 4.02e-06

+1
purM_VC2226 1.01e-05

-1
purK_VC0051 2.57e-05

+1
SCALE
| | | | |
1 25 50 75

Motif 1 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 1 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 1 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 1 regular-expression

CGTTTGCTT

Time 0.64 secs.

P N
MOTIF 2 width = 15 sites = 6 llr = 81 E-value = 6.7e+001

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
18.5 (bits)
Relative Entropy
19.4 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME STRAND START P-VALUE SITES
purD_VC0275 + 81 2.39e-08 GACGCATACC GGTGAAGATCATCGC CAT
purH_VC0276 + 53 1.03e-07 GTGCGGTGTT GGGGAAGCTCGCTGC AAAGCCTCAC
purK_VC0051 - 53 1.38e-07 GATGAGTAAA GTGGAAGCTTTCCGC GCTAAGCAGA
purM_VC2226 - 84 2.09e-07 GGGGAAGTTCCTCGG TCTAGTTGGA
folD_VC1942 + 3 8.60e-07 AAC GGGCTAGATCAACGC CTTTGTCTAA
guaA_VC0768 + 51 2.23e-06 ACTCGCCTCC GTTGAAGAATCCCCC GTATTAATTC

Motif 2 block diagrams


Name
Lowest
p-value
Motifs
purD_VC0275 2.39e-08

+2
purH_VC0276 1.03e-07

+2
purK_VC0051 1.38e-07

-2
purM_VC2226 2.09e-07

-2
folD_VC1942 8.60e-07

+2
guaA_VC0768 2.23e-06

+2
SCALE
| | | | |
1 25 50 75

Motif 2 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 2 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 2 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 2 regular-expression

G[GT][GT]GAAG[AC]T[CT][AC][CT]CGC

Time 1.15 secs.

P N
MOTIF 3 width = 8 sites = 10 llr = 76 E-value = 1.0e+002

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
11.1 (bits)
Relative Entropy
11.0 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME STRAND START P-VALUE SITES
purC_VC1190 + 85 3.28e-05 TGCTTAATAA AAAACAGC TCACAG
purM_VC2226 - 62 3.28e-05 GTTGGATATG AAAACGGC GCTATTCTAC
purA_VC2602 + 87 3.28e-05 AAATTCATAT AAAACGGC TTTG
purT_VC1228 + 29 3.28e-05 ACATTAGCAA AAAACAGC GGCCATGATA
guaA_VC0768 + 85 1.15e-04 CTCGGCTGAT AAGACTGC TAAACA
folD_VC1942 + 78 1.73e-04 CAACTGCCGA AAAACAGA ACAAAGCAAA
purH_VC0276 - 45 3.56e-04 GAGCTTCCCC AACACCGC ACAGCAGATC
purK_VC0051 + 30 3.56e-04 TGGGTTATCC AACACAGA TTGAGTCTGC
purR_VC1721 - 76 4.51e-04 GTTCCTGTCG ATGACTGC TGCCAGCGAA
purD_VC0275 + 60 5.99e-04 ATGCCAATTA ATGACGGA AGTGACGCAT

Motif 3 block diagrams


Name
Lowest
p-value
Motifs
purC_VC1190 3.28e-05

+3
purM_VC2226 3.28e-05

-3
purA_VC2602 3.28e-05

+3
purT_VC1228 3.28e-05

+3
guaA_VC0768 1.15e-04

+3
folD_VC1942 1.73e-04

+3
purH_VC0276 3.56e-04

-3
purK_VC0051 3.56e-04

+3
purR_VC1721 4.51e-04

-3
purD_VC0275 5.99e-04

+3
SCALE
| | | | |
1 25 50 75

Motif 3 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 3 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 3 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 3 regular-expression

A[AT][AGC]AC[AGT]G[CA]

Time 1.59 secs.

P N
SUMMARY OF MOTIFS


Combined block diagrams: non-overlapping sites with p-value < 0.0001


Name
Combined
p-value

Motifs
purT_VC1228 3.87e-05

+3
+1
purA_VC2602 7.91e-03

+3
folD_VC1942 7.05e-07

+2
-1
purK_VC0051 1.36e-06

+1
-2
guaA_VC0768 1.16e-06

+1
+2
purD_VC0275 1.71e-05

+2
purM_VC2226 1.01e-07

-1
-3
-2
purH_VC0276 6.94e-05

+2
purC_VC1190 2.43e-04

+1
+1
+3
SCALE
| | | | |
1 25 50 75

Motif summary in machine readable format.
Stopped because Stopped because nmotifs = 3 reached..



CPU: kodomo.fbb.msu.ru


EXPLANATION OF MEME RESULTS


The MEME results consist of:

MOTIFS

For each motif that it discovers in the training set, MEME prints the following information: