Command line Training Set First Motif Summary of Motifs Termination Explanation


Search sequence databases for the best combined matches with these motifs using MAST.
Search sequence databases for all matches with these motifs using FIMO.
Find Genome Ontology terms associated with upstream sequences matching these motifs using GOMO.
Submit these motifs to BLOCKS multiple alignment processor.


MEME - Motif discovery tool

MEME version 4.3.0 (Release date: Sat Sep 26 01:51:56 PDT 2009)

For further information on how to interpret these results or to get a copy of the MEME software please access http://meme.sdsc.edu.

This file may be used as input to the MAST algorithm for searching sequence databases for matches to groups of motifs. MAST is available for interactive use and downloading at http://meme.sdsc.edu.


REFERENCE

If you use this program in your research, please cite:

Timothy L. Bailey and Charles Elkan, "Fitting a mixture model by expectation maximization to discover motifs in biopolymers", Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28-36, AAAI Press, Menlo Park, California, 1994.



TRAINING SET

DATAFILE= upstream.fasta
ALPHABET= ACGT
Sequence name Weight Length Sequence name Weight Length
orf1ab_upstream 1.0000 277 S_protein_upstream 1.0000 100
NS3_protein_upstream 1.0000 100 NS4A_protein_upstream 1.0000 100
NS4B_protein_upstream 1.0000 100 NS5_protein_upstream 1.0000 100
envelope_protein_upstream 1.0000 100 membrane_protein_upstream 1.0000 100
nucleoprotein_upstream 1.0000 100 ORF8b_protein 1.0000 100

COMMAND LINE SUMMARY

This information can also be useful in the event you wish to report a
problem with the MEME software.

command: meme upstream.fasta -oc result -dna -mod zoops -nmotifs 3 -minsites 2 -maxsites 600 -minw 6 -maxw 50 
model: mod= zoops nmotifs= 3 evt= inf
object function= E-value of product of p-values
width: minw= 6 maxw= 50 minic= 0.00
width: wg= 11 ws= 1 endgaps= yes
nsites: minsites= 2 maxsites= 10 wnsites= 0.8
theta: prob= 1 spmap= uni spfuzz= 0.5
em: prior= dirichlet b= 0.01 maxiter= 50
distance= 1e-05
data: n= 1177 N= 10
strands: +
sample: seed= 0 seqfrac= 1
Letter frequencies in dataset:
A 0.259 C 0.223 G 0.199 T 0.319
Background letter frequencies (from dataset with add-one prior applied):
A 0.259 C 0.224 G 0.199 T 0.318

P N
MOTIF 1 width = 10 sites = 10 llr = 103 E-value = 4.4e-010

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
15.3 (bits)
Relative Entropy
14.9 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
membrane_protein_upstream 82 1.44e-06 ACGAGTGGGT TTAACGAACT CCTTCATA
NS5_protein_upstream 90 1.44e-06 ATCCAGGATT TTAACGAACT
NS4A_protein_upstream 88 1.44e-06 ACTCAGTTAA TTAACGAACT CT
NS3_protein_upstream 86 1.44e-06 TGTTCACTAA TTAACGAACT ATTA
S_protein_upstream 46 1.44e-06 GAGAGTCAAA TTAACGAACT CGTAATATCT
orf1ab_upstream 59 1.44e-06 AACTTTGATT TTAACGAACT TAAATAAAAG
nucleoprotein_upstream 76 1.52e-05 TTAATTGATT TTAACGAATC TCAATTTCAT
envelope_protein_upstream 90 4.14e-05 GGACATATGG AAAACGAACT
ORF8b_protein 43 1.10e-04 TACACTGGGC TTACCCAACA CGGGAAAGTC
NS4B_protein_upstream 71 1.17e-04 AGGACGCAGC TCAGCGAATC GCTTGGTTGC

Motif 1 block diagrams


Name
Lowest
p-value
Motifs
membrane_protein_upstream 1.44e-06

+1
NS5_protein_upstream 1.44e-06

+1
NS4A_protein_upstream 1.44e-06

+1
NS3_protein_upstream 1.44e-06

+1
S_protein_upstream 1.44e-06

+1
orf1ab_upstream 1.44e-06

+1
nucleoprotein_upstream 1.52e-05

+1
envelope_protein_upstream 4.14e-05

+1
ORF8b_protein 1.10e-04

+1
NS4B_protein_upstream 1.17e-04

+1
SCALE
| | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275

Motif 1 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 1 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 1 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 1 regular-expression

TTAACGAA[CT][TC]

Time 0.67 secs.

P N
MOTIF 2 width = 8 sites = 4 llr = 41 E-value = 1.3e+002

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
13.7 (bits)
Relative Entropy
14.7 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
NS3_protein_upstream 55 8.21e-06 AGGAATACGA CCTCGAGC CGCATAAGGT
envelope_protein_upstream 2 1.40e-05 CG CCCCGAGC TCGCTTATCG
ORF8b_protein 0 2.07e-05 CCACGAGC TGCACCAAAT
nucleoprotein_upstream 35 3.23e-05 ACTTGCATTG CTTCGAGC TTAGGCTCTT

Motif 2 block diagrams


Name
Lowest
p-value
Motifs
NS3_protein_upstream 8.21e-06

+2
envelope_protein_upstream 1.40e-05

+2
ORF8b_protein 2.07e-05

+2
nucleoprotein_upstream 3.23e-05

+2
SCALE
| | | | |
1 25 50 75

Motif 2 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 2 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 2 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 2 regular-expression

C[CT][TAC]CGAGC

Time 1.23 secs.

P N
MOTIF 3 width = 24 sites = 4 llr = 81 E-value = 1.4e+002

SEQUENCE LOGO PNG LOGOS require CONVERT from ImageMagick; see MEME installation guide
Information Content
29.8 (bits)
Relative Entropy
29.3 (bits)
Download LOGO
Without SSC:[EPS][PNG]
With SSC:[EPS][PNG]
NAME START P-VALUE SITES
NS4B_protein_upstream 7 1.36e-10 CAGCTAT CCTTTGCTGGTTATACTGAATCTG CTGTTAATTC
NS4A_protein_upstream 24 1.93e-10 TCAACTTCAA CTCATGATGGTCCTACCGAACATG TTACTAGTGT
nucleoprotein_upstream 6 5.17e-10 GTCCGC CTATTACGGCGGATATTGAACTTG CATTGCTTCG
NS5_protein_upstream 32 8.29e-10 ATACGGTCTT CCAATCAGGGTAATAAACAAATTG TTCATTCTTA

Motif 3 block diagrams


Name
Lowest
p-value
Motifs
NS4B_protein_upstream 1.36e-10

+3
NS4A_protein_upstream 1.93e-10

+3
nucleoprotein_upstream 5.17e-10

+3
NS5_protein_upstream 8.29e-10

+3
SCALE
| | | | |
1 25 50 75

Motif 3 in BLOCKS format


to BLOCKS multiple alignment processor.
Motif 3 position-specific scoring matrix


Scan sequence databases for the best match in each sequence using MAST.
Motif 3 position-specific probability matrix


Scan sequence databases for all matches with this motif using FIMO.
Compare to known motifs in motif databases using Tomtom.
Find Genome Ontology terms associated with upstream regions matching this motif using GOMO.
Motif 3 regular-expression

C[CT][ACT][AT]T[GAC][AC][GT]G[GC][TG][ACGTA][AC]TA[CAT][TAC][GC]AA[CAT][TAC]TG

Time 1.74 secs.

P N
SUMMARY OF MOTIFS


Combined block diagrams: non-overlapping sites with p-value < 0.0001


Name
Combined
p-value

Motifs
orf1ab_upstream 3.90e-03

+1
S_protein_upstream 5.27e-03

+1
NS3_protein_upstream 3.11e-06

+2
+1
NS4A_protein_upstream 6.16e-10

+3
+1
NS4B_protein_upstream 3.11e-08

+3
NS5_protein_upstream 2.38e-09

+3
+1
envelope_protein_upstream 5.69e-05

+2
+1
membrane_protein_upstream 5.46e-04

+1
nucleoprotein_upstream 7.66e-11

+3
+2
+1
ORF8b_protein 9.11e-05

+2
SCALE
| | | | | | | | | | | |
1 25 50 75 100 125 150 175 200 225 250 275

Motif summary in machine readable format.
Stopped because Stopped because nmotifs = 3 reached..



CPU: kodomo


EXPLANATION OF MEME RESULTS


The MEME results consist of:

MOTIFS

For each motif that it discovers in the training set, MEME prints the following information: