MAST - Motif Alignment and Search Tool
MAST version 4.3.0 (Release date: Sat Sep 26 01:51:56 PDT 2009)
For further information on how to interpret these results or to get
a copy of the MAST software please access http://meme.nbcr.net.
REFERENCE
If you use this program in your research, please cite:
Timothy L. Bailey and Michael Gribskov,
"Combining evidence using p-values: application to sequence homology
searches", Bioinformatics, 14(48-54), 1998.
DATABASE AND MOTIFS
DATABASE out_meme_out/meme.fasta (nucleotide)
Last updated on Tue May 12 19:01:53 2015
Database contains 50 sequences, 1750 residues
Scores for positive and reverse complement strands are combined.
MOTIFS out_meme_out/meme.txt (nucleotide)
MOTIF WIDTH BEST POSSIBLE MATCH
----- ----- -------------------
1 14 ATTTTGTACGAAAA
Random model letter frequencies (from non-redundant database):
A 0.274 C 0.225 G 0.225 T 0.274
SECTION I: HIGH-SCORING SEQUENCES
- Each of the following 13 sequences has E-value less than 10.
- The E-value of a sequence is the expected number of sequences
in a random database of the same size that would match the motifs as
well as the sequence does and is equal to the combined p-value of the
sequence times the number of sequences in the database.
- The combined p-value of a sequence measures the strength of the
match of the sequence to all the motifs and is calculated by
- finding the score of the single best match of each motif
to the sequence (best matches may overlap),
- calculating the sequence p-value of each score,
- forming the product of the p-values,
- taking the p-value of the product.
- The sequence p-value of a score is defined as the
probability of a random sequence of the same length containing
some match with as good or better a score.
- The score for the match of a position in a sequence to a motif
is computed by by summing the appropriate entry from each column of
the position-dependent scoring matrix that represents the motif.
- Sequences shorter than one or more of the motifs are skipped.
- The table is sorted by increasing E-value.
Links | Sequence Name | Description | E-value | Length
|
---|
| 31
|
| 0.0046
| 35
|
| 46
|
| 0.0092
| 35
|
| 32
|
| 0.0092
| 35
|
| 5
|
| 0.02
| 35
|
| 50
|
| 0.024
| 35
|
| 26
|
| 0.047
| 35
|
| 8
|
| 0.1
| 35
|
| 44
|
| 0.13
| 35
|
| 30
|
| 5.4
| 35
|
| 21
|
| 5.7
| 35
|
| 2
|
| 5.8
| 35
|
| 28
|
| 6.1
| 35
|
| 22
|
| 7.9
| 35
|
SECTION II: MOTIF DIAGRAMS
- The ordering and spacing of all non-overlapping motif occurrences
are shown for each high-scoring sequence listed in Section I.
- A motif occurrence is defined as a position in the sequence whose
match to the motif has POSITION p-value less than 0.001.
- The POSITION p-value of a match is the probability of
a single random subsequence of the length of the motif
scoring at least as well as the observed match.
- For each sequence, all motif occurrences are shown unless there
are overlaps. In that case, a motif occurrence is shown only if its
p-value is less than the product of the p-values of the other
(lower-numbered) motif occurrences that it overlaps.
- The table also shows the E-value of each sequence.
- Spacers and motif occurences are indicated by
- Spacers and motif occurences are indicated by
- Spacers and motif occurences are indicated by
- occurrence of motif `n' with p-value less than 0.001.
A minus sign indicates that the occurrence is on the
reverse complement strand.
- Sequences longer than 1000 are not shown to scale and are indicated by thicker lines.
Links | Name | Expect | Motifs
|
---|
| 31
| 0.0046
|
|
| 46
| 0.0092
|
|
| 32
| 0.0092
|
|
| 5
| 0.02
|
|
| 50
| 0.024
|
|
| 26
| 0.047
|
|
| 8
| 0.1
|
|
| 44
| 0.13
|
|
| 30
| 5.4
|
|
| 21
| 5.7
|
|
| 2
| 5.8
|
|
| 28
| 6.1
|
|
| 22
| 7.9
|
|
SCALE
|
|
---|
SECTION III: ANNOTATED SEQUENCES
- The positions and p-values of the non-overlapping motif occurrences
are shown above the actual sequence for each of the high-scoring
sequences from Section I.
- A motif occurrence is defined as a position in the sequence whose
match to the motif has POSITION p-value less than 0.001 as
defined in Section II.
- For each sequence, the first line specifies the name of the sequence.
- The second (and possibly more) lines give a description of the
sequence.
- Following the description line(s) is a line giving the length,
combined p-value, and E-value of the sequence as defined in Section I.
- The next line reproduces the motif diagram from Section II.
- The entire sequence is printed on the following lines.
- Motif occurrences are indicated directly above their positions in the
sequence on lines showing
- the motif number of the occurrence (a minus sign indicates that
the occurrence is on the reverse complement strand),
- the position p-value of the occurrence,
- the best possible match to the motif (or its reverse complement), and
- columns whose match to the motif has a positive score (indicated
by a plus sign).
31
LENGTH = 35 COMBINED P-VALUE = 9.16e-05 E-VALUE = 0.0046
DIAGRAM: 4-[+1]-17
[+1]
2.1e-06
ATTTTGTACGAAAA
++++++ ++++ ++
1 CCCTATTTTGGAAGGCAAGAAAAGGTTGTGATGTG
46
LENGTH = 35 COMBINED P-VALUE = 1.84e-04 E-VALUE = 0.0092
DIAGRAM: 5_[+1]_16
[+1]
4.2e-06
ATTTTGTACGAAAA
++ +++++++++++
1 CCCTCATCTTGCACGAAAAAGGGAAAGGGACAGTC
32
LENGTH = 35 COMBINED P-VALUE = 1.84e-04 E-VALUE = 0.0092
DIAGRAM: 19_[+1]_2
[+1]
4.2e-06
ATTTTGTACGAAAA
++++++ + +++++
1 GAATCTGACGGATTGGGACATTTTGGAGGAAAATG
5
LENGTH = 35 COMBINED P-VALUE = 4.07e-04 E-VALUE = 0.02
DIAGRAM: 14_[+1]_7
[+1]
9.3e-06
ATTTTGTACGAAAA
+++++ +++++ ++
1 AAGTGACTTCACCGATATTTTAAGAGGATGCAGCA
50
LENGTH = 35 COMBINED P-VALUE = 4.90e-04 E-VALUE = 0.024
DIAGRAM: 17_[+1]_4
[+1]
1.1e-05
ATTTTGTACGAAAA
+++ +++++++ ++
1 CTTGGCCCCGACCCGGGCTTCTGTAAGGTGACGCC
26
LENGTH = 35 COMBINED P-VALUE = 9.33e-04 E-VALUE = 0.047
DIAGRAM: 14_[+1]_7
[+1]
2.1e-05
ATTTTGTACGAAAA
+++++++++ + ++
1 TATCGGATACGATACTTTTGCACCGGGAGTGGTAC
8
LENGTH = 35 COMBINED P-VALUE = 2.04e-03 E-VALUE = 0.1
DIAGRAM: 15_[+1]_6
[+1]
4.7e-05
ATTTTGTACGAAAA
+++++++ +++ +
1 GCGATCTACACCCGGATATTGCGCGACTATTTCGC
44
LENGTH = 35 COMBINED P-VALUE = 2.68e-03 E-VALUE = 0.13
DIAGRAM: 5_[+1]_16
[+1]
6.1e-05
ATTTTGTACGAAAA
+++ +++ +++++
1 CTTGGCTTCTGTTCGGAGTTCAGGAGCACGACGAG
30
LENGTH = 35 COMBINED P-VALUE = 1.07e-01 E-VALUE = 5.4
DIAGRAM: 35
21
LENGTH = 35 COMBINED P-VALUE = 1.13e-01 E-VALUE = 5.7
DIAGRAM: 35
2
LENGTH = 35 COMBINED P-VALUE = 1.15e-01 E-VALUE = 5.8
DIAGRAM: 35
28
LENGTH = 35 COMBINED P-VALUE = 1.21e-01 E-VALUE = 6.1
DIAGRAM: 35
22
LENGTH = 35 COMBINED P-VALUE = 1.57e-01 E-VALUE = 7.9
DIAGRAM: 35
Debugging Information
CPU: kodomo.fbb.msu.ru
Time 0.000000 secs.
mast out_meme_out/meme.txt -ev 10.000000 -mt 0.001000
Button Help
Links to Entrez database at NCBI
Links to sequence scores (section I)
Links to motif diagrams (section II)
Links to sequence/motif annotated alignments (section III)
This information