MAST - Motif Alignment and Search Tool
MAST version 4.3.0 (Release date: Sat Sep 26 01:51:56 PDT 2009)
For further information on how to interpret these results or to get
a copy of the MAST software please access http://meme.nbcr.net.
REFERENCE
If you use this program in your research, please cite:
Timothy L. Bailey and Michael Gribskov,
"Combining evidence using p-values: application to sequence homology
searches", Bioinformatics, 14(48-54), 1998.
DATABASE AND MOTIFS
DATABASE ./seed.fasta (peptide)
Last updated on Fri Apr 26 18:59:28 2013
Database contains 10 sequences, 2092 residues
MOTIFS ./meme.txt (peptide)
MOTIF WIDTH BEST POSSIBLE MATCH
----- ----- -------------------
1 50 DESEVLTPADYQYHIWKWEFTVPFVCGCRNLCEALRRINEGASMIRTKGE
2 50 GKLPVVNFAAGGIATPADAALMMWLGCDGVFVGSGIFKSQNPEKRARAIV
3 50 QMQKGGVIMDVTNPEQAKIAEDAGACAVMALERVPYDIRKQGGVSRMSDP
PAIRWISE MOTIF CORRELATIONS:
MOTIF 1 2
----- ----- -----
2 0.12
3 0.09 0.16
No overly similar pairs (correlation > 0.60) found.
Random model letter frequencies (from non-redundant database):
A 0.073 C 0.018 D 0.052 E 0.062 F 0.040 G 0.069 H 0.022 I 0.056 K 0.058
L 0.092 M 0.023 N 0.046 P 0.051 Q 0.041 R 0.052 S 0.074 T 0.059 V 0.064
W 0.013 Y 0.033
SECTION I: HIGH-SCORING SEQUENCES
- Each of the following 10 sequences has E-value less than 10.
- The E-value of a sequence is the expected number of sequences
in a random database of the same size that would match the motifs as
well as the sequence does and is equal to the combined p-value of the
sequence times the number of sequences in the database.
- The combined p-value of a sequence measures the strength of the
match of the sequence to all the motifs and is calculated by
- finding the score of the single best match of each motif
to the sequence (best matches may overlap),
- calculating the sequence p-value of each score,
- forming the product of the p-values,
- taking the p-value of the product.
- The sequence p-value of a score is defined as the
probability of a random sequence of the same length containing
some match with as good or better a score.
- The score for the match of a position in a sequence to a motif
is computed by by summing the appropriate entry from each column of
the position-dependent scoring matrix that represents the motif.
- Sequences shorter than one or more of the motifs are skipped.
- The table is sorted by increasing E-value.
Links | Sequence Name | Description | E-value | Length
|
---|
| PDXS_MYCLE/17-225
|
| 4.2e-97
| 209
|
| PDX1_SCHPO/6-214
|
| 2e-96
| 209
|
| PDX1_CERNC/53-261
|
| 9.4e-95
| 209
|
| PDX1_HEVBR/20-228
|
| 1.3e-94
| 209
|
| PDXS_METTH/4-211
|
| 1.1e-89
| 208
|
| PDXS_BACSU/4-212
|
| 1.3e-89
| 209
|
| SNZ1_YEAST/3-212
|
| 1.9e-83
| 210
|
| PDXS_ARCFU/10-219
|
| 1.3e-81
| 210
|
| PDXS_HAEIN/3-211
|
| 3.3e-81
| 209
|
| PDXS_METJA/3-212
|
| 5.8e-77
| 210
|
SECTION II: MOTIF DIAGRAMS
- The ordering and spacing of all non-overlapping motif occurrences
are shown for each high-scoring sequence listed in Section I.
- A motif occurrence is defined as a position in the sequence whose
match to the motif has POSITION p-value less than 0.0001.
- The POSITION p-value of a match is the probability of
a single random subsequence of the length of the motif
scoring at least as well as the observed match.
- For each sequence, all motif occurrences are shown unless there
are overlaps. In that case, a motif occurrence is shown only if its
p-value is less than the product of the p-values of the other
(lower-numbered) motif occurrences that it overlaps.
- The table also shows the E-value of each sequence.
- Spacers and motif occurences are indicated by
- Spacers and motif occurences are indicated by
- Spacers and motif occurences are indicated by
- occurrence of motif `n' with p-value less than 0.0001.
- Sequences longer than 1000 are not shown to scale and are indicated by thicker lines.
Links | Name | Expect | Motifs
|
---|
| PDXS_MYCLE/17-225
| 4.2e-97
|
|
| PDX1_SCHPO/6-214
| 2e-96
|
|
| PDX1_CERNC/53-261
| 9.4e-95
|
|
| PDX1_HEVBR/20-228
| 1.3e-94
|
|
| PDXS_METTH/4-211
| 1.1e-89
|
|
| PDXS_BACSU/4-212
| 1.3e-89
|
|
| SNZ1_YEAST/3-212
| 1.9e-83
|
|
| PDXS_ARCFU/10-219
| 1.3e-81
|
|
| PDXS_HAEIN/3-211
| 3.3e-81
|
|
| PDXS_METJA/3-212
| 5.8e-77
|
|
SCALE
|
| |
| |
| |
| |
| |
| |
| |
| |
1 |
25 |
50 |
75 |
100 |
125 |
150 |
175 |
|
---|
SECTION III: ANNOTATED SEQUENCES
- The positions and p-values of the non-overlapping motif occurrences
are shown above the actual sequence for each of the high-scoring
sequences from Section I.
- A motif occurrence is defined as a position in the sequence whose
match to the motif has POSITION p-value less than 0.0001 as
defined in Section II.
- For each sequence, the first line specifies the name of the sequence.
- The second (and possibly more) lines give a description of the
sequence.
- Following the description line(s) is a line giving the length,
combined p-value, and E-value of the sequence as defined in Section I.
- The next line reproduces the motif diagram from Section II.
- The entire sequence is printed on the following lines.
- Motif occurrences are indicated directly above their positions in the
sequence on lines showing
- the motif number of the occurrence,
- the position p-value of the occurrence,
- the best possible match to the motif, and
- columns whose match to the motif has a positive score (indicated
by a plus sign).
PDXS_MYCLE/17-225
LENGTH = 209 COMBINED P-VALUE = 4.21e-98 E-VALUE = 4.2e-97
DIAGRAM: 11-[3]-37-[1]-61
[3]
8.6e-49
QMQKGGVIMDVTNPEQAKIAEDAGACAVMALERVPYDIRKQGGVSRMSDP
++++++++++++ ++++++++ ++++++++++++++++++++++++++++
1 TGTARVKRGMAEMLKGGVIMDVVIPEQARIAEGSGAVAVMALERVPSDIRAQGGVSRMSDPDMIESIIAAVTIPV
[1]
1.9e-57
DESEVLTPADYQYHIWKWEFTVPFVCGCRNLCEALRRINEGASMIRTKGE
++++++++++++++++++++++++++++++++++++++++++++++++++
76 MAKARIGHFVEAQILQSLGVDYIDESEVLTPADYTHHIDKWKFTVPFVCGATNLGEALRRINEGAAMIRSKGEAG
PDX1_SCHPO/6-214
LENGTH = 209 COMBINED P-VALUE = 2.00e-97 E-VALUE = 2e-96
DIAGRAM: 11-[3]-37-[1]-61
[3]
4.5e-57
QMQKGGVIMDVTNPEQAKIAEDAGACAVMALERVPYDIRKQGGVSRMSDP
++++++++++++++++++++++++++++++++++++++++++++++++++
1 KGSTQVKAGLAQMLKGGVIMDVVNAEQARIAEAAGACAVMALERVPADIRAQGGVARMSDPSMIKEIQAAVSIPV
[1]
5.7e-49
DESEVLTPADYQYHIWKWEFTVPFVCGCRNLCEALRRINEGASMIRTKGE
+++++++++++ +++ +++++++++++ ++++++++++++++++++++++
76 MAKVRIGHFVEAQILESIGVDYIDESEVLTPADDINHIEKSKFTVPFVCGSRNLGEALRRISEGAAMIRTKGEAG
PDX1_CERNC/53-261
LENGTH = 209 COMBINED P-VALUE = 9.37e-96 E-VALUE = 9.4e-95
DIAGRAM: 11-[3]-37-[1]-61
[3]
1.4e-57
QMQKGGVIMDVTNPEQAKIAEDAGACAVMALERVPYDIRKQGGVSRMSDP
++++++++++++++++++++++++++++++++++++++++++++++++++
1 QSSFAVKVGLAQMLKGGVIMDVVNAEQARIAEEAGACAVMALERVPADIRKDGGVARMSDPQMIKDIMNAVTIPV
[1]
8.0e-47
DESEVLTPADYQYHIWKWEFTVPFVCGCRNLCEALRRINEGASMIRTKGE
++++++++++ +++++++ + +++++++++++++++++++++++++++++
76 MAKSRIGHFVECQILQAIGVDYIDESEVLTPADPVNHIDKSVYNVPFVCGCKNLGEALRRISEGAAMIRTKGEAG
PDX1_HEVBR/20-228
LENGTH = 209 COMBINED P-VALUE = 1.30e-95 E-VALUE = 1.3e-94
DIAGRAM: 11-[3]-37-[1]-61
[3]
3.6e-56
QMQKGGVIMDVTNPEQAKIAEDAGACAVMALERVPYDIRKQGGVSRMSDP
+++ ++++++++++++++++++++++++++++++++++++++++++++++
1 KSPFSVKVGLAQMLRGGVIMDVVNPEQARIAEEAGACAVMALERVPADIRAQGGVARMSDPQLIKEIKQSVTIPV
[1]
1.6e-47
DESEVLTPADYQYHIWKWEFTVPFVCGCRNLCEALRRINEGASMIRTKGE
++++++++++++++++++++ ++++++++++++++++ +++++++++++
76 MAKARIGHFVEAQILEAIGIDYVDESEVLTPADEENHINKHNFRIPFVCGCRNLGEALRRIREGAAMIRTKGEAG
PDXS_METTH/4-211
LENGTH = 208 COMBINED P-VALUE = 1.08e-90 E-VALUE = 1.1e-89
DIAGRAM: 10-[3]-37-[1]-61
[3]
1.9e-47
QMQKGGVIMDVTNPEQAKIAEDAGACAVMALERVPYDIRKQGGVSRMSDP
++ ++++++++++++++ ++++++++++++++++++++++++++++++++
1 GTEVLKKGFAKMTKGGVIMDVVNAEQAAIAEDSGAVAVMALEKVPADIRASGGVARMADPNKVQEIMDAVSIPVM
[1]
4.7e-52
DESEVLTPADYQYHIWKWEFTVPFVCGCRNLCEALRRINEGASMIRTKGE
+++++++++++++++++ ++++++++++++++++++++++++++++++++
76 AKVRIGHFVEAQVLEALGVDMIDESEVLTPADERFHIDKKKFTVPFVCGARNLGEALRRIDEGAAMIRTKGEPGT
PDXS_BACSU/4-212
LENGTH = 209 COMBINED P-VALUE = 1.28e-90 E-VALUE = 1.3e-89
DIAGRAM: 11-[3]-37-[1]-61
[3]
3.4e-51
QMQKGGVIMDVTNPEQAKIAEDAGACAVMALERVPYDIRKQGGVSRMSDP
+++++++++++ ++++++++++++++++++++++++++++++++++++++
1 TGTERVKRGMAEMQKGGVIMDVINAEQAKIAEEAGAVAVMALERVPADIRAAGGVARMADPTIVEEVMNAVSIPV
[1]
3.3e-48
DESEVLTPADYQYHIWKWEFTVPFVCGCRNLCEALRRINEGASMIRTKGE
+++++++++++++++++ ++++++++++++++++ +++++++++++++++
76 MAKARIGHIVEARVLEAMGVDYIDESEVLTPADEEFHLNKNEYTVPFVCGCRDLGEATRRIAEGASMLRTKGEPG
SNZ1_YEAST/3-212
LENGTH = 210 COMBINED P-VALUE = 1.88e-84 E-VALUE = 1.9e-83
DIAGRAM: 11-[3]-37-[1]-62
[3]
3.1e-43
QMQKGGVIMDVTNPEQAKIAEDAGACAVMALERVPYDIRKQGGVSRMSDP
+++++++++++++++++++++ +++++++++++++++ ++++ + +++++
1 GEDFKIKSGLAQMLKGGVIMDVVTPEQAKIAEKSGACAVMALESIPADMRKSGKVCRMSDPKMIKDIMNSVSIPV
[1]
1.2e-49
DESEVLTPADYQYHIWKWEFTVPFVCGCRNLCEALRRINEGASMIRTKGE
++++++++++ ++++ + ++++++++++++++++++++++++++++++++
76 MAKVRIGHFVEAQIIEALEVDYIDESEVLTPADWTHHIEKDKFKVPFVCGAKDLGEALRRINEGAAMIRTKGEAG
PDXS_ARCFU/10-219
LENGTH = 210 COMBINED P-VALUE = 1.29e-82 E-VALUE = 1.3e-81
DIAGRAM: 11-[3]-37-[1]-62
[3]
5.3e-48
QMQKGGVIMDVTNPEQAKIAEDAGACAVMALERVPYDIRKQGGVSRMSDP
+++++++++++++++++ +++++++++++++ ++++++++++++++++++
1 FGTELIKRGFAKMQKGGVIMDVTNAEQAAIAEEAGAVAVMALHKVPADIRAAGGVARMADPKKIQEIIDAVTIPV
[1]
2.1e-42
DESEVLTPADYQYHIWKWEFTVPFVCGCRNLCEALRRINEGASMIRTKGE
++++++++++ ++++++++ +++++++++++++ +++ +++++++++++
76 MAKCRIGHIAEARALEALGVDMIDESEVLTPADVFFHIDKRKFVVPFVCGARSLGEAVRRIWEGAAMIRTKGEAG
PDXS_HAEIN/3-211
LENGTH = 209 COMBINED P-VALUE = 3.28e-82 E-VALUE = 3.3e-81
DIAGRAM: 11-[3]-37-[1]-61
[3]
2.8e-46
QMQKGGVIMDVTNPEQAKIAEDAGACAVMALERVPYDIRKQGGVSRMSDP
+++++++++++ +++++++++++++ ++++++++++++++ +++++++++
1 ENRYELNKNLAQMLKGGVIMDVQNPEQARIAEAAGAAAVMALERIPADIRAVGGVSRMSDPKMIKEIQGAVSIPV
[1]
6.7e-45
DESEVLTPADYQYHIWKWEFTVPFVCGCRNLCEALRRINEGASMIRTKGE
++++++++++ +++ ++ ++ +++++++++++++++++++++++++++++
76 MAKVRIGHFVEAQILEAIEIDYIDESEVLSPADNRFHVDKKEFQVPFVCGAKDLGEALRRIAEGASMIRTKGEPG
PDXS_METJA/3-212
LENGTH = 210 COMBINED P-VALUE = 5.78e-78 E-VALUE = 5.8e-77
DIAGRAM: 11-[3]-37-[1]-62
[3]
1.5e-47
QMQKGGVIMDVTNPEQAKIAEDAGACAVMALERVPYDIRKQGGVSRMSDP
++ + +++++++++++++++++++++++++++++++++++++++++++++
1 KGTDLLKKGFAKMVKHGVVMDVTNVEQAQIAEEAGAVAVMALERVPADIRAAGGVARMSDPALIEEIMDAVSIPV
[1]
1.5e-38
DESEVLTPADYQYHIWKWEFTVPFVCGCRNLCEALRRINEGASMIRTKGE
++++++++++ +++ + ++ +++++++++++++ +++ +++++++++++
76 MAKCRIGHTTEALVLEAIGVDMIDESEVLTQADPFFHIYKKKFNVPFVCGARNLGEAVRRIWEGAAMIRTKGEAG
Debugging Information
CPU: kodomo.fbb.msu.ru
Time 0.024001 secs.
mast ./meme.txt -d ./seed.fasta -ev 10.000000 -mt 0.000100
Button Help
Links to Entrez database at NCBI
Links to sequence scores (section I)
Links to motif diagrams (section II)
Links to sequence/motif annotated alignments (section III)
This information