MAST - Motif Alignment and Search Tool
MAST version 4.1.1 (Release date: Tue Apr 21 15:00:06 PDT 2009)
For further information on how to interpret these results or to get
a copy of the MAST software please access http://meme.nbcr.net.
REFERENCE
If you use this program in your research, please cite:
Timothy L. Bailey and Michael Gribskov,
"Combining evidence using p-values: application to sequence homology
searches", Bioinformatics, 14(48-54), 1998.
DATABASE AND MOTIFS
DATABASE ./seed.fasta (peptide)
Last updated on Wed May 19 02:28:16 2010
Database contains 15 sequences, 3423 residues
MOTIFS ./memeout.txt (peptide)
MOTIF WIDTH BEST POSSIBLE MATCH
----- ----- -------------------
1 47 FWDKTDNNENRGEQLLRYVERVLKETGAKKVDIIAHSMGGPNTRYYI
2 47 CSTHFGKVIRDDYHWNHLDEINQLLGIHSAFEEDPVTVYRQHVNRLK
3 43 PGTDPNQKILYTSIYSSNDMIVMNYLSRLDGARNIQIHGIGHI
PAIRWISE MOTIF CORRELATIONS:
MOTIF 1 2
----- ----- -----
2 0.11
3 0.10 0.10
No overly similar pairs (correlation > 0.60) found.
Random model letter frequencies (from non-redundant database):
A 0.073 C 0.018 D 0.052 E 0.062 F 0.040 G 0.069 H 0.022 I 0.056 K 0.058
L 0.092 M 0.023 N 0.046 P 0.051 Q 0.041 R 0.052 S 0.074 T 0.059 V 0.064
W 0.013 Y 0.033
SECTION I: HIGH-SCORING SEQUENCES
- Each of the following 14 sequences has E-value less than 10.
- The E-value of a sequence is the expected number of sequences
in a random database of the same size that would match the motifs as
well as the sequence does and is equal to the combined p-value of the
sequence times the number of sequences in the database.
- The combined p-value of a sequence measures the strength of the
match of the sequence to all the motifs and is calculated by
- finding the score of the single best match of each motif
to the sequence (best matches may overlap),
- calculating the sequence p-value of each score,
- forming the product of the p-values,
- taking the p-value of the product.
- The sequence p-value of a score is defined as the
probability of a random sequence of the same length containing
some match with as good or better a score.
- The score for the match of a position in a sequence to a motif
is computed by by summing the appropriate entry from each column of
the position-dependent scoring matrix that represents the motif.
- Sequences shorter than one or more of the motifs are skipped.
- The table is sorted by increasing E-value.
Links | Sequence Name | Description | E-value | Length
|
---|
| O33969_STRCJ/52-253
|
| 1.3e-10
| 202
|
| Q19310_CAEEL/67-318
|
| 1.3e-07
| 252
|
| Q20437_CAEEL/67-316
|
| 0.00038
| 250
|
| Q23548_CAEEL/63-276
|
| 0.0032
| 214
|
| Q22060_CAEEL/67-317
|
| 0.0092
| 251
|
| O16380_CAEEL/83-333
|
| 0.015
| 251
|
| Q17622_CAEEL/67-310
|
| 0.037
| 244
|
| Q19462_CAEEL/62-276
|
| 0.13
| 215
|
| O17377_CAEEL/64-277
|
| 1.3
| 214
|
| Q19941_CAEEL/63-295
|
| 2.7
| 233
|
| O44824_CAEEL/55-269
|
| 3.6
| 215
|
| O76556_CAEEL/63-275
|
| 4.6
| 213
|
| O17376_CAEEL/64-276
|
| 6.8
| 213
|
| O17375_CAEEL/64-288
|
| 9.3
| 225
|
SECTION II: MOTIF DIAGRAMS
- The ordering and spacing of all non-overlapping motif occurrences
are shown for each high-scoring sequence listed in Section I.
- A motif occurrence is defined as a position in the sequence whose
match to the motif has POSITION p-value less than 0.0001.
- The POSITION p-value of a match is the probability of
a single random subsequence of the length of the motif
scoring at least as well as the observed match.
- For each sequence, all motif occurrences are shown unless there
are overlaps. In that case, a motif occurrence is shown only if its
p-value is less than the product of the p-values of the other
(lower-numbered) motif occurrences that it overlaps.
- The table also shows the E-value of each sequence.
- Spacers and motif occurences are indicated by
- Spacers and motif occurences are indicated by
- Spacers and motif occurences are indicated by
- occurrence of motif `n' with p-value less than 0.0001.
- Sequences longer than 1000 are not shown to scale and are indicated by thicker lines.
Links | Name | Expect | Motifs
|
---|
| O33969_STRCJ/52-253
| 1.3e-10
|
|
| Q19310_CAEEL/67-318
| 1.3e-07
|
|
| Q20437_CAEEL/67-316
| 0.00038
|
|
| Q23548_CAEEL/63-276
| 0.0032
|
|
| Q22060_CAEEL/67-317
| 0.0092
|
|
| O16380_CAEEL/83-333
| 0.015
|
|
| Q17622_CAEEL/67-310
| 0.037
|
|
| Q19462_CAEEL/62-276
| 0.13
|
|
| O17377_CAEEL/64-277
| 1.3
|
|
| Q19941_CAEEL/63-295
| 2.7
|
|
| O44824_CAEEL/55-269
| 3.6
|
|
| O76556_CAEEL/63-275
| 4.6
|
|
| O17376_CAEEL/64-276
| 6.8
|
|
| O17375_CAEEL/64-288
| 9.3
|
|
SCALE
|
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
1 |
25 |
50 |
75 |
100 |
125 |
150 |
175 |
200 |
225 |
|
---|
SECTION III: ANNOTATED SEQUENCES
- The positions and p-values of the non-overlapping motif occurrences
are shown above the actual sequence for each of the high-scoring
sequences from Section I.
- A motif occurrence is defined as a position in the sequence whose
match to the motif has POSITION p-value less than 0.0001 as
defined in Section II.
- For each sequence, the first line specifies the name of the sequence.
- The second (and possibly more) lines give a description of the
sequence.
- Following the description line(s) is a line giving the length,
combined p-value, and E-value of the sequence as defined in Section I.
- The next line reproduces the motif diagram from Section II.
- The entire sequence is printed on the following lines.
- Motif occurrences are indicated directly above their positions in the
sequence on lines showing
- the motif number of the occurrence,
- the position p-value of the occurrence,
- the best possible match to the motif, and
- columns whose match to the motif has a positive score (indicated
by a plus sign).
O33969_STRCJ/52-253
LENGTH = 202 COMBINED P-VALUE = 8.76e-12 E-VALUE = 1.3e-10
DIAGRAM: 37-[1]-52-[3]-23
[1]
8.7e-14
FWDKTDNNENRGEQLLRYVERVLKETGAKKVDIIAHSM
+ ++ + +++ + ++++ +++++++++
1 PQPVVLVNGTFKLMAENWSKLSPKLKEAGYCVFAFNYGHFETDPIPESAAELRDFVEAVRGATGAAKVDIVGHSQ
[3]
1.6e-05
GGPNTRYYI PGTDPNQKILYTSI
++ +++ + ++ +
76 GGMLPRYYVKFLGGADKVDDLVGIVPSNHGTKNPLAIPAGWTFCPSCVDQQAGSELLQKLNAGDETPAGPDYTVI
YSSNDMIVMNYLSRLDGARNIQIHGIGHI
+ ++ + + + + +
151 TTRYDEVVIPYASALLTGDKEHLTNVVLQDKCPLDLYMHDQATKDPVVAQWV
Q19310_CAEEL/67-318
LENGTH = 252 COMBINED P-VALUE = 8.57e-09 E-VALUE = 1.3e-07
DIAGRAM: 57-[1]-148
[1]
8.8e-13
FWDKTDNNENRGEQLLRY
++ ++ + ++
1 KEPVIFIHGSGDTALFTQQPLATGFSRSIQYFLEQNYTEAELYATTWGDTWGSGSMLDTYSTIHTCGNLIYLRRF
VERVLKETGAKKVDIIAHSMGGPNTRYYI
+ + ++++++++++++ + + + +
76 LEAVIGYTGAKKVDIIAHSVGVPLMRKVVKGGTLIGTDGNCTLGPPLGAKVDTFLGIAGPNFGLCVCQLAQTVPA
Q20437_CAEEL/67-316
LENGTH = 250 COMBINED P-VALUE = 2.54e-05 E-VALUE = 0.00038
DIAGRAM: 54-[1]-149
[1]
1.7e-08
FWDKTDNNENRGEQLLRYVER
+ + ++ +
1 NTPIVFIHGNSDSALKFGDGPFQSGWDEILKYFMSKDYTLAELYGITYGDRNISNSYIRHFNCQNVQLHRRFLEF
VLKETGAKKVDIIAHSMGGPNTRYYI
++ + ++ ++++++++ + +
76 VLHYTKAKQINIVAHSMGVSIARKAIQGGEFVTKTESCDIGPNLSHRIGTFIALASANYGMCPCQHAFAFPACGM
Q23548_CAEEL/63-276
LENGTH = 214 COMBINED P-VALUE = 2.15e-04 E-VALUE = 0.0032
DIAGRAM: 45-[1]-122
[1]
2.8e-08
FWDKTDNNENRGEQLLRYVERVLKETGAKK
+ ++ + +++ +++ +
1 DRPVVFVHGLNNEAGSLWKIARDFTNANYPRQYLFATTWGKGTEPLNLNVAMSCTHVQHVRRFIETVLKYTGAQK
VDIIAHSMGGPNTRYYI
+ ++ +++ + + +
76 IDVIGYSMGSPIARKAILGGKCVDNPNVALGPSLHSRVHTYISVAGANQGSQLCALPFFDICNMKTGLMCNSKFL
Q22060_CAEEL/67-317
LENGTH = 251 COMBINED P-VALUE = 6.15e-04 E-VALUE = 0.0092
DIAGRAM: 52-[1]-152
[1]
7.1e-07
FWDKTDNNENRGEQLLRYVERVL
+ + + + + ++
1 NTPVVFIHGNSDAALHVSSAATGWSNSIEYFLEHGYTVAELYATSWQDTNALHAASRTHNCKDLVRLRKFLEAVL
KETGAKKVDIIAHSMGGPNTRYYI
+ + + + + ++++ + +
76 AYTAAPKISVVTHSMGVTLARKIIKGGSVSASDGSCDLGLPLNKKVEVMIGISGANYGLCNCEGGSATLEKTCNR
O16380_CAEEL/83-333
LENGTH = 251 COMBINED P-VALUE = 9.84e-04 E-VALUE = 0.015
DIAGRAM: 52-[1]-152
[1]
5.0e-07
FWDKTDNNENRGEQLLRYVERVL
+ + + ++
1 KTPVIFIHGNSDSALKTSSMATGWDNTISYFSTQGFTSAELYATSWQDTNSLKAETRTHDCKDLTRLRKFFEAVL
KETGAKKVDIIAHSMGGPNTRYYI
+ ++ ++ + ++++ + ++
76 AYTGYAKVSVISHSMGVTLGRKVIQGGIVNGTDGNCNLGVSISGKVDVFVGIAGANYGLCGCAGTVGLFAPTCNK
Q17622_CAEEL/67-310
LENGTH = 244 COMBINED P-VALUE = 2.45e-03 E-VALUE = 0.037
DIAGRAM: 54-[1]-143
[1]
9.3e-07
FWDKTDNNENRGEQLLRYVER
+ + +
1 HSPIVFIHGNSDSALKYGNDQYQSGWDTVVQYFMDRGYTLAELYGISYGDRNMTQSVFRVFNCQDLLLHRQLLLS
VLKETGAKKVDIIAHSMGGPNTRYYI
++ +++ ++++++++ + +
76 VLVYTGARHIDIIAHSMGVSIARAIIQGGKFTSEKESCDLGSDFSSLIGTFIAISSANYGMCSCEAGTAFPACDK
Q19462_CAEEL/62-276
LENGTH = 215 COMBINED P-VALUE = 8.80e-03 E-VALUE = 0.13
DIAGRAM: 45-[1]-123
[1]
6.3e-06
FWDKTDNNENRGEQLLRYVERVLKETGAKK
+ + + + +
1 GLPIVVIHGLAGYAGLSKKSLLNKYYSYGQPKGTVFATTYARGKLMDSLQHGMQCDYVLKVRTLILAVHEYTQKP
VDIIAHSMGGPNTRYYI
+++++ +++ + ++ +
76 VNIIACSMGSPITRKAILGGHCVDSGVNLGRPINHLIHNYVSVGGANHGAIMCAKQPFVNGICSLTHGLDCRSKF
O17377_CAEEL/64-277
LENGTH = 214 COMBINED P-VALUE = 8.94e-02 E-VALUE = 1.3
DIAGRAM: 214
Q19941_CAEEL/63-295
LENGTH = 233 COMBINED P-VALUE = 1.82e-01 E-VALUE = 2.7
DIAGRAM: 233
O44824_CAEEL/55-269
LENGTH = 215 COMBINED P-VALUE = 2.41e-01 E-VALUE = 3.6
DIAGRAM: 215
O76556_CAEEL/63-275
LENGTH = 213 COMBINED P-VALUE = 3.07e-01 E-VALUE = 4.6
DIAGRAM: 213
O17376_CAEEL/64-276
LENGTH = 213 COMBINED P-VALUE = 4.52e-01 E-VALUE = 6.8
DIAGRAM: 213
O17375_CAEEL/64-288
LENGTH = 225 COMBINED P-VALUE = 6.17e-01 E-VALUE = 9.3
DIAGRAM: 225
Debugging Information
CPU: kodomo-count
Time 0.040002 secs.
mast ./memeout.txt -d ./seed.fasta -ev 10.000000 -mt 0.000100
Button Help
Links to Entrez database at NCBI
Links to sequence scores (section I)
Links to motif diagrams (section II)
Links to sequence/motif annotated alignments (section III)
This information