MAST - Motif Alignment and Search Tool
MAST version 4.3.0 (Release date: Sat Sep 26 01:51:56 PDT 2009)
For further information on how to interpret these results or to get
a copy of the MAST software please access http://meme.nbcr.net.
REFERENCE
If you use this program in your research, please cite:
Timothy L. Bailey and Michael Gribskov,
"Combining evidence using p-values: application to sequence homology
searches", Bioinformatics, 14(48-54), 1998.
DATABASE AND MOTIFS
DATABASE ./seed.fasta (peptide)
Last updated on Sun May 26 16:57:51 2013
Database contains 10 sequences, 2022 residues
MOTIFS ./meme.txt (peptide)
MOTIF WIDTH BEST POSSIBLE MATCH
----- ----- -------------------
1 29 HWENINQRMPLCRFGYFHMYNNYYVHWEM
2 41 CHEKDCYDGLFDMKHDSQYVTISYNYFHHHWKVMLWGHSDS
3 50 SGINSRMGAKIKVEQNYFENMKDPLMTFYTQQIGYWQVRDNMFDNVTWEE
PAIRWISE MOTIF CORRELATIONS:
MOTIF 1 2
----- ----- -----
2 0.10
3 0.20 0.08
No overly similar pairs (correlation > 0.60) found.
Random model letter frequencies (from non-redundant database):
A 0.073 C 0.018 D 0.052 E 0.062 F 0.040 G 0.069 H 0.022 I 0.056 K 0.058
L 0.092 M 0.023 N 0.046 P 0.051 Q 0.041 R 0.052 S 0.074 T 0.059 V 0.064
W 0.013 Y 0.033
SECTION I: HIGH-SCORING SEQUENCES
- Each of the following 10 sequences has E-value less than 10.
- The E-value of a sequence is the expected number of sequences
in a random database of the same size that would match the motifs as
well as the sequence does and is equal to the combined p-value of the
sequence times the number of sequences in the database.
- The combined p-value of a sequence measures the strength of the
match of the sequence to all the motifs and is calculated by
- finding the score of the single best match of each motif
to the sequence (best matches may overlap),
- calculating the sequence p-value of each score,
- forming the product of the p-values,
- taking the p-value of the product.
- The sequence p-value of a score is defined as the
probability of a random sequence of the same length containing
some match with as good or better a score.
- The score for the match of a position in a sequence to a motif
is computed by by summing the appropriate entry from each column of
the position-dependent scoring matrix that represents the motif.
- Sequences shorter than one or more of the motifs are skipped.
- The table is sorted by increasing E-value.
Links | Sequence Name | Description | E-value | Length
|
---|
| PLY18_SOLLC/134-319
|
| 1e-58
| 186
|
| PLY56_SOLLC/127-312
|
| 6.3e-28
| 186
|
| PLYD_ERWCH/81-312
|
| 7.2e-23
| 232
|
| SBP_CRYJA/102-289
|
| 1.5e-20
| 188
|
| MPAA2_AMBAR/129-313
|
| 3.6e-20
| 185
|
| PLY1_ERWCT/70-278
|
| 5.1e-16
| 209
|
| PLYA_ERWCH/84-319
|
| 4.4e-15
| 236
|
| PLYC_ERWCH/71-279
|
| 2.3e-13
| 209
|
| PELD_PSEMA/46-240
|
| 5.1e-05
| 195
|
| PELD_ERWCA/46-241
|
| 0.00025
| 196
|
SECTION II: MOTIF DIAGRAMS
- The ordering and spacing of all non-overlapping motif occurrences
are shown for each high-scoring sequence listed in Section I.
- A motif occurrence is defined as a position in the sequence whose
match to the motif has POSITION p-value less than 0.0001.
- The POSITION p-value of a match is the probability of
a single random subsequence of the length of the motif
scoring at least as well as the observed match.
- For each sequence, all motif occurrences are shown unless there
are overlaps. In that case, a motif occurrence is shown only if its
p-value is less than the product of the p-values of the other
(lower-numbered) motif occurrences that it overlaps.
- The table also shows the E-value of each sequence.
- Spacers and motif occurences are indicated by
- Spacers and motif occurences are indicated by
- Spacers and motif occurences are indicated by
- occurrence of motif `n' with p-value less than 0.0001.
- Sequences longer than 1000 are not shown to scale and are indicated by thicker lines.
Links | Name | Expect | Motifs
|
---|
| PLY18_SOLLC/134-319
| 1e-58
|
|
| PLY56_SOLLC/127-312
| 6.3e-28
|
|
| PLYD_ERWCH/81-312
| 7.2e-23
|
|
| SBP_CRYJA/102-289
| 1.5e-20
|
|
| MPAA2_AMBAR/129-313
| 3.6e-20
|
|
| PLY1_ERWCT/70-278
| 5.1e-16
|
|
| PLYA_ERWCH/84-319
| 4.4e-15
|
|
| PLYC_ERWCH/71-279
| 2.3e-13
|
|
| PELD_PSEMA/46-240
| 5.1e-05
|
|
| PELD_ERWCA/46-241
| 0.00025
|
|
SCALE
|
| |
| |
| |
| |
| |
| |
| |
| |
| |
1 |
25 |
50 |
75 |
100 |
125 |
150 |
175 |
200 |
|
---|
SECTION III: ANNOTATED SEQUENCES
- The positions and p-values of the non-overlapping motif occurrences
are shown above the actual sequence for each of the high-scoring
sequences from Section I.
- A motif occurrence is defined as a position in the sequence whose
match to the motif has POSITION p-value less than 0.0001 as
defined in Section II.
- For each sequence, the first line specifies the name of the sequence.
- The second (and possibly more) lines give a description of the
sequence.
- Following the description line(s) is a line giving the length,
combined p-value, and E-value of the sequence as defined in Section I.
- The next line reproduces the motif diagram from Section II.
- The entire sequence is printed on the following lines.
- Motif occurrences are indicated directly above their positions in the
sequence on lines showing
- the motif number of the occurrence,
- the position p-value of the occurrence,
- the best possible match to the motif, and
- columns whose match to the motif has a positive score (indicated
by a plus sign).
PLY18_SOLLC/134-319
LENGTH = 186 COMBINED P-VALUE = 1.01e-59 E-VALUE = 1e-58
DIAGRAM: 83-[2]-15-[1]-18
[2] [1]
1.8e-38 1.7e-29
CHEKDCYDGLFDMKHDSQYVTISYNYFHHHWKVMLWGHSDS HWENINQRMPL
+++++++++++++++++++++++++++++++++++++++++ +++++++++++
76 GKNIWVDHCSLSNCHDGLIDAIHGSTAITISNNYFTHHDKVMLLGHSDSFTQDKGMQVTVAFNHFGEGLVQRMPR
CRFGYFHMYNNYYVHWEM
++++++++++++++++++
151 CRHGYFHVVNNDYTHWEMYAIGGSAAPTINSQGNRF
PLY56_SOLLC/127-312
LENGTH = 186 COMBINED P-VALUE = 6.30e-29 E-VALUE = 6.3e-28
DIAGRAM: 83-[2]-15-[1]-18
[2] [1]
2.3e-15 6.8e-22
CHEKDCYDGLFDMKHDSQYVTISYNYFHHHWKVMLWGHSDS HWENINQRMPL
+ ++ ++++++ +++ +++++ ++++ ++++++ + +++++++++++
76 SHDIWIDHISMSRATDGLIDAVAGSTNITISNCHFTDHEKVMLFGANDHAEEDRGMKITLAYNHFGKRLDQRMPR
CRFGYFHMYNNYYVHWEM
++++ ++ +++++++++
151 CRFGFFHLVNNDYTHWERYAIGGSSGATIISQGNRF
PLYD_ERWCH/81-312
LENGTH = 232 COMBINED P-VALUE = 7.21e-24 E-VALUE = 7.2e-23
DIAGRAM: 121-[2]-16-[1]-25
[2]
1.8e-19
CHEKDCYDGLFDMKHDSQYVTISYNYFHH
+ ++ +++ +++++++++++++ + +
76 APHYEEGDGWNAEWDAAVIDNSTRVWVDHVTISDGSFTDDKYTTKNGEKYVQHDGALDIKKGSDYVTISSSRFEL
[1]
2.5e-12
HWKVMLWGHSDS HWENINQRMPLCRFGYFHMYNNYYVHWEM
++++ +++++++ ++++ ++++ +++ + +++ + + +
151 HDKTILIGHSDSNGSQDSGKLRVTFHNNVFDRVTERTPRVRFGSIHAYNNVYLGDVKNSVYPYLYSFGLGTSGTI
SBP_CRYJA/102-289
LENGTH = 188 COMBINED P-VALUE = 1.47e-21 E-VALUE = 1.5e-20
DIAGRAM: 85-[2]-15-[1]-18
[2] [1]
4.1e-18 5.8e-11
CHEKDCYDGLFDMKHDSQYVTISYNYFHHHWKVMLWGHSDS HWENINQRM
+ ++ +++++ + ++ ++++++ ++++ +++++++ + ++++ +++
76 RTATNIWIDHNSFSNSSDGLVDVTLSSTGVTISNNLFFNHHKVMLLGHDDAYSDDKSMKVTVAFNQFGPNCGQRM
PLCRFGYFHMYNNYYVHWEM
++ + + +++ ++ ++ +
151 PRARYGLVHVANNNYDPWTIYAIGGSSNPTILSEGNSF
MPAA2_AMBAR/129-313
LENGTH = 185 COMBINED P-VALUE = 3.60e-21 E-VALUE = 3.6e-20
DIAGRAM: 82-[2]-15-[1]-18
[2] [1]
7.7e-15 1.3e-13
CHEKDCYDGLFDMKHDSQYVTISYNYFHHHWKVMLWGHSDS HWENINQRMPLC
+ ++ +++++ +++ +++++ +++++ ++ +++ ++ + + +++++
76 SDIWIDHCTLSKSFDGLVDVNWGSTGVTISNCKFTHHEKAVLLGASDTHFQDLKMHVTLAYNIFTNTVHERMPRC
RFGYFHMYNNYYVHWEM
+++ + +++++++ +++
151 RFGFFQIVNNFYDRWDKYAIGGSSNPTILSQGNKF
PLY1_ERWCT/70-278
LENGTH = 209 COMBINED P-VALUE = 5.14e-17 E-VALUE = 5.1e-16
DIAGRAM: 110-[2]-11-[1]-18
[2]
1.0e-09
CHEKDCYDGLFDMKHDSQYVTISYNYFHHHWKVMLWGHSD
+++++ ++ +++++++ + ++++ +++
76 PGGAKDGDAIRIDNSPNVWIDHNEIFAKNFECAGTPDNDTTFESAVDIKKASTNVTVSYNFIHGVKKVGLSGSSN
[1]
2.7e-15
S HWENINQRMPLCRFGYFHMYNNYYVHWEM
+ +++++++ ++++++ +++ +++++
151 TDTGRNLTYHHNIYSDVNSRLPLQRGGQVHAYNNLYDGIKSSGFNVRQKGIALIESNWF
PLYA_ERWCH/84-319
LENGTH = 236 COMBINED P-VALUE = 4.40e-16 E-VALUE = 4.4e-15
DIAGRAM: 125-[2]-16-[1]-25
[2]
6.8e-16
CHEKDCYDGLFDMKHDSQYVTISYN
+ + +++ +++++++++++++++
76 IDVEPHYEKGDGWNAEWDAMNITNGAHHVWIDHVTISDGNFTDDMYTTKDGETYVQHDGALDIKRGSDYVTISNS
[1]
7.8e-08
YFHHHWKVMLWGHSDS HWENINQRMPLCRFGYFHMYNNYYVHWEM
+++++++++ + + ++ ++++ + + + +++ + + +
151 LIDQHDKTMLIGHNDTNSAQDKGKLHVTLFNNVFNRVTERAPRVRYGSIHSFNNVFKGDAKDPVYRYQYSFGIGT
PLYC_ERWCH/71-279
LENGTH = 209 COMBINED P-VALUE = 2.29e-14 E-VALUE = 2.3e-13
DIAGRAM: 110-[2]-11-[1]-18
[2]
4.4e-10
CHEKDCYDGLFDMKHDSQYVTISYNYFHHHWKVMLWGHSD
+++++ ++ +++++++ + ++++ +++
76 PGGAKDGDMIRVDDSPNVWVDHNELFAANHECDGTPDNDTTFESAVDIKGASNTVTVSYNYIHGVKKVGLDGSSS
[1]
1.6e-11
S HWENINQRMPLCRFGYFHMYNNYYVHWEM
+ ++ ++ ++++ +++ ++ +++ ++++
151 SDTGRNITYHHNYYNDVNARLPLQRGGLVHAYNNLYTNITGSGLNVRQNGQALIENNWF
PELD_PSEMA/46-240
LENGTH = 195 COMBINED P-VALUE = 5.12e-06 E-VALUE = 5.1e-05
DIAGRAM: 90-[2]-18-[1]-17
[2] [
1.5e-06 1
CHEKDCYDGLFDMKHDSQYVTISYNYFHHHWKVMLWGHSDS H
++ ++ + +++ ++++ ++ + + +++ +
76 SKYWIDHCSFVGHSWSTTDGSEDKLLYIGEKADYATISNCFFGSHKYGLIFGHPADDNNAAFNGYPRLTLCHNRF
1]
.9e-06
WENINQRMPLCRFGYFHMYNNYYVHWEM
+ + + +++++++++ + +
151 DNMEVRAPGLMRYGYFHVYNNYINKFHLGFTLAQNANILSESNYF
PELD_ERWCA/46-241
LENGTH = 196 COMBINED P-VALUE = 2.53e-05 E-VALUE = 0.00025
DIAGRAM: 91-[2]-18-[1]-17
[2]
3.0e-06
CHEKDCYDGLFDMKHDSQYVTISYNYFHHHWKVMLWGHSDS
+ ++ + ++++++++ + ++ + +++
76 GKGYWVDHCSWPGHTWSDNDGSLDKLIYIGEKADYITISNCLFSNHKYGCIFGHPADDNNSAYNGYPRLTICHNY
[1]
3.7e-06
HWENINQRMPLCRFGYFHMYNNYYVHWEM
+ + + + +++++++++ + +
151 YENIQVRAPGLMRYGYFHVFNNYVNKFQLAFTVAQNANVISERNVF
Debugging Information
CPU: kodomo.fbb.msu.ru
Time 0.016001 secs.
mast ./meme.txt -d ./seed.fasta -ev 10.000000 -mt 0.000100
Button Help
Links to Entrez database at NCBI
Links to sequence scores (section I)
Links to motif diagrams (section II)
Links to sequence/motif annotated alignments (section III)
This information