MAST - Motif Alignment and Search Tool
MAST version 4.1.1 (Release date: Tue Apr 21 15:00:06 PDT 2009)
For further information on how to interpret these results or to get
a copy of the MAST software please access http://meme.nbcr.net.
REFERENCE
If you use this program in your research, please cite:
Timothy L. Bailey and Michael Gribskov,
"Combining evidence using p-values: application to sequence homology
searches", Bioinformatics, 14(48-54), 1998.
DATABASE AND MOTIFS
DATABASE ./seed.fasta (peptide)
Last updated on Mon May 31 17:37:23 2010
Database contains 71 sequences, 8874 residues
MOTIFS ./memeout.txt (peptide)
MOTIF WIDTH BEST POSSIBLE MATCH
----- ----- -------------------
1 50 TIESSGIAPAVMTGLEMGVPVIFARKHKSLTLTDNMYTASVYSYTKQTEN
2 50 MEALKRKIREYGIVLSDQVLKVDAFLNHQIDPQLMQQIGQEFARRFRNTG
3 50 DDFLANGQAAHGLISIIHQAGATVAGIGIVIEKSFQPGRDELREQGYRVE
PAIRWISE MOTIF CORRELATIONS:
MOTIF 1 2
----- ----- -----
2 0.10
3 0.10 0.11
No overly similar pairs (correlation > 0.60) found.
Random model letter frequencies (from non-redundant database):
A 0.073 C 0.018 D 0.052 E 0.062 F 0.040 G 0.069 H 0.022 I 0.056 K 0.058
L 0.092 M 0.023 N 0.046 P 0.051 Q 0.041 R 0.052 S 0.074 T 0.059 V 0.064
W 0.013 Y 0.033
SECTION I: HIGH-SCORING SEQUENCES
- Each of the following 39 sequences has E-value less than 10.
- The E-value of a sequence is the expected number of sequences
in a random database of the same size that would match the motifs as
well as the sequence does and is equal to the combined p-value of the
sequence times the number of sequences in the database.
- The combined p-value of a sequence measures the strength of the
match of the sequence to all the motifs and is calculated by
- finding the score of the single best match of each motif
to the sequence (best matches may overlap),
- calculating the sequence p-value of each score,
- forming the product of the p-values,
- taking the p-value of the product.
- The sequence p-value of a score is defined as the
probability of a random sequence of the same length containing
some match with as good or better a score.
- The score for the match of a position in a sequence to a motif
is computed by by summing the appropriate entry from each column of
the position-dependent scoring matrix that represents the motif.
- Sequences shorter than one or more of the motifs are skipped.
- The table is sorted by increasing E-value.
Links | Sequence Name | Description | E-value | Length
|
---|
| XPT1_CLOPS/26-155
|
| 0.00019
| 130
|
| Q980Q2_SULSO/96-223
|
| 0.007
| 128
|
| Q4DVM9_TRYCR/46-173
|
| 0.008
| 128
|
| Q5JJD9_PYRKO/87-214
|
| 0.0098
| 128
|
| A5TS89_FUSNP/29-157
|
| 0.016
| 129
|
| A2BJL0_HYPBU/91-207
|
| 0.02
| 117
|
| Q72I83_THET2/29-157
|
| 0.054
| 129
|
| PYRE_PROAC/45-157
|
| 0.072
| 113
|
| Q6NF43_CORDI/12-134
|
| 0.15
| 123
|
| A7E5A9_SCLS1/59-194
|
| 0.27
| 136
|
| C1DM73_AZOVD/1-118
|
| 0.32
| 118
|
| UPP_AQUAE/45-158
|
| 0.33
| 114
|
| Q4UG33_THEAN/46-167
|
| 0.36
| 122
|
| Q1PV22_9BACT/11-173
|
| 0.42
| 163
|
| A3MXE2_PYRCJ/188-316
|
| 1.2
| 129
|
| APT_TRIEI/24-148
|
| 1.5
| 125
|
| UPP_SALPA/36-160
|
| 2.2
| 125
|
| Q72L93_THET2/2-119
|
| 2.3
| 118
|
| PYREL_HALMA/64-184
|
| 2.4
| 121
|
| APT1_METMP/30-151
|
| 2.4
| 122
|
| Q317F5_DESDG/43-188
|
| 2.5
| 146
|
| Q0K973_RALEH/9-131
|
| 2.7
| 123
|
| Q8XK54_CLOPE/177-285
|
| 2.8
| 109
|
| A8LJR1_DINSH/152-260
|
| 2.9
| 109
|
| Q1V0T8_9RICK/6-134
|
| 3.2
| 129
|
| Q9YD32_AERPE/90-214
|
| 3.5
| 125
|
| A3XF44_9RHOB/32-157
|
| 3.8
| 126
|
| Q2YAL8_NITMU/17-176
|
| 4.9
| 160
|
| Q4XNN5_PLACH/69-207
|
| 5.1
| 139
|
| Q3SMI9_THIDA/9-141
|
| 5.7
| 133
|
| PYRE_BRASB/39-158
|
| 6.1
| 120
|
| Q14JN0_FRAT1/8-132
|
| 6.9
| 125
|
| PYRE_HYPNA/36-146
|
| 7
| 111
|
| A3DN49_STAMF/119-244
|
| 7.4
| 126
|
| PYRE_BIFLO/38-164
|
| 7.4
| 127
|
| A1KRZ3_NEIMF/3-127
|
| 7.8
| 125
|
| APT_PROM5/26-150
|
| 9.2
| 125
|
| A5LPK5_STRPN/106-234
|
| 9.5
| 129
|
| Q0EYW3_9PROT/15-136
|
| 9.6
| 122
|
SECTION II: MOTIF DIAGRAMS
- The ordering and spacing of all non-overlapping motif occurrences
are shown for each high-scoring sequence listed in Section I.
- A motif occurrence is defined as a position in the sequence whose
match to the motif has POSITION p-value less than 0.0001.
- The POSITION p-value of a match is the probability of
a single random subsequence of the length of the motif
scoring at least as well as the observed match.
- For each sequence, all motif occurrences are shown unless there
are overlaps. In that case, a motif occurrence is shown only if its
p-value is less than the product of the p-values of the other
(lower-numbered) motif occurrences that it overlaps.
- The table also shows the E-value of each sequence.
- Spacers and motif occurences are indicated by
- Spacers and motif occurences are indicated by
- Spacers and motif occurences are indicated by
- occurrence of motif `n' with p-value less than 0.0001.
- Sequences longer than 1000 are not shown to scale and are indicated by thicker lines.
Links | Name | Expect | Motifs
|
---|
| XPT1_CLOPS/26-155
| 0.00019
|
|
| Q980Q2_SULSO/96-223
| 0.007
|
|
| Q4DVM9_TRYCR/46-173
| 0.008
|
|
| Q5JJD9_PYRKO/87-214
| 0.0098
|
|
| A5TS89_FUSNP/29-157
| 0.016
|
|
| A2BJL0_HYPBU/91-207
| 0.02
|
|
| Q72I83_THET2/29-157
| 0.054
|
|
| PYRE_PROAC/45-157
| 0.072
|
|
| Q6NF43_CORDI/12-134
| 0.15
|
|
| A7E5A9_SCLS1/59-194
| 0.27
|
|
| C1DM73_AZOVD/1-118
| 0.32
|
|
| UPP_AQUAE/45-158
| 0.33
|
|
| Q4UG33_THEAN/46-167
| 0.36
|
|
| Q1PV22_9BACT/11-173
| 0.42
|
|
| A3MXE2_PYRCJ/188-316
| 1.2
|
|
| APT_TRIEI/24-148
| 1.5
|
|
| UPP_SALPA/36-160
| 2.2
|
|
| Q72L93_THET2/2-119
| 2.3
|
|
| PYREL_HALMA/64-184
| 2.4
|
|
| APT1_METMP/30-151
| 2.4
|
|
| Q317F5_DESDG/43-188
| 2.5
|
|
| Q0K973_RALEH/9-131
| 2.7
|
|
| Q8XK54_CLOPE/177-285
| 2.8
|
|
| A8LJR1_DINSH/152-260
| 2.9
|
|
| Q1V0T8_9RICK/6-134
| 3.2
|
|
| Q9YD32_AERPE/90-214
| 3.5
|
|
| A3XF44_9RHOB/32-157
| 3.8
|
|
| Q2YAL8_NITMU/17-176
| 4.9
|
|
| Q4XNN5_PLACH/69-207
| 5.1
|
|
| Q3SMI9_THIDA/9-141
| 5.7
|
|
| PYRE_BRASB/39-158
| 6.1
|
|
| Q14JN0_FRAT1/8-132
| 6.9
|
|
| PYRE_HYPNA/36-146
| 7
|
|
| A3DN49_STAMF/119-244
| 7.4
|
|
| PYRE_BIFLO/38-164
| 7.4
|
|
| A1KRZ3_NEIMF/3-127
| 7.8
|
|
| APT_PROM5/26-150
| 9.2
|
|
| A5LPK5_STRPN/106-234
| 9.5
|
|
| Q0EYW3_9PROT/15-136
| 9.6
|
|
SCALE
|
| |
| |
| |
| |
| |
| |
1 |
25 |
50 |
75 |
100 |
125 |
|
---|
SECTION III: ANNOTATED SEQUENCES
- The positions and p-values of the non-overlapping motif occurrences
are shown above the actual sequence for each of the high-scoring
sequences from Section I.
- A motif occurrence is defined as a position in the sequence whose
match to the motif has POSITION p-value less than 0.0001 as
defined in Section II.
- For each sequence, the first line specifies the name of the sequence.
- The second (and possibly more) lines give a description of the
sequence.
- Following the description line(s) is a line giving the length,
combined p-value, and E-value of the sequence as defined in Section I.
- The next line reproduces the motif diagram from Section II.
- The entire sequence is printed on the following lines.
- Motif occurrences are indicated directly above their positions in the
sequence on lines showing
- the motif number of the occurrence,
- the position p-value of the occurrence,
- the best possible match to the motif, and
- columns whose match to the motif has a positive score (indicated
by a plus sign).
XPT1_CLOPS/26-155
LENGTH = 130 COMBINED P-VALUE = 2.73e-06 E-VALUE = 0.00019
DIAGRAM: 31-[1]-49
[1]
5.6e-09
TIESSGIAPAVMTGLEMGVPVIFARKHKSLTLTDNMYTASVYSY
+ + ++++++ + ++ + + + + ++
1 LNHQMDIKFLNEVGKEFRKRFEGEKVDKILTIEASGIAIAGIASQYFDYVPVVFAKKTESLNLDKDVYESNVHSF
TKQTEN
+++
76 TKKKDYKVRVGKQFLNKGERVLIIDDFLAQGCATKGMIDLVEQAGAELVGIGIVI
Q980Q2_SULSO/96-223
LENGTH = 128 COMBINED P-VALUE = 9.92e-05 E-VALUE = 0.007
DIAGRAM: 31-[1]-47
[1]
6.5e-07
TIESSGIAPAVMTGLEMGVPVIFARKHKSLTLTDNMYTASVYSY
+ + ++ ++ + + +++ +++ + + +
1 SNLLFYPNLLKTFLELYLIKLNKVSDVTKIVGIASNGIPFATIVASILEKPLIISKKHKDSTQIQYIEENIRESN
TKQTEN
+
76 GVISTLYIRGDYISKKDKILIVDDVIRSGKTLLSLYNLIGKANASVVGALIIA
Q4DVM9_TRYCR/46-173
LENGTH = 128 COMBINED P-VALUE = 1.12e-04 E-VALUE = 0.008
DIAGRAM: 33-[1]-45
[1]
1.0e-06
TIESSGIAPAVMTGLEMGVPVIFARKHKSLTLTDNMYTASVY
+++ + +++++++++ + + +
1 SSITENPPLFRKVIHFLAQRYRDMGDAGPTHVLCVESRGYLVGSPLALELGVPLVLASATKRFPSTFLPEGQGPM
SYTKQTEN
+ +
76 VLPKSYSVRNGSISSDSRVLIVDDFIATGRSLVSVLRLMAIVGAKVIEAVALC
Q5JJD9_PYRKO/87-214
LENGTH = 128 COMBINED P-VALUE = 1.39e-04 E-VALUE = 0.0098
DIAGRAM: 30-[1]-48
[1]
4.4e-06
TIESSGIAPAVMTGLEMGVPVIFARKHKSLTLTDNMYTASVYSYT
+ ++ ++ + +++ ++ + + + + +
1 MPALSDTALMSLIAEEVASKYMDKDIDKVLTAETDGIVLGAHIARELGADLIYAKKKKEVGVEKFYEVNYVPSAS
KQTEN
+
76 GSVMTLYLPQWALKKGETVLIVDDVIRSGETQRALVELAHQAGAKPVGMFFLI
A5TS89_FUSNP/29-157
LENGTH = 129 COMBINED P-VALUE = 2.31e-04 E-VALUE = 0.016
DIAGRAM: 77-[2]-2
[2]
6.4e-05
MEALKRKIREYGIVLSDQVLKVDAFLNHQIDPQLMQQIGQEFARRFRNTG
++ + + + + + ++ +++ + + +++
76 NSQKLYLNNQDVNKIKGKRVALIDDVISTGQSLKALERLVEKAGANVVAKAAIL
A2BJL0_HYPBU/91-207
LENGTH = 117 COMBINED P-VALUE = 2.78e-04 E-VALUE = 0.02
DIAGRAM: 30-[1]-37
[1]
6.8e-06
TIESSGIAPAVMTGLEMGVPVIFARKHKSLTLTDNMYTASVYSYT
+++++ ++ + ++ + ++ + +
1 SVVLTDPLYLTLASLYYLERLRDIEPTRILVPEASGIPLATSLSLLLGAPFTVARRYMGYGLCSSSEPRFCIRPG
KQTEN
+
76 QLGRSDRVLIVDDIVETGMTLRALEEIADSIGARVEAVAALV
Q72I83_THET2/29-157
LENGTH = 129 COMBINED P-VALUE = 7.62e-04 E-VALUE = 0.054
DIAGRAM: 29-[1]-50
[1]
3.6e-06
TIESSGIAPAVMTGLEMGVPVIFARKHKSLTLTDNMYTASVYSYTK
+ +++ + + ++++ + ++ ++ +
1 VEFLGDPEFTRAAAEALRPLVPKEAEILFTTETSPIPLTHVLAEALGLPYVVARRRRRPYMEDPIIQEVQTLTLG
QTEN
+
76 VGEVLWLDRRFAEKLLNQRVVLVSDVVASGETMRAMEKMVLRAGGHVVARLAVF
PYRE_PROAC/45-157
LENGTH = 113 COMBINED P-VALUE = 1.02e-03 E-VALUE = 0.072
DIAGRAM: 4-[3]-59
[3]
2.6e-05
DDFLANGQAAHGLISIIHQAGATVAGIGIVIEKSFQPGRDELREQGYRVE
+ + +++++++ ++ + + + + + ++
1 RVTLSDPQTRDLIADGLASLVRTNFHQVDVVAGTATAGIAHAALAADRLGAPMAYVRSVPKDHGRGNQIEGRIPA
Q6NF43_CORDI/12-134
LENGTH = 123 COMBINED P-VALUE = 2.07e-03 E-VALUE = 0.15
DIAGRAM: 4-[3]-69
[3]
5.5e-05
DDFLANGQAAHGLISIIHQAGATVAGIGIVIEKSFQPGRDELREQGYRVE
+ + ++ + + + + + + + +++ + ++
1 TKEILTWEGFGIANRELAQQIVDSNYTPDIIVAVARGGLVPAGALSYSMGIKLSDAINVEFYTDVNETLPDPVLL
A7E5A9_SCLS1/59-194
LENGTH = 136 COMBINED P-VALUE = 3.75e-03 E-VALUE = 0.27
DIAGRAM: 33-[1]-53
[1]
6.3e-06
TIESSGIAPAVMTGLEMGVPVIFARKHKSLTLTDNMYTASVY
+ + + + + ++++ + ++ + + +
1 ASLLINQASIDVVDELSSFLSEKVAPFNPDVIIGLPTLGLTLAPIVARKLGLKRYIPLGYSRKFWYDDALSAAVS
SYTKQTEN
+ +
76 SITSPELGLKNTYLDPHLLPLLIGKKCVIIDDAVSSGTTLKTTWDLLERIGCEIVGCGVVM
C1DM73_AZOVD/1-118
LENGTH = 118 COMBINED P-VALUE = 4.51e-03 E-VALUE = 0.32
DIAGRAM: 118
UPP_AQUAE/45-158
LENGTH = 114 COMBINED P-VALUE = 4.58e-03 E-VALUE = 0.33
DIAGRAM: 24-[1]-40
[1]
6.7e-05
TIESSGIAPAVMTGLEMGVPVIFARKHKSLTLTDNMYTASVYSYTKQTEN
+ + + ++ + ++ + + + + + + +
1 DILLEEKEVRTWIGNKRFNYLNEEEIVFVPILRAGLSFLEGALQVVPNAKVGFLGIKRNEETLESHIYYSRLPEL
Q4UG33_THEAN/46-167
LENGTH = 122 COMBINED P-VALUE = 5.08e-03 E-VALUE = 0.36
DIAGRAM: 62-[2]-10
[2]
2.9e-05
MEALKRKIREYGI
++ + +
1 SGVLTDCESFDLMTDLMIAKLVESKVEFDAFHGCPYKAIPLVSTLCLKYYKLTGKKVYFGYHRKEVKDHGEGKLF
VLSDQVLKVDAFLNHQIDPQLMQQIGQEFARRFRNTG
+ + + + + + + + + ++ + +
76 VGSPKVFEENSRLVVVDDILTMGTSVSESLSLLKTTPAKVVCVLILL
Q1PV22_9BACT/11-173
LENGTH = 163 COMBINED P-VALUE = 5.96e-03 E-VALUE = 0.42
DIAGRAM: 19-[1]-94
[1]
2.7e-05
TIESSGIAPAVMTGLEMGVPVIFARKHKSLTLTDNMYTASVYSYTKQTEN
+ ++ + + + + + + +++ + +++
1 RNKLYVFKDRREAGKLLSQQLVGHKNTDAIVLGIPSGGVPVAAEIANALVLPLDLLIVRKVQIPYNTESGFGAVS
A3MXE2_PYRCJ/188-316
LENGTH = 129 COMBINED P-VALUE = 1.72e-02 E-VALUE = 1.2
DIAGRAM: 4-[3]-75
[3]
9.1e-05
DDFLANGQAAHGLISIIHQAGATVAGIGIVIEKSFQPGRDELREQGYRVE
+ + +++ +++ + + + + ++ + ++
1 DNVIDGVEVAETRAKLAEALAEKTRAEVDVAIGVPETGMFYASALARRLGVWSPLAFVATARGRSALLDEVKERL
APT_TRIEI/24-148
LENGTH = 125 COMBINED P-VALUE = 2.15e-02 E-VALUE = 1.5
DIAGRAM: 125
UPP_SALPA/36-160
LENGTH = 125 COMBINED P-VALUE = 3.07e-02 E-VALUE = 2.2
DIAGRAM: 69-[3]-6
[3]
7.1e-0
DDFLAN
+
1 GSLLTYEATADLETEKVTIEGWNGPVEIDQIKGKKITVVPILRAGLGMMEGVLENVPSARISVVGMYRNEETLEP
5
GQAAHGLISIIHQAGATVAGIGIVIEKSFQPGRDELREQGYRVE
+ +++ ++ + ++ + +++++ +
76 VPYFQKLVSNIDERMALIVDPMLATGGSVIATIDLLKKAGCSSIKVLVLV
Q72L93_THET2/2-119
LENGTH = 118 COMBINED P-VALUE = 3.19e-02 E-VALUE = 2.3
DIAGRAM: 118
PYREL_HALMA/64-184
LENGTH = 121 COMBINED P-VALUE = 3.35e-02 E-VALUE = 2.4
DIAGRAM: 121
APT1_METMP/30-151
LENGTH = 122 COMBINED P-VALUE = 3.42e-02 E-VALUE = 2.4
DIAGRAM: 29-[1]-43
[1]
6.3e-05
TIESSGIAPAVMTGLEMGVPVIFARKHKSLTLTDNMYTASVYSYTK
+ ++ ++ + + + + +++ + + +
1 GVPLVEPELLRDVSTRVIKMIDTDVDKIVTAEAMGIPIVTAVSIATDIPYVIMRKREYLLEGEIPVHQETGYSKG
QTEN
76 ELYLNGINKGDKVIILDDVISTGGTLVAIINALKRAGADIKDVLCII
Q317F5_DESDG/43-188
LENGTH = 146 COMBINED P-VALUE = 3.51e-02 E-VALUE = 2.5
DIAGRAM: 146
Q0K973_RALEH/9-131
LENGTH = 123 COMBINED P-VALUE = 3.76e-02 E-VALUE = 2.7
DIAGRAM: 123
Q8XK54_CLOPE/177-285
LENGTH = 109 COMBINED P-VALUE = 3.96e-02 E-VALUE = 2.8
DIAGRAM: 109
A8LJR1_DINSH/152-260
LENGTH = 109 COMBINED P-VALUE = 4.05e-02 E-VALUE = 2.9
DIAGRAM: 109
Q1V0T8_9RICK/6-134
LENGTH = 129 COMBINED P-VALUE = 4.57e-02 E-VALUE = 3.2
DIAGRAM: 129
Q9YD32_AERPE/90-214
LENGTH = 125 COMBINED P-VALUE = 4.95e-02 E-VALUE = 3.5
DIAGRAM: 125
A3XF44_9RHOB/32-157
LENGTH = 126 COMBINED P-VALUE = 5.33e-02 E-VALUE = 3.8
DIAGRAM: 126
Q2YAL8_NITMU/17-176
LENGTH = 160 COMBINED P-VALUE = 6.87e-02 E-VALUE = 4.9
DIAGRAM: 160
Q4XNN5_PLACH/69-207
LENGTH = 139 COMBINED P-VALUE = 7.19e-02 E-VALUE = 5.1
DIAGRAM: 139
Q3SMI9_THIDA/9-141
LENGTH = 133 COMBINED P-VALUE = 8.07e-02 E-VALUE = 5.7
DIAGRAM: 133
PYRE_BRASB/39-158
LENGTH = 120 COMBINED P-VALUE = 8.57e-02 E-VALUE = 6.1
DIAGRAM: 120
Q14JN0_FRAT1/8-132
LENGTH = 125 COMBINED P-VALUE = 9.70e-02 E-VALUE = 6.9
DIAGRAM: 125
PYRE_HYPNA/36-146
LENGTH = 111 COMBINED P-VALUE = 9.85e-02 E-VALUE = 7
DIAGRAM: 111
A3DN49_STAMF/119-244
LENGTH = 126 COMBINED P-VALUE = 1.04e-01 E-VALUE = 7.4
DIAGRAM: 126
PYRE_BIFLO/38-164
LENGTH = 127 COMBINED P-VALUE = 1.04e-01 E-VALUE = 7.4
DIAGRAM: 127
A1KRZ3_NEIMF/3-127
LENGTH = 125 COMBINED P-VALUE = 1.10e-01 E-VALUE = 7.8
DIAGRAM: 125
APT_PROM5/26-150
LENGTH = 125 COMBINED P-VALUE = 1.29e-01 E-VALUE = 9.2
DIAGRAM: 125
A5LPK5_STRPN/106-234
LENGTH = 129 COMBINED P-VALUE = 1.34e-01 E-VALUE = 9.5
DIAGRAM: 129
Q0EYW3_9PROT/15-136
LENGTH = 122 COMBINED P-VALUE = 1.35e-01 E-VALUE = 9.6
DIAGRAM: 122
Debugging Information
CPU: kodomo-count
Time 0.096006 secs.
mast ./memeout.txt -d ./seed.fasta -ev 10.000000 -mt 0.000100
Button Help
Links to Entrez database at NCBI
Links to sequence scores (section I)
Links to motif diagrams (section II)
Links to sequence/motif annotated alignments (section III)
This information