Дзиковский Ю. А. П4
Дано выравнивание. GC генома равен 0.6. p(b) – частота буквы b. eps(b) = псевдоотсчёт для буквы b.
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
C |
A |
A |
A |
C |
G |
T |
T |
T |
G |
C |
T |
T |
T |
C |
C |
C |
A |
A |
A |
C |
G |
T |
T |
T |
T |
C |
G |
T |
T |
T |
A |
C |
A |
A |
A |
C |
G |
G |
T |
T |
T |
C |
G |
T |
C |
A |
G |
C |
A |
A |
C |
C |
G |
T |
T |
T |
T |
C |
C |
T |
T |
G |
C |
C |
A |
A |
A |
C |
G |
T |
G |
T |
G |
C |
G |
T |
C |
T |
G |
C |
A |
A |
T |
C |
G |
G |
T |
T |
A |
C |
C |
T |
T |
G |
A |
C |
A |
A |
A |
C |
G |
T |
T |
T |
T |
C |
G |
T |
T |
A |
C |
В рассчетах и создании таблиц использовался скрипт на языке программирования Python>
Предоставленное выравнивание в csv-формате: align10.txt
Получена следующая таблица чисел букв по столбцам:
Base: | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | A | 0 | 7 | 7 | 5 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 2 | 2 |
1 | T | 0 | 0 | 0 | 1 | 0 | 0 | 5 | 6 | 7 | 4 | 0 | 1 | 7 | 5 | 2 | 0 |
2 | G | 0 | 0 | 0 | 0 | 0 | 7 | 2 | 1 | 0 | 2 | 0 | 4 | 0 | 0 | 2 | 2 |
3 | C | 7 | 0 | 0 | 1 | 7 | 0 | 0 | 0 | 0 | 0 | 7 | 2 | 0 | 2 | 1 | 3 |
4 | Consensus | C | A | A | A | C | G | T | T | T | T | C | G | T | T | t | C |
5 | Pattern+ | C | A | A | H | C | G | K | K | T | D | C | B | T | Y | N | V |
6 | Pattern- | G | T | T | D | G | C | M | M | A | H | G | V | A | R | N | B |
PWM и вес последовательности относительно PWM:
p(b) | e(b) | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
A | 0.2 | 1 | -1.72247 | 1.277534 | 1.277534 | 0.862496 | -1.72247 | -1.72247 | -1.72247 | -1.72247 | -1.72247 | -0.72247 | -1.72247 | -1.72247 | -1.72247 | -1.72247 | -0.1375 | -0.1375 |
T | 0.2 | 1 | -1.72247 | -1.72247 | -1.72247 | -0.72247 | -1.72247 | -1.72247 | 0.862496 | 1.084889 | 1.277534 | 0.599462 | -1.72247 | -0.72247 | 1.277534 | 0.862496 | -0.1375 | -1.72247 |
G | 0.3 | 1 | -1.1375 | -1.1375 | -1.1375 | -1.1375 | -1.1375 | 1.862496 | 0.447459 | -0.1375 | -1.1375 | 0.447459 | -1.1375 | 1.184425 | -1.1375 | -1.1375 | 0.447459 | 0.447459 |
C | 0.3 | 1 | 1.862496 | -1.1375 | -1.1375 | -0.1375 | 1.862496 | -1.1375 | -1.1375 | -1.1375 | -1.1375 | -1.1375 | 1.862496 | 0.447459 | -1.1375 | 0.447459 | -0.1375 | 0.862496 |
W= | 17.85 | C | A | A | A | C | G | T | T | T | G | C | T | T | T | C | C |