EMBOSS
task 1 (Compile multiple .fasta files in one)
task 2 (Split .fasta file)
task 3 (Getting coding regions from .gb file)
bench-mark data
seqret coli.gb[275:3691]
seqret coli.gb[2756:3692:r]
seqret coli.gb[27566:28922]
Result 1
Result 2
Result 3
task 4 (Translation of sequences)
task 5 (Getting the orf's with minimal size)
task 6 (Translating the alignments in different formats)
task 8 (Translating the annotations from .gb to .gff)
task 9 (Getting the CDS from .gb file)
task 10 (Shuffling sequences)
bench-mark data
shuffleseq -sequence DRD1.fasta -outseq DRD1_shuffled.fasta -shuffle 1
Result 1
task 11 (Making the random sequences)
makenucseq -amount 3 -length 100
Result 1
task 12 (Getting the codon frequency table)
task 14 (Degapping the alignments)
task 15 (Changing the format of a carriage return symbols)
bench-mark data
noreturn DRD4.fasta DRD4.noreturn -system unix
Result 1
task 15 (Changing the format of the sra files)
task 3 (Comparing expected and observed frequencies in the bacterial genome)
script
bench-mark data
The script takes the name of a .fasta file as a command line argument, and outputs expected and observed frequencies in a line: "The most differ dinucleotide is XY.
The difference from expected value is 0.000000"
task 4 (Converting gb/embl files in fasta files with CDS and descriptions)
script
bench-mark data 1
bench-mark data 2
The script takes the name of a .gb or .embl file (With different extensions error occurs), and creates the .fasta file with CDS.
© Gumerov Ruslan, 2017