1. Given the following sequence from Saccharomyces cerevisease: GATCTCACGATATAT
a. What is the most likely translation frame?
b. What is this sequence? How can you tell?
2. Given the following DNA sequence: GGTGTAAAGAATCTT
a. Construct a keyword tree
b. Construct a suffix tree
3. Given the following distance matrix, construct a neighbor-joining distance tree. Show all your work! (20 points).
|
1
|
2
|
3
|
4
|
1
|
0
|
|
|
|
2
|
21
|
0
|
|
|
3
|
5
|
25
|
0
|
|
4
|
15
|
30
|
23
|
0
|
4. Describe in detail how the BLAST algorithm works.
5. What are the criteria for confidence for the strength of a BLAST hit?
6. What is the measure of confidence used for phylogenetic tree construction? (5 points)
7. What is meant by "wobble" in reference to the genetic code?
8. How many different nucleotide sequences can code for the following protein sequence: (15 points): Arg-Lys-Pro-Val-Ser-Ile-Ala