Assignment
i. What does a VCF file contain and how is the data formatted? How is it generated? Be sure to include any relavant fields that are standard to the format?
ii. What are local optima in the text of phylogeny? What are some approaches to avoid them?
iii. In Linux, what does the apt command do? When would you use it?
iv. You have a spreadsheet with the five columns: ID, condition, and 100 columns of protein mass spec measurement values. Would you use PCA or LDA to analyze this data and why? How many dimensions of data are there? How many axes?
v. What is ANN (Artificial Neural Networks), Feed-Forward neural Network, Convolutional Neural Networks (CNN), and Generative Adversarial networks (GAN). What is the different between them?
vi. Describe how Kmers can be used to assemble a genome from shotgun sequence data
vii. What is the difference between the input, hidden, and output layers in an ANN?
viii. Which Linux command would you use to create a new directory? Give an example
ix. Describe how a hash function works, using both descriptive text and diagram with examples.
Format your assignment according to the following formatting requirements:
i) The answer should be typed, using Times New Roman font (size 12), double spaced, with one-inch margins on all sides.
ii) The response also includes a cover page containing the title of the assignment, the student's name, the course title, and the date. The cover page is not included in the required page length.
iii) Also include a reference page. The Citations and references must follow APA format. The reference page is not included in the required page length.