--%>

understanding of genes and their products

For this assignment you will have to make use of information and skills gained throughout the 4 weeks of the introductory programme. This may be understanding of genes & their products, how to find and reference information from different sources, and specifically how to execute code in Perl, Java and R - it will also test your logical thinking and a little bit of basic maths!

Part A. 

You are provided with an HGNC gene symbol - HTRA1. This protein is involved with Human disease. Prepare a one-page summary of the gene, its product, the disease and the protein/gene's role in the disease.

Part B. 

Obtain the sequence for the gene and the sequence for the RNA transcript in FASTA format. If there is more than one transcript choose and appropriate one and explain your choice. Using Perl, convert the FASTA format files into a simple strings containing only nucleotides - save those for later -  and determine the amino acid sequence of the protein.

Part C. 

Using the sequences prepared with Perl, calculate the molecular weight of the Gene vs mRNA vs protein using Java.

Part D. You now have sequences for gene, RNA and protein. Write an R script to calculate answers to the following:

By taking the current estimate of global population and multiplying that by the estimate for the average number of cells in the human body, determine the total number of nucleotides representing the coding part of the gene in living humans.

If all of these nucleotides were printed out using 12pt Arial font on A4 with 3cm margins and the sheets laid end-to-end, how long would it take to drive along the paper at 30km/h ? For comparison, how long would it take to drive the length of this sequence of nucleotides at the same speed, if it was in the form of a molecule of double-stranded DNA helix laid end-to-end ?

Presentation. 

There are also 10 marks available for good presentation of part A, correct referencing and well formatted/commented code.

Submission

You should submit your one page summary, Perl, Java and R scripts along with the solution to the two final questions to the Digital Drop Box on Blackboard .

Throughout the assignment, reference your sources of information appropriately.

   Related Questions in Biology

  • Q : Vascular lesions Vascular lesions

    Vascular lesions caused by the leeches on the blood vessels of their host cause blood naturally to coagulate. How does the leech resolve this trouble as it could be predicted that the ingested blood would coagulate within its body?

  • Q : Explain Process Modeling Process

    Process Modeling: The word process model is employed in various contexts. For illustration, in Business process modeling the enterprise process model is frequently termed to as the business process model. Process models are core perceptions in the dis

  • Q : Substrates of enzymatic reactions

    Illustrate out the substrates of enzymatic reactions?

  • Q : Animal phylum which comprises creatures

    Most of the insects have wings. Name the other animal phylum which comprises creatures with analogous organs?

  • Q : Function of myelin sheath Write down

    Write down the various function of the myelin sheath? Whether all axons consist of the myelin sheath?

  • Q : Describe the lateral lines of fishes

    Describe the lateral lines of fishes?

  • Q : Factors - Group Effectiveness -

    There are several factors that influence the effectiveness of a group. Certain characteristics of the group, the types of tasks they perform, the work setting, the technology used and the operating dynamics of the group within the work setting all have an influence on group effectiveness, we can

  • Q : Endocrine function of placenta Explain

    Explain the endocrine function of placenta? Briefly explain it.

  • Q : Pancreatic juice in intestine Besides

    Besides pancreatic juice in intestine there exists releasing of enteric juice which consists of the digestive enzymes too. State these enzymes and which kind of molecule do these enzymes break?

  • Q : Significance of the R group in an amino

    What is the significance of the –R group (that is, variable radical) in an amino acid molecule?