--%>

Principles of data analysis

For the data analysis project, you will address some questions that interest you with the statistical methodology we are learning in class. You choose the questions; you decide how to collect data; you do the analyses. The questions can address almost any topic, including topics in economics, psychology, sociology, natural science, education, medicine, public policy, sports, law, etc.

The project requires you to synthesize all the materials from the course. Hence, it's one of the best ways to solidify your understanding of statistical methods. Plus, you get answers to issues that pique your intellectual curiosity.

In twenty (20) PowerPoint slides or more, please create a presentation that adequately addresses and answers your statistical question(s). Include your random sampling, calculations, graphs, charts, hypothesis, conclusion, and anything pertinent to your

“statistical question(s).”

The most important aspects of any statistical analysis are stating questions and collecting data. To get the full experience of running your own study, the project requires you to analyze data that you collect. It is not permissible to use data sets that have been put together by others. You are permitted to collect data off of the web; however, you must be the one who decides on the analyses and puts the data set together.

Good projects begin with very clear and well-defined hypotheses. You should think of questions that interest you first, and then worry about how to collect and analyze data to address those questions. Generally, vague topics lead to uninteresting projects. For example, surveying Harvard Undergraduates to see which sex studies more does not yield a whole lot of interesting conclusions. On the other hand, it would be interesting to hypothesize why men or women study more, and then figure out how to collect and analyze data to test your hypotheses.

Practical Advice: It is often easier to collect accurate experimental data than accurate survey data. Non-responses tend to be less of an issue with projects based on experiments than with those based on surveys. I strongly encourage you to consider experiments as opposed to surveys. For those who want to do surveys, consider using students in dorms or certain courses as target populations. Make every effort to get a random sample, and try to keep track of the characteristics of non-respondents. You will have non-responses; however, your project will not be penalized for a non-response as long as you document it and hypothesize how it might affect your results.

   Related Questions in Basic Statistics

  • Q : OIL I need to product when oil will

    I need to product when oil will finish time (by years) for 6 countries if the keep their production (per day) in the same level. So, the 6 countries have fixed reserves and production 1. statistics for Bahrain Crude oil reserves (million barrels) = 124.6 be careful in million Crude oil producti

  • Q : Time series what are the four

    what are the four components of time series?

  • Q : Sample Questions in Graphical Solution

    Solved problems in Graphical Solution Procedure, sample assignments and homework Questions: Minimize Z = 10x1 + 4x2 Subject to

  • Q : Data Description 1. If the mean number

    1. If the mean number of hours of television watched by teenagers per week is 12 with a standard deviation of 2 hours, what proportion of teenagers watch 16 to 18 hours of TV a week? (Assume a normal distribution.) A. 2.1% B. 4.5% C. 0.3% D. 4.2% 2. The probability of an offender having a s

  • Q : Report on Simple Random Sampling with

    One of my friend has a problem on simple random sampling. Can someone provide a complete Report on Simple Random Sampling with or without replacement?

  • Q : What is your conclusion The following

    The following data were collected on the number of emergency ambulance calls for an urban county and a rural county in Florida. Is County type independent of the day of the week in receiving the emergency ambulance calls? Use α = 0.005. What is your conclusion? Day of the Week<

  • Q : Average think time Software monitor

    Software monitor data for an interactive system shows a CPU utilization of 75%, a 3 second CPU service demand, a response time of 15 seconds, and 10 active users. Determine the average think time of these users?

  • Q : Derived quantities in Queuing system

    Derived quantities in Queuing system: • λ = A / T, Arrival rate • X = C / T, Throughput or completion rate • ρ =U= B / T, Utilization &bu

  • Q : Get Solved LP Problems Solve Linear

    Solve Linear Programming Questions A producer manufactures 3 models (I, II and III) of a particular product. He uses 2 raw materials A and B of which 4000 and 6000 units respectively are obtainable. The raw materials per unit of 3

  • Q : Variance and standard error A hospital

    A hospital treated 412 skin cancer patients over a year. Of these, 197 were female. Give the point estimate of the proportion of females seeking treatment for skin cancer. Give estimates of the