--%>

Principles of data analysis

For the data analysis project, you will address some questions that interest you with the statistical methodology we are learning in class. You choose the questions; you decide how to collect data; you do the analyses. The questions can address almost any topic, including topics in economics, psychology, sociology, natural science, education, medicine, public policy, sports, law, etc.

The project requires you to synthesize all the materials from the course. Hence, it's one of the best ways to solidify your understanding of statistical methods. Plus, you get answers to issues that pique your intellectual curiosity.

In twenty (20) PowerPoint slides or more, please create a presentation that adequately addresses and answers your statistical question(s). Include your random sampling, calculations, graphs, charts, hypothesis, conclusion, and anything pertinent to your

“statistical question(s).”

The most important aspects of any statistical analysis are stating questions and collecting data. To get the full experience of running your own study, the project requires you to analyze data that you collect. It is not permissible to use data sets that have been put together by others. You are permitted to collect data off of the web; however, you must be the one who decides on the analyses and puts the data set together.

Good projects begin with very clear and well-defined hypotheses. You should think of questions that interest you first, and then worry about how to collect and analyze data to address those questions. Generally, vague topics lead to uninteresting projects. For example, surveying Harvard Undergraduates to see which sex studies more does not yield a whole lot of interesting conclusions. On the other hand, it would be interesting to hypothesize why men or women study more, and then figure out how to collect and analyze data to test your hypotheses.

Practical Advice: It is often easier to collect accurate experimental data than accurate survey data. Non-responses tend to be less of an issue with projects based on experiments than with those based on surveys. I strongly encourage you to consider experiments as opposed to surveys. For those who want to do surveys, consider using students in dorms or certain courses as target populations. Make every effort to get a random sample, and try to keep track of the characteristics of non-respondents. You will have non-responses; however, your project will not be penalized for a non-response as long as you document it and hypothesize how it might affect your results.

   Related Questions in Basic Statistics

  • Q : Explain Service times Service times: A)

    Service times:A) In most cases, servicing a request takes a “short” time, but in a few occasions requests take much longer.B) The probability of completing a service request by time t, is independent of how much tim

  • Q : What is Interactive Response Time Law

    Interactive Response Time Law: • R = (L/X) - Z• Applies to closed systems.• Z is the think time. The time elapsed since&nb

  • Q : State the hypotheses At Western

    At Western University the historical mean of scholarship examination score for freshman applications is 900. Population standard deviation is assumed to be known as 180. Each year, the assistant dean uses a sample of applications to determine whether the mean ex

  • Q : Assumptions in Queuing system

    Assumptions in Queuing system: • Flow balance implies that the number of arrivals in an observation period is equal to the

  • Q : Data Description 1. If the mean number

    1. If the mean number of hours of television watched by teenagers per week is 12 with a standard deviation of 2 hours, what proportion of teenagers watch 16 to 18 hours of TV a week? (Assume a normal distribution.) A. 2.1% B. 4.5% C. 0.3% D. 4.2% 2. The probability of an offender having a s

  • Q : Computing Average revenue using

    Can anyone help me in the illustrated problem? The airport branch of a car rental company maintains a fleet of 50 SUVs. The inter-arrival time between the requests for an SUV is 2.4 hrs, on an average, with a standard deviation of 2.4 hrs. There is no indication of a

  • Q : Develop the most appropriate regression

    Predicting Courier Costs The law firm of Adams, Babcock, and Connors is located in the Dallas-Fort metroplex.  Randall Adams is the senior and founding partner of the firm.  John Babcock has been a partne

  • Q : Networks of queues Networks of queues •

    Networks of queues • Typically, the flow of customers/request through a system may involve a number of different processing nodes.– IP packets through a computer network– Orders through a manufactur

  • Q : Sample z test and Sample t test A

    A random sample X1, X2, …, Xn is from a normal population with mean µ and variance σ2. If σ is unknown, give a 95% confidence interval of the population mean, and interpret it. Discuss the major diff

  • Q : Statistics basic question This week you

    This week you will analyze if women drink more sodas than men.  For the purposes of this Question, assume that in the past there has been no difference.  However, you have seen lots of women drinking sodas the past few months.  You will perform a hypothesis test to determine if women now drink more