A short paper, for the drop box, which says the following:
A. What is the subject of the envisaged project? A quick statement of what you are investigating and hope to demonstrate. For example, and I'm not saying that either of the following hypotheses are true, here are some fairly flippant things to look at: "clothing designers prefer brighter colors in years of economic growth" or "are there baseball teams that win more often on cold days than they win on hot days?".
Based on the earlier assignments a number of people are overly ambitious - it is not likely that you are going to be able to predict the stock market or the results of the NFL season in a few weeks of class work.
Pick something that you can imagine describing in a few pages with a few charts, and will have some conclusion. It's more important to have visualizations of your results than statistical measures. The late Richard Hamming (inventor of error-correcting codes) once said "the purpose of computing is insight, not numbers."
You are welcome to use my examples for your project, or variations on them. But you should pick something you understand. If you can't tell a herring from a halibut, you might want to avoid the fish project. I assume most people will do something they thought of themselves.
B. What are the data you are going to use? You should have found one or more datasets, with accessible numerical data. You may explain that you are going to have to do some editing of the data; I'm always going through files removing extraneous characters, reformatting, etc.
If you aren't familiar with editors or a language in which character editing is simple (i.e. not R, but Python or Perl), be careful to choose some data that doesn't need much preprocessing. Please include in the paper some URL (or printed source) and the column headings (data schema) of what you want to use. In fact, include one sample row of data to be sure you can get through the basic download and extraction work.
C. What is an example of a conclusion about data that you like and might want to model your paper on? This can be a popular article or a scientific article, but something that uses the kind of data you are going to work with and says something about it. Find such an article and provide a brief summary and explanation of why it's an example of using data for investigation or argument. Here are some examples which address important public policies and have gotten a lot of attention.