--%>

What is Imperfect data

Imperfect data: Most studies start with imperfect data. Few datasets involve the entire population of interest.

Typically, the data has been gathered by others for specific purposes, and as such may have built in biases or representational problems. As a consumer of analytical research, you should be looking for whether the authors properly describe the source of their data and any connected limitations imposed by that source. Surveys of populations will frequently report their confidence intervals. At either the national level or at the economy wide or sectoral level of analysis, data often has relatively small confidence intervals across space and over time.

As the data is subdivided to represent subsets of the source population (e.g., the Labour Force Survey unemployment rate in manufacturing in Saskatchewan vs. the unemployment rate for Canada as a whole), the confidence intervals will widen significantly. The level of confidence may widen to the point where differences of ± 10% to 20% may not be statistically significant. Authors should carefully consider the provenance and reliability of their data.

A second problem is that quite often authors report that they have “cleaned” a dataset – e.g., dropped outliers in panel data or lopped off tips or tails of longitudinal data. Any time you hear this, your antennae should go up. Cleaning data should be done very carefully and any changes in data should be fully discussed and analyzed, rather than simply accepted.

   Related Questions in Microeconomics

  • Q : Resolving principal-agent problems I

    I have a problem in economics on Resolving principal-agent problems. Please help me in the following question. Attempts to resolve the principal-agent problems among stockholders and top corporate managers (that is, CEOs) comprise: (i) Profit-sharing systems for the t

  • Q : Problem on short run demand I have a

    I have a problem in economics on short run demand. Please help me in the following question. In short run, the demand mainly depends most on: (1) Supply. (2) Costs of production. (3) Consumer tastes and preferences. (4) Technology. (5) Resource access

  • Q : Social opportunity cost of resource

    Economic rent is: (w) income received by a factor owner in excess of the social opportunity cost of supplying the resource. (x) the difference between a firm’s revenues and the sum of the fixed and variable costs of production. (y) a form of eco

  • Q : Example of determining new equilibrium

    As per such supply and demand curves for peanuts, there is the: (w) demand for peanuts has fallen. (x) price rises to P1 due to better peanut technology. (y) production of peanuts was initially Q0. (z) new equilibrium price of pe

  • Q : Needs of families by poverty line

    The official “poverty line” computed by the federal government is the income level needed to meet the perceived fundamental needs of families along with differing characteristics as size, location, etc. Therefore, it is based on: (1) a rel

  • Q : Kinked demand curve for an oligopoly A

    A kinked demand curve for an oligopoly is probably when: (1) all the rival firms face identical demand curves. (2) rival firms are expected to match price cuts, but not price hikes. (3) firms ignore their rivals’ strategies when

  • Q : Help The problem of asymmetric

    The problem of asymmetric information is that

  • Q : Total fixed costs of purely competitive

    Such lumber mill has incurred total fixed costs which average approximately: (1) $300 daily. (2) $500 per day. (3) $700 Per day. (4) $900 per day (5) $1100 per day.

    Q : Definitions of Poverty The official

    The official United States “poverty line” is based upon the cost of securing the goods essential to maintain a standard of living: (w) at a middle class level of comfort. (x) one standard deviation below the national average. (y) that is m

  • Q : Question based on type of economy An

    An industry comprised of a small number of firms, each of which considers the potential reactions of its rivals in making price-output decisions is called: A) monopolistic competition.  B) oligopoly.  C) pure monopoly.  D) pure competition.