Professional Documents
Culture Documents
Question 1. For each of the following examples, state whether the data are numerical or categorical, and
state whether they are cross-sections, time series, or panel data.
(a) Quarterly data on the level of new housing construction in Australia, from 2000 to 2015.
(b) Quarterly data on the level of new housing construction in Australia, from 2000 to 2015, broken down
by state or territory.
(c) Data on the number of doctor visits in 2015 for a sample of 192 individuals.
(d) Data on the total number of doctor visits in Australia, for every week in 2015.
(e) Data on the usual mode of transportation used to commute to work, for a sample of 151 individuals.
(f) The closing price of the Dow Jones Index, for every trading day in 2015.
(g) Data on whether the closing price of the Dow Jones Index was higher or lower than the day before,
for every trading day in 2015.
(h) Data on whether the closing prices of the Dow Jones Index, the Nasdaq Index, and the S&P 500 Index
were higher or lower than the day before, for every trading day in 2015.
(i) Data on which of the Dow Jones Index, the Nasdaq Index, and the S&P 500 Index gained the most in
value, for every trading day in 2015.
n
X
Question 2. Assume that n = 5. Compute zi in each of the following cases.
i=1
(a) zi = 1.
(b) zi = i − 3.
(c) zi = i2 / 20.
(d) zi = 1/i.
Question 3. Consider the following set of five data points: {11, 14, 6, 0, 14}.
(a) Compute the median, mean, standard deviation, coefficient of variation, skewness statistic, and kur-
tosis statistic. Note: for the skewness and kurtosis statistic, various slightly different formulas are given
– just pick the ones that you find easiest to calculate.
(b) For each of the six statistics you computed in part (a), discuss what the interpretation is.
Question 6. The following histogram and summary statistics were obtained using Stata. The variable is
the number of days that each of the 196 employees of a certain firm called in sick in February 2016.
80
60
Frequency
40
20
0
0 5 10 15 20
sick_days
sick_days
-------------------------------------------------------------
Percentiles Smallest
1% 0 0
5% 0 0
10% 0 0 Obs 196
25% 0 0 Sum of Wgt. 196
(a) Based on the histogram, do the data appear to be symmetric, right-skewed, or left-skewed? Also
provide an intuitive reason why they should be.
(b) Find three indications in the summary statistics that support your answer to part (a).
(c) What does the kurtosis statistic of 8.9 tell you about the data? Does this seem reasonable, given the
histogram?