Cases
Cases
CASES
Much of the Football player salary information is public, and the National Football
League data can be accessed for example at http://www.spotrac.com/nfl/, where
you can find year-by-year listings of salaries for NFL players (neglect players with
salaries below 300000, since they correspond to non full time players). The file
NFL.xlsx includes salaries for 5 randomly picked teams in 2015.
1. Consider just the first available team; illustrate the highest salary, the
lowest, the mean, and the median team salary. What can you say about
the shape of the distribution of players' salaries for this team based on
these statistics?
2. Graph the salaries of all players of the team you have chosen. Does the
shape of the data correspond to your answer to question one above?
3. Which measure of central tendency is preferred for this type of data, the
mean or the median and why? What other summary statistics would you
be interested to know to have a clear picture of the distribution of salaries?
4. Do other teams appear to have similar distributions?
5. Which of the available teams seems to have the highest paid players?
Most countries of the world have their own currency, and when the buyer of a
product deals in a different currency than the seller, some exchange has to be
made. Markets set exchange rates for most major currencies, but these market
levels vary over time. Therein lies the problem for business.
Assume that you have made a deal to sell something that you price at US$130
for Eur100, based on an exchange rate of US$1.30 = Eur1.00. In the meanwhile,
before you complete the deal, the rate changes to $/Eur 1.25. Consequently, the
Eur100 you agreed to take for your product are now only worth $125. You just
lost almost 4% of the purchase price simply because of exchange rate
movements.
There are ways to protect against these risks, called hedging. Hedging costs
money too, however, so the "insurance" is not free. Thus, businesspeople are
constantly monitoring exchange rates, trying to predict their movements, and
deciding how much risk (and insurance) to take on in international deals.
National monetary authorities constantly monitor and publish exchange rate
information (see for example the site of the Monetary Authority of Singapore,
https://eservices.mas.gov.sg/Statistics/msb/ExchangeRates.aspx). Assume that
your company’s headquarter is in Singapore, and you are planning to conduct a
business deal in US. The data in the file exchange_rates.xlsx, taken from MAS
website, reports the relevant Exchange Rates (Singapore $ vs US$, from Jan
2000 to Mar 2015, end of the month values).
1. Transform the data into growth rates = ln(exchange rate in month t)-
ln(exchange rate in month t-1)
2. Assume that growth rates are independent and normally distributed. Let’s
define by μ the expected value, and by σ the standard deviation. Use the
sample average as an estimate of μ and the sample standard deviation as
an estimate for σ.
3. Using your estimates, assess:
1. The probability that exchange rates will grow next month.
2. The probability that exchange rates will decrease next month
4. Suppose that the profitability of the business deal for both partners is
based on the exchange rate staying within 2% of the last value in either
direction, what is the probability that it will turn out ok?
5. All of this planning is based on the assumption that this sample and its
underlying population are indipendent and Gaussian. Is there any
evidence against these assumptions?
Using the dataset on NFL salaries used for the first case, answer the following
questions:
1. Based only on the data of the first team, provide a point and a 95%
confidence interval estimate of the average salary of NFL players.
2. The average salary in the UK Premier League (Soccer) is reported to be
£1.16 million a year (http://soccerlens.com/finance-in-english-football-
wage-disparities-between-the-divisions/92692/). Based only on the data of
the first team you picked, test the null hypothesis that the average salary
of US NFL football players is equal to the average salary of UK Premier
League soccer players (do not forget to convert £ in $!)
3. Repeat steps 1 and 2 using all teams. Explain the difference among the
results with one team and four teams. Which is more reliable?
4. Do you think your sample can be assumed to be random and
independent? Is the population from which the sample has been drawn
normal?
Often regression models are used to calculate prices of houses. The models can
include many independent variables such as square footage, number of
bedrooms and bathrooms, acreage, whether the house has a fireplace,
basement or patio, the number of stories, the age of the home, etc. The
dependent variable is usually price. This exercise allows you to build your own
real estate model.
Visit the Realtor.com site (http://www.realtor.com/) to obtain data on houses for
sale, their descriptions and their prices. Data on price (dependent variable),
square feet, acres, number of bedrooms and number of bathrooms are usually
available for most houses. It is also possible to collect additional information
(such as the presence of fireplace, basement, swimming pool …).
The file real_estate_Valparaiso2015.xlsx includes data on price (dependent
variable) and some relevant independent variables for a random sample of 50
houses in Valparaiso, zip code 46385, single-family homes, 0-5 years old.1
1. Illustrate the main characteristics of the real estate market in the area through
appropriate descriptive statistics.
2. Run a regression using all of the explanatory variables. Analyze the results in
terms of significant t (which variables should be kept in the model and which
are not significant predictors?), R2adj (how much of the error is explained by
the model?) and residual standard deviation. Drop the insignificant variables:
does the model improve in terms of R2adj?
3. Try to interpret the sign and magnitude of each coefficient. Are they as
expected?
4. Analyze the residuals using appropriate graphs and statistics.
5. Assume you are interested in buying a house in the area, with given
characteristics (number of square feet, ...).2 Based on your model, work out
the expected price (point estimate and 95% confidence bound) for the house
you are interested in.