Ask Statistics and Probability Expert

Summary Statistics and Hypothesis Testing Problem Set

Useful Commands for this Problem Set -

The underlined portion of each command indicates the abbreviation that can be used for that command.  You should also refer to the guide from the Stata tutorial as you work through this problem set.

codebook: Provides a summary of a variable, including the mean, standard deviation, number of unique values, etc.

summarize: Provides summary statistics for a variable.

ttest: Performs a t-test of the mean.

Part I - Note: Part I does not require the use of Stata.

Variable

Definition

Hurricaned

Coded 1 if a FL county was hit by a hurricane, 0 otherwise.

Bush04

The % of the vote for Bush in the 2004 presidential election in FL counties (among the 2 major parties).

. sum bush04

    Variable |       Obs        Mean    Std. Dev.       Min        Max

-------------+--------------------------------------------------------

    bush04 |        67         .6    .1079281         .3        .78

1. Using the Stata output presented above, provide the following summary statistics. If you perform a calculation using the statistics provided in the Stata output, please show your work.

  • What is the mean of the bush04 variable (X ¯)?
  • What is the variance of the bush04 variable (SX2)
  • What percentage of the vote share did Bush win in the county with the highest % of the vote in his favor?

2. Research by political scientists, Larry Bartels and Chris Achen, suggests that the act of voting can be more irrational than previously thought. They examine whether voters tend to punish incumbents for acts that could not in any way be the fault of an incumbent or an incumbent party. For instance, they looked at the effects of shark attacks in New Jersey in 1916 on the vote for Wilson in the presidential election. Findings suggested the shark attacks had a negative impact on the vote for Wilson. The counties that experienced the attacks failed to support the incumbent in that election. It seemed voters blamed the incumbent for something entirely out of his control. Bartels and Achen examined this hypothesis on other contexts, noting that voters tend to blame incumbents for droughts and other natural events outside of the government's influence.

Do voters blame incumbents for situations the government has no control over? To test this hypothesis, this question focuses on the impact of hurricanes in the 2004 presidential election in Florida at the county level. That year, Florida experienced four back to back hurricanes within months and weeks of the election. Did voters blame incumbent president Bush? The dependent variable is the Bush vote (of the 2-party vote) in the 2004 presidential election in Florida.

The variable Hurricaned equals 1 if a FL county was hit by a hurricane (hurricane-hit) and 0 otherwise (no-hurricane). Notice that 4 counties were not hit by a hurricane.

  • What is % mean bush04 vote among counties that were not hit by a hurricane (X ¯no-hurricane)?
  • What is mean % bush04 vote among hurricane-hit counties (X ¯hurricane-hit)?
  • What is the difference between these means (X ¯no-hurricane- X ¯hurricane-hit)?
  • What is the standard error of the difference between the means?
  • State the null hypothesis and the alternative hypothesis to test that the mean vote for Bush in '04 in counties that were not hit by hurricanes is the same as the meanvote for Bush in hurricane-hit counties.
  • Calculate the t-statistic you would use to test the null hypothesis.
  • Test the null hypothesis at the 5% significance level. Do you reject or fail to reject the null hypothesis?
  • Calculate a 95% confidence interval for the difference between the means (show your work).
  • Does this evidence support the argument that voters punish the incumbent for situations outside their control (e.g. natural disasters)? In your explanation, consider whether this analysis excludes any important variables.

Part II - In this section you will use Stata to conduct an analysis on the relationship between having a college education and support for Trump in the 2016 presidential election.

This past year, it was widely reported that Trump's favorability was highest among those without a college education. Do education levels explain Trump's popularity in the recent election? Using the cnn2016election.dta dataset, determine whether there is a difference in support for Trump among those with and without a college education.

3. To accomplish this task, you will first need to create a new variable for college education with two categories, where 1 contains all individuals with at least some college education, and 0 contains all those without any college education. Label the new variable "colleduc."

  • State the null and alternative hypothesis for this test.
  • What is the t-statistic? Show how this t-statistic was calculated.
  • What is the result of the hypothesis test?
  • Do you believe this analysis provides a satisfactory answer to the question of whether education levels explain support for Trump in '16? Why or why not?

4. In Stata, repeat the t-test above but, instead of comparing college educated and non-college educated respondents, compare males and females. To do this, create a new variable labeled "gender" (from the variable "sex"), where male equals 0 and female equals 1.

  • What is the t-statistic reported in the Stata output?
  • Is there a statistically significant difference in support for Trump between men and women?

Attachment:- Assignment.rar

Statistics and Probability, Statistics

  • Category:- Statistics and Probability
  • Reference No.:- M92185999
  • Price:- $60

Priced at Now at $60, Verified Solution

Have any Question?


Related Questions in Statistics and Probability

Introduction to epidemiology assignment -assignment should

Introduction to Epidemiology Assignment - Assignment should be typed, with adequate space left between questions. Read the following paper, and answer the questions below: Sundquist K., Qvist J. Johansson SE., Sundquist ...

Question 1 many high school students take the ap tests in

Question 1. Many high school students take the AP tests in different subject areas. In 2007, of the 144,796 students who took the biology exam 84,199 of them were female. In that same year,of the 211,693 students who too ...

Basic statisticsactivity 1define the following terms1

BASIC STATISTICS Activity 1 Define the following terms: 1. Statistics 2. Descriptive Statistics 3. Inferential Statistics 4. Population 5. Sample 6. Quantitative Data 7. Discrete Variable 8. Continuous Variable 9. Qualit ...

Question 1below you are given the examination scores of 20

Question 1 Below you are given the examination scores of 20 students (data set also provided in accompanying MS Excel file). 52 99 92 86 84 63 72 76 95 88 92 58 65 79 80 90 75 74 56 99 a. Construct a frequency distributi ...

Question 1 assume you have noted the following prices for

Question: 1. Assume you have noted the following prices for paperback books and the number of pages that each book contains. Develop a least-squares estimated regression line. i. Compute the coefficient of determination ...

Question 1 a sample of 81 account balances of a credit

Question 1: A sample of 81 account balances of a credit company showed an average balance of $1,200 with a standard deviation of $126. 1. Formulate the hypotheses that can be used to determine whether the mean of all acc ...

5 of females smoke cigarettes what is the probability that

5% of females smoke cigarettes. What is the probability that the proportion of smokers in a sample of 865 females would be greater than 3%

Armstrong faber produces a standard number-two pencil

Armstrong Faber produces a standard number-two pencil called Ultra-Lite. The demand for Ultra-Lite has been fairly stable over the past ten years. On average, Armstrong Faber has sold 457,000 pencils each year. Furthermo ...

Sppose a and b are collectively exhaustive in addition pa

Suppose A and B are collectively exhaustive. In addition, P(A) = 0.2 and P(B) = 0.8. Suppose C and D are both mutually exclusive and collectively exhaustive. Further, P(C|A) = 0.7 and P(D|B) = 0.5. What are P(C) and P(D) ...

The time to complete 1 construction project for company a

The time to complete 1 construction project for company A is exponentially distributed with a mean of 1 year. Therefore: (a) What is the probability that a project will be finished in one and half years? (b) What is the ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As