Ask Question, Ask an Expert

+61-413 786 465

info@mywordsolution.com

Ask Statistics and Probability Expert

Part A

Answer all of these questions using R Commander. Include all graphs, outputs from R Commander .

This assignment uses the dataset called "Anscombe". This contains data on four different variables for different states in the USA.
Data in attached packages -> Anscombe

The dataset contains:
education - Per-capita education expenditure.
income - Per-capita income.
young - Number of under 18 year olds per 1000 people.
urban - Number of urban dwellers per 1000 people for each state in the USA.

Of the four variables, you will need investigate two of them.

To randomize which variables each student will be using, x = the last digit of your student ID

If x = 0 - 1, use education and income
If x = 2 - 3, use education and young
If x = 4 - 5, use education and urban
If x = 6 - 7, use income and young
If x = 8, use income and urban
If x = 9, use young and urban

  1. Write a brief description of the data you're working with.
  2. Produce a histogram of both variables, and describe both distributions.
  3. Using R Commander, test to see which of the two variables is most normal. Justify your choice.
  4. For the most normal of your two variables, assume the data is taken from a normally distributed population to answer the following questions:
    a. Find the mean and standard deviation.
    b. Find the 80th percentile. Explain in words what this value means.
    c. What is the probability of a randomly selected member of the population being within 17 units of the mean? Use R Commander to plot the normal distribution and use vertical lines to show the region of interest.
    d. What is the z score of the point 33 units above the mean? Interpret the meaning of this value.
  5. Make a scatterplot showing the relationship between the two variables. Describe the relationship. Relate this back to the meaning of the data.
  6. Produce r, the correlation coefficient. Explain in words what this value shows?
  7. Find the other correlation coefficients for the relationships between the other variables in Anscombe. Compare and contrast with the correlation coefficient found for your data, and relate these back to the meaning of the data.
  8. Produce r2. Explain in words what this value shows?  

 

Part B

To gather the above information in each state, surveys were done in each state. 4 random districts were selected from a list of all of the districts in the state, and the phone directories were used to select and survey 1000 random people from each district.

  1. What is the name for this kind of sampling?
  2. What were the sampling frames used here? Could better sampling frames have been used? If you answered "yes", suggest an example of a better sampling frame.
  3. What is the difference between a sample and a census? Why do you think a census was not used in this instance?
  4. List three possible sources of sampling bias for this sampling method. Justify each one. 

Statistics and Probability, Statistics

  • Category:- Statistics and Probability
  • Reference No.:- M9131891

Have any Question?


Related Questions in Statistics and Probability

A coin is biased so that the probability of obtaining heads

A coin is biased so that the probability of obtaining heads is ¾, What is the probability of obtaining at least three heads in four tosses of the coin?

Five percent of the eyeglasses sold at an optical retailer

Five percent of the eyeglasses sold at an optical retailer have tinted lenses. Forty pairs of glasses are sold on a particular day and six have tinted lenses. Identify n, p, and x.

According to annbspairline flights on a certain route are

According to an? airline, flights on a certain route are on time 80% of the time. Suppose 13 flights are randomly selected and the number of?flights is recorded. ?a)  Explain why this is a binomial experiment. ?b)  Deter ...

Two candidates face each other in an election the

Two candidates face each other in an election. The Democratic candidate is supported by 58% of the population, and the Republican candidate is supported by 42%. In other words, if you randomly chose a voter and asked the ...

54 of public high school students are provided a computer

54% of public high school students are provided a computer by their school district. 40 students are selected at random. The random variable represents the number of students who have been provided a computer by their sc ...

Choose the correct problem formulation it is known that 70

Choose the correct problem formulation: It is known that 70% of the customers in a sporting goods store purchase a pair of running shoes. A random sample of 25 customers is selected. Assume that customers' purchases are ...

1 the personnel office at a large electronics firm

1. The personnel office at a large electronics firm regularly schedules job interviews and maintains records of the interviews. From the past records, they have found that the length of a first interview is normally dist ...

The time between accidents in a day in town a is given by t

The time between accidents in a day in town A is given by T and the time between accidents in a day in town B is given by  S. The joint density function for  T and  S is   f ( t , s )= e^  -( t + s )    , for  t ≥0  and  ...

A researcher reports that the size of an effect in

A researcher reports that the size of an effect in Population A is d = 0.10 and the effect size in Population B is d = 0.34. Which population is associated with greater power to detect an effect? A) Population A B) Popul ...

Consider the study of two insect populations at two

Consider the study of two insect populations at two experimental stations. At station A, the egg hatch rate (computed from a set of 100 eggs at a time) is known to follow approximately a normal distribution with mean 62% ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As