Ask Question, Ask an Expert

+61-413 786 465

info@mywordsolution.com

Ask Statistics and Probability Expert

All work must be done independently.

A retrospective sample of males in a heart-disease high-risk region of the Western Cape, South Africa. There are roughly two controls per case of CHD. Many of the CHD positive men have undergone blood pressure reduction treatment and other programs to reduce their risk factors after their CHD event. In some cases the measurements were made after these treatments. These data are taken from a larger dataset, described in Rousseauw et al, 1983, South African Medical Journal.

There are 463 observations in the dataset. The variables in the dataset are:

sbp - systolic blood pressure
tobacco - cumulative tobacco (kg)
ldl - low density lipoprotein cholesterol adiposity
famhist - family history of heart disease (Present, Absent)
typea - type-A-behavior
obesity
alcohol - current alcohol consumption age - age at onset
chd - response, coronary heart diseease

The data can be found and read into R by the following command:

read.table(" http://www-stat.stanford.edu/ tibs/ElemStatLearn/datasets/SAheart.data" , sep=",",head=T,row.names=1)

If you would prefer to analyze this data in using some other statistical package, you will need to export the data from R using something like a write.table command (or some variation thereof).

The following questions are of practical interest:

1. What are significant predictors of CHD ? What would a final model look like and can you provide an estimate of its predictive accuracy (i.e. do model selection and then evaluate predictive accuracy)? What functional forms are most appropriate for the various predictors in your final model ?

2. Since high Idl often precedes a diagnosis of CHD, will a two stage model which first uses ldl as a response in stage 1 and then CHD as a response in stage 2, provide more accurate predictions of CHD than the model built question 1 above ?

3. There are often situations where finding just one obviously best sub-model is difficult. There may be many good competing sub-models. However, you might decide to bring together multiple models to im¬prove predictive performance. Develop a strategy for doing this on this dataset, being careful to clearly compare and contrast (to the single model approach) predictive performance. Also, make sure to clearly motivate your strategy giving enough intuition so that I can follow things easily.

Please provide complete justifications for why you chose a particular mod¬eling strategy including the underlying assumptions you are making. Analyze the data and provide some overall inferences with regards to the questions being posed. Write a (maximum) 5 page report (tables and figures inclusive) that details your analysis. Computer output may be attached as supplemen¬tary material.

Statistics and Probability, Statistics

  • Category:- Statistics and Probability
  • Reference No.:- M9955130
  • Price:- $30

Priced at Now at $30, Verified Solution

Have any Question?


Related Questions in Statistics and Probability

A bottle of water is supposed to have 12 ounces the

A bottle of water is supposed to have 12 ounces. The bottling company has determined that 98% of bottles have the correct amount. Which of the following describes a binomial experiment that would determine the probabilit ...

Phillips owns 25 of mintor inc a private company in 2007

Phillips owns 25 % of Mintor, Inc. (a private company). In 2007, Mintor had sales of $22,00,000, had net income of $82,000 and Mintor paid $32,000 in dividends. If Phillips uses the Equity Income method, what did Phillip ...

According to the current results website the state of

According to the Current Results website, the state of California has a mean annual rainfall of 21 inches, whereas the state of New York has a mean annual rainfall of 55 inches. Assume that the standard deviation for bot ...

Advantage and disadvantage of sole proprietorship than a

Advantage and disadvantage of sole proprietorship than a corporation? Example of a business that is more suitable for both sole proprietorship and corporation structure.

How much of the opposing side should you share in a

How much of the opposing side should you share in a presentation to a multiple-perspective audience, and what techniques would you use?

1 overproduction of uric acid in the body can be an

1. Overproduction of uric acid in the body can be an indication of cell breakdown. This may be an advance indication of illness such as gout, leukemia, or lymphoma.† Over a period of months, an adult male patient has tak ...

Te height of woman ages 20-29 is normally distributed

The height of woman ages 20-29 is normally distributed , with a mean of 64.3 inches. assuming the standard diviation = 2.4 inches. are you more likely to randomly select 1 woman with a height less than 66.2 inches or are ...

Consider the probability distribution shown belowx 0 1 2 px

Consider the probability distribution shown below. x 0 1 2 P(x) 0.65 0.30 0.05 Compute the expected value of the distribution. Compute the standard deviation of the distribution. (Round your answer to four decimal places ...

It has been a bad day for the stock market and you have

It has been a bad day for the stock market and you have heard that only 30% of all stocks gained value. Suppose you have a portfolio of 10 securities and assume a binomial distribution for the number of your stocks that ...

According to a field poll 79 of california adults actual

According to a Field Poll, 79% of California adults (actual results are 400 out of 506 surveyed) feel that "education and our schools" is one of the top issues facing California. We wish to construct a 90% confidence int ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As