Ask Question, Ask an Expert

+61-413 786 465

info@mywordsolution.com

Ask Homework Help/Study Tips Expert

Data:

When you google "General Social Survey Data 2006", you will find many links connecting GSS-2006 data. For example, you can use the following link:

http://www.thearda.com/Archive/Files/Descriptions/GSS2006.asp

Then you can download the dataset, say for example, from the section called "Microsoft Excel File", and save it in your computer. Of course you can rename your data file to, say for example, "gss06".

You can download the codebook for all the variables in the survey as well.

Of course you will only use a few variables (columns) of the data for the final exam. So you can delete as many variables (columns) as you like from the original data file. This will save your computer memory when you analyze the data in R or SAS.

Problem 1:

Please work on this problem using SAS.

Import the data into SAS (referred as gss06), and report the number of subjects and number of variables.

Create a dataset (referred as gss06_sub) including only the following 8 variables, AGEWED, AGEKDBRN, WRKSTAT, AGE, CHILDS, EDUC, SEX, and RACE. Read the codebook for the details of these 8 variables.

Print the first 20 observations of the above data named gss06_sub.

Descriptive analysis of two variables "AGEWED" and "AGEKDBRN". For each variable, describe the number of observations, number of missing values (treat "Don't know" and "No answer" as missing), mean and standard deviation, and five-number summary.

Create two datasets; one dataset named "gss1" excluding those subjects who had missing value on variable "AGEWED", and the other dataset named "gss2" excluding those subjects who had missing value of variable "AGEKDBRN". Export these two datasets as CSV data files to be used in R.

Problem 2:

Please work on this problem using R.

Using R to calculate a number based on your initial. For example, the instructor's name is Yixin Fang, so his initial is "yf". The location of "y" in the letters is 25 and the location of "f" in the letters is 6. Then his number is 256, which is equal to 25 × 10 + 6. Hint: You may want to use built-in R object "letters" and R functions "strsplit" and "which".

Use the number you just calculate as the seed to generate a random number between 0 and 1. If the random number is less than 0.5, then you will use the dataset named "gss1" for the following problems. If the random number is bigger than 0.5, then you will use the dataset named "gss2" for the following problems. Name the selected dataset as "mygss". Hint: Use your seed number to generate random numbers so that the grader can reproduce your results.

In your dataset named "mygss", there are 8 variables, but you will only use 7 variables of them. For example, if your dataset is gss1 originally, then the dependent variable is "AGEWED", and then you can ignore variable "AGEKDBRN" and consider the remaining variables (WRKSTAT, AGE, CHILDS, EDUC, SEX, and RACE) as independent variable. Of course, if your dataset is gss2 originally, then the dependent variable is "AGEKDBRN", and then you can ignore variable "AGEWED". Use R to describe and summarize those 7 variables in your dataset named "mygss".

Randomly divide your dataset "mygss" into two halves. In order to this, use R function "sample" to randomly sample m subjects from n subjects without replacement. Here n is the sample size of your dataset "mygss" and m is the largest integer less than n/2. Name the dataset consisting of these m subjects as "mygss_train" and name the data consisting of the remaining subjects as "mygss_test".

Problem 3:

Please work on this problem using SAS.

Use dataset "gss1" to test if "AGEWED" is marginally associated with "WRKSTAT", "AGE", "CHILDS", "EDUC", "SEX", and "RACE", respectively. If one categorical variable has too many categories, you can decide whether or not to dichotomize it.

Use dataset "gss2" to test if "AGEKDBRN" is marginally associated with "WRKSTAT", "AGE", "CHILDS", "EDUC", "SEX", and "RACE", respectively. If one categorical variable has too many categories, you can decide whether or not to dichotomize it.

Summarize and interpret statistical findings you obtained from the above bivariate tests.

Problem 4:

Please work on this problem using R.

Fit a linear regression model using the dataset named "mygss_train", with "AGEWED" or "AGEKDBRN" as dependent variable, and the other 6 variables as independent variables.

Summarize and interpret statistical findings you obtain from the above regression analysis. Report the statistic called "adjusted R-square".

Identify those independent variables that are significantly associated with the dependent variable under significance level of 0.05.

Fit a linear regression model using the dataset named "mygss", with "AGEWED" or "AGEKDBRN" as dependent variable, and but considering only those independent variables that are identified as significant in the preceding step.

Summarize and interpret statistical findings you obtain from the above regression analysis. Report the statistic called "adjusted R-square".

Homework Help/Study Tips, Others

  • Category:- Homework Help/Study Tips
  • Reference No.:- M92055788

Have any Question?


Related Questions in Homework Help/Study Tips

Question topic 1 influenzatarnatural history of

Question: Topic 1 Influenza Tar Natural History of Influenzas The influenza virus was once called the Spanish Flu. This was because at the end of May 1918, the King of Spain, King Alfonso XIII, was the first to openly ad ...

1 prenatal developmentdesign an education course for

1. Prenatal Development Design an education course for expectant parents. At the minimum, prepare an outline of all the topics that you would cover in this course and include a description of any activities and resources ...

Define repression denial displacement regression projection

Define Repression, Denial, Displacement, Regression, Projection, Reaction Formation, & Sublimation, and provide an example of what this would look like in real life.

Introduction this assignment will assist in your gaining a

Introduction: This assignment will assist in your gaining a better understanding of the culture influence in society This assignment fulfills/supports Module Outcome: You will be able to identify some ways that the diffe ...

Question details this assignment contains four parts as

Question: Details: This assignment contains four parts, as identified and described below. Please include four subheadings in your paper that clearly identifies each part. In a 1,500-word paper, include the following: Pa ...

The first draft is a partial draft just to get you to start

The first draft is a partial draft, just to get you to start writing. It needs to be 3 pages long, double spaced. The final draft should be at least 5 pages double spaced. But it could be up to 8. Just be sure that you a ...

Assignment project data mining using rthe goal of this

Assignment Project: Data Mining using R The goal of this project is to applying association rule mining, classification and clustering methods on theMushroom or Ionosphere and groceriesdata sets. For detailed information ...

Big data and analytics assignment - analytic report and

Big Data and Analytics Assignment - ANALYTIC REPORT and PRESENTATION Analytic Report Purpose: The purpose of this task is to provide students with practical experience in working in teams to write a Data Analytical repor ...

Questions 1 what were the main expectations for mr and mrs

Questions: 1. What were the main expectations for Mr. and Mrs. Wade? 2. When and how were these expectations formed? 3. If you were the MOD, would you have gone to all that trouble to help the Wades? 4. Was Mary right in ...

Question select two real companies or businesses you will

Question: Select two real companies or businesses. You will have to give a new name to the companies selected. Your selection will be kept on the secret until the presentation of your final project. Watch these two video ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As