Ask Question, Ask an Expert

+61-413 786 465

info@mywordsolution.com

Ask Statistics and Probability Expert

Assignment: Multivariate Data Analysis

Part A

Refer to the data in FoodConsumptionNutrients en.xls. It has information for about 175 countries. Choose 30 or so countries that interest you to work on. Be sure that you use countries from at least three different country groups from different regions (see the sheet CountryGroupComposition to get some ideas for groupings that you might use). Collect the information on energy consumption, fat consumption and protein consumption for your chosen countries onto a single sheet. Create a variable for the country group.

1. Choose two of the three original variables. Draw a scatterplot with the country group of each point indicated. Comment.

2. Generate classification rules using

• Linear discriminant analysis

• Quadratic discriminant analysis

• Multinomial logistic regression

• Classification trees

3. Using the confusion matrix and the apparent error rate, compare the effectiveness of each of the classifi- cation rules.

4. Assume that you did not know which countries were in which groups. Use the following methods to group the observations.

• One hierarchical implementation of cluster analysis

• K-means cluster analysis

• Multidimensional scaling

Do any of these correctly divide all the observations into the original groups?

Part B appears overleaf.

Find two datasets using online sources that you can use to demonstrate the techniques that you have learned in this subject. Some good places to find interesting data are:

• http://blog.visual.ly/data-sources/
• http://blog.bigml.com/2013/02/28/data-data-data-thousands-of-public-data-sources/
• http://www.tableausoftware.com/public/community/sample-data-sets
• https://www.kaggle.com/
• http://lib.stat.cmu.edu/DASL/
• http://www.models.kvl.dk/datasets
• http://research.library.gsu.edu/c.php?g=115854&p=754836
• http://www.stat.ufl.edu/ winner/datasets.html
• http://www.statsci.org/data/

You must get approval from me for your datasets before you begin. I may not approve two students using the same dataset.

Some datasets are quite extensive and you may feel that you can illustrate a range of techniques with different subsets of the same dataset. If you think this applies to your chosen dataset talk to me about this when you are getting approval for your dataset.

If you are having trouble thinking about what you need to be able to do, think back over the broad areas that we have covered in class - inferences about mean vectors, MANOVA (one- and two-way), multivariate linear regression, PCA and factor analysis, canonical correlation, discrimination and classification including clustering. You don't need to show that you can do all of these but I would hope (read expect) to see at least 5 of these broad areas represented in your answer.

For each of your chosen datasets, you need to pose one or more questions that you believe you can (try to) address using the dataset. You then need to use appropriate techniques to analyse the data to address the research question(s) that you have posed. Finally, you will need to reflect on the adequacy of the dataset to address the questions that you have posed, and make suggestions about how you might collect the data differently to better address your question (consider what to collect or how to collect, for instance).

Your answer to this question should include (separately for each of the two datasets, if appropriate):

• A report that describes the data, poses the research question(s), analyses the research question(s) and reflects on the usefulness of the data to answer the question(s). This should be in a report format, with essential output in the report and any other output that you use in an appendix. You should also indicate where you obtained the data from (e.g. reference to a paper or URL).

• A .R file containing your code.

• A .csv file containing the data set (if it is not already in your .R file)

Attachment:- FoodConsumptionNutrients en.xlsx

Statistics and Probability, Statistics

  • Category:- Statistics and Probability
  • Reference No.:- M91549253
  • Price:- $30

Priced at Now at $30, Verified Solution

Have any Question?


Related Questions in Statistics and Probability

Question - the data in the table below is from a study

Question - The data in the table below is from a study conducted by an insurance company to determine the effect of changing the process by which insurance claims are approved. The goal was to improve policyholder satisf ...

Can you look for multicollinearity between a categorical

Can you look for multicollinearity between a categorical independent variable and a continuous dependent variable or can you only look for multicollinearity between a continuous independent variable and a continuous depe ...

A cell phone company offers 15 different voice packages and

A cell phone company offers 15 different voice packages and 15 different data packages. Of those, 6 packages include both voice and data. How many ways are there to choose either voice or data, but not both?

What are the ways that it can help comply with legal

What are the ways that IT can help comply with legal requirements and social responsibilities surrounding the sales of alcohol?

Income can have significant effects on peoples spending

Income can have significant effects on people's spending patterns. Research studies have revealed that consumer expenditure is influenced by various factors such as their income, gender, age and level of education. In or ...

Across the nine cities in multilevel multivariate analysis

Across the nine cities, in multilevel, multivariate analysis, controlling for income inequality (GINI coefficient), percent living in poverty and percent Non-Hispanic Black population, the ZIP code level overall HIV diag ...

A pawnshop will lend 4750 for 45 days at a cost of 35

A pawnshop will lend $4,750 for 45 days at a cost of $35 interest. What is the effective rate of interest?  (Use a 360-day year. Do not round intermediate calculations. Input your answer as a percent rounded to 2 decimal ...

Industries recently commissioned management consultants to

Industries recently commissioned management consultants to estimate an appropriate rate for investment projects with the same risk as the firm. Unfortunately, part of the report was lost, and you have been asked to calcu ...

Youre trying to find out how many students who graduate

You're trying to find out how many students who graduate with accounting degrees from large universities are employed at graduation. You design an experiment where you collect information on several variables from recent ...

Researchers randomly assigned 25 beginning students of

Researchers randomly assigned 25 beginning students of Russian to begin speaking practice immediately and another 25 to delay speaking for 4 weeks. At the end of the semester both groups took a standard test of comprehen ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As