Ask Question, Ask an Expert

+61-413 786 465

info@mywordsolution.com

Ask Computer Engineering Expert

Introduction to Data Science

Task

Background

A research team planned to study the heath development of the world in the past 15 years. The team retrieved the dataset from World Bank (http://databank.worldbank.org) about Health and Population Statistics between 2001 and 2015.

The dataset contains the following attributes:
- Birth rate, crude (per 1,000 people)
- Fertility rate, total (births per woman)
- Adolescent fertility rate (births per 1,000 women ages 15-19)
- Death rate, crude (per 1,000 people)
- Cause of death, by communicable diseases and maternal, prenatal and nutrition conditions (% of total)
- Cause of death, by injury (% of total)
- Cause of death, by non-communicable diseases (% of total)
- Mortality caused by road traffic injury (per 100,000 people)
- Health expenditure per capita (current US$)
- GNI per capita, Atlas method (current US$)
- Health expenditure, private (% of GDP)
- Health expenditure, public (% of GDP)
- Health expenditure, total (% of GDP)
- Maternal mortality ratio (national estimate, per 100,000 live births)
- Immunization, BCG (% of one-year-old children)
- Life expectancy at birth, male (years)
- Life expectancy at birth, female (years)
- Life expectancy at birth, total (years)
- School enrollment, primary (% gross)
- School enrollment, secondary (% gross)
- School enrollment, tertiary (% gross)
- School enrollment, tertiary, female (% gross)
- Total alcohol consumption per capita (liters of pure alcohol, projected estimates, 15+ years of age)
- Unemployment, female (% of female labor force) (modeled ILO estimate)
- Unemployment, male (% of male labor force) (modeled ILO estimate)
- Unemployment, total (% of total labor force) (modeled ILO estimate)

More details about the data attributes and data content can be found in the attached documents.

Assignment Task

You are a member of the team, and need to perform data analysis on countries in the region of East Asia & Pacific.

The team has not set any specific goal for the analysis. Therefore, you have the freedom to explore the data, and dig out anything you feel interesting or significant.

You have been requested to prepare a data analysis report about your work and explain your findings. The potential audiences include other researchers, business representatives, and government agencies. They may have limited ICT or mathematical knowledge.

To prepare the report, please follow the following outline:

1. Introduction
Provide an introduction to the problem. Include background material as appropriate: who cares about this problem, what impact it has, where does the data come from.

2. Data Setup
Describe how to load the data, and the libraries needed. Provide an overview of the data about its dimensions and structures.

3. Exploratory Data Analysis
Perform 3 one-variable analysis. Plot at least one graph for each variable. Explain why the selected graph is appropriate.

Perform 2 two-variable analysis. Plot at least one graph for each variable. Explain why the selected graph is appropriate

The analysis can be performed on all years and all countries, or on a subset of your interest.

4. Advanced Analysis
Clustering
Briefly explain the concept of clustering and k-means.
Try to do a clustering analysis to group countries according to some selected attributes.

Linear Regression
Briefly explain the concept of linear regression.
Try to do 2 linear regression analysis. Plot the learned models.

The analysis can be performed on all years and all countries, or on a subset of your interest.

5. Conclusion

6. Reflections
In this part, discuss any difficulties you had performing the analysis and how you solved those difficulties. Reflect on how the analysis process went for you, what you learnt, and what you might do differently next time.

For the data analysis, you need to provide both R code, and the explanation to the code and the result. For the section 2 - 4, please represent each R code snippet in a box with some comments. For example:

Report Format

Your report should be no less than 1,200 words and it would be best to be no longer than 2,000 words long. All comments and graph titles are counted.

The report MUST be formatted using the following guidelines:
- Paragraph text - 12 point Calibri single line spacing
- Headings - Arial in an appropriate type size
- Margins - 2.5cm on all margins
- Header - Report title
- Footer - page number (including the word "Page")
- Page numbering - roman numerals (i, ii, iii, iv) up to and including the Table of Contents, restart numbering using conventional numerals (1, 2, 3, 4) from the first page after the Table of Contents.
- Title Page - Must not contain headers or footers. Include your name as the report's author but DO NOT include any reference to your student ID, course code or course name.
- The report is to be created as a single Microsoft Word document. No other format is acceptable and doing so will result in the deduction of marks.

Computer Engineering, Engineering

  • Category:- Computer Engineering
  • Reference No.:- M92288661
  • Price:- $30

Priced at Now at $30, Verified Solution

Have any Question?


Related Questions in Computer Engineering

Hayley motorcycle company just paid a dividend of 14 today

Hayley Motorcycle Company just paid a dividend of $1.4 today, and is expected to pay a dividend in year 1 of $1.8, a dividend in year 2 of $2.3, a dividend in year 3 of $2.9, and a dividend in year 4 of $3.8. After year ...

Question suppose that during document indexing we are

Question : Suppose that during document indexing we are interested in identifying all dates in the format mm/dd/yyyy, where mm is a 2 digit string representing the month (ranging from 01 through 12), dd is a 2-digit stri ...

Question the discussion board db is part of the core of

Question: The Discussion Board (DB) is part of the core of online learning. Classroom discussion in an online environment requires the active participation of students and the instructor to create robust interaction and ...

In the spring of 2015 three utility companies in the

In the Spring of 2015, three utility companies in the Ukraine received email purporting to come from Ukraine's parliament, the Rada. It was addressed to employees that were used to receiving communications from the Rada ...

Question consider a problem that you think can be addressed

Question: Consider a problem that you think can be addressed using AI/ML. Provide a detailed 1200-1500 words report that explains the problem and solution. And then explains the challenges associated with adoption of tha ...

Can someone help me with this question you need step by

Can someone help me with this question? You need step by step and You need names of any theorms or formulas you are using please. The Codechef Programming Society has 198 membership-paying mem- bers, 97 females and 101 m ...

Referring to the in class rectangleclass examplelt

Referring to the in class RectangleClass example: class RectangleMain{ > static void main(String[] args)> use to test You will Create a Class named and RtTriangleMain { > args)> to test your The will: 1)Establish Two pub ...

Question you work for a multi-state company with three

Question: You work for a multi-state company with three sites in three different states, 1,000 employees, an ERP application with a backend database, and two datacenters. Prepare a 2 Page Disaster Recovery and Business C ...

Combustion analysis of a hydrocarbon produces 3301 g co2

Combustion analysis of a hydrocarbon produces 33.01 g CO2 and 27.03 g H 2O Calculate the empirical formula of the hydrocarbon. Express your answer as a chemical formula.

Question david doe is a network administrator for the abc

Question: David Doe is a network administrator for the ABC Company. David is passed over for promotion three times. He is quite vocal in his dissatisfaction with this situation. In fact, he begins to express negative opi ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As