Ask Question, Ask an Expert

+61-413 786 465

info@mywordsolution.com

Ask Applied Statistics Expert

The Big Data Assignment is comprised of two parts:

- The first part is to create the algorithms in the tasks, namely: Decision Tree, Gradient Boosted Tree and Linear regression and then to apply them to the bike sharing dataset provided. Try and produce the output given in the task sections (also given in the Big-Data Assignment.docx provided on Blackboard).

- The second part is then use those algorithms created in the first part and apply them to another dataset chosen from Kaggle (other than the bike sharing dataset provided).

1. Utilising Python 3 Build the following regression models:
- Decision Tree
- Gradient Boosted Tree
- Linear regression
2. Select a dataset (other than the example dataset given in section 3) and apply the Decision Tree and Linear regression models created above. Choose a dataset
3. Build the following in relation to the gradient boost tree and the dataset choosen in step 2
a) Gradient boost tree iterations (see Big-Data Assignment.docx section 6.1)
b) Gradient boost tree Max Bins (see Big-Data Assignment.docx section 7.2)
4. Build the following in relation to the decision tree and the dataset choosen in step 2
a) Decision Tree Categorical features
b) Decision Tree Log (see Big-Data Assignment.docxsection 5.4)
c) Decision Tree Max Bins (see Big-Data Assignment.docx section 7.2)
d) Decision Tree Max Depth (see Big-Data Assignment.docx section 7.1)
5. Build the following in relation to the linear regression and the dataset choosen in step 2
a) Linear regression Cross Validation
i. Intercept (see Big-Data Assignment.docx section 6.5)
ii. Iterations (see Big-Data Assignment.docx section 6.1)
iii. Step size (see Big-Data Assignment.docxsection 6.2)
iv. L1 Regularization (see Big-Data Assignment.docx section 6.4)
v. L2 Regularization (see Big-Data Assignment.docx section 6.3)
b) Linear regression Log (see Big-Data Assignment.docx section 5.4)
6. Follow the provided example of the Bike sharing data set and the guide lines in the sections that follow this section to develop the requirements given in steps 1,3,4 and 5

3.1 Task 1
Task 1 is comprised of developing:
1. Decision Tree
a) Decision Tree Categorical features
b) Decision Tree Log (see Big-Data Assignment.docx section 5.4)
c) Decision Tree Max Bins (see Big-Data Assignment.docx section 7.2)
d) Decision Tree Max Depth (see Big-Data Assignment.docx section 7.1)

3.2 Task 2
Task 2 is compromised of developing:
1. Gradient boost tree
a) Gradient boost tree iterations (see Big-Data Assignment.docx section 6.1)
b) Gradient boost tree Max Bins (see Big-Data Assignment.docxsection 7.2)
c) Gradient boost tree Max Depth (see Big-Data Assignment.docx section 7.1)

3.3 Task 3
Task 3 is compromised of developing:
1. Linear regression model
a) Linear regression Cross Validation
i. Intercept (see Big-Data Assignment.docx section 6.5)
ii. Iterations (see Big-Data Assignment.docx section 6.1)
iii. Step size (see Big-Data Assignment.docx section 6.2)
iv. L1 Regularization (see Big-Data Assignment.docx section 6.4)
v. L2 Regularization (see Big-Data Assignment.docx section 6.3)
b) Linear regression Log (see Big-Data Assignment.docx section 5.4)

Attachment:- Marking Creiteria.rar

Applied Statistics, Statistics

  • Category:- Applied Statistics
  • Reference No.:- M93083077

Have any Question?


Related Questions in Applied Statistics

Business data analysis computer assignment -part 1

Business Data Analysis Computer Assignment - PART 1 - Economists believe that high rates of unemployment are linked to decreased life satisfaction ratings. To investigate this relationship, a researcher plans to survey a ...

Business data analysis facts from figures assignment

BUSINESS DATA ANALYSIS: FACTS FROM FIGURES Assignment - Question 1 - Private capital expenditure for 12 successive quarters are presented in the following table: Quarter Millions 1 31,920 2 25,120 3 30,350 4 24,650 5 30, ...

Assignment -in this assignment ms excel must be used to

Assignment - In this assignment MS Excel must be used to perform any calculations/graphical presentations as required in this assignment. Question 1 - Below you are given the examination scores of 20 students (data set a ...

Business analytics and statistics research report

Business Analytics and Statistics Research Report Assignment - This assignment is based on fictional data - do not contact the company listed below. You are creating a business report for the CEO of a retail company call ...

Business analytics and statistics research report -this

Business Analytics and Statistics Research Report - This assignment is based on fictional data. You are creating a business report for the CEO of a retail company called, Athlete Panda. It must be professional in present ...

Exercise -q1 do the example data in table 35-2 meet the

Exercise - Q1. Do the example data in Table 35-2 meet the assumptions for the Pearson χ 2 test? Provide a rationale for your answer. Q2. Compute the χ 2 test. What is the χ 2 value? Q3. Is the χ 2 significant at α = 0.05 ...

Question 1 for the prostate data set fit a model with lpsa

Question 1. For the prostate data set, fit a model with lpsa as the response, and the other variables as predictors. (a) Suppose a new patient with the following values arrives: lcavol = 1.45000, lweight = 3.59801, age = ...

Part a -question 1 - true or false in data collection the

Part A - Question 1 - True or False: In data collection, the most common technique to ensure proper representation of the population is to use a random sample. True False Question 2 - Most analysts focus on the cost of H ...

Business analytics and statistics research reportthis

Business Analytics and Statistics Research Report This assignment is based on fictional data. You are creating a business report for the CEO of a retail company called, Athlete Panda. It must be professional in presentat ...

You are expected to work in groups and write a research

You are expected to work in groups and write a research report. When you work on your report, you need to use the dataset, and other sources such as journal articles. If you use website material, please pay attention to ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As