Ask Question, Ask an Expert

+61-413 786 465

info@mywordsolution.com

Ask Homework Help/Study Tips Expert

Machine Learning Project Assessment -

Learning Outcomes -

  • Perform linear regression, classification using logistic regression and linear Support Vector Machines.
  • Perform non-linear classification using Support Vector Machines with kernels, Decision trees and Random forests.
  • Understand the concept of maximum likelihood and Bayesian estimation.
  • Construct a multi-layer neural network using backpropagation training algorithm.
  • Perform model selection and compute relevant evaluation measure for a given problem.

Purpose - This assessment is an extensive machine learning project. Students will be given a specific data set for analysis and will be required to develop and compare various classification techniques. Each student must demonstrate skills acquired in data representation, classification and evaluation.

Instructions

  • the dataset consists of training and testing data in "train" and "test" folders. Use training data: X_train.txt labels: y_train.txt and testing data: X_test.txt labels: y_test.txt. There are other files that also come with the dataset and may be useful in understanding the dataset better.
  • Please read the pdf file "dataset-paper.pdf" to answer Part 1.

Task A: Understanding the data

Answer the following questions briefly, after reading the paper

  • What is the objective of the data collection process?
  • What human activity types does this dataset have? How many subjects/people have performed these activities?
  • How many instances are available in the training and test sets? How many features are used to represent each instance? Summarize the type of features extracted in 2-3 sentences.
  • Describe briefly what machine learning model is used in this paper for activity recognition and how is it trained. How much is the maximum accuracy achieved?

Task B: K-Nearest Neighbor Classification

Build a K-Nearest Neighbor classifier for this data.

  • Let K take values from 1 to 50. For choosing the best K, use 10-fold cross-validation. Choose the best value of K based on model F1-score.
  • Show a plot of cross-validation accuracy with respect to K.
  • Using the best K value, evaluate the model performance on the supplied test set. Report the confusion matrix, multi-class averaged F1-score and accuracy.

Task C: Multiclass Logistic Regression with Elastic Net

Build an elastic-net regularized logistic regression classfier for this data.

  • Elastic-net regularizer takes in 2 parameters: alpha and l1-ratio. Use the following values for alpha: 1e-4,3e-4,1e-3,3e-3, 1e-2,3e-2. Use the following values for l1-ratio: 0,0.15,0.5,0.7,1.
  • Choose the best values of alpha and l1-ratio using 10-fold cross-validation, based on model F1-score.
  • Draw a surface plot of F1-score with respect to alpha and l1-ratio values.
  • Use the best value of alpha and l1-ratio to re-train the model on the training set and use it to predict the labels of the test set. Report the confusion matrix, multi-class averaged F1-score and accuracy.

Task D: Support Vector Machine (RBF Kernel)

Build a SVM (with RBF Kernel) classfier for this data.

  • SVM with RBF takes 2 parameters: gamma (length scale of the RBF kernel) and C (the cost parameter). Use the following values for gamma: 1e-3, 1e-4. Use the following values for C: 1, 10, 100, 1000.
  • Choose the best values of gamma and C using 10-fold cross-validation, based on model F1-score.
  • Draw a surface plot of F1-score with respect to gamma and C.
  • Use the best value of gamma and C to re-train the model on the training set and use it to predict the labels of the test set. Report the confusion matrix, multi-class averaged F1-score and accuracy.

Task E: Random Forest

Build a Random forest classifier for this data.

  • Random forest uses two parameters: the tree-depth for each decision tree and the number of trees. Use the following values for the tree-depth: 300,500,600. Use the following values for the number of trees: 200,500,700.
  • Choose the best values of tree-depth and number of trees using 10-fold cross-validation, based on model F1-score.
  • Draw a surface plot of F1-score with respect to tree-depth and number of trees.
  • Use the best value of tree-depth and number of trees to re-train the model on the training set and use it to predict the labels of the test set. Report the confusion matrix, multi-class averaged F1-score and accuracy.

Task F: Discussion

Write a brief discussion about which classification method achieved the best performance. Your thoughts on the reason behind this. What method performed the worst? Could you do better or worse than the results in the dataset paper? Do you have any suggestions to further improve model performances?

Homework Help/Study Tips, Others

  • Category:- Homework Help/Study Tips
  • Reference No.:- M93109440
  • Price:- $120

Guranteed 48 Hours Delivery, In Price:- $120

Have any Question?


Related Questions in Homework Help/Study Tips

Question in the example of the pin makers who each owned

Question: In the example of the pin makers who each owned specialized equipment, sales of half-finished pins between them are costly transactions. As noted in the text, each of them has market power as both buyer and sel ...

Access to restrooms for transgender people has been a

Access to restrooms for transgender people has been a heated public policy debate over the last few years. If you are unfamiliar with these laws, review this NPR article: 'I Hope This Will Set A Precedent,' Says Trans Te ...

Question this assignment is a take-home essay consisting of

Question: This assignment is a take-home essay consisting of 3 questions, 2 pages total, to test knowledge and assimilation of the course objectives. Please exclusively use the course materials to support each answer. To ...

For this online discussion you will need to review chapter

For this online discussion, you will need to review Chapter 8, answer the following questions, and share with the class: 1. Which stage do you think best describes your current stage of self-regulatory ability? 2. Why di ...

Question must be one page minimum be sure to fully and

Question: Must be one page minimum. Be sure to fully and completely answer each question. I am not looking for your to regurgitate what is in the textbook. Rather, students who analyze, synthesize, and evaluate course ma ...

Assessment d - case studycase studyjjs bistro is located in

ASSESSMENT D - CASE STUDY Case study JJ's Bistro is located in Jackson's hotel. It seats 210 people and is open for lunch and dinner, seven days a week. The hotel promotes a family environment and has a playroom for youn ...

Assignment 2 recognizing the impact of diversity on the

Assignment 2: Recognizing the Impact of Diversity on the Workplace BANKS Industries continues to work on bridging cultural gaps as it embraces the diversity that resulted from its merger. You have been asked to develop a ...

Question scenario health systems daniel a dnp prepared ed

Question: Scenario: Health Systems: Daniel, a DNP prepared ED NP identified that patients seen in the ED for various reasons had undiagnosed HTN. He observed that the patient's chief compliant was well addressed but the ...

Question in this assignment you will have the opportunity

Question: In this assignment, you will have the opportunity to select two Spanish-speaking countries of your choice, each one from a different continent. You will continue with these countries for your individual assignm ...

Question in this assignment you will be creating a

Question: In this assignment, you will be creating a PowerPoint presentation based on the application of the functional health assessment of a movie character. To complete this assignment, choose a movie from the followi ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As