Ask Question, Ask an Expert

+61-413 786 465

info@mywordsolution.com

Ask Homework Help/Study Tips Expert

Assignment Project: Data Mining using R

The goal of this project is to applying association rule mining, classification and clustering methods on theMushroom or Ionosphere and groceriesdata sets. For detailed information about the mush room or Ionosphere data set, refer to the Machnie Learning Repositoryprovided by the University of California, Irvine. You can download and read more about the data there.

The groceries Dataset
Imagine 10000 receipts sitting on your table. Each receipt represents a transaction with items that were purchased. The receipt is a representation of stuff that went into a customer's basket. That is exactly what the Groceries Data Set contains: a collection of receipts with each line representing 1 receipt and the items purchased. Each line is called a transaction and each column in a row represents an item.

Task 1: Data Pre-processing

Read the data in R. There are many ways to read in csv tables in R. For more details, please refer to data import/export in R

For the clustering experiments, the column for class labels need to be removed. Refer to lecture Module 10 to see how to do so.

Verify if any other pre-processing is beneficial for the analysis. For example, replacing missing values, attribute range normalization, converting numerical or string to nominal values etc.

Task 2: Data Mining

- Association Rule Mining experiments: Using R to explorer "association rules" on the groceries dataset.Try out different algorithms. Visualize the result you found. Report any interesting association rules discovered in the experiments and explain why they are interesting.

- Classification experiments: Using to construct classifiers on the mushroom or Ionosphere dataset. Randomly split the data set in the training and test data set (80% v.s. 20%). Select at least one classifier from each of the following two categories of classifiers: Tree-based models, Bayes classifiers, and Rule-based classifiers. Compare the result of the chosen classifers.

- Clustering experiments: Using R explorer clusters on the mushroom or Ionospheredataset.Select and compare two clustering algorithms from R(e.g. k-means v.s. density-based). Use R to visually explore the resulting clusters.

- For all the above experimentations, try different parameter settings to fine tune the outcome. In principle select methods that work well on the given data set.

Task 3: Prepare a report

Your report should contain the following:

- Theoretical Discussion: Limited to two pages discussing about data preprocessing steps, the motivation for selecting a particular method, and how the parameters are chosen.

- Results: Include results and screenshots of the above experimentations.

- Discussion and error analysis: Try to interpret the results of your model. Discuss intuitions or hypothesis that can be obtained by visual inspections of the resulting classes or clusters. Mention about assumptions if any, discuss issues that might have affected the model's performance.

- References: If you are using information from other sources apart from R manual and official website, you should cite them.

Attachment:- Assignment.zip

Homework Help/Study Tips, Others

  • Category:- Homework Help/Study Tips
  • Reference No.:- M93115621
  • Price:- $65

Priced at Now at $65, Verified Solution

Have any Question?


Related Questions in Homework Help/Study Tips

Case study profits amp food safetythe case of bmijose

Case Study : PROFITS & FOOD SAFETY THE CASE OF BMI Jose works as a clerk at the headquarters of Best Meat International, LLC (BMI), a US-based food processing company. With a history of over 60 years, the company has est ...

Question 1 sampt corporation retained brooke to find

Question: 1. S&T Corporation retained Brooke to find sources of raw material for its products. List and explain the three types of contractual authority Brooke may have in her actions pertaining to her assignment. What i ...

What is the connection between perceiving and moving

What is the connection between perceiving and moving through the environment?

Assignment 2 using scrum dsdm and lean software

Assignment 2: Using SCRUM, DSDM, and Lean Software Development The following Website may be helpful when completing this assignment: • DSDM Consortium The following resources may be helpful when completing this assignmen ...

Question bullinclude your name and the name of the local

Question: • Include your name and the name of the local company/organization selected. • CITE YOUR SOURCES! 1. Select a local small business and write a brief analysis of its function and the issues routinely faced. 2. I ...

Question older women iione of the most obvious

Question: Older Women II One of the most obvious characteristics that one observes in a group of elderly women is the vast individual differences that are present. Some individuals seem "old" at 60. Physically they may a ...

Project task assignmentdesign project ideasbackgroundthis

Project Task assignment Design Project Ideas Background This is the time to create project ideas. The project is a design problem that includes at least one distinct component for each member of the group. The components ...

Question evolving practice of nursing and patient care

Question: Evolving Practice of Nursing and Patient Care Delivery Models As the country focuses on the restructuring of the U.S. health care delivery system, nurses will continue to play an important role. It is expected ...

Students at your hometown high school have decided to

Students at your hometown high school have decided to organize their social network using databases. So far, they have collected information about sixteen students in four grades, 9-12. Here's the schema: Highschooler ( ...

Many movies and television shows have examples of

Many movies and television shows have examples of ceremonial speaking (eulogies, toasts, inspirational speeches, entertaining speeches, etc.) Select an example of a ceremonial speech that you have seen in a movie or tele ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As