Ask Question, Ask an Expert

+61-413 786 465

info@mywordsolution.com

Ask Statistics and Probability Expert

Question on data mining

Your task is to predict the output variable "choice" based on 16 input features: x1, x2, ....,x15, x16.The output "choice" is a categorical variable that can take 5 possible values: "M", "B", "J", P", and "O".The first 8 input features (x1, x2, ....,x8) are binary variables. The last 8 input features (x9, x10, ....,x16) are continuous variables.

1. Train a decision tree inductive learning model on the data from the CSV file "finalQ3Train.csv" that contains 1500 examples.

2. Express your trained model in the form of IF ... THEN rules. Test your trained model on the 500 examples from the CSV file "finalQ3Test.csv" and present your confusion matrix.

3. Predict values for "choice" for the 8 examples in the csv file "finalQ3newCases.csv". The examples are shown below

x1

x2

x3

x4

x5

x6

x7

x8

x9

x10

x11

x12

x13

x14

x15

x16

1

1

1

1

1

0

1

0

0.0284

0.2196

0.5259

0.6206

0.0950

0.3350

0.2470

0.9676

1

1

0

1

1

0

0

1

0.7419

0.9260

0.4711

0.8340

0.8770

0.1129

0.4805

0.7469

0

0

1

0

1

0

1

1

0.3867

0.9002

0.4240

0.6029

0.5547

0.6674

0.1499

0.4527

0

1

0

1

1

0

0

0

0.8848

0.0752

0.1195

0.3625

0.1565

0.1205

0.7666

0.4188

1

0

0

0

1

1

1

0

0.2893

0.0067

0.1855

0.6999

0.5777

0.5959

0.0324

0.8211

1

1

1

1

1

1

1

1

0.7549

0.3705

0.3349

0.8772

0.9453

0.2476

0.3782

0.1878

1

1

1

1

0

1

1

1

0.7921

0.1539

0.9011

0.5596

0.7125

0.1035

0.0587

0.2399

0

0

1

0

1

0

0

0

0.7190

0.8441

0.5841

0.8670

0.7620

0.8794

0.3351

0.4677

Statistics and Probability, Statistics

  • Category:- Statistics and Probability
  • Reference No.:- M91873580
  • Price:- $15

Priced at Now at $15, Verified Solution

Have any Question?


Related Questions in Statistics and Probability

For a population of individuals that has a standard

For a population of individuals that has a standard deviation of 10, what is the standard error of the mean for samples of size (a) 2, (b) 3, (c) 4, (d) 5, (e) 10, (f) 20, (g) 100?

Annbspe-commerce web site claims thatnbsp8nbspof people who

An? e-commerce Web site claims that 8?% of people who visit the site make a purchase. Complete parts a through e below based on a random sample of 15 people who visited the Web site. a.  What is the probability that none ...

How do you find the minimum sample size when population

How do you find the minimum sample size when population standard deviation is anywhere between 14 to 24, and the half-width B desired could be anywhere between 2 to 7?

A coin is randomly picked from a collection of 10 coins the

A coin is randomly picked from a collection of 10 coins, the ith coin having a probabiliity i/10 of coming up heads. The coin in then flipped repeatedly until a head appears. Let X be the number of flips necessary. Find ...

Heights of women follow a normal distribution with

Heights of women follow a normal distribution with mean μ=63.6 inches and standard deviation of σ=2.5 inches.  Suppose random samples of 10 women are chosen and their mean height calculated. Please calculate the mean and ...

Confidence1the daily sales of burger baby restaurants

Confidence 1.The daily sales of Burger Baby restaurants follow the normal distribution with a standard deviation of $4000. a. A sample is taken of 50 restaurants and the average sales were $40,000/day. What is the 99% co ...

A researcher is working on a study that examines the

A researcher is working on a study that examines the wellbeing of older adults. She has an interest in finding if there is a difference in life satisfaction scores (measured on a multi-item Likert scale) by ethnicity (Af ...

According to the same national collegiate athletic

According to the same National Collegiate Athletic Association data, the means and standard deviations of eligibility and retention rates (based on a 1,000-point scale) for the 2013-2014 academic year are presented, alon ...

An independent measures study with n 6 in each sample

An independent measures study with n = 6 in each sample, produces a sample mean difference of 4 points and a pooled variance of 12. What is the value for the t statistic?

A study discovered that americans consumed an average of

A study discovered that Americans consumed an average of 10.7 pounds of chocolate per year. Assume that the annual chocolate consumption follows the normal distribution with a standard deviation of 3.1 pounds. Complete p ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As