Ask Question, Ask an Expert

+61-413 786 465

info@mywordsolution.com

Ask Computer Engineering Expert

Data Mining - Practical: Using Naïve Bayes Classifier for Classification

This is a combined practical and exercise session, in which we have two tasks. First, we use Weka to run two different classifiers (Naïve Bayes and J48) and compare their performances. Second, we use Weka to help us to calculate and decide whether someone is likely to get sunburned or not as Naïve Bayes Classifier does. This will help us to familiarise ourselves with the theories on Naïve Bayes Classifier we have learned in today's lecture.

Step 1:Launching Weka and Loading Training Data

Launch Weka by clicking on: RunWeka.bat

Select ‘Explorer' from the list of Applications.

Select the ‘Preprocess' tab and click on ‘Open File'. Choose the file ‘sunburn.arff' which contains the training data set.

Step 2: Exploring Naïve Bayes Classifier

Select the ‘Classify' tab and make sure that "NaiveBayes" is chosen from the classifier list and "Use training set" is clicked as the test option. Note that attribute "burned" is chosen by default as the class attribute but the class attribute can be changed if needed.

Click ‘Start'will create a classification model/classifier from the training dataset. The classifier is listed in the Result list while the details about the classifier are displayed in the ‘Classifier output' window.

Step 3: Comparing Naïve Bayes Classifier with Decision Tree-based Classifier J48 Using 10-fold Cross-validation

First, you can easily run Naïve Bayes Classifier again with a different evaluation method. Select the "cross-validation" test option with 10 folds as default and click Start again. The classifier output is quickly replaced to show how well the learned model performs on the cross-validation.

Second, you can also easily run J48 with 10-fold cross-validation. Select the ‘Classify' tab and make sure that "J48" is chosen from the classifier list and Select the "cross-validation" test option with 10 folds as default and click Start again. The classifier output is quickly replaced to show how well J48 performs on the cross-validation.

Finally, compare the two sets of evaluation results.

Q1. Provide a summary of the comparison between the classifier outputs by Naïve Bayes Classifier and J48.

Step 4: Discovering Naïve Bayes Classifier

We now focus on discovering how the Naïve Bayes Classifier produces the output. We have just covered the Naïve Bayes Classifier in today's lecture. We now want to familiarise ourselves with the theories that we have learned in today's lecture by working through an example using Weka.

Select the ‘Preprocess' tab and go back to examine the training data set. Click on attribute burned in the attributes window on the left and you will see that the class attribute has two class labels "burned" and ‘none".

Q2. Calculate the probability estimates of these two class labels, i.e., P(burned = "burned") and P(burned = "none") based on their class distributions. Click on attribute burned in the attributes window will display the class distribution, i.e., 9/16 with class label "burned" while 7/16 with class label "none"

P(burned = "burned") =
P(burned = "none") =

Click "Edit" to view the sunburn relation table that shows the value distribution of each attribute given a class label. Click on attribute burned will sort the instances in the table in alphabetic order. For example, among the 9 instances with class attribute value "burned", 2/9 has hair = "red".

Q3. Calculate the conditional probability of each attribute on each class, e.g., P(hair = "blonde"|burned = "burned"). Note that you would have 6 + 6 + 6 + 4 = 22 conditional probabilities to calculate.

P(hair = "blonde"|burned = "burned") =
P(hair = "brown"|burned = "burned") =
P(hair = "red"|burned = "burned") =
P(hair = "blonde"|burned = "none") =
P(hair = "brown"|burned = "none") =
P(hair = "red"|burned = "none") =
...

Q4. Calculate the classification probability of each class, given hair = "blonde", height = "short", weight = "average", lotion = "none".

P(burned = "burned"|hair = "blonde", height = "short", weight = "average", lotion = "none)
=

P(burned = "none"|hair = "blonde", height = "short", weight = "average", lotion = "none)
=

Q5. Is the given person likely to get sunburned based on the prediction by Naïve Bayes Classifier?

Attachment:- Practical2.rar

Computer Engineering, Engineering

  • Category:- Computer Engineering
  • Reference No.:- M92283588
  • Price:- $70

Guranteed 36 Hours Delivery, In Price:- $70

Have any Question?


Related Questions in Computer Engineering

Going to concerts and reading books take time and money

Going to concerts and reading books take time and money. Suppose a book costs $10 and takes 3 hours to read and a concert costs $20 and takes 3 hours. For a person with $80 to spend and 18 hours for these activities: a. ...

On the spot courier services grew and changed over the

On the Spot courier services grew and changed over the years. At first, Bill received requests for package pickups on his mobile phone, recorded that information in a log, and would then drive around to retrieve all the ...

How is international trade regulated what is involved in

How is international trade regulated? What is involved in "trade agreements"?

Question students will turn a basic python set of files

Question : Students will turn a basic Python set of files into one single batch file. This will demonstrate students understanding of sequential scripting. Student will developed a simple 3-6 line code script file. Using ...

Review the interactive session on turner broadcasting and

Review the Interactive Session on Turner Broadcasting and e-commerce in the Management Information Systems: Managing the Digital Firm on pages 381-382. Then write a short paper (400 to 800 words) that answers all four Ca ...

1nbsphillary wants to go to disneyland in 425 years she

1) Hillary wants to go to Disneyland in 4.25 years. She wants to take her partner and 2 kids (4 people in Total). If it is going to cost $453.27 per person to go on the trip. -What will the cost be for the entire trip? - ...

Think about a good or service for which you believe there

Think about a good or service for which you believe there has been a shift in demand or supply. Explain the reasons behind the shift and how that has influenced the equilibrium price.

Identify at least two 2 factors that have led to the

Identify at least two (2) factors that have led to the explosive growth of digital crime over the past a few decades. Next, describe the most common forms of digital crime, and give your opinion as to why those forms you ...

Suppose you have an object made up of two w-bit integers x

Suppose you have an object made up of two w-bit integers, x and y. Suppose that the hash code for your object is defined by some deterministic function h(x; y) that produces a single w-bit integer. Prove that there exist ...

Question the discussion board db is part of the core of

Question: The Discussion Board (DB) is part of the core of online learning. Classroom discussion in an online environment requires the active participation of students and the instructor to create robust interaction and ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As