Ask Programming Language Expert

1) Label each case as describing either data mining (DM), or the use of the results of data mining (Use).
a) _____ Choose customers who are most likely to respond to an on-line ad.

b) _____ Discover rules that indicate when an account has been defrauded.

c) _____ Find patterns indicating what customer behavior is more likely to lead to response to an on-line ad.

d) _____ Estimate probability of default for a credit application.

e) _____ Predict whether a customer is pregnant

2) Plumbing Inc. has been selling plumbing supplies for the last 20 years. The owner, Joe, decides that next year it is finally time to diversify by adding gardening tools to his products. Having had success using customer data to build predictive models to guide direct mail campaigns for special plumbing offers, he considers that data mining could help him to identify a subset of customers who should be good prospects for his new set of products. Is Joe ready to solve this as a supervised learning problem?

If yes - what would you suggest as the target variable?


If no - why not? What would you recommend that Joe do to achieve his business goal?

3) Choose a problem from a past job, hobby, or interest that would make for a good predictive modeling classification application. Describe it in one page or less, using the relevant concepts introduced in classes 1 & 2 and Ch. 1 - 3 in the book. Your description should be as complete and precise as possible, referring to the concepts introduced in class/in the book. Please do not choose one of the applications we have discussed already (churn, targeted marketing, default prediction, pregnancy prediction).
Include answers to the following:

a) What exactly is the business decision you want to support with this solution? (Specifically, what is the business action you are considering? Discuss briefly the timing of the decision and the eventual outcome.)

b) Describe the use phase.

c) Why did you select this as a good predictive modeling problem?

d) How and where would you get the data?

e) Explain precisely why and how you expect doing the predictive modeling will add value.

f) What exactly is the quantity that you inherently do not know and need to predict?

g) Is this a classification, ranking, or probability estimation problem?

h) What are the features? Provide a list of at least 5 features that you think (a) you can get and (b) you think might be useful.

i) What exactly would be your training data?

4 Hands on (WEKA version). This is a first simple hands-on modeling task using Weka. Your task is to experiment with the classification tree induction algorithm in Weka. The data is available on NYU Classes in the data section under Resources->Datasets->Mailing (mailing_train.arff and mailing_test.arff). Build a classification tree using the J48 algorithm. If our classroom Weka demonstration was not enough, please consult the Weka tutorial (available under Resources->Weka >Weka_tutorial. It is useful to try to figure things out on your own, but if you get frustrated trying to figure out how to do something, please post a question to the discussion forum.

HINTS: A quick guide to the required commands: start Weka; select Explorer; use ‘Open file' to load a dataset; go to the Classify tab and use the Choose button to pick J48 from the trees. Scroll around in the ‘Classifier output' and try to understand what you see there.

I) Explore the evaluation options (test options in the Classify tab, on the left under the Choose button). Understand what they do in light of Chapter 5 (it is fairly straightforward, but you can also consult the Weka documentation or Google). Build/evaluate a tree under each of the 4 options (use the default whenever there is a parameter). Report the "accuracy" for each option and write a sentence or two about your observations (look at the summary in the Classifier output and identify the accuracy as the percent ‘Correctly Classified Instances' - you can ignore all the other stuff for now).

II) Figure out how to get predictions out of Weka (try the "More Options" button in the Test options) and copy a dozen of them from the ‘Classifier output' window here.

III) Identify the most INFORMATIVE attribute (according to the tree induction) and explain how you found it.

IV) Examine the parameters of the tree induction by clicking on J48 in the box just to the right of the ‘Choose' button. Set "unpruned" to True. Now, try changing the values for ‘minNumObj' and see (i) how it affects in-sample accuracy by evaluating on the training set, and (ii) how it affects the generalization accuracy using the test set. Explain the results. Use the concepts from the readings where appropria

Programming Language, Programming

  • Category:- Programming Language
  • Reference No.:- M91036089
  • Price:- $70

Priced at Now at $70, Verified Solution

Have any Question?


Related Questions in Programming Language

Assignment - haskell program for regular expression

Assignment - Haskell Program for Regular Expression Matching Your assignment is to modify the slowgrep.hs Haskell program presented in class and the online notes, according to the instructions below. You may carry out th ...

Assignment task -q1 a the fibonacci numbers are the numbers

Assignment Task - Q1. (a) The Fibonacci numbers are the numbers in the following integer sequence, called the Fibonacci sequence, and are characterised by the fact that every number after the first two is the sum of the ...

Question - create a microsoft word macro using vba visual

Question - Create a Microsoft Word macro using VBA (Visual Basic for Applications). Name the macro "highlight." The macro should highlight every third line of text in a document. (Imagine creating highlighting that will ...

Assignmentquestion onegiving the following code snippet

Assignment Question One Giving the following code snippet. What kind of errors you will get and how can you correct it. A. public class HelloJava { public static void main(String args[]) { int x=10; int y=2; System.out.p ...

Assignment - proposal literature review research method1

Assignment - Proposal, Literature Review, Research Method 1. Abstract - Summary of the knowledge gap: problems of the existing research - Aim of the research, summary of what this project is to achieve - Summary of the a ...

1 write a function named check that has three parameters

1. Write a function named check () that has three parameters. The first parameter should accept an integer number, andthe second and third parameters should accept a double-precision number. The function body should just ...

Assignment - horse race meetingthe assignment will assess

Assignment - Horse Race Meeting The Assignment will assess competencies for ICTPRG524 Develop high level object-oriented class specifications. Summary The assignment is to design the classes that are necessary for the ad ...

Task silly name testeroverviewcontrol flow allows us to

Task: Silly Name Tester Overview Control flow allows us to alter the order in which our programs execute. Building on our knowledge of variables, we can now use control flow to create programs that perform more than just ...

Structs and enumsoverviewin this task you will create a

Structs and Enums Overview In this task you will create a knight database to help Camelot keep track of all of their knights. Instructions Lets get started. 1. What the topic 5 videos, these will guide you through buildi ...

Task working with arraysoverviewin this task you will

Task: Working with Arrays Overview In this task you will create a simple program which will create and work with an array of strings. This array will then be populated with values, printed out to the console, and then, w ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As