Ask Question, Ask an Expert

+61-413 786 465

info@mywordsolution.com

Ask Statistics and Probability Expert

Question 1) Define each of the following terms.

Business Intelligence

Business Analytics

Data Mining.

Question 2) How is business analytics different to data mining?

How is business analytics different from Business Intelligence?

Question 3) Suppose a credit-card company wants to create a model to predict whether a customer will default. After some investigation they decided that they will only use two attributes: Age and Income. They also decided to use a decision tree to build the model. Discuss the pros and cons of this decision. You can create a small data set to justify your answer.

Question 4.

A winery maintains a dataset containing information about customers who subscribe to its tasting events and special offers for wine cases. The winery occasionally mails tasting samples of new wines in an effort to increase sales. The chief marketing officer is aiming to send samples of a newly produced wine to customers who are NOT likely to place an order for the new wine (when probability of purchase is less than 0.5). Based on a survey conducted amongst its customers and their willingness to buy a case of the new wine before tasting it, the firm collected tree attributes - whether customers prefer dry, or have preference for red wine and their ages. The classification problem is to predict whether customers will buy or not buy wine, with certain confidence/probability. The following classification tree was induced:

1915_How is business analytics different to data mining.png

4.1) After running a business analytics software (example SAS Enterprise Miner), suppose the above tree is generated. If you have to choose a single attribute to predict which customer is likely to place an order or not, which attribute would you use (check one and briefly justify your choice)?

A. Preference for red wine

B. Age (whether the customer is older or younger than 50 years old)

C. Preference for dry wine

D. Impossible to determine given the information provided.

4.2) The company would like to have a few, simple, English language rules that embody the decisions represented by the tree. Write out succinctly the rule you would suggest from the highlighted path on the picture above.

4.3) The Winery manager, Mary Sue, wants to consult with you if she should send a sample to a new customer named Thomas. She tells you that Thomas prefers dry wine and strongly prefers red wine. What's your recommendation (on whether to send a sample to Thomas)? Justify. (Note: The chief marketing officer is aiming to send samples of a newly produced wine to customers who are NOT likely to place an order for the new wine (when probability of purchase is less than 0.5).)

4.4) Assume that the cost of producing and shipping a wine sample to a customer is $8 and the profit from each order is $100. Assume the company ascertained that the probability of purchase will become 17% for a customer who receives a sample of a new wine. Would you suggest shipping a sample to customers preferring dry and red wines? Explain your answer.

5. Tom, the new manager of supermarket Einstein is trying to understand association rule with your help. He presented you the following information. In April 2005, 80 transactions contain eggs; 40 transactions contain milk and 15 transactions contain sugar. Moreover, 10 transactions contain all three items (egg, milk and sugar) altogether; 20 transactions contain egg and milk together; 15 transactions contain egg and sugar together and 10 transactions contain milk and sugar together. (Note: these transactions may overlap, e.g. the 15 transactions with both egg and sugar are among the 80 transactions with egg.) Tom got a rule as follows: egg ? milk with support 20%. Now he asks you to show him the support, confidence and lift for the rule: {egg, milk} ? sugar. (Note: You may not need all the numbers provided here. This question is used to test your understanding about the concept of support, confidence and lift. You should be able to get the answer with a few simple calculations).

Question 5a and 5b are based on the following scenario. A professor in the department of Business conducted an experiment for eBay to examine customer satisfaction with eBay. He used two measures - ease-of-use of eBay's auction mechanism and usefulness of online reviews - to segment eBay's customers. Suppose he identified the following two clusters, each one with 3 customers.

Customer ID

Ease-of-use

Usefulness

 

Customer ID

Ease-of-use

Usefulness

A

8

9

 

D

6

5

B

6

8

 

E

4

5

C

7

10

 

F

2

8

Cluster A - satisfied customers Cluster B - Unsatisfied customers

Question 5a. What are the centers for cluster A and cluster B?

Question 5b. Now there is a 7th customer, and you want to identify which cluster she belongs to. You are given the following information - her rating of ease-of-use is 5 and usefulness is 7. Which cluster do you think she belongs to?

Question 6:

Take the bank-data-final.text file posted on blackboard (in the Final Exam folder), and perform association rule analysis using SAS Enterprise Miner to answer the following questions.

6.1). What type of customers have a higher chance of buying a personal equity plan (pep=YES)? What types of customers have a lower chance of buying a personal equity plan?

6.2). Please examine the results and identify two interesting rules and explain why you think they are interesting.

In addition to writing down the answers to the questions, please also explain the steps you've taken to reach the answers (i.e. how you set the parameters in SAS Enterprise Miner, and how you read the results to get the conclusion). You may need your own judgment when deciding how to set the parameters (e.g. thresholds for support and confidence).

Below is the description of the attributes in the data.

age

age of customer in years

sex

MALE / FEMALE

region

inner_city/rural/suburban/town

income

income of customer

married

is the customer married (YES/NO)

children

number of children

car

does the customer own a car (YES/NO)

save_acct

does the customer have a saving account (YES/NO)

current_acct

does the customer have a current account (YES/NO)

mortgage

does the customer have a mortgage (YES/NO)

pep

did the customer buy a PEP (Personal Equity Plan) after the last mailing (YES/NO)

Question 7:

Consider the following scenario. A baseball manager wants to identify and group players on the team who are very similar with respect to several statistics of interest. The manager simply wants to identify different groups of players. The manager also wants to learn what differentiates players in one group from players in a different group.

The statistics of the players is provided in the Baseball.text file posted on blackboard (in the Final Exam folder). Please use SAS Enterprise Miner to help the manager understand the player groups better. When you cluster, please specify the number of clusters to be 2, 3, 4 respectively. Please describe the differences in results for k=2, 3, 4 with respect to the quality of the clusters you get and whether the clusters you obtained make sense.

When writing your report for this question, please be as specific as possible. Please also copy the results generated by SAS Enterprise Miner in the report, and point out which part of the results you used to reach certain conclusion.

Note: Please make sure you exclude Name from the attributes you use for clustering. There are also a few missing values from the data, just let SAS Enterprise Miner takes care of the missing values automatically.

Statistics and Probability, Statistics

  • Category:- Statistics and Probability
  • Reference No.:- M91576898

Have any Question?


Related Questions in Statistics and Probability

Complete parts a and b using the probability distribution

Complete parts (a) and (b) using the probability distribution below.  The number of overtime hours worked in one week per employee Overtime hours 0 1 2 3 4 5 6 Probability 0.016 0.063 0.178 0.283 0.220 0.173 0.067 Find t ...

Let x be a random variable with range rx -1 0 1 and let px

Let X be a random variable with range RX = {-1, 0, 1} and let P(X = 1) = P(X = -1) = p/2 for some p ∈ [0, 1]. a) Compute P(X = 0). b) Compute the expectation E[X] and variance Var(X) of X as a function of p, and determin ...

The weights of ice cream cartons are normally distributed

The weights of ice cream cartons are normally distributed with a mean weight of 20 ounces and a standard deviation of 0.5 ounces. You randomly select 25 cartons. What is the probability that their mean weight is greater ...

Data are collected on the relationship between the number

Data are collected on the relationship between the number of hours per week practicing a musical instrument and scores on a math test. The line of best fit is as follows: y = 72.5 + 2.8x. What would you predict the score ...

Suppose a soccer team wins each of its games with

Suppose a soccer team wins each of its games with probability 0.4, loses with probability 0.4, and ties with probability 0.2, with each game's outcome independent. Their season starts, and they play a game every day.Let  ...

The service manager for anbspcar dealership reviewed sales

The service manager for a car dealership reviewed sales records of the past 25 sales of new cars to determine the number of warranty repairs he will be called on to perform in the next 90 days. Corporate reports indicate ...

How would i calculate this anystate auto insurance company

How would I calculate this: Anystate Auto Insurance Company took a random sample of 382 insurance claims paid out during a 1-year period. The average claim paid was $1590. Assume  σ  = $234. Find a 0.90 confidence interv ...

In an intro psychology class all 83 students have completed

In an Intro Psychology class, all 83 students have completed an aggression rating scale. The possible ratings range from 1-7 with low scores indicating low aggression. Results showed that the average rating was 4.23 with ...

Penney wishes to borrow 75000 today for the purchase of

Penney wishes to borrow $75,000 today for the purchase of bakery equipment. She have an agreement with their commercial banker that she can borrow money at an annual rate of 7.75%. How much will she owe if she repay the ...

If you want to be 95 confident of estimating the population

If you want to be 95?% confident of estimating the population mean to within a sampling error of ±25 and the standard deviation is assumed to be 125?, what sample size is? required?

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As