Ask Question, Ask an Expert


Ask DBMS Expert

Home >> DBMS

problem 1: Consider the following frequency counts of some item sets in a transaction database r:
freq({A}, r) = 0.405
freq({B}, r) = 0.510
freq({C}, r) = 0.303
freq({A,B}, r) = 0.380
freq({A,C}, r) = 0.256
freq({B,C}, r) = 0.197
freq({A,B,C}, r) = 0.095
Based on this information, you can use them to compute the following probabilities. (ex: P(X) = freq(X, r)).
a) What is the joint probability P(A and B)? What is P(A) ? P(B)? 
b) What are the confidence and  lift ratio of the association rules generated from the following itemsets: {A, B, C} and {A‾, B}?
c) Compare the results of a) and b) with  the frequencies  freq({A}, r) and freq({B}, r). What observations can you made about the relationship of A and B in terms of independence and possible causal relationship? Please describe your answer.
problem 2: Your friend owns a computer store in Yuen Long, selling Desktop and Notebook PCs and other computer peripherals. Having been instead successful with his business there, he decided to venture to the infamous Mongkok Computer Center and has already been there for three months. As expected, compared to his Yuen Long store, his new store has been recording much larger revenue however when it comes to profit, he is not so sure. He requires to pay some times more in rent! In order to stimulate sales, your friend feels that he requires to understand his customers in Mongkok more. To help him do so, you have asked for a sample of the transactional data he collected and they are shown in table below.

1851_transaction data.jpg

a) Set the Minimum Support to 18% and Minimum Confidence to 80%, find all frequent  large  itemsets  (for product items)  and all interesting rules using the Apriori algorithm.

b) By setting the Lift Ratio to 2, which rules you discovered  in Part (a) are still interesting?

c) How many possible association rules (even though both the support and confidence are 0) would be generated from the following itemsets: {Case, Desktop, Maintenance,  Mouse, Speaker, Webcam} and {Computer, Printer, Peripherals, Notebook_PC}.  Compare the results, what you can conclude?
problem 3: You are working for the ABC Telecom and are given some customer records for data mining. You are asked to discover, from the data, patterns that characterize low-, medium- and high-usage customers. He would like to make sure that newly recruited salespersons be able to recommend the right service plans (500-free-mins (low-usage), 2500-free-mins  (medium-usage), and 5000-free-mins  (high-usage)) to the right customers. 
a) Show how you can make use of the ID3 algorithm to discover in a sample of customer records (shown in Table below) what best  plan to make to which kind of customers.

1837_training data set.jpg

b) You are given a testing data set (shown in Table above) as follows, how much should you trust the recommendations made according to the rules discovered by ID3 algorithm?
c) Use the Naïve Bayesian Approach to check the recommendations against the testing data set. How many recommendation(s) is/are trustful?
d) Given a choice between the  Naïve  Bayesian Approach  and the  ID3 algorithm for this task, which one would you choose? Why?

DBMS, Programming

  • Category:- DBMS
  • Reference No.:- M91845

Have any Question? 

Related Questions in DBMS

Project database and programming designthis assignment

Project: Database and Programming Design This assignment consists of two sections: a design document and a revised Gantt chart or project plan. You must submit both sections as separate files for the completion of this a ...

Consider a typical sales invoice that would include the

Consider a typical sales invoice that would include the following information. Design a single table to hold all of the information required to store an invoice including this information. Next, apply normalization to re ...

Organizations want information organizations need

Organizations want information. Organizations need information. However, information must be in an organized format that supports the creation of business intelligence. Otherwise, according to Rebecca Wettemann, vice pre ...

Homeworkunlimited pickers is a group of workers who have

Homework Unlimited Pickers is a group of workers who have joined together to provide harvesting services to farmers who need to have their crops brought in. The organization has many teams of workers who travel from loca ...

Database design discusion 150 wordsnormalizationplease

Database Design Discusion (150 words) Normalization Please respond to BOTH of the following questions: Question A In your own words discuss the benefits of normalization. Question B Do you think we should normalize our d ...

Taskoverview of business casealive amp boating aampb is a

Task Overview of business case: Alive & Boating (A&B) is a small start­up company that sells small boats in Wagga. A&B keeps its models in several showrooms across the city. At this stage customers cannot view the availa ...

Data analysis projectyou have contracted with a local

Data Analysis Project You have contracted with a local school district to help them decide whether to use an interactive computer program or a standard chapter from a textbook to teach students to use fractions. The scho ...

Assignmentyou are to design a program that will serve as a

Assignment You are to design a program that will serve as a database for keeping track of video games and various statistics for the games. This application will allow for the storing of the name of a video game, its gen ...

Assignment - data mining and machine learning in the real

Assignment - Data Mining and Machine Learning in the Real World OBJECTIVE: Learn about some of the things going on in the real-world with Machine Learning and Data Mining. Your answers to the questions in the cells right ...

Data analysis assignment- apriori analysis and cluster

Data Analysis assignment- ApriorI analysis and Cluster analysis CLUSTER ANALYSIS The purpose of this assignment is to demonstrate steps performed in a K-Means Cluster analysis. Review the "k-MEANS CLUSTERING ALGORITHM" s ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Section onea in an atwood machine suppose two objects of

SECTION ONE (a) In an Atwood Machine, suppose two objects of unequal mass are hung vertically over a frictionless

Part 1you work in hr for a company that operates a factory

Part 1: You work in HR for a company that operates a factory manufacturing fiberglass. There are several hundred empl

Details on advanced accounting paperthis paper is intended

DETAILS ON ADVANCED ACCOUNTING PAPER This paper is intended for students to apply the theoretical knowledge around ac

Create a provider database and related reports and queries

Create a provider database and related reports and queries to capture contact information for potential PC component pro

Describe what you learned about the impact of economic

Describe what you learned about the impact of economic, social, and demographic trends affecting the US labor environmen