Ask DBMS Expert


Home >> DBMS

Universal Bank is a young bank growing rapidly in overall customer acquisition. The majority of these customers are depositors with varying sizes of relationship with the bank. The customer base of borrowers is quite small, and the bank is interested in expanding this base to bring in more loan business. In particular, it wants to explore ways of converting depositors to borrowers while retaining them as depositors.

A campaign that the bank ran last year for depositors showed a healthy conversion rate of over 9% success. This has encouraged the retail marketing department to devise smarter campaigns with better targeted marketing. The goal of this assignment is to model the previous campaign's customer behavior to analyze what combination of factors make a customer more likely to accept a personal loan. This will serve as the basis for the design of a new campaign.

The file UniversalBank contains data on 5000 customers. The data include customer demographic information, account information, and the customer response to the last personal loan campaign. The layout of the file is described below. The last five columns are Yes/No responses. 0 = No; 1 = Yes.

Data Description:

ID

Customer ID

Age

Customer's age in completed years

Experience

#years of professional experience

Income

Annual income of the customer ($000)

Family

Family size of the ustomer

CCAvg

Avg. spending on credit cards per month ($000)

Mortgage

Value of house mortgage if any. ($000)

Securities Account

Does the customer have a securities account with the bank?

CD Account

Does the customer have a certificate of deposit (CD) account with the bank?

Online

Does the customer use internet banking facilities?

CreditCard

Does the customer use a credit card issued by UniversalBank?

Personal Loan

Did this customer accept the personal loan offered in the last campaign?


In this assignment, you will use a set of R scripts that I wrote to train and test a K nearest neighbors (KNN) classifier for the UniversalBank data set.

The script UB_tr_vl_ts.R partitions UniversalBank into a training set (50% of the cases), a validation set (30% of the cases) and a test set (20% of the cases). This process corresponds to slides 6 and 7 in this week's slide deck.

With the script BillsKNNtrain.R, you supply k, the number of neighbors to use in the analysis, and R calculates the training error and the validation set results including the confusion matrix, the error rate, the true positive rate and the true negative rate.

With the script BillsKNNtest.R, you supply k, the number of neighbors to use in the analysis, and R calculates the test set results including the confusion matrix, the error rate, the true positive rate and the true negative rate.

To complete this assignment, answer the questions below in a Word document and submit the document by the due date.

1) Produce a table similar to the one shown in this week's slide 15. Investigate k values from 1 through 20 and report the training error and the validation error.

2) From your results in question 1, choose the best value of k for this analysis and explain your choice..

3) Run BillsKNNtest.R for your chosen value of k.

4) From your results in questions 1, 2 and 3, what error rate can you expect on new data if you use your chosen value of k? Explain how you arrived at your answer.

5) For your chosen value of k, explain why the Validation Confusion Percentages and the Test Confusion Percentages are different.

6) Explain how we avoid overfitting in the development of this knn classifier.

7) Explain why the training error for a 1 nearest neighbor classifier is always 0.

8) What do the True Positive Rate and the True Negative Rate tell us about the performance of the classifier? Why might this information be useful to someone using the classifier?

9) Evaluate the following statement. Since every student uses the same UniversalBank.txt file, every student's confusion matrices should be exactly the same.

http://wikisend.com/download/761982/BillsKNNtrain.R

[URL=http://wikisend.com/download/761982/BillsKNNtrain.R]BillsKNNtrain.R[/URL]

http://wikisend.com/download/616584/UB_tr_vl_ts.R

[URL=http://wikisend.com/download/616584/UB_tr_vl_ts.R]UB_tr_vl_ts.R[/URL]
http://wikisend.com/download/139866/UniversalBank.txt.docx

[URL=http://wikisend.com/download/139866/UniversalBank.txt.docx]UniversalBank.txt.docx[/URL]

http://wikisend.com/download/640962/BillsKNNtest.R

[URL=http://wikisend.com/download/640962/BillsKNNtest.R]BillsKNNtest.R[/URL]

DBMS, Programming

  • Category:- DBMS
  • Reference No.:- M91592191
  • Price:- $120

Guranteed 48 Hours Delivery, In Price:- $120

Have any Question?


Related Questions in DBMS

Data mining assignment -in this assignment you are asked to

Data Mining Assignment - In this assignment you are asked to explore the use of neural networks for classification and numeric prediction. You are also asked to carry out a data mining investigation on a real-world data ...

Sql query assignment -for this assignment you are to write

SQL Query Assignment - For this assignment you are to write your answers in a word document. This assignment is in three parts: Part A (reporting queries), Part B (query performance), Part C (query design). For this assi ...

The groceries datasetimagine 10000 receipts sitting on your

The groceries Dataset Imagine 10000 receipts sitting on your table. Each receipt represents a transaction with items that were purchased. The receipt is a representation of stuff that went into a customer's basket. That ...

You are in a real estate business renting apartments to

You are in a real estate business renting apartments to customers. Your job is to define an appropriate schema using SQL DDL in MySQL. The relations are Property(Id, Address, NumberOfUnits), Unit(ApartmentNumber, Propert ...

Objectivethe objective of this lab is to be familiar with a

OBJECTIVE: The objective of this lab is to be familiar with a process in big data modeling. You're required to produce three big data models using the MS PowerPoint software. This tool is available on UMUC Virtual Deskto ...

The relation memberstudentid organizationid roleid stores

The relation Member(StudentId, OrganizationId, RoleId) stores the membership information of student joining organization. For example, ('S1', 'O2', 'R3') indicates that student with Id 'S1' joined the organization with i ...

Relational database exerciseyou have been assigned to a new

Relational Database Exercise: You have been assigned to a new development team. A client is requesting a relational database system to manage their present store with the anticipation of adding more stores in the future. ...

Relational database design a given the following business

Relational Database Design A) Given the following business rules, identify entity types, attributes (at least two attributes for each entity, including the primary key) and relationships, and then draw an Entity-Relation ...

We can represent a data set as a collection of object nodes

We can represent a data set as a collection of object nodes and a collection of attribute nodes, where there is a link between each object and each attribute, and where the weight of that link is the value of the object ...

Data model development and implementationpurpose of the

Data model development and implementation Purpose of the assessment (with ULO Mapping) The purpose of this assignment is to develop data models and map Database System into a standard development environment to gain unde ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As