There are 300 rows of data for training and the number of, Ask an Expert

Programming

problem 1) Categorical variable tread Pattern = {A, B} and numerical variable tread Depth are used to classify Handling = {good, poor} of a vehicle. Depth is normally distributed with mean 9 and standard deviation 3 when Handling is good and mean 16 and standard deviation 4 when Handling is poor.

There are 300 rows of data for training and the number of rows of data for each combination of Pattern and Handling are given in the table below.

1228_pattern and handling.jpg

Given row x₀ = (Pattern = A, Depth=12). Use a naive Bayes’ classifier to compute the class of Handling assigned to x₀. Show your work for full credit.

problem 2) Consider the following neural network for two predictors Thickness and Alignment and two classes Print Quality High and Low. Some weights are shown in the table, including weights for constant (threshold) nodes that link to each output node.

1751_neural network.jpg

Suppose input data row x₀ produces z₁ = 0.5 at node H₁ and z₂ = 0.4 at node H2. Assume the softmax function is used at the outputs. Compute the output at each output node. To what Print Quality class is row x₀ assigned?

problem 3) In either the design or training of a neural network, provide a list of several ways to reduce the complexity of the model.

problem 4) Consider a support vector machine with three inputs x₁, x₂, and x₃. Assume there are only two support vectors with values and parameters as follows:

405_vector machine.jpg

problem 5) Consider one versus the rest voting used for classifier with three classes {a, b, c}. Given a row of data denoted as x₀ suppose that the classifier for a versus the rest predicts the rest, b versus the rest predicts the rest, and c versus the rest predicts c. What are the votes for each class for x₀?

problem 6) Consider the data with categorical predictor x₁ = {green or red} and numerical predictor x₂and the class variable y shown in the following table. The weights for a round of boosting are also shown in the table. Suppose the classifier built from this round of boosting assigns y = 1 for x₂ < 0.45 and y = -1 for x₂ > 0.45. Compute the coefficient α for this classifier in the boost algorithm.

425_categorical predictor.jpg

problem 7) A data set with 1000 rows is input to a neural network in Weka. The test option is set to 10-fold cross validation and the neural network option validationSetSize = 20%. How many rows of data are used in a validation set in a fold?

A support vector machine in Weka uses a Gaussian (radial basis function) kernel with parameter gamma. Gamma should be selected less than or equal to one. Circle True or False.

A support vector machine in Weka uses a Gaussian (radial basis function) kernel with parameter gamma = 0.2. This corresponds to the parameter σ² equal to what value?

problem 8) Given a dataset with 1000 rows and 25 predictors labeled x₁, x₂, …,x₂₅ to classify into two classes {a, b}. Consider the small random forest with 3 trees and one split in each tree as shown below. Here 5 predictors are selected randomly at each node. The class assigned to each leaf node is also shown.

a) Given a row of data x₀ with x1= green, x₅ = 4, x₉ = 9, predict the class label for the row.

b) For Tree1, it can be concluded that the best split among all 25 predictors is obtained from x1. Circle True or False.

c) Approximately 368 rows are expected to be out of bag for Tree 1. Circle True or False.

d) Categorical variables are coded to {0, 1} indicator variables in a random forest. Circle True or False.

problem 9) Suppose the random sample used for each tree in the previous random forest is decreased from 1000 rows to 500 rows. Circle ALL that are true for the ensemble classifier.

a) Variance of predictions is expected to decrease.

b) Bias of predictions is expected to decrease.

c) Each base classifier is expected to have more out of bag rows of data.

d) Generalization error is expected to decrease.

problem 10) A Bayesian network is shown for the variables paper Thickness, paper Alignment and Print Quality. The conditional probabilities are provided in the tables beside the nodes. Here, Thickness = {high or low}, Alignment = {straight or angled}, and Print Quality = {good or bad}.

1930_bayesian network.jpg

find out the following quantities from the information in the Bayesian network.

a) P(Alignment = straight)

b) P(Alignment = straight | Print Quality = good)

View complete question

DBMS, Programming

Category:- DBMS
Reference No.:- M9377

Have any Question?Write your Review or question?

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Recent Questions

Ask DBMS Expert

Programming

Related Questions in DBMS

Data mining assignment -in this assignment you are asked to

Sql query assignment -for this assignment you are to write

The groceries datasetimagine 10000 receipts sitting on your

You are in a real estate business renting apartments to

Objectivethe objective of this lab is to be familiar with a

The relation memberstudentid organizationid roleid stores

Relational database exerciseyou have been assigned to a new

Relational database design a given the following business

We can represent a data set as a collection of object nodes

Data model development and implementationpurpose of the

Ask Experts for help!!

Looking for Assignment Help?

Why might a bank avoid the use of interest rate swaps even

Describe the difference between zero coupon bonds and

Compute the present value of an annuity of 880 per year

Compute the present value of an 1150 payment made in ten

Compute the present value of an annuity of 699 per year

Follow Us