Ask Financial Accounting Expert

1 Competitive Auctions on eBay.com. The eBayAuctions contains information on 1972 auctions transacted on eBay.com during May-June 2004. The goal is to use these data to build a model that will classify competitive auctions from noncompetitive ones.

A competitive auction is defined as an auction with at least two bids placed on the item auctioned. The data include variables that describe the item (auction category), the seller (his/her eBay rating), and the auction terms that the seller selected (auction duration , opening price, currency, day-of-week of auction close). In addition, we have the price at which the auction closed. The goal is to predict whether or not the auction will be competitive.

Data Preprocessing. Create dummy variables for the categorical predictors. These include Category (18 categories), Currency (USD, GBP. Euro), EndDay (Monday- Sunday), and Duration (1, 3, 5, 7, or 10 days). Split the data in to training and validation datasets using a 60% : 40% ratio.

a. Fit a classification tree using all predictors using the best pruned tree. To avoid overfitting, set the minimum number of observations in a leaf node to 50. Also. set the maximum number of levels to be displayed at seven (the maximum allowed in XLMiner). To remain within the limitation of 30 predictors, combine some of the categories of categorical predictors. Write down the results in terms of rules.

b. Is this model practical for predicting the outcome of a new auction?

c. Describe the interesting and uninteresting information that these rules provide.

d. Fit another classification tree ( using the best-pruned tree, with a minimum number of observations per leaf node = 50 and maximum
allowed number of displayed levels), this time only with predictors that can be used for predicting the outcome of a new auction. Describe the resulting tree in terms of rules. Make sure to report the smallest set of rules required for classification.

e. Plot the resulting tree on a scatterplot: Use the two axes for the two best (quantitative) predictors. Each auction will appear as a point, with coordinates corresponding to its values on those two predictors. Use different colors or symbols to separate competitive and noncompetitive auctions. Draw lines (you can sketch these by hand or use Excel) at the values that create splits. Does this splitting seem reasonable with respect to the meaning of the two predictors? Does it seem to do a good job of separating the two classes?

f. Examine the lift chart and the classification table for the tree. What can you say about the predictive performance of this model?

g. Based on this last tree, what can you conclude from these data about the chances of an auction obtaining at least two bids and its relationship to the auction settings set by the seller (duration, opening price. ending day, currency)? What would you recommend for a seller as the strategy that will most likely lead to a competitive auction?

9.2 Predicting Delayed Flights. The file FlightDelays.xls contains information on ail commercial flights departing the Washington, D.C., area and arriving at New York during January 2004. For each flight there is information on the departure and arrival airports, the distance of the route, the scheduled time and date of the flight, and so on. The variable that we are trying to predict is whether or not a flight is delayed. A delay is defined as an arrival that is at least 15 minutes later than scheduled.

Classification and Regression Tree

Data Processing. Create dummies for day of week, carrier, departure airport, and arrival airport.

This will give you 17 dummies. Bin the scheduled departure time into 2- hour bins (in XLMiner use Data Utilities > Bin Continuous Data and select 8 bins with equal width). After binning DEP _TIME into 8 bins, this new variable should be broken down into 7 dummies (because the effect will not be linear due to the morning and afternoon rush hours). This will avoid treating the departure time as a continuous predictor because it is reasonable that delays are related to rush-hour times. Partition the data into training and validation
sets.

a. Fit a classification tree to the flight delay variable using all the relevant predictors. Do not include DEP_TI ME (actual departure time) in the model because it is unknown at the time of prediction (unless we are doing our predicting of delays after the plane takes off, which is unlikely). In the third step of the classification tree menu, choose:

• "Maximum number levels to be displayed = 6".
• Use the best pruned tree without a limitationon the minimum number of observations in the final nodes.

Express the resulting tree as a set of rules.

b. If you needed to fly between DCA and EWR. on a Monday at 7 AM. would you be able to use this tree? What other information would you need? Is it available in practice? What information is redundant?

c. Fit another tree, this time excluding the day-of-month predictor. (Why?) Select the option of seeing both the full tree and the best pruned tree. You will find that the best pruned tree contains a single terminal node.

i. How is this tree used for classification? (What is the rule for classifying?)
ii. To what is this rule equivalent?
iii. Examine the full tree. What are the top three predictors according to this tree?
iv. Why, technically, does the pruned tree result in a tree with a single node?
v. What is the disadvantage of using the top levels of the full tree as opposed to the best pruned tree?
vi. Compare this general result to chat from logistic regression in the example in Chapter 10. What are possible reasons for the classification tree's failure to find a good predictive model?

9.3 Predicting Prices of Used Cars (Regression Trees). The file ToyotaCorolla.xls contains the data on used cars (Toyota Corolla) on sale during late summer of 2004 in The Netherlands. It has 1436 observations containing details on 38 attributes, including Price, Age, KM, HP, and other specifications. The goal is to predict the price of a used Toyota Corolla based on its specifications. (The example in Section 9.8 is a subset of this dataset.)

Data Preprocessing. Create dummy variables for the categorical predictors (Fuel Type and Color). Split the data into training (50%), validation (30%), and test (20%) datasets.

a. Run a regression tree (RT) using the prediction menu in XLMiner with the out- put variable Price and input variables Age_08_0-L KM, FueLType, H

P, Automatic, Doors, Quarterly_ Tax, Mfg_Guarantee, Guarantee _ Period, Airco, Automatic_Airco, CD_ Player, Powered _ Windows, Sport_ Model, and Tow_ Bar. Normalize the variables. Keep the minimum number of observations in a terminal node to 1 and the scoring option to Full Tree, to make the run least restrictive.

b. Which appear to be the three or four most important car specifications for predicting the car's price?

Financial Accounting, Accounting

  • Category:- Financial Accounting
  • Reference No.:- M9353375

Have any Question?


Related Questions in Financial Accounting

Case study - the athletes storerequiredonce you have read

Case Study - The Athletes Store Required: Once you have read through the assignment complete the following tasks in order and produce the following reports Part 1 i. Enter the business information including name, address ...

Scenario assume that a manufacturing company usually pays a

Scenario: Assume that a manufacturing company usually pays a waste company (by the pound to haul away manufacturing waste. Recently, a landfill gas company offered to buy a small portion of the waste for cash, saving the ...

Lease classification considering firm guidance issues

Lease Classification, Considering Firm Guidance (Issues Memo) Facts: Tech Startup Inc. ("Lessee") is entering into a contract with Developer Inc. ("Landlord") to rent Landlord's newly constructed office building located ...

A review of the ledger of oriole company at december 31

A review of the ledger of Oriole Company at December 31, 2017, produces these data pertaining to the preparation of annual adjusting entries. 1. Prepaid Insurance $19,404. The company has separate insurance policies on i ...

Chelsea is expected to pay an annual dividend of 126 a

Chelsea is expected to pay an annual dividend of $1.26 a share next year. The market price of the stock is $24.09 and the growth 2.6 percent. What is the cost of equity?

Sweet treats common stock is currently priced at 3672 a

Sweet treats common stock is currently priced at $36.72 a share. The company just paid $2.18 per share as its annual dividend. The dividends have been increasing by 2,2 percent annually and are expected to continue doing ...

Highway express has paid annual dividends of 132 133 138

Highway Express has paid annual dividends of $1.32, $1.33, $1.38, $1.40, and $1.42 over the past five years, respectively. What is the average divided growth rate?

An investment offers 6800 per year with the first payment

An investment offers $6,800 per year, with the first payment occurring one year from now. The required return is 7 percent. a. What would the value be today if the payments occurred for 20 years?  b. What would the value ...

Oil services corp reports the following eps data in its

Oil Services Corp. reports the following EPS data in its 2017 annual report (in million except per share data). Net income $1,827 Earnings per share: Basic $1.56 Diluted $1.54 Weighted average shares outstanding: Basic 1 ...

At the start of 2013 shasta corporation has 15000

At the start of 2013, Shasta Corporation has 15,000 outstanding shares of preferred stock, each with a $60 par value and a cumulative 7% annual dividend. The company also has 28,000 shares of common stock outstanding wit ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As