Ask Case Study Expert

Assignment: Advanced Analytics

Objectives: The assignment will rely on a series of workshops in which  students will understand how to use SAS Enterprise Miner (EM) to combine analytics of structured and unstructured data (text) to make business predictions. The assignment will assume good working knowledge of all previously studied methods.

Mini Case Study: This mini case study will be used in all workshops of module. All amendments, extensions and assumptions should be recorded in the final submission.

Business Problem - Early Warning System

The client is US National Highway Traffic Safety Administration (NHTSA, pronounced "NITS-uh"). NHTSA is an agency of the Executive Branch of the U.S. government, part of the Department of Transportation. They are responsible for reducing deaths, injuries and economic losses resulting from motor vehicle crashes. They require an Early Warning System for potential safety issues associated with automotive vehicles due to manufacturing problems. In particular they require an analytic model to be developed, capable of predicting the likelihood of a vehicle crash, based on publicly available vehicle safety complaints. In the circumstances when the likelihood of crashes is high, NHTSA will initiate a recall of all vehicles likely to be affected.

The sample data for recreational vehicles (e.g. pick-up trucks, minivans, SUVs, etc.), is from the NHTSA. Each record was filed by individuals who had experienced problems with a specific vehicle component that may or may not have resulted in a vehicle crash.

Two data tables have been provided:

- trucks1.sas7bdat (20.5 Mb of complaints data - download from CloudDeakin)

- stoplst2.sas7bdat (word stop list - download from CloudDeakin)

There are 56,601 observations in this sample of NHTSA data, where each observation is a document (record) representing a single complaint filed with the NHTSA through their survey instrument.

Target Variable: "CRASH" Approximately 30% of the complaints (documents) describe a situation in which a vehicle crash resulted from the failure/malfunction of the specified vehicle component.

The NHTSA collects consumer complaints regarding safety related motor vehicle and motor vehicle equipment by make, model and year, and includes Vehicle Component.

Consumers are directed to a web site that guides them through submitting their complaint through a survey instrument. The complaint information is entered into NHTSA's vehicle owner's complaint database and used with other complaints to determine if a safety-related defect trend exists. (For consumers without web access, they may call NHTSA directly and an operator will collect their information and enter it into the database.)

- If a safety-related defect exists in a motor vehicle or item of motor vehicle equipment, the manufacturer must fix it at no cost to the owner. The complaint is the first step in the process.

- Government engineers analyse the problem. If warranted, the manufacturer is asked to conduct a recall. If the manufacturer does not initiate a recall, the government can order the manufacturer to initiate a recall.

- The NHTSA does not have to receive a specific number of complaints before they look into a problem. They gather all available information on a problem. Each complaint is important to them.

Mini Study Predictive Analytics Workshop and Assignment

3 of 5 Questions

Q1. Describe the business problem and the potential value of the predictive model to the Propose an analytic solution to the problem and support your recommendation with references to the conducted data and text analytics.

Q2. Provide a summary of the sample data using descriptive statistics and frequency Specifically identify any anomalous or inconsistent data characteristics, explaining the potential impact.

Q3. Describe any treatments or transformations undertaken to resolve, the anomalous or inconsistent data characteristics from question 2.

Q4. Perform text analytics on the "CSUMMARY" data item, generating at least 5 topic clusters. Provide a description for each of the clusters generated.

Q5. Develop at least three predictive models for each of the following input characteristics combinations:

a. Using only the structured data (all columns excluding: CSUMMARY and the text topic clusters)

b. Using only the unstructured data (using only the generated text topic clusters)

c. Using both structured and unstructured data

Q6. For all models provide a summary of the model assessment statistics over the and validation partitions

Q7. Select the best predictive model and provide a summary of the model and its performance.

Case Study, Writing

  • Category:- Case Study
  • Reference No.:- M91417634
  • Price:- $70

Priced at Now at $70, Verified Solution

Have any Question?


Related Questions in Case Study

Aim of assessmentone important way that infants and young

Aim of assessment: One important way that infants and young children learn about their world is through their psychosocial devel- opment. This assessment item aims to provide you with an opportunity to explore the psycho ...

Assignment - solve the given case using below stepscase -

Assignment - Solve the given case using below steps. Case - The South African Wine Industry in 2016: Where Does It Go from Here? Steps - 1. Identify the Article/Topic/Situation. 2. Gather Info (Company website). 3. Sort. ...

Assignment - media evolution analysisassignment details

Assignment - Media Evolution Analysis Assignment details: Following our lectures and discussions in class, choose one media technology or phenomenon and research its historical trajectory. In particular, you should focus ...

Company law assignment question -hi tech supplies pty ltd

COMPANY LAW: ASSIGNMENT QUESTION - Hi Tech Supplies Pty Ltd is a company formed by two friends, Bill and Sue who met while studying computer studies at University. The company has very little assets and Bill and Sue have ...

Growth development and ageing for exercise scientists

Growth, Development and Ageing for Exercise Scientists Assessment Task - Video Critique Instructions This assessment task will assess your ability to apply key concepts of growth, development and ageing to a series of ca ...

Importance of communicable disease surveillanceword

Importance of communicable disease surveillance. word count:300

Case - adidasquestions1 analyse the importance of using

Case - ADIDAS QUESTIONS 1. Analyse the importance of using both above-the-line and below-the-line communication for Adidas. 2. Justify how effective you feel the high cost use of sponsorship has been to Adidas? 3. Evalua ...

Question 1requiredwhat is the major environmental or

QUESTION 1 Required: WHAT is the major environmental or resource issue, HOW is it caused and WHERE is it occurring? WHAT is the main resource involved and HOW is it being impacted or developed? WHO are the primary stakeh ...

Answer the questions using volkswagen group- write on this

Answer the questions using Volkswagen Group- write on this topic Case Assignment Please select a company from among the ones listed below (see Articles for Case Assignment) or else select an organization of choice facing ...

Fever case study -elena is a 74-year-old hispanic f who was

Fever Case Study - Elena is a 74-year-old Hispanic F who was recently discharged one week ago from a local rehabilitation center after status post left hip replacement (one month ago). Negative for any complications. PMH ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As