Ask Question, Ask an Expert

+61-413 786 465

info@mywordsolution.com

Ask Management Information System Expert

Allstate Claim Prediction Challenge

A key part of insurance is charging each customer the appropriate price for the risk they represent.

Risk varies widely from customer to customer, and a deep understanding of different risk factors helps predict the likelihood and cost of insurance claims. The goal of this competition is to better predict Bodily Injury Liability Insurance claim payments based on the characteristics of the insured customer's vehicle.

Many factors contribute to the frequency and severity of car accidents including how, where and under what conditions people drive, as well as what they are driving.

Bodily Injury Liability Insurance covers other people's bodily injury or death for which the insured is responsible. The goal of this competition is to predict Bodily Injury Liability Insurance claim payments based on the characteristics of the insured's vehicle.

Files

Train.cvs

Test.cvs

Data Description

Each row contains one year's worth information for insured vehicles. Since the goal of this competition is to improve the ability to use vehicle characteristics to accurately predict insurance claim payments, the response variable (dollar amount of claims experienced for that vehicle in that year) has been adjusted to control for known non-vehicle effects.

Some non-vehicle characteristics (labeled as such in the data dictionary) are included in the set of independent variables. It is expected that no "main effects" corresponding will be found for these non-vehicle variables, but there may be interesting interactions with the vehicle variables.

Calendar_Year is the year that the vehicle was insured. Household_ID is a household identification number that allows year-to-year tracking of each household. Since a customer may insure multiple vehicles in one household, there may be multiple vehicles associated with each household identification number.

"Vehicle" identifies these vehicles (but the same "Vehicle" number may not apply to the same vehicle from year to year). You also have the vehicle's model year and a coded form of make (manufacturer), model, and submodel. The remaining columns contain miscellaneous vehicle characteristics, as well as other characteristics associated with the insurance policy. See the "data dictionary" (data_dictionary.txt) for additional information.

Our dataset naturally contained some missing values. Records containing missing values have been removed from the test data set but not from the training dataset. You can make use of the records with missing values, or completely ignore them if you wish. They are coded as "?".

There are two datasets to download: training data and test data. You will use the training dataset to build your model, and will submit predictions for the test dataset. The training data has information from 2005-2007, while the test data has information from 2008 and 2009. Submissions should consist of a CSV file. Records from 2008 will be used to score the leaderboard, and records from 2009 will be used to determine the final winner.

Missing feature values have been kept as is, so that the competing teams can really use the maximum data available, implementing a strategy to fill the gaps if desired. Note that some variables may be categorical (e.g. f776 and f777).

The competition sponsor has worked to remove time-dimensionality from the data. However, the observations are still listed in order from old to new in the training set. In the test set they are in random order.

Attachment:- Walmart-Instructions.rar

Management Information System, Management Studies

  • Category:- Management Information System
  • Reference No.:- M92489075

Have any Question?


Related Questions in Management Information System

Write a 2 page paper that discusses what policies were

Write a 2 page paper that discusses what policies were missing in the particular case. Do additional research than what was provided in the text. Use APA format Cite your sources. •Private Sector •Target Corporation •1,7 ...

Question how are authentication and authorization alike and

Question : How are authentication and authorization alike and how are they different? What is the relationship, if any, between the two? Please write minimum of 200 words with 0 % plagiarism.

Part 11 what are the practical implications in the

Part 1 1) What are the practical implications in the theoretical framework? 2) What are the complexities seen in the organization? Write 300 Words. Include in-text citations and references in APA style formatting. Must h ...

Confidentiality integrity and availability or the cia triad

Confidentiality, integrity, and availability or the CIA triad of security is introduced in this session. These three dimensions of security may often conflict. Confidentiality and integrity often limit availability. So, ...

Question in roughly 200 words - explain what social

Question : In roughly 200 words - Explain what social engineering is, and discuss how organizations can recognize social engineering and reduce the risks associated with it. The response must be typed, single spaced, mus ...

Instructionsyou have just been hired as a compliance

Instructions You have just been hired as a compliance officer for your healthcare organization, and you have discovered that the food services department of the organization is not in compliance with state food safety re ...

Find a systems analyst position not a business systems

Find a systems analyst position (not a business systems analyst) currently advertised near you. Dice, GlassDoor, Indeed, and Monster are good sources but Do not repeat a listing that has been used by another student. Ple ...

Research histogram and its use then post a description of

Research Histogram, and its use, then post a description of Histogram, and describe it in detail-explaining its purpose and how it works. Give your opinion on when it would be appropriate to use this tool in an IT settin ...

Part 1 200-250 words with referencesprovide an example of a

Part 1: 200-250 words with references Provide an example of a data warehouse model defining the grain, dimensions and facts of the data warehouse. Part 2: 200-250 words with references Identify the importance of selectin ...

The records are ordered by the date of entry into the union

The records are ordered by the date of entry into the union. The first three lines of the file are: Delaware, DE, 12/7/1787, 1954, 941875 Pennsylvania, PA, 12/12/1787, 44817, 12856989 New Jersey, NJ, 12/18/1787, 7417, 89 ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As