Ask Question, Ask an Expert

+61-413 786 465

info@mywordsolution.com

Ask Statistics and Probability Expert

Statistical Modelling Assignment

OVERVIEW OF THE ASSIGNMENT

This assignment will test your skills of collecting and analysing data to answer a specific business problem. It also gives you the opportunity to apply the theories you have learned in this course such as finding numerical summaries, displaying with appropriate graphs and using statistical inferences to solve business problems, including constructing hypotheses, test them and interpret the findings. You may have to use two Data sets. One Data set will be sent to you via KOI student email individually and you need to find or collect another dataset.

Suppose you are working for an agency who analyse NSW transport system data to make a recommendation to improve public transport system. You will be given series of research questions. Use your knowledge that you gain from this course to answer these questions by displaying appropriate outputs of Excel, StatKey or Wolfram alpha. Use these answers to write an executive summary which might be a valuable recommendation to Transport NSW.

TASK DESCRIPTION: WRITTEN REPORT

There are two datasets involved in this assignment: Dataset 1 and Dataset 2, detailed below.

Dataset 1: You will receive an email that contains a dataset that is specifically allocated to you. This dataset is a subset of a data Opal Tap on and Tap Off Location - 8th to 14th August 2016 individual sample file, provided by the Transport for NSW Open Data and has been edited to only include a subset of the cases and variables.

The original dataset can be obtained and it is under the license of Creative Commons Attribution 3.0 Australia. Data dictionary of the edited dataset is given in the following table.

Variable

Description

Values

mode

Type of the public transport

Bus, Train, Ferry and Light Rail

date

Date of the tap on/off held

Date/month/year

tap

It is a tap on or off

On and Off

loc

Locations of stops. For bus

postcodes and others name of the stations

Postcodes and names of the stations

count

Total number tap on or off on the certain location and

the certain date

Number

Dataset 2: Collect data (e.g. via a survey) that will answer research question given in section 3. There is no requirement about the number of variables, sampling methods and sample size, but you need to justify your approaches in Section 1 (see below).

Both datasets should be saved in an Excel file (one file, separate worksheets). All data processing should be performed in Excel or Statkey.

Prepare a report in a document file (.doc or .docx) which includes all relevant tables and figures, using the following structure:

1. Section 1: Introduction
a. Give a brief introduction about the assignment and search related article and write a paragraph of summary which supports your assignment. You need to give the full citation of the article.
b. Dataset 1: Give a short description about this dataset. Is this primary or secondary data? What are types of variables involved? Explain briefly what are the possible cases used in this study.
c. Dataset 2: Explain how you collect the data and discuss its limitation (e.g. whether your sample is biased). Is this primary or secondary data? What is/are the type(s) of variable(s) involved? Give a description of cases you consider for this data set.

2. Section 2: Analysis of single variable in Dataset 1
a. To answer research question "Which type of public transport was most used by the NSW people during 8th to 14th of August 2016?", provide a suitable numerical summary and graphical display for the variables mode of Dataset 1. Give a detailed comment to answer the research question.
b. Now to answer research question "Are there more than 50% of public transport users in NSW use the particular mode of transport found in Part a?" setup an appropriate hypotheses, perform hypotheses test and answer the research question by writing the conclusion of the test.

3. Section 3: Analysis of two variables in Dataset 1
NSW Government need to decide on whether they have to build an underground Railway line from either Parramatta, Bankstown or Gosford to central. To prepare a recommendation for this;
a. Give a numerical summary and an appropriate graphical display for the variables location, by only considering those three stations; and the variable count by considering the data with trains only.
b. Perform a suitable hypothesis test at a 5% level of significance to test whether there is difference between mean counts of taps on and off.
c. Use the conclusion of the test in part b and the outputs in part a to write a recommendation to NSW government.

4. Section 4: Collect and analysis Dataset2
You are interested in finding whether there is a difference in preference between different gender in terms of their transport mode (Bus, Train, Ferry and Light Rail). by considering appropriate number of cases and variable, give a proper graphical display and use it to write a comments.

Section 5: Discussion & Conclusion

Write an executive summary by combining all your findings in the previous sections which must be a valuable recommendation for NSW Transport. Give a suggestion for further research

TASK DESCRIPTION: PRESENTATION/INTERVIEW

A presentation/interview for the assignment is scheduled on Week 11, in your allocated tutorial.

You do NOT need to prepare a presentation material (e.g. power-point slides), instead, you will be asked to demonstrate and/or explain how you summarised the data and how you performed the analysis. You may be asked to reproduce what you have made in your written report (e.g. generate a chart or numerical summary using Excel or Statkey).

Attachment:- Data 15.rar

Statistics and Probability, Statistics

  • Category:- Statistics and Probability
  • Reference No.:- M93099988
  • Price:- $35

Priced at Now at $35, Verified Solution

Have any Question?


Related Questions in Statistics and Probability

According to national data about 14 of american college

According to national data, about 14% of American college students earn a graduate degree. Using this estimate, what is the probability that exactly 24 undergraduates in a random sample of 200 students will earn a colleg ...

Help me study by answering this question a stock is just

Help me study by answering this question. A stock is just paid a dividend of $0.91 and is growing at a constant rate of 10 percent per year. If the required rate of return is 15 percent, what is the stock's expected pric ...

Suppose a stock has just paid a 44 per share dividend d0

Suppose a stock has just paid a $4.4 per share dividend (D 0 ). The dividend is projected to grow at 15% for the  next three  years, then 6% thereafter indefinitely. What should be the amount of  dividend  in  four  year ...

The statistical and regression questions are listed

The statistical and regression questions are listed below: What is the value of the simple correlation between TICKETS and COST? If you are able to determine the answer, is showing the film in more theatres associated wi ...

A researcher calculates a 90 confidence interval of 3397

A researcher calculates a 90% confidence interval of (3397, 3421) to estimate the population birth weight using a random sample of 84 babies born to mothers with gestational diabetes. She wants to compare this range to t ...

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. Assume that the first cash flow will occur one year from today (that is, at t = 1). (Round your answer ...

A system consists of five independent components in

A system consists of five independent components in parallel. The system will work if at least one of the five components works. Let Ci represent the event that component i works, i=1,...,5. P(Ci)=0.97 for all i. What is ...

Youve finally decided to retire at the ripe old age of 50

You've finally decided to retire at the ripe old age of 50, and due to some fancy investing, you have accumulated $750,000 in mutual funds. Based upon genetics, you're likely to live until you're 80. Since you've taken t ...

In a single season an average of 24 home runs were hit per

In a single? season, an average of 2.4 home runs were hit per game. Assume the number of home runs per game follows the Poisson distribution. ?a) What is the probability that 6 home runs will be hit in a randomly selecte ...

1 if there are 150 values in a data set how many classes

1) If there are 150 values in a data set, how many classes should be created for a frequency histogram?  A. 5 B. 6 C. 7 D. 8 E. 9 2) . If x is a binomial random variable where n = 100 and p = 0.2, find the probability th ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As