Ask Question, Ask an Expert

+61-413 786 465

Ask PHP Expert

To begin this assignment, download Data Generator script (v2.3.9) and activate it as your own web service which runs on MySQL, PHP, and HTTP server. Data Generator is a free PHP and JavaScript package. It facilitates a generation of the typical variation of data values which can be subsequently used to test-populate a relational database. In this assignment, you required to extend its data generation capabilities. By doing so, you can populate a realistic test data set which will also allow you to conduct analysis experiments in the remaining part of the course project. Activation of the software components as well as the Data Generator may require your hands-on work, sometimes a futile cycle of try-and-errors, depending on your skills and experience. Indeed, this is the exact mission and objective of the assignment – learning through practice!

Your first task is to realize two additional data generation functions. Firstly, extend the system to generate random integral numbers based on normal distribution. You require to study Data Generator's structure and extend number generation type to activate normal distribution. The interface needs to obtain both mean and sigma as shown in first Figure. Consider the code found here which is reproduced below for your convenience:

function gauss() {
  // N(0,1)
  // returns random number with normal distribution:
  // mean=0
  // std dev=1

  // auxiliary vars
  // two independent variables with normal distribution N(0,1)
  // i will return only one, couse only one needed
    return $u;
  function gauss_ms($m=0.0,$s=1.0) {
  // N(m,s)
  // returns random number with normal distribution:
  // mean=m
  // std dev=s
    return gauss()*$s+$m;
  function random_0_1() {
  // auxiliary function
  // returns random number with flat distribution from 0 to 1
    return (float)rand()/(float)getrandmax();

Notice that the return value of the above code is a floating value. You could round it to nearest integer by adding a "rounding option" to the interface.

396_Data generator.jpg

Next, realize one form of skewed distribution that approximates Pareto Principle. Consider a skewed access pattern often evidenced in data applications such that s percent of accesses would go to (100 - s) percent of data items. For instance, a typical "80-20 rule" for 1000 accesses over 500 data items means that about 800 accesses (80% of accesses) go to a specific set of about 100 items (20% of data items). In our case, data generation should be based on independent repeated trials, not as all trials once in a batch. Therefore, implementing strict Pareto Principle is difficult. Instead, we can approximate access pattern generation by the following method:

• skew generation function receives a range r and a skew factor s as parameter, both of which are integers and r must be larger 1 while s must be between 50 and 100.
• data elements are considered to have unique IDs in the range [1, r], in which elements are listed in an increasing order of IDs such as 1, 2, 3, ..., r.
• skew generation function produces an integer value between 1 and r representing a data access in the following manner:
1) skewed access will go to the top portion of the elements, that is, those between 1 and t = r × (100 - s) / 100.
2) draw a random number p from uniform distribution between 0 and 99.
3) if p falls in less than s, i.e., [0, s - 1], the top portion of elements [1, t] is accessed.
4) otherwise the access goes to [t + 1, r].

Above illustration should be sufficient to provide you with the concrete requirement for the two frequently utilized data generation. Figure 1 and 2 shows interface and sample output respectively. In these figures, rounding to integer is applied automatically. A checkbox should be added to the interface so that users can choose whether values generated are rounded or not. Notice that this development is not from scratch, but is "reverse engineering" of already developed product. Addition of the above functions to Data Generator is easily done. You should look into the contents of docs/data_types.php.

1313_Data generated.jpg

The second task is to add more generation capabilities and/or tailor the built-in functionality of generator. This enables you to produce "all" the test data for the MySQL tables you produced in the previous assignment. You do not need to populate data to the tables in this assignment (which will be the task in the next assignment) but you must become able to generate all the realistic data required for the proposed information analysis. You are not allowed to use other software tools (such as Microsoft Excel) to generate data. Any non-trivial extension from the statistical viewpoint concerning the data nature (like the above assigned functionality).

PHP , Programming

  • Category:- PHP
  • Reference No.:- M9633

Have any Question? 

Related Questions in PHP

Question continue to build on the skills of providing web

Question: Continue to build on the skills of providing Web page content and structure with HTML and Web page style and layout using CSS. Demonstrate the skill of creating a dynamic Web page that includes JavaScript clien ...

Question using the course software and week one template as

Question: Using the course software and Week One Template as a starting point, create a new HTML webpage and include the following: • Write the code to display your name in the largest-size heading element. • Write the m ...

In php write a simple addtion calculator that reads in two

In PHP write a simple addtion calculator that reads in two values and then add them together and display the output. Be sure to validate both inputs and ensure that it can add 0+0 = Sum: 0. The user should be able to inp ...

Final project assignment -requirements specifications -1

FINAL PROJECT ASSIGNMENT - REQUIREMENTS / SPECIFICATIONS - 1. For the final project, you will have to make good use of your CMS using PHP and MySQL for the content delivery. You will have to be able to deliver the conten ...

Question develop a 5-6 page word document that describes

Question: Develop a 5-6 page Word document that describes the design for a small Web site that meets the following specifications. 1. The Web site includes four or more Web pages designed according to current usability g ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As