Using Unix extract the various ethnic populations in your file.
2. Your PERL program should produce two sets of data that can be used to produce figure1 and table 1 and table 2 for your final paper.
a. Table 1: Number of participants that subscribe to each ethnic group
b. Table 2: Number of each genotyped result by ethnic group.
Example : Table 2
Genotype
|
African-American
|
Asian
|
Caucasian
|
*1/*1
|
1094
|
276
|
7174
|
*1/*10
|
0
|
0
|
2
|
c. Figure 1: Using the data in Table 2 create a figure that can easily highlight the differences in genotype distributions across ethnic populations. Figure should be publication ready with titles, axis labels, legend, and description. Can be created using any statistics program or Excel.
Perl program requirements:
- Code should follow good programming practices
- The program should accept the path of the input file and the path of the output file from the command line.
- Keep in mind that *1/*2 is the same as *2/*1 Your code should consider these the same and eliminate any duplicate reporting.
- If a particular genotype is not included in an ethnic group, your code must put a zero value.
- You will need to include at least 2 subroutines in your program and pass any arrays or hashes as references to and from subroutines.
- You will need to use a hash as the data structure to determine the ethnic distribution.