Ask Question, Ask an Expert

+61-413 786 465

info@mywordsolution.com

Ask Computer Network & Security Expert

Genome4U is a scientific research project at a large university in the United States. Genome4U has recently started a large-scale project to sequence the genomes of 250,000 volunteers with a goal of preparing a set of publicly accessible databases with human genomic, trait, and medical data.

The project's founder, a brilliant man with several talents and interests, tells you that the public databases will give information to the world's scientific community in general, not just those interested in medical research. Genome4U is trying not to prejudge how the data will be used because there can be opportunities for interconnections and correlations that computers will find that people might have missed.

The founder envisions clusters of servers that may be accessible by researchers all over the world. The databases will be used by end users to study their own genetic heritage, with the help of their doctors and genetic counselors. In addition, the data will be used by computer scientists, mathematicians, social scientists, physicists, and other researchers.

The genome for a single human consists of complementary DNA strands wound together in a double helix. The strands hold about 6 billion base pairs of nucleotides connected by hydrogen bonds. To store the research data, 1 byte of capacity is used for each base pair.

As a result, 6 Giga-Bytes of data capacity is required to store the genetic information of just one person. The project plans to use network-attached Storage (NAS) clusters. A system has been prototyped using the current version of FreeNas software. Production software is expected to migrate to HBase and Hadoop cloud computing infrastructure.

Genome4U has prepared new techniques to sequence a person's genome, quickly, accurately and most importantly at low cost. The research group is a contestant for the $10,000,000 X-Prize offered by Archon-Genomics. With their current funding they expect to complete the sequencing of 25,000 individuals by December 2012 and will sequence 5,000 individuals every month thereafter with the equipment that they are currently using.

In addition to genetic information, the project will ask volunteers to provide detailed information about their traits so that researchers can find correlations between traits and genes. Volunteers will also give their medical records. Storage will be needed for these data sets and the raw nucleotide data. This detailed medical information is expected to require not more than 100 Mega-Bytes of storage for each individual.

Since the data is to be publically shared, an initial community of 25,000 active users are expected, and this community expected to double every 18 months. Active users are expected to access 10% of the entire database daily which is expected to make huge demand on the networking infrastructure.

You have been brought in as a network design consultant to help the Genome4U project and the management team has asked you to help them organize their needs.

They would appreciate your analysis to answer the subsequent questions:

1. List the major user communities.

2. List the major data stores and the user communities for each data store.

3. Prepare a graph of the storage requirements for the project monthly for the next 3 years.

4. Based on the size of the database, and the demands of the active users, what is the expected network capacity required to support the growing community of users? Add this capacity demand to the storage graph you drew above.

5. Can you evaluate the relationship between the storage size, number of genomes, number of users and network capacity requirements? If possible express this as an equation.

6. Review the capabilities of FreeNAS software. Will the FreeNAS software scale to the projected requirements of this application? If you find limits to its scalability what other solutions are possible?

7. Characterize the network traffic in terms of flow, load, behavior, and QoS requirements. You will not be able to precisely characterize the traffic but provide some theories about it and document the types of tests you would conduct to prove your theories right or wrong.

8. What additional questions would you ask Genome4U's founder about this project? Who besides the founder would you talk to and what questions would you ask them?

Computer Network & Security, Computer Science

  • Category:- Computer Network & Security
  • Reference No.:- M9132252

Have any Question?


Related Questions in Computer Network & Security

Question what is active threat in terms of network security

Question: What is active threat in terms of network security? Provide an example. The response must be typed, single spaced, must be in times new roman font (size 12) and must follow the APA format. Note: minimum 300 wor ...

Lab activity investigate system backup and restore

Lab Activity: Investigate System Backup and Restore Tools Purpose: Assess and Document Tools to Backup and Restore the System Hard Drive for a Windows 8.1 Workstation. - Assess and document the use of a system backup too ...

Overviewthis assignment has three major aims- to help

Overview This assignment has three major aims: - To help students gain good understanding of theoretical and practical material. - To encourage students to use content analysis summaries to prepare for tests, examination ...

Suppose after collecting data on an existing firms actual

Suppose, after collecting data on an existing firm's actual short-run ouput, the following production function is found to match the data: TP = Q = 5*L + 0.6*L2 - 0.01*L3 1. Using the equation above, find the following e ...

Fiona told her friend that she is very fortunate as the

Fiona told her friend that she is very fortunate as the slow-down in the economy has not decreased sales in her grocery store by much compared to sales of new cars in his car dealership. Explain what Fiona meant using th ...

Suppose that third national bank has reserves of 20000 and

Suppose that Third National Bank has reserves of $20,000 and check able deposits of $200,000. The reserve ratio is 10 percent. The bank sells $20,000 in securities to the Federal Reserve Bank in its district, receiving a ...

What is the difference between a positive economic

What is the difference between a positive economic statement and a normative one.

You need to prepare packet tracer fileattached pdf contains

You need to prepare packet tracer file attached pdf contains topology and required configurations and assigned ip address. In packet tacer file you need to include banner, router and switches. 1. VLSM Design a) As first ...

How would you explain the concept of a quality adjusted

How would you explain the concept of a quality adjusted life year? When is it appropriate to use "QALYs" instead of simply improved life expectancy as the outcome measure in an economic evaluation?

Suppose that serendipity bank has excess reserves of 12000

Suppose that Serendipity Bank has excess reserves of $12,000 and check able deposits of $150,000. If the reserve ratio is 20 percent, what is the size of the bank's actual reserves?

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As