Ask Question, Ask an Expert

+61-413 786 465

info@mywordsolution.com

Ask Biology Expert

Bioinformatics Assignment -

In this assignment should check the following sequence and test whether it has the following restriction cut sites. This searching should be done globally, that is, it should check for all possible restriction sites. If the restriction sites are present, print out the regex, the pattern that matched the regex, and the position of where the cut beings.

Hint: the pos function gets the position of the last matched substring. Play around with it to see how it works.

Construct regular expressions for the two restriction enzyme motifs. Each restriction enzyme motif should be represented by one regular expression:

CACNNN/GTG (so CACNNN or CACGTG) where N represents A,C,T, or G

GCCWGG, where W represents A or T

The DNA sequence you will be searching in is this one, which you will paste into your program:

$dna = 'AACAGCACGGCAACGCTGTGCCTTGGGCACCATGCAGTACCAAACGGAACGATAGTGAAAACAATCACGA

ATGACCAAATTGAAGTTACTAATGCTACTGAGCTGGTTCAGAGTTCCTCAACAGGTGAAATATGCGACAG

TCCTCATCAGATCCTTGATGGAGAAAACTGCACACTAATAGATGCTCTATTGGGAGACCCTCAGTGTGAT

GGCTTCCAAAATAAGAAATGGGACCTTTTTGTTGAACGCAGCAAAGCCTACAGCAACTGTTACCCTTATG

ATGTGCCGGATTATGCCTCCCTTAGGTCACTAGTTGCCTCATCCGGCACACTGGAATTTAACAATGAAAG

CTTCAATTGGACTGGAGTCACTCAAAATGGAATCAGCTCTGCTTGCAAAAGGAGATCTAATAACAGTTTC

TTTAGTAGATTGAATTGGTTGACCCACTTAAAATTCAAATACCCAGCATTGAACGTGACTATGCCAAACA

ATGAAAAATTTGACAAATTGTACATTTGGGGGGTTCACCACCCGGGTACGGACAATGACCAAATCTTCCT

GTATGCTCAAGCATCAGGAAGAATCACAGTCTCTACCAAAAGAAGCCAACAGACTGTAATCCCGAATATC

GGATCTAGACCCAGAGTAAGGAATATCCCCAGCAGAATAAGCATCTATTGGACAATAGTAAAACCGGGAG

ACATACTTTTGATTAACAGCACAGGGAATTTAATTGCTCCTAGGGGTTACTTCAAAATACGAAGTGGGAA

AAGCTCAATAATGAGATCAGATGCACCCATTGGCAAATGCAATTCTGAATGCATCACTCCAAATGGAAGC

ATTCCCAATGACAAACCATTTCAAAATGTAAACAGGATCACATATGGGGCCTGGCCCAGATATGTTAAGC

AAAACACTCTGAAATTGGCAACAGGGATGCGAAATGTACCAGAGAAACAAACTAGAGGCATATTTGGCGC

AATCGCGGGTTTCATAGAAAATGGTTGGGAAGGAATGGTGGATGGTTGGTACGGTTT'

If you print out $dna, you may notice that the sequence is wrapped around some 70 characters or so. This means that $dna currently contains some \n characters in it, which will affect how regex matches against the string. In order to correctly identify all possible restriction sites, you would need to first remove those newline characters. This can be done by including the substitution operator after the variable declaration (similar to what we did in the vim writing exercises):

$dna =~ s/\s//g; # What would happen if the 'g' modifier is removed?

The following is the expected output. Instead of "$pattern1" and "$pattern2", you should be printing out the actual regular expression that you used to match the restriction enzyme motif. I did not print it out because that would give you part of the answer.

The program should include:

two regular expressions, one for each enzyme

one variable that contains the DNA sequence

optional if you would like to challenge yourself, include some code that will accept one command-line argument. If one is given, replace $dna above with the sequence provided by the user. Ensure that the provided sequence is a DNA sequence; otherwise, end the program and print a helpful message back to the user.

One subroutine called find_cut_sites that will accept 2 parameters: a DNA sequence and a regular expression. The subroutine should match the regular expression against the sequence and print the positions of all found cut sites. The position printed should be the starting position of where the site was found. Nothing should be explicitly returned by this subroutine. Whenever a subroutine does not explicitly return anything, it is known to be a void subroutine (void because a result is not provided back to the caller).

(There should be two subroutine calls for find_cut_sites(): one for each regular expression.)

Comments describing your subroutine (what it accepts, what it returns, what it does) and any other ambiguous code.

Attachment:- Assignment File.rar

Biology, Academics

  • Category:- Biology
  • Reference No.:- M93137338

Have any Question?


Related Questions in Biology

A filamentous organism has been isolated from decomposing

A filamentous organism has been isolated from decomposing organic matter. This organism has a cell wall but no chloroplast. why would you classify this organism into "domain eukarya, kingdom of fungi"

Effect of light intensityfor the effect of light intensity

Effect of Light Intensity For the Effect of Light Intensity Experiment, calculate the rate of oxygen production for the light intensity of 6. Subtract the original volume at 00:00:00 from the final reading at 00:02:00 to ...

What is the process of turning polynucleotides into

What is the process of turning polynucleotides into proteins?

Soap and detergent molecules have a long hydrophobic tail

Soap and detergent molecules have a long, hydrophobic tail and a polar, hydrophilic head. They are sometimes referred to as  bridge molecules  because they allow oils and fats to be suspended and dissolved in water (whic ...

What is the difference between systole and diastole and how

What is the difference between systole and diastole, and how do they relate to Systolic blood pressure and diastolic blood pressure?

Case study question -case study - mary 21 years old

Case Study Question - Case Study - Mary, 21 years old, presented to the hospital emergency department with an infected laceration on her left foot. Mary was at a beach resort four days ago, when she trod on a broken glas ...

Question overview although it may sound like science 2520

Question: Overview: Although it may sound like science 2520, DNA and RNA technologies are already used in products today. In this discussion you will examine one specific product, organism, or technology in your initial ...

Write the null hypothesis for the effect of temperature on

Write the null hypothesis for the effect of temperature on catechol oxidase activity?

What is the complementary strand for 5-atgcatgcatgccc-3how

What is the complementary strand for: 5'-ATGCATGCATGCCC-3' How many turn(s) will this strand have? Are Eukaryotic cells always diploid during S phase whereas bacteria are only haploid at the end of DNA replication?

What are the equations for molarity molality density ppm

What are the equations for Molarity, molality, density, ppm, ppb, mass percent, Volume percent, and mole percent?

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As