Ask Biology Expert

Bioinformatics Assignment -

In this assignment should check the following sequence and test whether it has the following restriction cut sites. This searching should be done globally, that is, it should check for all possible restriction sites. If the restriction sites are present, print out the regex, the pattern that matched the regex, and the position of where the cut beings.

Hint: the pos function gets the position of the last matched substring. Play around with it to see how it works.

Construct regular expressions for the two restriction enzyme motifs. Each restriction enzyme motif should be represented by one regular expression:

CACNNN/GTG (so CACNNN or CACGTG) where N represents A,C,T, or G

GCCWGG, where W represents A or T

The DNA sequence you will be searching in is this one, which you will paste into your program:

$dna = 'AACAGCACGGCAACGCTGTGCCTTGGGCACCATGCAGTACCAAACGGAACGATAGTGAAAACAATCACGA

ATGACCAAATTGAAGTTACTAATGCTACTGAGCTGGTTCAGAGTTCCTCAACAGGTGAAATATGCGACAG

TCCTCATCAGATCCTTGATGGAGAAAACTGCACACTAATAGATGCTCTATTGGGAGACCCTCAGTGTGAT

GGCTTCCAAAATAAGAAATGGGACCTTTTTGTTGAACGCAGCAAAGCCTACAGCAACTGTTACCCTTATG

ATGTGCCGGATTATGCCTCCCTTAGGTCACTAGTTGCCTCATCCGGCACACTGGAATTTAACAATGAAAG

CTTCAATTGGACTGGAGTCACTCAAAATGGAATCAGCTCTGCTTGCAAAAGGAGATCTAATAACAGTTTC

TTTAGTAGATTGAATTGGTTGACCCACTTAAAATTCAAATACCCAGCATTGAACGTGACTATGCCAAACA

ATGAAAAATTTGACAAATTGTACATTTGGGGGGTTCACCACCCGGGTACGGACAATGACCAAATCTTCCT

GTATGCTCAAGCATCAGGAAGAATCACAGTCTCTACCAAAAGAAGCCAACAGACTGTAATCCCGAATATC

GGATCTAGACCCAGAGTAAGGAATATCCCCAGCAGAATAAGCATCTATTGGACAATAGTAAAACCGGGAG

ACATACTTTTGATTAACAGCACAGGGAATTTAATTGCTCCTAGGGGTTACTTCAAAATACGAAGTGGGAA

AAGCTCAATAATGAGATCAGATGCACCCATTGGCAAATGCAATTCTGAATGCATCACTCCAAATGGAAGC

ATTCCCAATGACAAACCATTTCAAAATGTAAACAGGATCACATATGGGGCCTGGCCCAGATATGTTAAGC

AAAACACTCTGAAATTGGCAACAGGGATGCGAAATGTACCAGAGAAACAAACTAGAGGCATATTTGGCGC

AATCGCGGGTTTCATAGAAAATGGTTGGGAAGGAATGGTGGATGGTTGGTACGGTTT'

If you print out $dna, you may notice that the sequence is wrapped around some 70 characters or so. This means that $dna currently contains some \n characters in it, which will affect how regex matches against the string. In order to correctly identify all possible restriction sites, you would need to first remove those newline characters. This can be done by including the substitution operator after the variable declaration (similar to what we did in the vim writing exercises):

$dna =~ s/\s//g; # What would happen if the 'g' modifier is removed?

The following is the expected output. Instead of "$pattern1" and "$pattern2", you should be printing out the actual regular expression that you used to match the restriction enzyme motif. I did not print it out because that would give you part of the answer.

The program should include:

two regular expressions, one for each enzyme

one variable that contains the DNA sequence

optional if you would like to challenge yourself, include some code that will accept one command-line argument. If one is given, replace $dna above with the sequence provided by the user. Ensure that the provided sequence is a DNA sequence; otherwise, end the program and print a helpful message back to the user.

One subroutine called find_cut_sites that will accept 2 parameters: a DNA sequence and a regular expression. The subroutine should match the regular expression against the sequence and print the positions of all found cut sites. The position printed should be the starting position of where the site was found. Nothing should be explicitly returned by this subroutine. Whenever a subroutine does not explicitly return anything, it is known to be a void subroutine (void because a result is not provided back to the caller).

(There should be two subroutine calls for find_cut_sites(): one for each regular expression.)

Comments describing your subroutine (what it accepts, what it returns, what it does) and any other ambiguous code.

Attachment:- Assignment File.rar

Biology, Academics

  • Category:- Biology
  • Reference No.:- M93137338

Have any Question?


Related Questions in Biology

Case study question -case study - mary 21 years old

Case Study Question - Case Study - Mary, 21 years old, presented to the hospital emergency department with an infected laceration on her left foot. Mary was at a beach resort four days ago, when she trod on a broken glas ...

Assignment -the upper-case blue letters are the 14th exon

Assignment - The upper-case, blue letters are the 14th exon (of 20) in the Hephl1 gene in mice. The lower-case (black) letters are from the flanking introns.  The highlighted bases indicate primers that may be used to ge ...

Question - a pure strain of mendels peas dominant for all

Question - A pure strain of mendel's peas, dominant for all seven of his independently assorting genes, was testcrossed. How many different kinds of gametes could the F1 PRODUCE?

Igfbp2 rbp4 and factor d post bariatric surgeryigfbp2 what

IGFBP2/ RBP4 and Factor D Post Bariatric Surgery IGFBP2 ( what the normal physiological action in the body? And how it affectedby obesity? andpost bariatric surgery?) RBP4 (what the normal physiological action in the bod ...

Assignment on nutrition - q1 task you need to select 2

Assignment on Nutrition - Q1. Task: You need to select 2 different age groups of your choice. You will need to plan balanced meals with snacks for a day. Once you have laid out the meal plan you need to: Explain why the ...

Question - gene cloning a please write the steps to clone

Question - Gene Cloning a) Please write the steps to clone the protease gene from Bacillus strain whose genome sequence is not known. b) Express the protease gene to obtain the enzyme in high yield, please plan your prot ...

Instructions address each question below as it relates to

Instructions: Address each question below as it relates to the caw study given. A patient was brought to the Emergency Department by ambulance with two arrow wounds. One arrow is still in the patient on the left side; en ...

Use of molecular tools and bioinforrnatics in the diagnosis

Use of Molecular Tools and Bioinforrnatics in the Diagnosis Characterization of Enteric Pathogens from a Case Study Purpose: The purpose of this project is to familiarize the student with modern molecular tools and bioin ...

Experiment 1 staining video1 open the media player by

Experiment 1: Staining Video 1. Open the Media Player by clicking on the film-strip button in the lower left of the lab's window frame, as shown below. The Media Player is a repository of images, videos, saved snapshots, ...

Chosen dr jan nolta- stem cell researcher head of uc davis

Chosen Dr. Jan Nolta- Stem Cell Researcher Head of UC Davis Stem Cell Program Director Topic Background: early Stem cells have the ability to develop into many different types of cells. Stem Cell Research is not without ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As