Deoxy ribonucleic acid (DNA) is a molecule that carries the genetic instructions used in the growth, development, functioning and reproduction of all known living organisms and many viruses.
computer science
Description
Background Information
Deoxyribonucleic acid (DNA) is a molecule that carries the genetic
instructions used in the growth, development, functioning and
reproduction of all known living organisms and many viruses.
The two DNA strands composed of simpler units called
nucleotides. Each nucleotide is composed of one of four nitrogencontaining nucleobases — cytosine (C), guanine (G), adenine (A),
or thymine (T). During DNA transcription, DNA is divided into two
sections, coding regions (exons) are separated by noncoding regions
(introns).
Problem Statement
In this assignment, students will investigate how to find introns and exons in DNA
sequences.
Instructions
IMPORTANT: Complete the steps below in the order they are given. Completing the
steps out of order may complicate the assignment or result in an incorrect result.
1. Download and extract the provided Data Files ZIP file into the same directory
as your source code. It contains the following file for use in this assignment:
a. dna.txt – Sample DNA sequence [1].
2. Create a new Python script named lastname_firstname_hw3_dsp.py.
3. At the beginning of your script, insert a comment block containing the following
information:
# Firstname Lastname
# Date
# Computer Science 293B Section YY
# Homework #3: DNA Sequencing Problem
4. We wish to read data from the text file.
a. Create a function named read_data(). It should take the following
parameter:
Parameter Name Type Description
filename String Name of file to read.
Optional parameter with a default value
of dna.txt