Python for Biologists: A complete programming course for beginners by Dr Martin Jones

Python for Biologists: A complete programming course for beginners by Dr Martin Jones

Author:Dr Martin Jones [Jones, Dr Martin]
Language: eng
Format: epub
Publisher: UNKNOWN
Published: 2018-06-12T23:00:00+00:00


We'll get an error: NameError: name 'a_count' is not defined 1 Indeed, it doesn't actually exist when it's defined, only when it runs.

Calling and improving our function

Let's write a small program that uses our new function, to see how it works. We'll try both storing the result in a variable before printing it (lines 8 and 9) and printing it directly (lines 10 and 11):

1 def get_at_content(dna):

2 length = len(dna)

3 a_count = dna.count('A')

4 t_count = dna.count('T')

5 at_content = (a_count + t_count) / length

6 return at_content

7

8 my_at_content = get_at_content("ATGCGCGATCGATCGAATCG")

9 print(str(my_at_content))

10 print(get_at_content("ATGCATGCAACTGTAGC"))

11 print(get_at_content("aactgtagctagctagcagcgta"))

Looking at the output, we can see that the first function call works fine – the AT content is calculated to be 0.45, is stored in the variable my_at_content, then printed. However, the output for the next two calls is not so great. The call at line 10 produces a number with way too many figures after the decimal point, and the call at line 11, with the input sequence in lower case, gives a result of 0.0, which is definitely not correct:

0.45

0.5294117647058824

0.0

We'll fix these problems by making a couple of changes to the get_at_content function. We can add a rounding step in order to limit the number of significant figures in the result. Python has a built-inround function that takes two arguments – the number we want to round, and the number of significant figures. We'll call theround function on the result before we return it. And we can fix the lower case problem by converting the input sequence to upper case before starting the calculation. Here's the new version of the function, with the same three function calls:

def get_at_content(dna):

length = len(dna)

a_count = dna.upper().count('A') t_count = dna.upper().count('T') at_content = (a_count + t_count) / length return round(at_content, 2)

my_at_content = get_at_content("ATGCGCGATCGATCGAATCG") print(str(my_at_content))

print(get_at_content("ATGCATGCAACTGTAGC"))

print(get_at_content("aactgtagctagctagcagcgta"))



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.