ࡱ > C E B [ bjbj 4$ ΐ ΐ 8 ' C _ u f f f ` b b b b b b $ { n f f f f f f ` f ` @=N# * L 0 * r f f f f f f f f f f f f f f f f f f f f f f f : 4Y03 Genomes and Evolution - Assignment 1 Total number of marks 50. Worth 15% of total course 1. Database Awareness Find the following biological databases on the web. Have a look at them and familiarize yourself with them. For each one, write a short paragraph describing the purpose of the database and the kind of information that it contains. In what situations might a biochemist might want to look up information in the database? (a) GenBank; (b) OMIM; (c) UniProt; (d) PDB; (e) KEGG; (f) Stanford Microarray Database (1 mark for each part above) (g) Any other biological database of your choice - explain why you chose it and why you think it is interesting. (4 marks) 2. Counting substitutions The sequences on the following page are part of the Cytochrome Oxidase Subunit I gene sequence (known as COX1 or CO1) from the mitochondrial genome of Gorilla, Human and Dog. There are no insertions and deletions in this region. The gaps have been put into the alignment to indicate the positions of the codons. There are 60 codons shown here, but the full gene is much longer. (a) Find out about this gene and briefly describe the role of the corresponding protein. (2 marks) (b) Mitochondrial genomes use a slightly different genetic code from the standard code that is used by almost all eukaryotic nuclear genomes and almost all prokaryotic genomes. Find out about alternative genetic codes (e.g. HYPERLINK "http://www.ncbi.nlm.nih.gov/Taxonomy/Utils/wprintgc.cgi?mode=c" http://www.ncbi.nlm.nih.gov/Taxonomy/Utils/wprintgc.cgi?mode=c ). What are the differences between the vertebrate mitochondrial code (used for the genes above) and the standard code? (2 marks) (c) Consider, firstly, the human/gorilla pair, and secondly, the human/dog pair. For each pair, how many substitutions have there been at 1st, 2nd and 3rd positions? Comment as fully as you can on what these numbers tell us about the process of molecular evolution. (4 marks) (d) For both human/gorilla and human/dog, how many non-synonymous substitutions have there been? For each non-synonymous substitution, list the pair of amino acids that have been exchanged. What can you say about the physico-chemical properties of these pairs of amino acids? Is there any evidence that amino acid substitutions are conservative? (4 marks) (e) For both human/gorilla and human/dog, what proportion of substitutions are non-synonymous (as percentages of the total number of substitutions for that pair). Which of these two percentages is higher? Can you give an argument to explain why this should be so? (4 marks) (f) How many sites are there where human and gorilla have the same base but dog is different? How many sites are there where human and dog are the same but gorilla is different? What do these numbers tell you about the relatedness of the three species? (This is linked to the Maximum Parsimony method of constructing phylogenies that we will discuss later). (4 marks). 3. Human Origins These questions refer to the paper Mitochondrial genome variation and the origin of modern humans by Ingman et al. - available on the course website. You may have to do some background reading to answer some of the questions. (a) Explain what is meant by the recent African origin hypothesis, and describe the main reasons why the data in this paper support this hypothesis. (5 marks) (b) What is linkage disequilibrium? How would recombination affect linkage disequilibrium? Comment on what is shown by Figure 1, and explain why this is relevant for the present paper. (5 marks) (c) What quantities are being plotted in Figures 3a and 3b? At the top of p711 it says that the distribution of pairwise differences for the Africans is ragged, whereas for the non-Africans, it is bellshaped. Can you explain why this should be so? (5 marks) (d) What are the D-loop and control region? In what ways are these different from the rest of the mitochondrial genome? (5 marks) PAGE 1 * + A B C D F G ` b e w x y ` a s y / 0 C D E Z i 꺲 hn hyjT j hyjT UhJ h ! hyjT h!~ hZ >*hZ hiD h~\ h#w hY hHL h(T[ h!~ hHL >*hK h h*R hc3 hp' h h h 5>*8 * + G a b x y / 0 B C E F G $a$gdZ $a$gd#w gdp' B z F P V v ! ( ) . $ + 4 Z f h$C j h$C UhK hJ hiD h; h h; hiD 6hyjT hiD >*hiD hc3 hJ h h h!~ h hp' h) hZ 0J j hZ Uh!~ hZ hZ j hZ U 5 ) * 5 6 $a$gd#w hK hJ h*R 0J mH nH u hP: 0J j hP: 0J UhP: h$C j h$C U $a$gd#w 2 1h:pY / =!"#$% D d -PCC 0 # A b ^ 0o88O} : D n2 0o88O}PNG IHDR sRGB@} 0PLTE { IDATxK(ف[2e^@pȷK:9 H~_"tih6#mF@یm3fh6#mF@یm3fh6#mF@' ;Ѹ4kǣ/unu' ln;3soޡq8&Sޥ8-p??Ϛ~5g @zutN=Nߍ[OK[~ǣI3fh ۰ }y,R2)նMF=`@#pH;@p6$"r~K7 ?aG $s {#iP#"cRCS~A]}²,f|}~_~K)`.uEcxG_1Cض9PhPu@ @:ts/?Aߥ|cv=G.YT09ն5>90$>}uBz|;81~#>|X}|^|vŏqtRYuI1>Lm1L57:& tOX8Dw:pbY"lazÆr߆ ƜʰmH+a|P.'tzCk|,_r xa Ñs ˠ~!${".-v B^~%s90"P'3pȮPHO|Iu 8v+}Cy?y]bX_0X%W@bkO|sB.Dzrt M7ٗ2qQBmP/^tjې嫃ݻP|U.<ɽKuIƲK|_zozE ..TpK-?|˰bYN#Oii"[{0'C#`X NH MjHQ,q~>OjF\Cjd?'/<=b[#aK#sFVzF&Yjdi$.;&5p`Yc/kdL;% _adnQ#k[ѥjg52,z_FFt@2&4@,kdh42B ܯa\#¸id zjdCCL">'42V6ȼ0.OB`05r|`Q,1zg5
Ff/+45FFB8B|ǒkdC{>U?EC١FO `ejd l[ׁU,)JT( 52yvD#Xqkdu{sY{YR