Digital Analysis of the Genome

  1. A major shock to emerge from the completion  of the human genome sequence was the discovery of only about ______ genes. Of these genes, roughly 21,000-23,000 encode proteins, while the remainder are transcribed into _____ that are not __________, such as _____ and _____ that participate in translation, and the _______ that function in spliceosomes
    • 25,000
    • RNAs
    • translated
    • rRNAs and tRNAs
    • snRNAs
  2. These numbers are much lower than expected. Estimates made before the initiation of the Human Genome Project had suggested that there might be _______ or more genes. Many of these estimates anticipated that because humans have much ______ biological complexity than simpler model organisms such as bacteria, yeasts, nematodes, and fruit flies, our genomes must have many more ____
    • 100,000
    • greater
    • genes
  3. The total length of genomes varies much more markedly over the course of evolution than the number of genes (compare the genomes of human, mice vs other simple euks like worms and flies). Why does this generalization hold?
    because the exome (the part of the genome corresponding to the exons of all known genes) constitutes only a small proportion of genomes, roughly 1.5-2.0% of the total in humans
  4. The vast majority of DNA sequences instead are found in _______, in the spaces between genes (______ regions), in ________ ______ that can move from one chromosomal position to another, and in structural features like ______ and _________
    • introns
    • intergenic
    • transposable elements
    • centromeres and telomeres
  5. The tremendous variation seen in the size of genomes of different species is thus mostly due to expansions and contractions of ________ DNA outside of the _____, rather than changes in gene ______ and ______
    • noncoding DNA
    • exome 
    • number or gene sizes
    • *exampleson pg 321
  6. Students sometimes assume that all the genes on a chromosome are transcribed in the same direction, always using the same strand of double-stranded genomic DNA as the template. Why is this wrong?
    Neighboring genes can be transcribed either in the same or in opposite directions with respect to each other, and either toward the centromere or toward the telomere with respect to the chromosome as a whole
  7. Why does RNA polymerase use the chromosome's "Watson" strand as the template for some genes, while for other genes, the template is the "Crick" strand?
    neighboring genes can be transcribed in opposite directions
  8. On average, the density of a gene in the human genome is slightly less than 1 gene in every 100 kb of DNA (25,000 genes in 3,000,000 kb genome); however, this rough value obscures the fact that the packing of genes can be very different in various parts of the _______. Some regions on some chromosomes are _____-rich, with little space between densely packed genes
    • genome 
    • gene-rich
  9. The most gene-rich region of the human genome is a 700 kb stretch of chromosome 6 that contains 60 genes encoding _________ proteins with diverse functions
    • histocompatibility
    • Image Upload 2
  10. Chromosomal regions that have many more genes than expected from average gene _______ over entire genome are gene-rich regions. Example in human genome is the class III region of major ________ complex. ______ ______ are regions of >1 Mb that have no identifiable genes and they make up 3% of human genome. Do they exist simply because the genes are hard to identify (e.g. big genes)?
    • density
    • histocompatibility 
    • Gene deserts
    • Some deserts are gene poor because they contain big genes whose nuclear transcripts spans 500kb or more of chromosomal DNA
  11. Genomes continually undergo many different kinds of DNA sequence changes that provide the raw material for ______ _______
    natural selection
  12. Genome annotation has revealed that exons often encode discrete _________ domains, each of which is a linear sequence of ______ _____ that folds up in ______-dimensional space so as to act as a _____ ______ ______
    • protein domains
    • amino acids
    • three-dimensional 
    • single functional unit
  13. Genes with multiple exons often encode proteins with multiple _______ analogous to the cars of a train. Each train is composed of multiple different cars, and each kind of car has a discrete function.
    domains
  14. Similarly, many genes are composed of multiple ______ that encode discrete protein domains. The shuffling, addition or deletion of exons during evolution can create new genes whose protein products have novel ____ architectures (different numbers and kinds of ____ in different orders) and thus can assume ____ roles in cells and organisms
    • exons
    • domain architectures
    • domains
    • new
  15. The fig. shows domains associated with various transcription factors, proteins that bind to regions of DNA such as enhancers that control the _______ of nearby genes. Exon ______ over evolution has produced different transcription factors with differing domains that enable these proteins to ______ particular DNA sequences and also to interact uniquely with _______ such as other proteins
    Image Upload 4
    • transcription 
    • shuffling
    • recognize
    • cofactors
  16. The mechanism of RNA splicing facilities this kind of exon rearrangement in euk genomes (aka creation of new genes) because _______ does not have to be precise. 
    Image Upload 6
    • reshuffling 
    • pg 323 for example
  17. Multigene families
    aka gene families, are groups of genes closely related in sequence and function; such gene families are abundant throughout genomes
  18. Examples of gene families include the genes that encode the hemoglobin that allow us to transport _______ in our blood, the _______ (antibodies) that help us ward off infections, and the ________ receptors critical for our sense of smell
    Image Upload 8
    • oxygen
    • immunoglobins
    • olfactory receptors
  19. With the use of bioinformatics, researcher can see that each gene family evolved by a process of ________ and _______ from an ________ gene. The two DNA sequence products of a duplication event, which start out _______, eventually _______ as they accumulate different ________
    Image Upload 10
    • duplication and divergence
    • ancestral 
    • identical
    • diverge
    • mutations
  20. The duplication and divergence process is critical for the creation of new raw material. Once a gene has duplicated, divergence allows either or both of the copies to assume new specialized but _______ functions, as long as one or both of the copies still fulfills the role of the _______ gene
    • related
    • original
  21. The genes in such families may be _______ on one chromosome or _______ on several chromosomes. In the case of the hemoglobin family, the α-globin gene cluster, aka the ______ _____, on chromosome 16 contains five functional genes, while the β-globin (aka _____ _____)  cluster on chromosome 11 also has five genes
    Image Upload 12
    • clustered
    • dispresed
    • α-globin locus
    • β-globin locus
  22. The sequences of all α-like genes are ______ similar to each other than they are to the β-like sequences, and vice versa. The β-like genes are exactly the same ______, and the five β-like genes have two _____ at exactly the same position; in fact the α-like genes also have two _______ at the same position
    Image Upload 14
    • more
    • length
    • introns
    • introns
  23. These comparisons suggest that all of the globin genes can be traced back to a single ______ DNA sequence. Hundreds of millions of yrs ago, this ancestral globin gene _____, and one copy moved to another ________. With time, one of the two copies gave rise to the ______  and the other to the ________
    • ancestral 
    • duplicated
    • chromosome
    • α-lineage
    • β-lineage
  24. Each lineage then underwent further ______ to generate the present array of α-like and β-like genes in humans. By comparing genomes of different organisms, it is possible to estimate when these various _____ events occured
    • duplications
    • duplication 
    • pg 324 for Chimp vs human example
  25. Orthologous genes (define) usually retain the same _______.
    Image Upload 16
    • genes in two different species taht arose from the same gene in the species' common ancestor
    • function
  26. The genes for the ε-globin in humans and chimps are orthologs (why?)
    The ε gene already existed in their last common ancestor
  27. By contrast paralogous genes (define) arise by _______
    Image Upload 18
    • paralogous gene: usually denotes the different members of a gene family
    • duplication
  28. The genes for δ-globin and ε-globin in the human β-globin locus are close _______, and both are more distant ________ of genes in the α-globin cluster
    Image Upload 20
    • paralogs
    • paralogs
  29. Homology (define)
    is a blanket term for all related sequences
  30. All hemoglobin genes in all species are thus _______, and all these genes share _____ homologies with myoglobin genes encoding more ______ related oxygen-carrying proteins in muscle tissues rather than red blood cells
    • homologous 
    • weaker
    • distantly
  31. The duplications that gave rise to multiple functional hemoglobin genes also produced genes that eventually lost the ability to ______. Molecular geneticists made this last deduction from data showing two additional α-like sequences within the α locus and one β-like sequence within the β locus that no longer have the capacity for proper ________
    • function
    • expression
    • Image Upload 22
  32. Pseudogenes
    sequences that look like genes, but do not function as genes (they occur in many gene families in all higher euks)
  33. Why do pseudogenes accumulate mutations at a faster rate than coding or regulatory sequences of a functional gene?
    They serve no function, so they are subject to mutation without selection
  34. Eventually, nearly all pseudogene sequences mutate past a boundary beyond which it is no longer possible to identify the ______ genes from which they have been derived. Continuous mutation can thus turn a once functional sequence into an essentially ______ sequence of DNA
    • functional
    • random
  35. The mouse and human genomes, which diverged 85 million years ago, exhibit striking similarities and differences not only in the _______ of individual genes, but also in the ______ of those genes on the chromosomes
    • sequences
    • order
  36. Syntenic blocks
    blocks of linked loci within a genome
  37. Within syntenic blocks, the order of genes is very ______  in humans and mice. However, the orders of these blocks along the chromosmes are totally ______ in the two organisms. It is as if one genome had been cut into 180 pieces of _______ size and then ______ assembled into the other genome
    • similar
    • different
    • varying 
    • randomly
  38. Conserved synteny (define) also exists between the human and puffer fish genomes , which diverged more than 400 million yrs ago. In this case the _____ _____ are relatively small
    • the same two or more loci are linked in different species
    • syntenic blocks
  39. What causes the apparent cutting and reassembling of chromosomal blocks accompanying evolution?
    translocations 
    inversions
    • Chromosomal rearrangements (two types):
    • translocations: connecct part of one chromosome to parts of a different, nonhomologous chromosome
    • inversions: flip a region of a chromosome 180° with respect to the rest of the chromosome
  40. The farther back in time two species last shared a common ancestor, the ______ chromosomal rearrangements accumulate in each separate lineage that will alter the order of genes. As a result, the avg size of syntenic blocks becomes ______ with _______ evolutionary distances between species
    • more
    • smaller
    • increasing
  41. Combinatorial amplification (define) can increase info and generate diversity. State two strategies
    • Combinatorial amplification: results  from potential for combining a set of basic elements in many different ways
    • Can be done by deletion of specific introns bringing certain exons together or by splicing together RNA exons in different orders
  42. T cells undergo ________ of DNA coding for antibodies. By _______ the V, D and J segments, there can be increased ________. Approximately a thousand combinations are possible
    Image Upload 24
    • rearrangements
    • recombining
    • variation
  43. Three neurexin genes, which encode proteins that help bind neurons at _______ illustrate ______ combinatorial strategies. Each gene has two _______ regions (producing α and β class mRNAs) and five sites at which ______ _____ can occur. Together, these three genes can probably generate more than 2000 ________ ______ forms of mRNA. Not all of the combos are ________
    Image Upload 26
    • synapses
    • both
    • promoters 
    • alternative splicing
    • alternatively spliced
    • functional
  44. GenBank database
    RefSeq
    • GenBank—
    • Database established by the NIH in 1982
    • Still the most widely used repository for sequence data

    • RefSeq—
    • Single, complete, annotated version of a species’ genome
    • Agreed upon standard for comparison
    • Maintained by the NCBI (http://www.ncbi.nlm.nih.gov)
  45. Suppose you have identified a gene (from Drosophila) and would like to know whether the human genome contains a homolog of this fly gene. Nam a tool you would use and what does it do?
    BLAST: (Basic Local Alignment Search Tool) allows you to find nucleotide or amino acid sequences related to any given nucleotide or amino acid sequence
  46. In this typical output of a BLAST search, we are looking for a human _____ that share ______ with the fly _______ of interest. The Query is the sequence you already know; here, the ______ ____ sequence of the fly ______ is written in the one letter code
    Image Upload 28
    • proteins 
    • homology
    • proteins
    • amino acid
    • protein
  47. The subject is the ________ sequence found by the BLAST program; in this case, the related human ______. The row between the Query and the Subject indicates the conserved amino acids, with a + symbol denoting ______ amino acid replacements (missense substitutions in which an amino acid is replaced by a different amino acid with similar chemical props)
    Image Upload 30
    • homologous 
    • protein
    • conservative
Author
chikeokjr
ID
335671
Card Set
Digital Analysis of the Genome
Description
Ch 9.6-end
Updated