BIOL5420 - Molecular Biology
- professor
Week 1
Lecture 1
Course Organization
Two components
Analytical approach
Experimental system
Genetic analysis modules 1-5
Mutations & screens
Molecular & atomic genetics
Recombination & transposition
Suppression, synthetic lethality & epistasis
Genomic approaches
Experimental systems 1-5
lac repressor
Phage λ
Transcription
Translation
Signaling & Network
Molecular Biology
Lecture 2
A Brief Overview of Genetics Molecular & Cloning Techniques
Genes
Heredity
Phenotype
Alleles
How do you study gene control?
Genetic Terms
Week 2 - Mutations
Reading 2.1 - [cite:@De_Lucia_1969]
E. coli strain with a mutation affecting DNA polymerase was a temperature sensitive strain.
False, it was UV sensitive.
What was the likely purpose of N-methyl-N’-nitro-N-nitrosoguanidine (MNNG)?
Induce mutations across the E. coli genome
What might be the best conclusion of the paper?
Fully viable and replication competent E. coli strain was identified and was shown display less than 1% activity of Kornberg’s DNA polymerase activity found in wildtype
TODO Lecture 3
Sperm mutations
slide 19
synthetic lethal genetic screen
Discovery of mutagens
Ames test
Genetic-biochemical analysis of DNA Polymerase discovered by Kornberg
Why were several thousand colonies needed to be screened?
What if no clone had been identified?
What if there are no other mutations in the mutant?
*
Reading 2.2 - [cite:@Pausch_2020]
The article reports a discovery of Casϕ with a significant sequence identity to type V CRISPR-Cas proteins that have been documented thus far.
False
Processing of pre-crRNA for the CRISPR-Casϕ system is most likely performed by an endogenous RNase III in the host cells.
False
Elution profile from a size exclusion column was used to provide evidence for which of the following?
RuvC-inactivating mutations did not mis-form Casϕ
TODO Lecture 4
homologous recombination repair
RAD(radiation repair genes)
zinc fingers(ZFN)
CRISPR
Week 3 - Lac Repressor
Reading 3.1 - [cite:@Pardee_1959]
Which chemical was used for measuring beta-galactosidase activity in the studies?
ONPG
What was the purpose of streptomycin and T6 phage in mating D that generated the data for Figure 4?
To inhibit protein synthesis and remating in the ♂ bacteria
The “inducer” model, in contrast to the “repressor” model, described by the authors is most accurately described by the which of the following:
Presence of an inducer in the constitutive and inducible organism that is destroyed by the i+ gene product
Lecture 5
genetic screens - basic structure of genetic screens
genetic approaches
How to do a genetic screen
Model organisms
People have already streamlined the use of these that's why we use them.
Thomas Hunt Morgan
Fly Boys
Nobel Prize in Physiology or Medicine 1933
recombination
Hermann Müller
When do screens fail
Type I - False Positives
How do you know the phenotype is caused by that gene?
Type II - False Positives
redundancy - multiple genes responsible for the phenotype
penetrance - incomplete penetrance
pleiotropy - one gene can affect multiple phenotypes
crosses and recombinants – E. coli
TODO Bacterial genetics terms
39:00
plasmids
TODO Blender Technique
gene mapping
bacterial transcription and control
Transcription In E. coli
Steps
TODO E. coli RNA polymerase - ~ 55 minutes
Sigma factor - Transcription Regulator
constitutive and inducible systems
adaptation, propagation, evolution
Reading 3.2 - [cite:@gilbert_muller-hill_1966]
The is mutant is used as a positive control for the experiments to show that the i gene encodes a repressor that binds the inducer.
False
The it mutant was identified in a screen for cells that is able to produce beta-galactosidase with a very low IPTG concentration but grow in TONPG which is toxic to cells that are constitutively expressing beta-galactosidase.
False
Which of the following does not support the conclusion that the lac repressor is a protein?
Material that binds IPTG sediments at 10S
Lecture 6 - Lac Repressor
lactose metabolism
IPTG
LacZ
Jacques Monod
Francois Jacob
[cite:@Pardee_1959]
z- and i- Means their missing the gene
FIG. 4. Enzyme formation in mating D
[cite:@gilbert_muller-hill_1966]
TABLE 1
Shows the concentration of C14 IPTG
FIG. 1
Got the IT alelle
To demonstrate the binding
FIG. 3
it
The isus is a negative control because it no longer makes the inducer
The Figure is showing that it no longer produces that
TABLE 2
Able to compare different inducers
IPTG binds extremely well
Nobel Prize in Chemistry 1980
Negative control of lac operon by the respressor
Postive control of lac operon
Week 4 - Lambda Repressor
Reading 4.1 - [cite:@Ptashne_1967]
Which of the following was not an approach employed by the author to label the lambda repressor specifically?
Use temperature sensitive mutant C1ts857 to label the repressor
What might be the reason for C1tsU32 and C1tst2 mutants not showing an H3 labeled peak in DEAE chromatography?
Both are more insoluble than c1ts857
What is the purpose of the amber mutant C1sus10 in the experiments?
It is used as a negative control for repressor synthesis
Lecture 7
[cite:@Ptashne_1967]
Mark Ptashne
Experiment Explanation
λ Phage
cI
cro
System Used to Study
Positive control mutants define activating region whose basic function is to recruit RNA polymerase (RNAP)
cI dimers cooperatively bind to OR1 and OR2
How does cI regulate lysogen
TODO cI Mechanism of action
cro Promotes Lytic fate
cII Controls the lytic-lysogen decision stabilization of cII results in the turning off of cro
Reading 4.2 - [cite:@Cui_2013]
What is not true about the enhancers?
Needs CAT protein to function over distance
Which statement is true based on the data presented in Fig. 2(C)?
UP element increased PRM activity in the presence of C1
UP element likely functions by
Interacting with the alpha subunit of RNAP
screenshot
TODO Lecture 8
enhancer
Models
[cite:@Cui_2013]
reporters
Phage DNA injection can be slow and heterogeneous
Week 5
Reading 5.1 - [cite:@Keegan_1986]
Based on the results in Table 1, which of the following is not supported by the data?
All GAL4 fusion with Beta-gal exhibited some basal beta-galatosidase activity in addition to the GAL1 reporter activity
What is the purpose of the lane b in Figure 4? Select the best answer.
A negative control to show that lambda repressor does not binds UASG
GAL4(147-881) is not required for binding to UASG but is required for activation of GAL1.
True
TODO Lecture 9
TODO Nature of Mutations
TODO Basic Types of Mutations
TODO Mutations Induced by Common Mutagens
Molecular genetics
TODO Molecular analysis - domain bashing
TODO [cite:@Keegan_1986]
TODO
Reading 5.2 - [cite:@Hecht_1983]
Lambda virulent phages used in the study are different from wildtype lambda phages in that
They harbor mutations in various operator sites for lambda repressor
Which of the following is not true about thermolysin analysis of lambda cI mutants?
Identification of enhanced denaturation of weak solvent exposed cI mutants
Asn77 to Thr77 mutant is a pseudo-revertant, because threonine is similar in chemical property as serine which is the residue in wildtype cI.
True
TODO Lecture 10
Week 6
Reading 6.1 - [cite:@Gaiano_1996]
TODO Lecture 11
Transponsons
Barbra McClintok
LINE-1 retrotransposons
Transposons as genetic tools
inverse PCR to identify integrated/transposed site
[cite:@Gaiano_1996]
Nancy Hopkins
Conditional degrons
Exam 1
Following section is for Q1-Q5: The diagram below illustrates an experiment in which a lysogenic ‘male' donor strain of E. coli transfers its chromosome to a ‘female' recipient by the process called conjugation. The outcome is that the lambda prophage is induced as soon as it enters the recipient cell (a phenomenon termed zygotic induction, originally called ‘erotic induction' by its French discoverers). The lambda repressor cI is illustrated as a dumbbell.
file: media/image1.jpeg
Q1. What is the molecular explanation for zygotic induction?
When the DNA is transferred, the recipient cell lacks the lambda repressor(unlike the lysogenic donor), so the cro protein is immediately transcribed due to the lack of the repressor.
Q2. What is the outcome if the cross is reversed (that is, a non-lysogenic donor transfers its chromosome to a lysogenic recipient)?
The recipient cell would not undergo lysis because it has only gained a normal chromosome and that won't induce the lambda prophage because it already has sufficent cI built up.
Q3. What is the outcome if both the donor and the recipient are lysogens?
The donor would continue to be lysogenic. The recipient would also be lysogenic but might be more susceptible to undergo lysis because it does not have sufficent cI to repress the new copy of the PR and PL promoters.
Q4. What is the outcome if the recipient expresses lambda repressor cI from a plasmid constitutively(at a constant rate)?
The recipient would not become lytic because the excess cI will be able to repress the new copy.
Q5 What is the outcome if the recipient expresses cro from a plasmid constitutively?
The recipient will become lytic because the cro will ensure the lytic cycle occurs by block the PRM promoter.
Following section is for Q6-Q8: In an activation complex such as the one shown below, it is typically the case that only one monomer of the activator, and only one C-terminal domain of the alpha subunit of RNA polymerase is required for activation.
file: media/image2.png
Q6. Since one activator monomer is sufficient for contact with RNAP, what is a possible reason for the activator being dimeric?
While only one activator monomer is sufficient, the dimeric could create a stronger bond with the RNAP, a positive cooperativity. If one of the monomers binds to RNAP, it could cause the second to have a stronger bond
Q7. In the diagrammed scenario, a DNA sequence near the activator binding site interacts with the alpha subunits of RNAP and is required for an optimal level of expression. From your readings, which element does this sequence most similar in function?
Sigma factor, the bacterial Transcription Factor (TF) that that neables specific binding of RNAP to gene promoters in bacteria.
Q8. Describe a genetic approach that would allow you to identify amino acids in the activator that make contact with the RNAP alpha subunit.
Create mutants of the activator with different deletion mutations that cause changes in the amino acids.
Test the activator mutants ability to activate a reporter gene such as lacZ
The mutants that have a lower β-galactosidase activity are those with the amino acids that are necessary for activator activity
Following section is for Q9-Q12: There is a gene y that is controlled by a gene x in a newly discovered fungal species. The gene x encodes a 100 amino acids long transcriptional regulator. To study x better, you make several deletion mutants that can be expressed constitutively from a plasmid. The plasmid is pX with one of the following deletions:
file: media/image3.wmf
pX-FL is capable of expressing a full length protein x, while pX-A through pX-G express various deletion mutants of x.
You generate a strain of yeast carrying a reporter plasmid with the promoter of the gene y fused to the lacZ gene (pPr-Y-LacZ). To this strain, you introduce various versions of pX deletions and determine the β-galactosidase activity (in Miller units).
x deletion mutants | β-gal activity |
---|---|
no x | 200 |
FL | 2,000 |
A | 20 |
B | 200 |
C | 200 |
D | 200 |
E | 10,000 |
F | 20 |
G | 200 |
Q9. What can you infer about the functional domain organization of the transcription factor x? Domain A, and F is very vital(Perhaps they code for specific DNA binding sites) Domains B, C, D are vital(Perhaps they reduce the confirmation of the protein) Domain E is inhibitory to the functioning and the lack of it increases the effectiveness
To investigate the function of each potential functional domains better, you generate x fusion protein to the GAL4 DNA binding domain (GAL4DBD) and use UASG-LacZ reporter to assess the activity GAL4DBD-x fusion proteins. “1-20” indicates amino acid residues included in the fusion for x(1-20), similarly for other x(##-##) fusions.
x fusion | β-gal activity |
---|---|
GAL4DBD | 20 |
GAL4DBD-x(1-20) | 20 |
GAL4DBD-x(21-40) | 0 |
GAL4DBD-x(41-60) | 20 |
GAL4DBD-x(61-80) | 20 |
GAL4DBD-x(81-100) | 1,000 |
You also generate x fusion protein to GAL4 activation domain (GAL4AD) and use the reporter plasmid pPr-Y-LacZ to determine the activity of the GAL4AD-x fusion proteins.
x fusion | β-gal activity |
---|---|
GAL4AD | 20 |
GAL4AD-x(1-20) | 20 |
GAL4AD-x(21-40) | 20 |
GAL4AD-x(41-60) | 20 |
GAL4AD-x(61-80) | 1,000 |
GAL4AD-x(81-100) | 20 |
Q10. Which region might be an activation domain of x?
61-80 because it has the highest B-gal activity with GAL4AD
Q11. Which region might be a DNA binding domain of x?
81-100 because it has the highest B-gal activity with GAL4BD
Q12. What might be the function of the region corresponding to 21-40 amino acid residues of protein x?
It has no binding activity, as seen by the GAL4BD experiment. It is a weak activation domain because of the GAL4AD
Week 7
Reading 7.1 - [cite:@Young_2019]
Which gene was used to regulate the expression of potentially deleterious proteins from the library?
lacI gene was expressed
How was the library constructed to express potential RNA polymerase subunits?
Yeast genomic DNA was fragmented with EcoRI restriction enzyme and inserted into lambda gt11 DNA
How was genes for RNA polymerase II identified from the lambda gt11 library?
By identifying clones that expressed proteins that bound the available RNA polymerase II antibody
What might be the benefit of using an expression library constructed from genomic DNA rather than cDNA?
wrong? genomic DNA contains introns
each gene is more likely to remain functionally intact in genomic DNA library
TODO Lecture 12
Reading 7.2 - [cite:@Berger_1992]
How was the genetic screen for ADA (alteration/deficiency in activation) genes performed?
Screening for mutants that can grow despite high toxic levels of GAL4-VP16
ADA2 is an essential adaptor protein for all acidic transcription activation domains.
False
We conclude that ADA2 potentiates the activity of one class of acidic activation domain but not a second class.
ADA1 was not studied further, because ADA1 mutant was found to reduce the level of GAL4-VP16
True
The ada 1 mutant was considered a poor candidate to encode the adaptor, because it apparently relieved inhibition by reducing the level of synthesis of GAL4-VP16.
Which of the following is not true about the SPT genes?
SPT genes work like ADA2 as an adaptor to link activators to general Transcription Factor (TF)
The properties of the ada mutant are different from spt mutants, including GALll.
TODO Lecture 13
Week 8 - Epistasis
Reading 8.1 - [cite:@Thompson_1993]
Cells lacking both SRB2 and SRB5 have a more severe phenotype than cells lacking only SRB2 or SRB5 alone.
False
Of the 85 independent suppressor mutants identified, what fraction were dominant mutants?
1/3
Both SRB2 and SRB5 are required for activated transcription, but not basal transcription in vitro.
False
In transcription assays using extracts from cells lacking SRB5, basal and activated transcription required addition of recombinant SRB5 and SRB2 together.
True
What is not a G-less cassette?
A Part of DNA template used for in vitro transcription assays
Two different versions were used – one long and one short
Template does not encode Glycine
Template is engineered so that guanosine triphosphate is not required for generation of a full length transcript
TODO Lecture 14
carboxyl-terminal domain (CTD)
[cite:@Thompson_1993]
Both SRB2 and SRB5 are required for preinitiation complex formation
Mediator Complex
RNAP-Mediator complex
Transcription activation requires many distinct steps
Genetic Analysis IV - suppression, synthetic lethality, epistasis
Questions to ask about mutants
Epistasis
Reading 8.2 - [cite:@Cooper_2006]
Which of the following is not a known mutation in the alpha-synuclein gene that have been linked to Parkinson’s disease?
Deletion of the alpha-synuclein gene
Which of the following was not a rationale for measuring degradation rates of CPY* and Sec61-2p?
WRONG Sec61-2p and CYP* degradation are differentially blocked by alpha-synuclein overexpression.
CYP* degradation was faster in cells overexpressing A53T alpha-synuclein mutant than in cell overexpressing wildtype alpha-synuclein.
GYP8 is a suppressor of toxicity due to alpha-synuclein by activating the GTPase activity of Ypt1p.
WRONG True False
TODO Lecture 15
Possible double mutant phenotypes
Classes of second site suppressors
Suppression by a complementary mutation in the second gene product at the interaction interface
Bypass Supression
Intragenic supression
Supressors provide insight into gene function
Week 9 - Translation
Reading 9.1 - [cite:@Lee_1993]
lin-4(e912) mutation is epistatic to lin-14 null.
lit/papers/biology/lee1993.org
True
False other way around
C to T transition at nucleotide position 517 found in ma161 mutation was experimentally validated to disrupt basepairing between lin-4 and lin-14.
True
False
Which of the following is not one of the possible mechanisms listed by authors for how lin-4 may regulate lin-14?
lin-4 could modify lin-14 mRNA to indirectly inhibit translation
lin-4 can bind to lin-14 in cytoplasm to inhibit lin-14 translation
If lin-4 RNA inhibits lin-74 translation by an antisense inter- action with the/in-74 3’UTR,
lin-4 RNA may bind to lin-14 mRNA in the cytoplasm and inhibit its translation
lin-4 can sequester lin-14 within nucleus to inhibit translation
lin-4 could inhibit LIN-14 protein synthesis indirectly
lin-4 likely control the stability of lin-14 mRNA
lin-4 RNA probably does not control the stability of lin-14 mRNA
How long is lin4S?
lin-4S, is approximately 22 nt in length.
WAIT Lecture 16
Nobel Prize in Chemistry 2020
Nobel Prize in Physiology or Medicine 2009
[cite:@Findlay_2018]
Translation Control
mRNA maturation
Steady State mRNA levels vs transcription rate
Synthesis(transcription) vs Degradation(miRNA, mRNA stability)
Translation
substrate
pre-mRNA processing and quality control and mRNA export
mRNA stability and half life
translation control
miRNA
RNA binding proteins
protein localization
split genes: fundamental eukaryotic specialization
Introns
Nobel Prize in Physiology or Medicine 1993
Caping and other modifications
RNA processing takes place during, not after synthesis.
Therefore, it is co-transcriptional rather than post-transcriptional.
Pre-mRNA processing is coupled to transcription and involves the carboxyl-terminal domain (CTD) of the larger subunit of RNA polymerase II
co-transcriptional processing of RNA
Splicing
Spliceosome
TODO RNA sequences and factors in regulated splicing
exon definition
variable rate of transcription elongation across the genome
kinetic coupling of RNA processing with transcription (Kornbhlitt)
nucleosomes are known barrier to transcription, as well as transcription factors
RNA polymerase pauses at exons
exons exhibit enrichment of histone and DNA methylation - modifying enzymes migrate on RNAP
strong splicing sites are enriched in exons that deviate from 147bp average
most exons are efficiently spliced co-transcriptionally
some regulated splicing of exons can be very slow and are spliced post-transcriptionally
TODO local Global Run-on Sequencing (GRO-seq) pattern around exon
TODO regulated splicing in the context of chromatin
Polyadenylation
Functional consequences of alternative Polyadenylation
nonsense mediated decay - post nuclear export RNA surveillance
Nobel Prize in Physiology or Medicine 2006
[cite:@Lee_1993]
Victor Ambros
Question that Computational Biologist never talk about
Without knowledge of mRNA and sequences, we have no idea where the genes are.
No way to predict exons from introns
Reading 9.2 - [cite:@Gerber_2006]
A reason for not using TAP-tagged full length Pum construct was
TAP-tagged RNA binding domain of Pum was sufficient to rescue pum mutant phenotypes
Expression of TAP-tagged full length Pum was toxic to Drosophila cells Toxic to e coli
TAP-HumHD can be expressed in a tissue specific manner using the GAL4-UAS system
Sequence of TAP-tagged full length Pum construct was not obtained in E. coli
Which of the following is not true about the tetranucleotide UGUA?
UGUA is a part of the core motif found in Pum mRNA targets
UGUA ubiquitously found in many mRNAs – 67% of all mRNAs analyzed
UGUA is enriched more than two fold in Pum mRNA targets
Core motif containing the UGUA is moderately enriched in Pum mRNA targets
The assay presented is unable to uncover all relevant Pum mRNA targets because targets and nontarget mRNAs may associate and disassociate during the affinity purification steps.
True
TODO Lecture 17
Extensive Association of Functionally and Cytotopically Related mRNAswith Puf Family RNA-Binding Proteins in Yeast
Pat Brown
[cite:@Gerber_2006]
Nobel Prize in Chemistry 2009
Week 10 - Signal Transduction
Reading 10.1 - [cite:@Velazquez_1992]
IGSF3 is dimerizes in response to interferon association with the receptor and sufficient to activate IFN-inducible genes.
False
Which of the following is not true about the 2fTGH cell line used?
It is derived from a hypoxanthine phosphoribosyltransferase negative cell line, HT1080
It is viable in HAT media
It is not viable in 6TG media that has been supplemented with interferon
It contains a gpt gene that is under the control of an upstream region of the human Interferon /β inducible gene 6-16.
How was the mutagenized 2fTGH cells screened to isolate 11,1 cells?
Screened for their ability to grow in HAT media
Screened for their ability to grow in HAT media supplemented with interferon
Screened for their ability to grow 6TG media supplemented with interferon
Screened for their ability to grow 6TG
Screened for their ability to grow in HAT media supplemented with interferon
How was the complementation screen for the defect in interferon response in 11,1 cells carried out?
By transfecting human genomic DNA into 11,1 cells and screening for transfected 11,1 cells that grew in HAT media plus interferon
Which of the following is not true about the K6 clone from DDD3-1,5?
K6 clone restored interferon response in 11,1 cells
Lecture 18
Signal Transduction
critical for cells to adapt to its environment
critical for organisms to coordinate development and differentiation
Two-component regulatory system
Basic Components of Signaling
Nuclear Receptor Signaling
JAK-STAT Pathways
1992 - First IFN response mutant identified as Tyk2 by molecular complementation
[cite:@Velazquez_1992]
Interferon Signaling
various mutations in pathways
what would happen to interferon response, if you introduced a mutation in tyk2 that rendered it constitutively active?
would it be recessive or dominant to wildtype?
what would happened to interferon response, if you introduced a mutation in the catalytic residue within the kinase domain?
what would happen if you over expressed (100 fold excess) this mutant in the presence of wild type tyk2? would this over expression mutant be dominant or recessive to the wildtype?
Y701 in STAT1 is phosphorylated by JAK1/TYK2
what would you predict if Y701 is mutated to alanine?
Similarly Y690 in STAT2 is phosphorylated by JAK1/TYK2
what would you predict if Y690 is mutated to alanine?
G-Protein coupled receptor signaling
Nobel Prize in Physiology or Medicine 1994
Nobel Prize in Chemistry 2012
G-proteins-coupled receptors (GPCRs)
Receptor Tyrosine Kinase
Nobel Prize in Physiology or Medicine 1986
Recurrent and Universal RTK Activation in Cancers
ERBB2, EGFR amplifications in breast and lung cancers and other
amplification leads to overexpression of these oncogenes
overexpression of these oncogenes drive tumorigenesis
targets for cancer therapy
Notch-Delta
Sonic Hedgehog
& Hedgehog (Hh) signaling pathway
Cyclopamine
[cite:@Visel_2009]
Reading 10.2 - [cite:@Tong_2004]
Synthetic Genetic Array analysis identified an equivalent number of genetic interactions as determined for protein-protein interactions
False
132 SGA screens involved performed by analyzing double mutants carrying one of 132 mutants in combination with 4700 other mutant genes (or about 620,000 double mutants).
True
Authors were able to generate a complete genetic network representing the global map of functional relationships between genes in the yeast genome.
True
Which of the following human diseases was not noted by the authors for relevant models for synthetic genetic interactions?
Cystic fibrosis
Retinitis pigmentosa
Bardet-Biedl syndrome
Retinoblastoma
Lecture 19
Networks
Biochemical and Molecular approaches for inferring gene-gene interactions
protein-protein interactions:
proteomic or mass spectrometric approaches
2 hybrid assay
Co-immunoprecipitation assay
protein-DNA interactions
gel shift/mobility shift assays
1 hybrid assay
reporter assays
protein-RNA interactions
immunoprecipitation (CLIP-seq - crosslinking is now standard)
3 hybrid assay
RNA-RNA interactions
RNA antisense precipitation (RAP - crosslinking is now standard)
genetic validation
mutation of the binding site
suppressor mutation on corresponding binding domain
Network Analysis
the goal of genetic analysis is to elucidate all genes underlying a given phenotype (disease)
by identifying and linking all genes in the process, you are building a network (pathways) of genes responsible for the phenotype
genetic approach has the been the traditional way to build networks
big data in biology is enabling network inference from massive biological information
massive data across many conditions, tissues and individuals/organisms
genomic data
transcriptomic data
proteomic data
functional genomic data
Networks are composed of nodes and edges
In biological networks (graphs)
Nodes (vertices) represent genes
Network can be undirected, directed or weighted
Thickness of the edges is the strength of the interaction
Benefit of network analysis
disease phenotype is a result of complex interaction of genes and environment
identify all genes that are involved in disease
designing drugs need to consider the network of genes that are linked to the potential targets
disease genes do not function alone and can be affected by other disease genes
Systematic Genetic Analysiswith Ordered Arrays of YeastDeletion Mutants
Budding yeast deletion library
PCR amplify homologous arms for the targeting cassette
transform the targeting cassette into yeast
select for positive recombinants (G418, Geneticin)
array the strain into microtitre plates for manipulation, storage and distribution
http://www-sequence.stanford.edu/group/yeast_deletion_project/deletions3.html
[cite:@Tong_2004]
notable observations
genetic interactions are 4x more abundant than protein interactions
degree of separation between any two genes - 3.3
small world network
essential genes are connected to many genes and are hubs
disease genes tend to be peripheral genes
Gene Ontology (GO)
GWAS
studies have identified many disease genes through association of common sequence variants with disease genes
for many complex disease, associations have yielded many common variants with low effect
Crohn’s disease associated variants are analyzed with network visualization
TODO An Expanded View of Complex Traits: From Polygenic to Omnigenic
Evan A. Boyle
Week 11 - Genomics
Reading 11.1 - [cite:@Findlay_2018]
Lecture 20
[cite:@Findlay_2018]
Exam 2
WAIT means internet
1. What might be the differences between genomic and cDNA sequence of a gene in yeast?
cDNA would not have all the necessary noncoding regions, the introns and UTR regions. That might make the cDNA not able to create spliced RNAs using alternative splicing.
2. If you want to express a human gene in E. coli to produce a recombinant version of the protein, how would you accomplish this?
Through DNA cloning. The target gene would be inserted into a circular plasmid, and the plasmid introduced through transformation, and selected using an antibiotic screen. The protein could then produced by inducing the expression of the gene, depending on the plasmid.
3. Which region of the eukaryotic RNA polymerase is critical for pre-mRNA processing?
The carboxyl-terminal domain (CTD)
4. You are studying a neuronal gene whose product is localized to the synapse. You delete a part of the last exon of the gene that corresponds to the most of 3' UTR of the mRNA in the neuronal cells. You stain the mutant and wildtype cells for the protein and found that mutant cells show the protein localized to neuronal cell body, while the wildtype cells show the expected distribution with the protein localized to synapses. What can you infer about the mutant that you have generated?
The deleted region is important for the protein transmission. This could be caused by the lack of polyadenylation, and the stability of the mRNA, preventing it from being translated effectively.
Questions 5-9 is based on your reading [cite:@Velazquez_1992].
5. The authors isolated the Cosmid 3D clone that restored the interferon response in 11,1 cells. It contained the gene for TYK2. This clone was recovered from a Cosmid library containing fragmented genomic DNA from DDD3-1,5 cells. A typical Cosmid plasmid has a cargo limit of 40-45 Kb of DNA sequence. What might be the size of the TYK2 gene in the human genome? What might be a result for the screen if the TYK2 gene was 100Kb long?
The gene would be around 40-45 Kb in length because that's the size that could fit in the Cosmid plasmid. If it were 100Kb long the gene wouldn't be able to fit in the Cosmid plasmid so it wouldn't have been able to restore the interferon response in 11,1 cells. So the screen would have been the attempted rescued 11,1 cells would grow in 6TG and 6TG+IFN but not in HAT, HAT+IFN.
6. What would happen to interferon response in 11,1 cells, if you introduced the Cosmid 3D with a mutation in the TYK2 gene that rendered it constitutively active (specifically what would be the grow pattern in HAT and 6TG media with or without interferon)? Fill in the table with “growth” or “no growth”.
Growth HAT | Growth HAT+IFN |
---|---|
no growth 6TG | no growth 6TG+IFN |
The TYK2 would be unregulated by Interferon Receptor so it will cause the formation of ISGF3, which will lead to the expression of interferon stimulated genes(i.e. gpt)
7. Continuing on the previous question 5. You introduce the constitutively active TYK2 mutant into 2fTGH cells that have an intact wildtype TYK2. What would be the grow pattern in HAT and 6TG media with or without interferon for the 2fTGH cells with constitutively active TYK2? Would this mutant TYK2 be recessive or dominant to wildtype TYK2?
HAT No growth | No growth HAT+IFN |
---|---|
6TG Growth | Growth 6TG+IFN |
It wouldn't matter if there's a wild type because it's constitutively active so it would be dominant. The wild type would not render the cells to become normal. It will become unresponsive to interferon and not produce gpt in the presence of interferon
8. What would happen to interferon response in 2fTGH cells, if you introduced the Cosmid 3D with a damaging mutation in the catalytic residue within the kinase domain and expressed this mutant 100 fold excess of the wild type TYK2? Would this over expression mutant be dominant or recessive to the wildtype?
Same phenotype as the 11,1 cells, it won't be able to respond to Interferon Stimulation. It would be recessive though because the wild type copy will be able to compliment it.
9. In their screen to isolate 11,1 cells, they also identified 11,2 cells. However, multiple attempts to rescue the interferon response in the 11,2 cells have failed. Sequencing of the 6-16gpt reporter in the 11,2 cells found a nonsense mutation in the gpt gene. What can you infer about the 11,2 cells?
The interferon pathway would be functioning but the gpt gene would not be producing gpt. So it would grow in 6TG and 6+IFN because the functional gpt could not be produced regardless. The researchers were interested in the IFN pathway however, so the 11,2 cells wouldn’t be useful.
For questions 10 -- 12. Genes A, B, C and D define a metabolic pathway that you are investigating and the corresponding mutants a, b, c and d are null mutants. A series of enzymatic reactions convert the metabolite S to the metabolite P through intermediate alpha, intermediate beta and intermediate gamma.
Analysis of single and double mutants yielded following observations:
mutant | intermediate accumulated |
---|---|
a | alpha |
b | gamma |
c | beta |
d | S |
ab | gamma |
ac | beta |
ad | S |
bc | beta |
bd | S |
cd | S |
10. Complete the table below by identifying which mutation is epistatic for double mutants.
mutant | intermediate accumulated | mutation epistatic |
---|---|---|
ab | gamma | b is epistatic to a |
ac | beta | c is epistatic to a |
ad | S | d is epistatic to a |
bc | beta | c is epistatic to b |
bd | S | d is epistatic to b |
cd | S | d is epistatic to c |
11. "Order the genes in the metabolic pathway using the double and single mutant analysis data."
ab gamma means b before a ac beta means c before a bc beta means c before b cd S means d before c
d -> c -> b -> a
12. Order the intermediates in the metabolic pathway.
beta accumulates if C is mutated gamma accumulates if B is mutated alpha accumulates if A is mutated
S -> beta -> Gamma -> Alpha -> P