Previous Article | Next Article ![]()
Eukaryotic Cell, January 2006, p. 62-76, Vol. 5, No. 1
1535-9778/06/$08.00+0 doi:10.1128/EC.5.1.62-76.2006
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Department of Pharmacology and Toxicology, Indiana University School of Medicine, Indianapolis, Indiana 46202,1 Department of Genetics, University of Georgia, Athens, Georgia 306022
Received 26 September 2005/ Accepted 25 October 2005
|
|
|---|
|
|
|---|
Histone acetylation is linked to the activation of gene expression either by attenuating the DNA-histone interaction and/or contributing to a "histone code," which proposes that patterns of histone modifications constitute a language read by other proteins to bring about a cellular response (43). A variety of histone acetyltransferases (HATs) have been identified in eukaryotes (40). The best characterized is GCN5 (general control nonrepressed), previously identified as a putative transcriptional adaptor in a yeast mutant screen for amino acid biosynthesis deficiencies (6). GCN5 HATs operate in multisubunit complexes (e.g., SAGA and ADA) to acetylate lysine residues in histone tails, preferentially those of histone H3 (40). GCN5 and ADA2, an adaptor protein that binds GCN5 as well as the acidic activation domain of transcriptional activators like GCN4 (2), are nonessential in yeast. However, each is critical for growth under starvation conditions (26, 36) or during temperature stress (31). Consequently, GCN5 has been linked to activating GCN4-dependent genes important to the cellular stress response, controlling
10% of yeast genes (3, 21).
The phylum Apicomplexa is home to protozoan parasites of tremendous medical and economical importance, including Plasmodium (malaria), Cryptosporidium (cryptosporidiosis), and Eimeria (coccidiosis). Toxoplasma gondii is an apicomplexan that parasitizes warm-blooded vertebrates, causing congenital birth defects and serious complications in immunocompromised patients (5). GCN5 HATs have been cloned in T. gondii and Plasmodium falciparum (16, 18, 47). Histone H3 is well conserved in these parasites, including each lysine targeted by GCN5 HATs in other species (45). However, in contrast to other GCN5s, the T. gondii homologue (TgGCN5) exhibits an unusual bias to acetylate only lysine 18 of histone H3 (H3 [K18]) (39). Whether another HAT existed in T. gondii that could acetylate the additional lysines in H3 remained unresolved.
The tachyzoite stage of the life cycle causes acute toxoplasmosis as the parasites rapidly invade and replicate inside nucleated cells. To address the role TgGCN5 plays in the context of acute infection, we attempted to disrupt the TgGCN5 locus in the virulent (type I) RH strain of T. gondii, which has largely lost its ability to convert to the bradyzoite cyst form (13). Here we report the successful generation of a TgGCN5 "knockout" clone that proved amenable to the identification of genes that may be regulated by this HAT. Using a mouse model, we also establish that the loss of TgGCN5 does not attenuate virulence. Finally, we identify a second GCN5 in T. gondii (TgGCN5-B) that targets multiple lysines in the H3 tail and clone two distinct ADA2 homologues that interact differently with the two TgGCN5s.
|
|
|---|
Generation of
TgGCN5.
Two
DASH-II clones used to map the TgGCN5 genomic locus (47) provided ample contiguous sequence for the construction of a knockout vector (Fig. 1A). An
6.0-kb fragment from p
GCN5Xho was cloned into pminHXGPRT (AIDS reagent no. 2855) at the XhoI site. An
6.0-kb SpeI-SmaI fragment from p
GCN5Bgl2 was cloned into the resulting vector at SpeI and a blunted XbaI site (lost). The resulting construct (
TgGCN5::HX) contains
6.0 kb of continuous genomic sequence on each side of a cDNA-derived HXGPRT minigene, allowing selection in RH
HXGPRT parasites (AIDS reagent no. 2859). Twenty-five micrograms of the
TgGCN5::HX vector was linearized with ScaI for transfection into 2 x 107 parasites by electroporation. Methods of transfection, selection in mycophenolic acid plus xanthine, and cloning by limiting dilution was performed as previously described (11). Clones were prescreened by PCR of genomic DNA using primers designed to amplify the TgGCN5 HAT domain; Southern analysis was used to confirm allelic replacement (Fig. 1B).
![]() View larger version (31K): [in a new window] |
FIG. 1. Generation of TgGCN5 parasite clone. (A) Schematic diagram comparing the endogenous genomic locus to the replacement allele contained on the TgGCN5::HX "knockout" construct. Black boxes indicate exons, mottled boxes indicate introns, and gray boxes indicate UTRs. In a perfect allelic replacement via a double crossover event, virtually the entire TgGCN5 coding sequence would be replaced by the HXGPRT minigene cassette. Striped boxes denote the dihydrofolate reductase promoter and 3' UTR regions. B, BglII; S, SpeI; Sm, SmaI; X, XhoI; Xb, XbaI. (B) Southern analysis verifies allelic replacement. Probe 1, a cDNA-derived portion of the HAT domain hybridizes to the expected size fragments in wild-type (WT) genomic DNA digested with indicated restriction enzymes. However, probe 1 does not hybridize to genomic DNA from the TgGCN5 knockout clone (KO). Probe 2, designed to a portion of the final exon plus 3' UTR, hybridizes to DNA fragments of the expected size in both WT and KO parasites. H, HindIII; X, XhoI. (C) TgGCN5 transcript is absent from TgGCN5. Primers were designed to amplify a portion of the HAT domain of TgGCN5 using total RNA harvested from either wild-type (WT) or "knockout" (KO) parasites. Primers designed to amplify a control transcript (adenosine kinase) validate the RNA purification from each sample. CNTL, control.
|
108 freshly lysed filter-purified tachyzoites from wild-type RH or
TgGCN5. After normalizing samples by Bradford assay, first-dimension resolving of proteins was conducted using the Bio-Rad PROTEAN isoelectric focusing cell and ReadyStrips (pH 3 to 10) for 120,000 Vh. Strips were then equilibrated for 20 min in buffer containing 6 M urea, 375 mM Tris-HCl (pH 8.8), 20% glycerol, 2% SDS, and 2% dithiothreitol (DTT). Strips were alkylated with 2.5% iodoacetamide in the same buffer minus DTT for 20 min, washed twice in 1x morpholinepropanesulfonic acid (MOPS), and resolved on Bio-Rad Criterion gels for 3 h at 120 V. Each sample treatment was performed in triplicate. Gels were fixed and stained with Coomassie brilliant blue R250 and scanned using the Fluor-S MAX MultiImager system and analyzed with PDQuest software (version 7.1.1; Bio-Rad). The background values for the gels were subtracted, and the spot peaks were located and quantified. Both the total optical density and the total number of spots correlate to the total concentration of each protein expressed. As a result, each protein's quantity was defined as parts per million of the total integrated optical density. After a reference pattern was prepared, the image analysis software was used to create a match set encompassing all gels of interest and to perform gel matching. Each gel in the match set was matched to the reference pattern, with landmark proteins (those uniformly expressed in all of the gels) used to quickly and accurately match all gels. The abundance of individual proteins was calculated using a quantitative analysis set within the PDQuest software, which allows for the construction of a relative change analysis comparison chart for any or all expressed proteins. Protein spots of interest were excised from the gel robotically using the PROTEAN 2D Spot Cutter (Bio-Rad), destained in 50% acetonitrile-50 mM ammonium bicarbonate, reduced with 10 mM DTT, and alkylated with 55 mM iodoacetamide. Following overnight digestion with trypsin (6 ng/µl) at 37°C,
-cyano-4-hydroxycinnamic acid was used as a matrix and matrix-assisted laser desorption ionization mass spectra were recorded in positive reflectron mode of the matrix-assisted laser desorption ionization-time of flight mass spectrometer (Micromass, Manchester, United Kingdom). Peptide mass profiles were compared to theoretical peptide masses using MS-Fit (http://prospector.ucsf.edu/ucsfhtml4.0/msfit.htm). At least 5 peptides and 15% coverage had to be observed for the identification of a peptide mass profile to be considered unambiguous (30). Two-dimensional gels and mass spectrometry work were performed in conjunction with the Protein Analysis Research Center (Indiana School of Medicine, Indianapolis, IN). Molecular methods. Rapid amplification of cDNA ends reactions were performed with the GeneRacer kit (Invitrogen) using T. gondii tachyzoite mRNA as a template for reverse transcription (RT) with random primers. RT-PCRs were performed using the SuperScript one-step kit from Invitrogen. Northern blots of parasite mRNA were produced using the NorthernMax system (Ambion). Probes for Northern blots were cDNA derived and intercalated with psoralen-biotin to allow detection by chemiluminescence following hybridization. All DNA sequencing was carried out by the IUSM Biochemistry Biotechnology Facility (Indianapolis, IN).
Recombinant protein expression and purification.
Full-length FLAGTgGCN5-B was amplified from cDNA using the following primers: sense, 5'-CATATGAAAATGGACTACAAGGACGACGACGACAAGGCGCCTTCAGAGTGTCCCAGCG; antisense, 5'-CCTAGGCTAGAAAAATGTCGGATGCTTCGCGCCC. FLAG
528TgGCN5-B was generated using the sense primer 5'-CATATGAAAATGGACTACAAGGACGACGACGACAAGCTCGACAAAGCCCCAACAGGAGCC with the antisense primer above. Both sense primers encode an N-terminal FLAG epitope tag (sequence shown in italics). Inserts were ligated into ptubXFLAG::HX (4) at the NdeI and AvrII sites (underlined above). Plasmid DNA (
50 µg) was linearized using NotI prior to transfection of RH
HXGPRT tachyzoites by electroporation as described previously (38). Stably transfected parasites were selected for by using mycophenolic acid and xanthine and cloned by limiting dilution.
To purify FLAG-tagged protein from stable clones,
1 x 108 filter-purified parasites were spun at 800 x g at 4°C for 10 min and washed in phosphate-buffered saline. The parasite pellet was resuspended in 1.0 ml of lysis buffer (50 mM Tris-HCl [pH 7.4], 150 mM NaCl, 1 mM EDTA, and 1% Triton X-100) plus protease inhibitors, and sonicated three times on ice. Lysates were centrifuged at maximum speed for 10 min to remove insoluble material. Preclearing of lysates was performed for 30 min with 60 µl protein A affinity gel (Sigma) and then transferred to a new tube with 60 µl anti-FLAG M2 affinity gel (Sigma) and incubated overnight at 4°C with gentle agitation. Affinity gel was washed three times with wash buffer (50 mM Tris-HCl [pH 7.4r], 300 mM NaCl, and 1 mM EDTA), followed by two washes with 1x HAT assay buffer. After the last wash,
10 µl packed resin was transferred into a separate tube for Western blot analysis to insure that immunoprecipitation was successful. The remaining 20 µl packed resin was used in a HAT assay.
Radioactive HAT assay.
Each HAT assay (6) used
20 µl of packed resin from a FLAG affinity purification mixed with 2 µl of acetyl [3H]coenzyme A (acetyl-[3H]CoA) (Amersham), 4 µg chicken erythrocyte core histones (Upstate), 6 µl 5x HAT assay buffer (16), and 18 µl of distilled H2O. Polyhistidine-tagged Saccharomyces cerevisiae GCN5 (ScGCN5) was generated in CodonPlus Escherichia coli BL21 (Stratagene) and purified over nickel for use as a positive control (1 µg per HAT assay reaction). HAT reaction mixtures were incubated at 30°C for 60 min and stopped by the addition of 15 µl of 2x NuPAGE loading dye (Invitrogen) containing 2.5 µl of ß-mercaptoethanol. After 10 min of incubation at 70°C, the reaction mixture was resolved on a NuPAGE 10% Bis-Tris gel (Invitrogen) using morpholineethanesulfonic acid (MES) buffer. Gel was fixed in an aqueous solution of 5% isopropanol and 5% acetic acid for 2 h followed by gentle washing in distilled water for 1 h. The gel was incubated in Autoflour reagent (National Diagnostic) for 2 h, dried, and processed for autoradiography.
Nonradioactive HAT assay and Western blotting. The HAT assay was performed as described above, with the following differences: 1 µl of acetyl-CoA (1 mM in 10 mM sodium acetate [pH 4.5]), 1 µg histone H3 (Upstate), and 22 µl of distilled H2O. Gels were transferred to a polyvinylidene difluoride membrane and blocked in 5% nonfat dry milk in Tris-buffered saline-Tween (TBST) overnight at 4°C. Membranes were incubated with anti-AcH3 [K14] or anti-AcH3 [K9] (Upstate) or anti-AcH3 [K18] (Abcam) (all at 1:1,000) in TBST with 5% milk. After three washes with TBST, anti-rabbit antibody conjugated to horseradish peroxidase (Amersham) was incubated with membrane at a 1:2,500 dilution in 5% milk-TBST for 1 h. Results were detected by chemiluminescence using the Amersham ECL detection reagent.
Yeast two-hybrid testing.
950TgGCN5-A was amplified by PCR using the following primers: sense, 5'-CATATGCGTCTCAGCACCCGAATAAATTACC; antisense, 5'-CCCGGGTCAGAAACTCCCGAGAGCCTCGACCTTGG. The insert was ligated in frame to the GAL4 DNA-binding domain of pGBKT7 (Clontech) at the NdeI and XmaI sites (underlined).
781TgGCN5-B was ligated into pGBKT7 at the NdeI and BamHI sites following PCR amplification using the following primers: sense, 5'-CATATGCACATCAATCCGCGCATCAATTACC; antisense, 5'-GGATCCCTAGAAAAATGTCGGATGCTTCGCGCCC. Portions of TgADA2-A and TgADA2-B corresponding to the putative GCN5-binding domains were ligated in frame to the GAL4 activation domain of pGADT7 (Clontech) using NdeI and BamHI sites. The Insert was generated using primers 5'-CATATGGAACGGAGAGCGAAACGCGCG (sense) and 5'-GGATCCTTAGCCCATGAGCAGTGCCG (antisense) to amplify TgADA2-A cDNA encoding amino acids 158 to 410 and primers 5'-CATATGCCTGAGGACGGCGGGACTCCTCC (sense) and 5'-GGATCCTTACTCGGGTGGGCTGTGAACTTTTGG (antisense) to amplify TgADA2-B cDNA encoding amino acids 461 to 761. Competent AH109 Saccharomyces cerevisiae cells (Clontech) were cotransformed with 0.25 µg of each vector. Each transformation was plated on double dropout (lacking leucine and tryptophan) and triple dropout (TDO, lacking leucine, tryptophan, and histidine) media. Plates were incubated at 30°C for 3 days. Eight colonies from the TDO plates were spotted onto quadruple dropout plates (QDO, lacking leucine, tryptophan, histidine, and adenine) containing X-
-Gal and incubated as before. Blue colonies larger than 3 mm were considered positive for protein-protein interaction.
Bioinformatics. Preliminary genomic sequence data was accessed via http://ToxoDB.org. Genomic data were provided by The Institute for Genomic Research (supported by the NIH grant no. AI05093) and by the Sanger Center (Wellcome Trust). Sequence analysis tools included BLAST (http://www.ncbi.nih.gov/BLAST/), SMART (http://smart.embl-heidelberg.de/), and Pfam (http://www.sanger.ac.uk/Software/Pfam/search.shtml). Alignments and phylogenetic tree constructions were performed with Vector NTI 9.0 (Informax).
For the analysis of TgGCN5-dependent promoters, the data set was prepared by extracting
1,500 bp upstream of the translational start for most of the gene products in Table 1 (promoter sequences for accession no. 21214754 and 6003679 were disregarded, since the translational start site for these genes is unclear). BLASTX was performed against T. gondii coding sequences to eliminate the inclusion of coding regions within the data set. MEME (1) was used to identify conserved sequence motifs present in the upstream regions for these genes. MEME was run using three different parameters to specify the mode of occurrence of the motif(s), and a motif width of 6 to 20 bp was specified. Sequence logos to represent the motifs were created at http://weblogo.berkeley.edu/.
|
View this table: [in a new window] |
TABLE 1. Gene products reported to be down-regulated in GCN5 parasites
|
|
|
|---|
HX tachyzoites (haploid) was replaced with an HXGPRT selectable marker by homologous recombination (11). As diagrammed in Fig. 1A, the knockout construct (
TgGCN5::HX) would remove the bulk of the TgGCN5 locus, including portions encoding the entire catalytic (HAT) domain and bromodomain. Figure 1B shows a Southern analysis of one clone representing a true allelic replacement, showing that the targeted locus is no longer present in the parasite genome. Moreover, mRNA encoding TgGCN5 is no longer detected by RT-PCR in the knockout clone, termed
TgGCN5 (Fig. 1C).
Similar to yeast lacking GCN5, no readily observable phenotype was evident in the
TgGCN5 clone. No appreciable difference in parasite growth rates between the wild type and
TgGCN5 were detected in vitro using parasite doubling or uracil incorporation assays (data not shown). We examined whether the loss of TgGCN5 impaired virulence in vivo in collaboration with David Sibley (Washington University, St. Louis, MO). An equal number of
TgGCN5 or wild-type tachyzoites were inoculated (intraperitoneally) into type CD-1 outbred mice. In addition, a T. gondii transgenic clone overexpressing recombinant TgGCN5 (4) was analyzed in the mouse model. Neither the loss nor overexpression of TgGCN5 alters the kinetics of infection in vivo (data not shown), revealing that TgGCN5 is not likely to play a significant role during the acute stage of toxoplasmosis.
Gene products differentially expressed in
TgGCN5.
Despite having no discernible phenotype under normal growth conditions, yeast lacking GCN5 were successfully employed to find genes regulated by this factor (20). A viable knockout of TgGCN5 presented a unique opportunity to elucidate the genes that may be controlled by this HAT in protozoa. Thus, differential expression analysis between wild-type and
TgGCN5 parasites was performed. As comprehensive microarrays for T. gondii are still in development, a proteomics approach was pursued. Two-dimensional (2D) gels were used to compare the proteomes of wild-type versus
TgGCN5 tachyzoites. Protein spots reported to be differentially expressed that could be unambiguously identified are listed in Tables 1 (down-regulated in
TgGCN5) and 2 (up-regulated in
TgGCN5). In some instances, multiple independent protein spots were assigned the same identity (e.g., GRA2), possibly reflecting differential splice variants or variability in posttranslational processing. As TgGCN5 is associated with transcriptional activation, gene products reported to be up-regulated in
TgGCN5 are likely due to an indirect effect caused by the lack of TgGCN5; similar observations were reported for yeast lacking GCN5 (20). Some of the down-regulated products are involved in metabolism and cellular signaling, but most are parasite specific and include microneme (MIC) and rhoptry proteins associated with host cell invasion (Table 1).
|
View this table: [in a new window] |
TABLE 2. Gene products reported to be up-regulated in GCN5 parasites
|
TgGCN5 parasites (Fig. 2A). Consistent with the proteomics analysis, SAG1, MIC4, MIC5, and MIC11 appear to be down-regulated in the
TgGCN5, roughly corresponding to the levels observed on the 2D gels. For example, the 2D gel study suggested that MIC5 and SAG1 are down-regulated
60% in
TgGCN5, and a difference in protein levels is clear on the immunoblot. In contrast, MIC4 and MIC11 were only reported to be decreased
30%, and only modest decreases are observed by immunoblot. MIC2 was included as a control as it was not indicated to be differentially expressed.
![]() View larger version (28K): [in a new window] |
FIG. 2. (A) Western analysis comparing levels of indicated protein between wild-type (WT) and TgGCN5 parasites. Samples were normalized to protein concentration. Antibody dilutions applied were as follows (dilutions are indicated in parentheses): anti-SAG1 (1:500), anti-MIC2 (1:10,000), anti-MIC4 (1:5,000), anti-MIC5 (1:5,000), anti-MIC11 (1:1,000). (B) Immunoblot for MIC2 following a microneme secretion assay. IM, invasion media (control); EtOH, ethanol, used at a final concentration of 1%. Parasite lysate displays the size of unprocessed MIC2 for comparison.
|
TgGCN5, we examined whether there was any functional defect in microneme secretion, which can be stimulated in vitro by ethanol. As shown in Fig. 2B,
TgGCN5 exhibits a considerable defect in microneme secretion, as assayed by processed MIC2 levels. Despite the apparent reductions in the expression and secretion of some microneme proteins, we were unable to detect any difference in parasite attachment or invasion in vitro using the red-green invasion assay (22). Either the observed deficiencies are not profound enough to alter the process or other invasion proteins are compensating. It is also possible that ethanol, being an artificial stimulus, is working differently than the native signals that induce microneme secretion.
Analysis of candidate TgGCN5-dependent promoters.
Gene expression facilitated by GCN5 is largely promoter specific (25). GCN5 does not bind DNA directly; rather, it is recruited by a transcriptional activator that is translationally up-regulated during stress. For example, yeast GCN4 recruits the GCN5 complex after binding the general control response element found upstream of target genes. We hypothesized that a similar scenario occurs in Apicomplexa, but bioinformatics analysis of ToxoDB does not reveal any hits for a GCN4 homologue or the human orthologue, ATF4. To assess whether any commonalities exist in the promoters of genes purportedly regulated by TgGCN5, we retrieved
1,500 bp of sequence upstream of the translational start from most of the gene products in Table 1. We were unable to detect a palindromic motif akin to the yeast general control response element consensus sequence in the set of T. gondii sequences examined. We could identify the (A/T)GAGACG motif in 16 of these sequences (Fig. 3A), previously reported to be a cis-acting enhancer of T. gondii genes (33). Another significant motif identified with an E-value of 8.1 x 102 was an 11-bp T-rich element, (A/G)(C/T)TTGTTTT(T/C)T (Fig. 3B). This motif represents a fairly well conserved sequence present on either strand in all 21 sequences analyzed. To ensure that the T-rich element is not commonplace in T. gondii promoter regions, 16 genes that did not show a change in protein levels in
TgGCN5 were analyzed in a similar manner. Importantly, the T-rich motif was not identified in this control set of genes (not shown).
![]() View larger version (20K): [in a new window] |
FIG. 3. Promoter analysis of candidate GCN5-dependent genes. (A) A known cis-acting enhancer of T. gondii genes (33) was identified by MEME in some of the promoters of genes reportedly down-regulated in TgGCN5. The letter size depicts the relative abundance of each nucleotide conserved in the sequence. (B) T-rich element present in GCN5-dependent promoters analyzed in Table 1. (C) Schematic diagram of promoter sequences showing approximate location of the T-rich element. All positions are indicated with respect to the translational start site. A triangle indicates the position of the motif; gray triangles signify that the motif was found on the opposite strand. PDI, protein disulfide isomerase; cAMP, cyclic AMP; PK, protein kinase; dep, dependent; reg, regulatory.
|
T. gondii possesses a second GCN5 family HAT. Mammals possess a second GCN5 family member called p300/CBP-associating factor (PCAF), but there are no reports of multiple GCN5 HATs in other animals or plants. While TgGCN5 may simply be nonessential, as in yeast, we monitored the T. gondii genome project (http://ToxoDB.org) to determine whether additional GCN5 homologues are present that could be playing a compensatory role. Database entry TGG_994635 contains significant homology to the nucleic acid sequence of the TgGCN5 previously reported, henceforth referred to as TgGCN5-A. Primers were designed based on genomic sequence data, and the products from multiple overlapping RT-PCRs allowed the verification and construction of the 3,099-bp coding sequence for this second GCN5 homologue in T. gondii (termed TgGCN5-B, accession no. AY875982 [GenBank] ). 5' and 3' rapid amplification of cDNA ends were employed to delineate the corresponding UTRs, which consist of 905 and 818 bp, respectively. The proposed start codon fits Kozak rules and is preceded by an in-frame stop codon 42 bp upstream (24). A comparative diagram of the loci for TgGCN5-A (8.4 kb) and TgGCN5-B (8.7 kb) is shown in Fig. 4A. The structural organization of each genomic locus is most similar in the region spanning the HAT catalytic domain; exons 3 to 5 of TgGCN5-A are highly homologous and identical in length to exons 4 to 6 of TgGCN5-B. According to the ToxoDB, TgGCN5-A is located on chromosome III and TgGCN5-B on chromosome LG14.
![]() View larger version (40K): [in a new window] |
FIG. 4. A pair of GCN5 HATs in T. gondii. (A) Schematic diagram of the genomic loci for TgGCN5-A and -B. TgGCN5-A is comprised of 7 exons (black) and 6 introns (white), while TgGCN5-B contains 8 exons. The approximate location of key domains in the encoded protein are indicated. HAT, histone acetyltransferase domain; ADA2, ADA2-binding domain; Br, bromodomain. Gray boxes represent untranslated regions. (B) Northern blots probed for TgGCN5-A and -B transcripts, respectively. (C) Dual-probing of a single Northern blot with both TgGCN5s. Two different exposure times are shown. Lower panel shows that equivalent amounts of nonisotopically labeled probes emit approximately equal chemiluminescent signals, so the differences observed on the blot are not due to a bias in probe labeling. (D) Equal amounts of cDNA generated from wild-type RNA were used in PCRs to amplify either TgGCN5-A or -B.
|
4.8 kb for TgGCN5-B (Fig. 4B). Similar to TgGCN5-A, there do not appear to be any alternatively spliced messages for TgGCN5-B (4). Hybridizing equivalent levels of probe specific for each GCN5 to the same lane on a Northern blot shows that TgGCN5-A message levels are considerably higher than those for TgGCN5-B in tachyzoites (Fig. 4C). RT-PCR data are consistent with this result (Fig. 4D). Alignments of the deduced amino acid sequences for TgGCN5-A and -B are shown in Fig. 5A. The TgGCN5-B open reading frame encodes a predicted protein of 1,032 amino acids, 137 residues shorter than TgGCN5-A. The HAT domains are nearly identical, and the putative ADA2-binding domain and bromodomains are less conserved. Most striking is the lack of homology in the lengthy regions upstream of the HAT domain (794 residues in TgGCN5-A; 625 residues in TgGCN5-B). Neither one of these N-terminal extensions has homology to protein sequences in other species, nor are any known protein motifs evident.
![]() View larger version (69K): [in a new window] |
FIG. 5. TgGCN5-B protein analysis. (A) Pairwise alignment of the deduced protein sequences for TgGCN5-A and -B. Dark gray highlights show identical residues; light gray denotes conserved differences. Locations of introns in the genomic sequence corresponding to the protein are indicated by vertical lines with diamondheads at junction sites. HAT and bromodomains are boxed and labeled. The conserved glutamic acid residue critical for HAT activity is marked with an asterisk, and conserved bromodomain residues important for interacting with acetylated lysines are designated with plus signs. (B) Cartoon depicting protein structures of GCN5 family proteins from Apicomplexa (top 5) and other eukaryotes (lower 5). Black boxes indicate HAT domains, gray boxes indicate bromodomains, and mottled boxes indicate PCAF homology domains. ADA2-binding domains are located between HAT and bromodomains. Numerical values represent the percent identities between TgGCN5-B and the indicated protein. Tg, Toxoplasma gondii; Pf, Plasmodium falciparum; Tp, Theileria parvum; Cp, Cryptosporidium parvum; Sc, Saccharomyces cerevisiae; Hs, Homo sapiens; Tt, Tetrahymena thermophila; At, Arabidopsis thaliana. It should be noted that structures for C. parvum and T. parvum were drawn based on predicted open reading frames in genome sequencing databases.
|
2 and -
4, both of which are present in the T. gondii GCN5 homologues (37). The structure of the bromodomain has also been delineated and demonstrated to be a module that can interact with acetylated lysine residues (10). Both TgGCN5 bromodomains contain the 6 residues that form the acetyl lysine binding pocket (10), suggesting a conserved function in the parasite (Fig. 5A). Not as well conserved is the region between the HAT and bromodomain, which in other GCN5 family members has been shown to interact with the SAGA component ADA2. We conducted surveys of completed apicomplexan genomes to check if others possessed a pair of GCN5 homologues. Plasmodium (16), Cryptosporidium (CpIOWA_EAK89017), and Theileria (TP01_0465) harbor only one GCN5 each, making T. gondii unique in being the only apicomplexan reported to possess two GCN5s. Figure 5B compares the structural features of apicomplexan GCN5s to those of other eukaryotic organisms. Curiously, apicomplexans universally possess N-terminal extensions of various lengths and compositions. N-terminal regions of GCN5 in other lower eukaryotes such as yeast and fellow alveolate Tetrahymena thermophila are greatly abbreviated.
TgGCN5-B localizes to the nucleus via a different signal.
To begin delineating the functions of each TgGCN5 in T. gondii, we examined whether TgGCN5-B was cytosolic (type B HAT) or nuclear (type A HAT). We have previously shown that TgGCN5-A is imported into the parasite nucleus by virtue of a nuclear localization signal (NLS) capable of associating with T. gondii importin-
(4). To test the subcellular localization of TgGCN5-B, we fused a recombinant version of the full-length protein to the FLAG peptide for expression in T. gondii. Immunofluorescent assays reveal that TgGCN5-B completely localizes to the parasite nucleus (Fig. 6). The TgGCN5-A NLS (RKRVKR) is a part of the unusual N-terminal extension, but this NLS is not present in TgGCN5-B. However, removal of the N-terminal extension of TgGCN5-B causes it to accumulate in the cytoplasm (Fig. 6). Precisely how TgGCN5-B gains entry into the nucleus remains unresolved, but like TgGCN5-A, its N-terminal extension plays a critical role.
![]() View larger version (16K): [in a new window] |
FIG. 6. TgGCN5-B localizes to the parasite nucleus via the N-terminal extension. Recombinant protein in parasites expressing either full-length FLAGTgGCN5-B or FLAG 528TgGCN5-B was detected by immunofluorescence assay. Antibody recognizing the FLAG epitope is shown in green. Infected cells were also stained with 4',6'-diamidino-2-phenylindole (DAPI), shown in red. hN, host cell nucleus; TgN, parasite nucleus.
|
![]() View larger version (26K): [in a new window] |
FIG. 7. TgGCN5-B preferentially acetylates histone H3 at multiple lysine residues. (A) Autoradiogram of HAT assays using core histones and either recombinant yeast GCN5 (Saccharomyces cerevisiae, Sc) or FLAGTgGCN5-B (Tg-B). An equivalent amount of untransfected parasite lysate was used as a negative control. The right panel is a Coomassie-stained SDS-polyacrylamide gel showing the positions of the core histones. (B) Western blot of similar HAT assays performed on recombinant histone H3, probed with antibodies ( ) to specific acetylated (Ac) H3 lysine residues. (C) Immunofluorescence assay showing acetylation of H3 [K18] in TgGCN5 and wild-type (RH) parasites (green). 4',6'-Diamidino-2-phenylindole (DAPI) was applied to stain nuclei (red). hN, host cell nucleus; TgN, parasite nucleus.
|
TgGCN5 parasites using an antibody specific to acetylated H3 [K18] in an immunofluorescence assay. As shown in Fig. 7C, H3 [K18] is still being acetylated despite the lack of TgGCN5-A. Since TgGCN5-B is capable of targeting H3 [K18], it therefore has the potential to compensate for the loss of TgGCN5-A.
Two homologues of ADA2 exist in T. gondii.
In other species, GCN5 HATs associate with the transcriptional coactivator ADA2. Lower eukaryotes are not known to possess more than one ADA2, but given that two GCN5 HATs exist in T. gondii, we sought to determine whether two ADA2 proteins were present. Drawing on data harnessed from the T. gondii genome sequence, we were able to clone two distinct ADA2 homologues and map the genomic loci (Fig. 8A). TgADA2-A refers to the shorter one (DQ112184
[GenBank]
), a
4.0-kb transcript as determined by Northern analysis, predicted to give rise to a 958-amino-acid protein (Fig. 8B). The TgADA2-A genomic locus is
8 kb, comprised of 9 exons interrupted by 8 introns. In contrast, TgADA2-B is much longer (DQ112185
[GenBank]
), hybridizing to a barely detectable
10-kb transcript (Fig. 8B). The open reading frame is 8,094 kb and encodes a protein of 2,697 amino acids. The
13-kb TgADA2-B genomic locus consists of 7 exons and 6 introns. According to ToxoDB, TgADA2-A resides on chromosome XI and TgADA2-B on chromosome VI. As observed for the ADA2 homologue in Plasmodium falciparum (15), TgADA2s are considerably larger than those in other species (yeast and metazoan ADA2s are comprised of 400 to 550 amino acids). Regions flanking the ADA2 domain in these apicomplexan ADA2s are not conserved with other species or one another.
![]() View larger version (26K): [in a new window] |
FIG. 8. A pair of ADA2s in T. gondii. (A) Schematic diagram of the genomic loci for TgADA2-A and -B. Black boxes represent exons, and the lines in between are introns. Gray boxes represent untranslated regions. (B) Northern blots probed for TgADA2-A and -B transcripts, 4 kb and 10 kb, respectively. Bands at 1.7 and 3.5 kb overlap with rRNA contamination. (C) Dual probing of a single Northern blot with both TgADA2s. The lower panel shows that equivalent amounts of nonisotopically labeled probes emit approximately equal chemiluminescent signals. (D) Equal amounts of cDNA generated from wild-type RNA were used in PCRs to amplify either TgADA2-A or -B.
|
![]() View larger version (88K): [in a new window] |
FIG. 9. Comparative analysis of ADA2 domains. (A) ADA2 domains from TgADA2-A and -B are aligned for comparison to those found in other species. Black boxes denote identity of residues across species; gray boxes denote residues that are chemically similar. The ZZ, SANT, and ADA3-binding domains are boxed. Numerals in parentheses refer to amino acid numbers. Tg, Toxoplasma gondii; Pf, Plasmodium falciparum; Sc, Saccharomyces cerevisiae; Hs, Homo sapiens; Dm, Drosophila melanogaster. (B) Phylogenic analysis of ADA2 orthologues from representative species with the calculated distance values in parentheses.
|
Interactions between TgGCN5s and TgADA2s.
It has been noted above that the putative ADA2-binding domain is not very conserved between the two TgGCN5s. Conventional biochemistry is difficult to perform on this organism, so to test whether either TgADA2 binds to either TgGCN5 we employed a directed yeast two-hybrid test. Based on well-characterized corresponding regions elucidated in other species, the putative GCN5-binding domains of TgADA2-A (amino acids 158 to 410) and -B (amino acids 461 to 761) were amplified and fused to the GAL4 activation domain (7, 15, 41). C-terminal portions of each TgGCN5, starting with the putative ADA2-binding domain (
950TgGCN5-A and
781TgGCN5-B, respectively), were fused to the GAL4 DNA-binding domain. Combinations of TgGCN5 plus TgADA2 were cotransformed into AH109 yeast and plated onto QDO media with a colorimetric substrate to test for protein-protein interactions. This system utilizes three independent reporters (two nutritional genes plus galactosidase) to minimize false-positive results; no growth argues that the test proteins do not interact. The growth of blue colonies on QDO plates indicates that all three independent reporter genes were activated by virtue of a strong interaction between the test proteins. As shown in Fig. 10A, TgGCN5-B has the ability to interact with either TgADA2-A or -B, while TgGCN5-A only interacts with TgADA2-B. These results suggest that three GCN5-containing complexes may exist in T. gondii: TgGCN5-A plus TgADA2-B, TgGCN5-B plus TgADA2-A, and TgGCN5-B plus TgADA2-B. It is worth noting that TgGCN5-B can associate with the only TgADA2 capable of interacting with TgGCN5-A. Considered with its enzymatic abilities, it is plausible that TgGCN5-B can partially complement the function of TgGCN5-A. This complementation may not be complete, as reflected in the proteomics study comparing
TgGCN5-A versus wild-type tachyzoites. In contrast, the more exclusive nature of TgADA2-binding and substrate preference makes it unlikely that TgGCN5-A could compensate for a loss of TgGCN5-B.
![]() View larger version (25K): [in a new window] |
FIG. 10. Protein-protein interactions between TgGCN5s and TgADA2s. (A) Top panels: AH109 yeast were cotransformed with plasmids expressing simian virus 40 large-T antigen (TAg) and murine p53 (positive control) or TAg and human lamin C (cLam) (negative control). Four independent colonies initially plated on TDO plates were spotted onto QDO plates containing X- -gal and grown for 3 days at 30°C. Growth of blue colonies represents protein-protein interaction. Bottom panels: AH109 yeast were cotransformed with the plasmids indicated and plated for incubation as described above. All colonies shown are blue in color. (B) Working model for TgGCN5 HAT complexes. Based on yeast two-hybrid data, at least three possible complexes containing GCN5 may exist in T. gondii. The lowest panel depicts a complex comprised of TgGCN5-B and TgADA2-B that could compensate for TgGCN5-A. Based on what is observed in other eukaryotes, a question mark refers to a putative DNA-binding transcription factor with the capacity to recruit the GCN5 complexes by virtue of binding ADA2. Note that the position of the T-rich motif is approximate, as its actual location varies as shown in Fig. 3C.
|
|
|
|---|
Genes under control of GCN5 family HATs.
The nature of the genes controlled by GCN5 HATs is incompletely characterized. Studies with yeast show that GCN5 modulates the expression of tightly regulated genes governing the stress response (21). By virtue of being recruited to the yeast master regulator GCN4, GCN5 activates expression of genes for amino acid biosynthesis and transport and for vitamin and purine biosynthesis as well as peroxisomal genes (19). In plants, similar evidence argues that GCN5 regulates stress remediation genes (42). Serving to ameliorate stress would also explain why GCN5 is nonessential for organisms free of environmental insult. GCN5 plays a vital role in development in some plants (50); but in mammals, which have two GCN5s, only one seems essential for development (51, 52). Our data show that the subset of genes under TgGCN5-A control is remarkably different than other eukaryotes. Our proteomic analysis of
TgGCN5-A shows most of the affected products to be parasite specific, including surface antigens and micronemes, proteins that participate in attachment and invasion into host cells. Despite the differential expression of these genes in
TgGCN5, the phenotypic impact in vitro and in vivo appears to be inconsequential. Considering the critical nature of parasite invasion, it is conceivable that redundancies exist that adequately compensate for the reduced expression of such gene products. It is also possible that there is a basal level of transcription of the GCN5-dependent genes that provides adequate amounts of the respective gene products.
Our aim in this study was to examine the impact of TgGCN5-A on the tachyzoites responsible for the acute stage of infection, for which type I strains like RH are ideal. The discovery of TgGCN5-B creates the possibility that it is more important in regulating gene expression during acute infection. Moreover, histone acetylation was recently found to correlate with stage-specific changes in gene expression in T. gondii (39). To study the potential roles of TgGCN5s in this process, we are attempting to disrupt each locus in an avirulent type II strain, which is better suited for differentiation studies. It is also possible that TgGCN5-A and/or -B governs gene expression changes that are pertinent to transcriptional changes occurring while in the definitive (feline) host or during sporulation.
Multiple GCN5 HATs in T. gondii. We have previously shown that T. gondii has a GCN5-type HAT that exhibits an uncharacteristic proclivity to acetylate strictly H3 [K18] (39, 47). Now we show that T. gondii is also in possession of a more prototypical GCN5 family HAT, TgGCN5-B, capable of acetylating multiple lysine residues in the H3 tail. This is significant for several reasons. First, T. gondii is unique in being the only invertebrate described to date harboring two GCN5 HATs. Second, unlike the dual GCN5 HATs in mammalian cells, TgGCN5-A and -B have different histone acetylation profiles. Third, the two TgGCN5s contain lengthy N-terminal extensions that are not similar to one another. In contrast, mammalian GCN5 and PCAF possess N-terminal domains that are highly analogous and associate with another HAT, p300/CBP (CREB-binding protein). p300/CBP is not present in apicomplexan genomes or other lower eukaryotes, but it remains possible that the N-terminal extensions of TgGCN5s are involved in protein-protein interactions. We have discerned that the N-terminal extensions of each TgGCN5 are dispensable for HAT activity in vitro but are required for nuclear localization.
Why two HATs?
T. gondii appears to be unique among its fellow apicomplexans in harboring more than one GCN5 family HAT. We speculate that the two TgGCN5s are paralogues that probably arose from a gene duplication event occurring exclusively in T. gondii prior to their segregation into three different clonal lineages
10,000 years ago (44). TgGCN5-B resembles archetypal GCN5s with regard to its sequence homology as well as its behavior in HAT assays, making it more likely to be the original gene. The vastly different N-terminal extensions between TgGCN5-A and -B might suggest that only the HAT domain through the bromodomain was copied. Notably, removal of the N-terminal extensions does not alter in vitro substrate specificity of the TgGCN5s. The more restrictive nature of TgGCN5-A HAT activity may involve one or more of the four residues previously noted to be conserved in all other GCN5s except TgGCN5-A (47). Further study will have to be performed to confirm this hypothesis.
Mammalian GCN5 and PCAF are much more similar to one another in both sequence and substrate preference than the two TgGCN5 HATs. Our inability to discern a clear phenotype for
TgGCN5-A mirrors the results seen in mice that showed that GCN5 could compensate for the loss of PCAF (52). Interestingly, PCAF could not compensate for the loss of GCN5, making it of interest to investigate whether a knockout of TgGCN5-B is possible in T. gondii. We imagine that TgGCN5-A, given its more restricted enzymatic ability and exclusivity for one ADA2, will not be able to adequately compensate for the loss of TgGCN5-B.
It is unclear at present why T. gondii has two GCN5 HATs. It is possible that the two GCN5s regulate genes in different stages of the parasite life cycle. The availability of comprehensive microarrays for T. gondii will permit gene expression profiling experiments designed to identify the complement of genes associated with each TgGCN5 in different life cycle stages.
TgGCN5 complexes: simple or complex? ADA2 proteins associate with GCN5 and act as coactivators bridging the GCN5 complex to the acidic activation domains of DNA-binding transcription factors and TATA-binding protein (2, 12). As is the case for GCN5, it was unexpected to find two novel ADA2 homologues in a lower eukaryotic cell. While most metazoans and land plants have two ADA2s, our search results indicate that most protozoa, like yeast, have only one. T. gondii appears to be an exception to this observation, possibly because it contains two GCN5s, but precisely how and why a second ADA2 locus arose remains an intriguing issue. Moreover, like PfADA2 (15), TgADA2-A and -B are much longer than other species, containing large regions flanking the ADA2 domain that have no similarity to other protein sequences or known motifs.
Our studies using the yeast two-hybrid assay determined that either TgADA2 can interact with TgGCN5-B. However, TgGCN5-A can only interact with TgADA2-B. It remains possible that TgGCN5-A may interact with TgADA2-A in vivo, but the data at present suggest that at least three potential GCN5-containing complexes may exist in T. gondii (Fig. 10B). It should be mentioned that yeast two-hybrid results do not always reflect what truly occurs in vivo in the parasite, and further studies are under way to confirm the result suggested by the yeast two-hybrid test. It is also possible that the TgADA2 homologues operate independently of TgGCN5. Drosophila has two ADA2 homologues, but only one (dADA2b) is incorporated into the SAGA complex (27). Data argue that dADA2a may have novel roles in transcription that are independent of GCN5 (27, 34).
ADA3 is another component found in all GCN5 complexes, but no homologue of it is readily apparent in apicomplexan databases. Either the ADA3 homologue is very divergent in these parasites or they have evolved an unprecedented means to nucleate the GCN5 complex. If the latter is true, the lengthy extensions on TgGCN5 and TgADA2 proteins may mimic the role of ADA3 in other eukaryotes.
One of the most striking features of apicomplexan genomes is their lack of conventional transcription factors (48). The elucidation of a T-rich motif may assist the effort to identify a DNA-binding factor that subsequently recruits chromatin remodeling complexes to the target promoters. Given the paucity of conventional transcription factors but robust repertoire of chromatin remodeling enzymes (39, 46, 47), a plausible case can be made that T. gondii has streamlined gene expression regulation, relying more heavily on epigenetics. On a related note, in addition to the lack of a conserved ADA3, we have also noted a lack of other SAGA components in the Apicomplexa (W. J. Sullivan, Jr., unpublished data), arguing further that these ancient eukaryotic cells have simplified GCN5 complexes. The unique attributes of their gene regulatory machinery, as well as the potential for pharmacological applications (9), makes T. gondii an increasingly attractive organism for the study of these phenomena.
We are grateful to David Sibley (Washington University, St. Louis, MO) and Vern Carruthers (Johns Hopkins Malaria Research Institute, Baltimore, MD) for MIC antibodies and Blima Fux (Sibley lab) for assistance with the virulence study. We thank Tony Sinai (University of Kentucky, Lexington, KY) and M. A. Hakimi (Joseph Fourier University, France) for a critical reading of the manuscript and Chunling Jiang (Sullivan lab) for technical assistance. Preliminary genomic sequence data were accessed via http://ToxoDB.org. Genomic data were provided by The Institute for Genomic Research (supported by NIH grant no. AI05093) and by the Sanger Center (Wellcome Trust).
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»