Previous Article | Next Article ![]()
Eukaryotic Cell, June 2007, p. 1018-1029, Vol. 6, No. 6
1535-9778/07/$08.00+0 doi:10.1128/EC.00078-07
Copyright © 2007, American Society for Microbiology. All Rights Reserved.
,
Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, California 94720-3102,1 Oregon Health and Science University, Beaverton, Oregon 97006-89212
Received 13 March 2007/ Accepted 16 April 2007
|
|
|---|
|
|
|---|
1,000) in response to amino acid starvation (34, 45).
The production of Gcn4p itself is induced during amino acid starvation via translational regulation; four short upstream open reading frames (uORFs) encoding products of 2 to 4 amino acids affect the ribosome scanning of GCN4 mRNA under conditions of stress (24, 44, 61). The translational regulation of GCN4 is associated with the kinase activity of Gcn2p, which is stimulated by the binding of deacylated tRNAs that accumulate under conditions of amino acid starvation. The phosphorylation target of Gcn2p is the
subunit of eukaryotic translation initiation factor 2 (eIF2). In addition to amino acid starvation, the Gcn4p pathway is induced by starvation for purines (50), glucose limitation (71), and exposure to DNA-damaging agents, such as methyl methanesulfonate (45) and rapamycin (67); it is not clear whether the patterns of translational regulation of GCN4 are identical under all these stress conditions.
Gcn4p binds to 5'TGACTCA3' sequences in the promoters of target genes, such as HIS4, HIS3, ILV1, and ILV2 (1). By chromatin immunoprecipitation (ChIP) experiments, the promoters of 207 genes were found to be bound by Gcn4p (P < 0.001) (22, 36); the transcriptional profiling and ChIP data sets show an overlap of 95 genes. Gcn4p was also shown to interact physically with numerous other proteins that are predicted to affect Gcn4p DNA binding and function in vivo (22, 33, 60).
The filamentous fungus Neurospora crassa diverged from S. cerevisiae approximately 350 million years ago (16). Previously, it was shown by genetic analyses that cross pathway or general amino acid control is evolutionarily conserved between S. cerevisiae and N. crassa (reviewed in reference 53). A mutation identified in N. crassa confers a phenotype similar to that of gcn4 mutants of S. cerevisiae; the cross pathway control (cpc-1) mutant failed to induce the expression of a number of amino acid biosynthetic genes upon amino acid starvation (3, 4, 18, 37, 56). CPC1 is required for the global regulatory response observed during starvation for amino acids (9-11, 46). The cpc-1 locus encodes a bZIP transcription factor with sequence similarity to Gcn4 (46). The cpc-1 mRNA comprises a long leader segment with two uORFs of 2 and 41 codons (53). As that of GCN4, the translation of cpc-1 mRNA is regulated by amino acid starvation; the uORFs play a regulatory role (53). An ortholog of the GCN2 eIF2
kinase gene, cpc-3, is important for the regulation of cpc-1 function in response to amino acid starvation (55).
The level of amino acid identity between CPC1 and Gcn4p is only 25% (similarity, 42%). The carboxy terminus of CPC1 contains the DNA binding and dimerization domain (57 amino acid residues) (46), and regions associated with DNA binding and transcriptional activation in Gcn4p and CPC1 are more similar in primary sequence (30 amino acid residues involved in DNA binding are 70% identical) than other regions. The introduction of GCN4 into a cpc-1 mutant of N. crassa did not restore cross pathway control (48). Only constructs in which the DNA binding domain of cpc-1 was replaced by that of GCN4 complemented the cpc-1 mutant phenotype.
In N. crassa, cpc-1 is regulated throughout the asexual life cycle (14, 31, 54). The relative levels of DNA binding activity in cell extracts correlate with CPC1 binding activity (14). Binding studies of the trp-3 and arg-2 promoter regions show that CPC1 binds to a DNA sequence identical to that bound by Gcn4p (5'TGACTC3'). However, the size of the bound complex is larger than the predicted CPC1 homodimer, indicating that additional unknown proteins also bind to these regions. As many as 20% of the total detectable mRNA species in N. crassa are influenced directly or indirectly by the presence of a functional cpc-1 gene during amino acid starvation (17), including the cpc-1 mRNA itself; CPC1 regulates it own expression (14). The synthesis of many polypeptides also increases in response to arginine limitation, which is dependent on functional CPC1 (17, 58). These data indicate that CPC1 controls a large number of N. crassa genes, similar in magnitude to the number reported to be controlled by Gcn4p in S. cerevisiae (45).
In this study, we determined the regulon of CPC1 by profiling transcription in a wild-type (WT) strain and a cpc-1 mutant under normal growth conditions and under amino acid starvation. As with the GCN4 regulon, the transcription of a large number of genes was affected by conditions of amino acid starvation and was also dependent upon functional CPC1. At least 443 genes were predicted to be either direct or indirect CPC1 targets, including 67 amino acid biosynthesis genes, 16 tRNA synthetase genes, and 13 vitamin metabolism-related genes. In addition, a large number of genes showed CPC1-dependent repression. We compared our results to those obtained in a similar study of both WT strains and gcn4 mutants of S. cerevisiae and Candida albicans (62). We identified conserved and divergent elements in the response to amino acid starvation among these three fungi. A comparison of these regulons showed that the regulation of only 32 orthologous genes is conserved among all three fungi. Of these 32 genes, only 10 retain the conserved cis element bound by Gcn4p/CPC1.
|
|
|---|
Strains and culture conditions. The cpc-1 mutant (FGSC 4264) and the WT sequenced strain (FGSC 2489) were obtained from the Fungal Genetics Stock Center. The cpc-1 CD15 allele mutation is a single-base-pair deletion in codon 93 resulting in a nonfunctional, truncated polypeptide (47). Strains were inoculated onto slants containing Vogel's (69) minimal medium and grown at 30°C for 2 days, followed by incubation at 25°C under constant light for 7 days. Conidia were harvested with water and inoculated into 50 ml of Bird's medium (43) in 250-ml Erlenmeyer flasks at a final concentration of 106 conidia/ml. The cells were grown under constant light at 30°C for 12 h in a gyratory shaker at 300 rpm. An inhibitor of imidazole glycerol phosphate dehydratase, 3-aminotriazole (3-AT), was then added to the cultures of the WT and cpc-1 strains (FGSC 2489 and FGSC 4264, respectively) to a final concentration of 6 or 30 mM. The addition of 3-AT to N. crassa cultures inhibits histidine biosynthesis and causes imidazole glycerol phosphate to accumulate. Control cultures received no 3-AT. All cultures were incubated under identical conditions for a further 2 h. Each sample was represented by duplicate cultures. The mycelium was harvested by filtration, immediately immersed in liquid nitrogen, and stored at 70°C.
In N. crassa, amino acid starvation has been experimentally induced by the addition of 6 mM 3-AT to hyphal cultures (14, 54). To determine the appropriate concentration for a transcriptional profiling analysis of amino acid starvation in N. crassa, we performed profiling experiments with 12-h-old hyphal cultures grown in liquid medium and exposed or not to 6 or 30 mM 3-AT for 30 min or 2 h. Approximately 300 genes showed differential regulation after a 2-h exposure to 3-AT at both concentrations (6 and 30 mM); expression levels were more affected by the duration of 3-AT treatment than by the 3-AT concentration. To minimize the possible side effects of using too high a concentration of 3-AT, for subsequent analyses we exposed 12-h-old hyphae to 6 mM 3-AT for 2 h.
RNA isolation and cDNA labeling. RNA isolation was accomplished using TRIzol (Invitrogen Life Technologies), and RNA was subsequently purified using the RNAeasy kit (QIAGEN) according to the manufacturer's protocols. For cDNA synthesis and labeling, the Pronto kit (catalog no. 40076; Corning) was used according to the manufacturer's specifications. Briefly, cDNA was synthesized from a mixture containing 10 µg total RNA and oligo(dT) primer, ChipShot reverse transcriptase, and aminoallyl-deoxynucleoside triphosphate and incubated at 42°C for 2 h. The cDNA was purified by using a ChipShot membrane column. The dyes Cy3 and Cy5 (Amersham; catalog no. RPN5661) were incorporated into cDNA by adding Cy3 or Cy5 monofunctional N-hydroxysuccinimide ester dye to the cDNA solution for 1 h at 22°C. The cDNA was subsequently cleaned by using a ChipShot membrane column and dried under vacuum and was subsequently used for hybridization.
Hybridization and image acquisition. Slides were prehybridized and hybridized at 42°C according to the instructions of the manufacturer of the Pronto kit (Corning; catalog no. 40076). Briefly, the presoak solution was prewarmed (42°C) for 30 min, and then a 1% volume of sodium borohydride was added to the presoak solution and the solution was mixed well. The microarray slide was then added to the solution and incubated at 42°C for 20 min. Slides were washed, transferred to prewarmed prehybridization solution (42°C) for 15 min, and then rewashed. The presoak and prehybridization steps reduced slide background. Following slide prehybridization, labeled cDNA was resuspended in 30 µl of hybridization solution (Pronto kit) and the suspension was heated at 95°C for 5 min and subsequently pipetted into the space between a microarray slide and a LifterSlip cover glass (Erie Scientific, Portsmouth, NH). Hybridization was carried out for 16 h at 42°C, and unbound DNA was washed off according to the manufacturer's instructions. A GenePix 4000B scanner (Axon Instruments, CA) was used to acquire images, and GenePix Pro6 software was used to quantify hybridization signals. Low-quality spots were flagged automatically by GenePix software, and subsequently each slide was inspected manually.
Experimental design and data analysis. For the transcriptional profiling of the WT and the cpc-1 mutant under conditions of amino acid starvation, we chose to use a closed-circuit design for microarray comparisons (Fig. S1 in the supplemental material). Circuit designs for microarrays are statistically robust and improve resolution in identifying differentially regulated genes compared to designs for microarrays that use a universal reference (32, 63, 64, 68, 72).
Hybridized spots with a mean fluorescence intensity for at least one of the Cy3 and Cy5 dyes that was greater than the mean background intensity plus three standard deviations were scored for further analysis if less than 0.02% of the pixels were saturated. Normalized ratio data were analyzed using Bayesian Analysis of Gene Expression Levels software, with which we inferred a relative expression level and a credible interval for each gene in each sample (65). These inferred levels of gene expression were then clustered (15) using Hierarchical Clustering Explorer 2.0 (57), in which similarity in expression patterns between genes is measured as Pearson's correlation coefficient and the closest two genes or clusters are successively joined. The functional catalog FunCat created by MIPS (19, 52) was mined to associate functional annotations with Neurospora genes (http://mips.gsf.de/genre/proj/ncrassa/Search/Catalogs/catalog.jsp). The statistically significant overrepresentation of gene groups in functional categories relative to the whole genome was determined with the MIPS online FunCat system using hypergeometric distribution for P value calculation (http://mips.gsf.de/proj/funcatDB/help_p-value.html).
cis-element discovery and enrichment analysis. Enrichment of motifs (P < 0.001) was assessed using Fisher's exact test performed by the Fisher test function implemented in the R 1.9 program (http://bioconductor.org). A motif logo illustrating the consensus sequence was obtained using the WebLogo program (http://weblogo.berkeley.edu). For the identification of the CPC1 DNA binding motif, the total number of putative targets was evaluated by MIPS functional category analysis. Motif searches were conducted using three programs, BioProspector (42), MDscan (38), and MEME (2), on segments of 500 bases upstream of predicted translational start sites, which were downloaded from the Broad Institute N. crassa database release version 7 (http://www.broad.mit.edu/annotation/fungi/neurospora_crassa_7/). The different predictions were compared and inspected manually. The CPC1 DNA binding cis-element matrix was built based on the MDscan prediction with the default parameter. With the results from the initial MDscan step, the conserved 7-bp region in the middle of the element was used to make the initial matrix from motif 1 (10 bp in width). By using the initial matrix, the promoter regions of the 67 amino acid genes were evaluated by the PATSER program (23). Fifty-seven genes were retained when known CPC1 targets were used to determine the cutoff value. The predicted cis-elements in the promoter regions of these 57 genes were used to build the CPC1 DNA binding matrix and logo. To identify additional CPC1 target genes in the N. crassa and S. cerevisiae genomes, predicted promoter regions were scanned using the CPC1/Gcn4p matrix (23). Enrichment analysis results were binned into a group of genes with perfect matches to the CPC1/Gcn4p consensus and a second bin reflecting a match score of 70% of the full score.
Identification of orthologs and regulogs in N. crassa and other fungi. The sequences of predicted ORFs in the N. crassa genome were downloaded from Broad Institute N. crassa database release version 7 (http://www.broad.mit.edu/annotation/genome/neurospora/Downloads.html). Sequences of S. cerevisiae ORFs and upstream 500-bp promoter regions were downloaded from the Saccharomyces Genome Database (ftp://genome-ftp.stanford.edu/pub/yeast/data_download/sequence/genomic_sequence/orf_dna/archive/), and sequences of C. albicans ORFs and upstream regions were downloaded from http://www.candidagenome.org/download/sequence/genomic_sequence/. Orthologous genes were identified as best bidirectional hits by using BLASTp with a cutoff value of 1e10. If orthologs were CPC1, Gcn4p, or CaGcn4p targets, they were defined as regulogs in a common regulon. The term regulog extends the concept of a protein-DNA interolog as defined by Yu et al. (73). Targets can be direct (genes involved in Gcn4p/CPC1-DNA interactions) or indirect (genes exhibiting a transcriptional response affected by mutations in cpc-1/gcn4/Cagcn4 that are not direct DNA targets). The Gcn4p target gene data set includes ChIP data (http://jura.wi.mit.edu/young_public/regulatory_code/GWLD.html; 22, 36) and transcriptional profiling data (34, 45) that were obtained from haploid and diploid cells, respectively. Transcriptional profiling data for WT strains and gcn4 mutants of S. cerevisiae and C. albicans were kindly provided by Alan G. Hinnebusch (NIH NICHD) and Alistair Brown (University of Aberdeen), respectively. We reanalyzed the CaGcn4 profiling data using the Bayesian Analysis of Gene Expression Levels software (65) and normalized expression data. Genes were defined as CaGcn4p targets if they were induced in the WT by 3-AT treatment, showed a statistically significant change in expression level upon 3-AT exposure, and were repressed in the Cagcn4 mutant but not in the WT. Sequences of promoter regions and predicted ORFs from the Candida tropicalis, Aspergillus nidulans, and Magnaporthe grisea genomes were downloaded from the Broad Institute (http://www.broad.mit.edu/annotation/fgi/).
Supplemental material and the Neurospora functional genomics microarray database. The complete data set generated in this study is available in the supplemental material and in the Neurospora functional genomics microarray database (http://www.yale.edu/townsend/Links/ffdatabase/introduction.htm).
|
|
|---|
A large number of genes showed differential expression levels in cultures of WT N. crassa exposed to 6 mM 3-AT for 2 h relative to those in unexposed control cultures; the expression levels of 334 genes showed a statistically significant increase in the 3-AT-exposed culture, while the expression levels of 280 genes decreased in the 3-AT-exposed culture relative to those in the unexposed WT culture (Table 1). The genes identified in the increased- and decreased-expression-level data sets were subjected to functional category representation analysis (FunCat; http://mips.gsf.de/genre/proj/ncrassa/Search/Catalogs/catalog.jsp) (Table S1 in the supplemental material). Of the 334 genes in the WT whose expression levels were significantly increased by 3-AT treatment, 120 genes were classified in the metabolism functional category. Of these, 54 genes were involved in amino acid metabolism (P = 1.44e34) and included known targets of CPC1, such as trp-1 (NCU00200.2, encoding anthranilate synthase component II), arg-12 (NCU01667.2, encoding ornithine carbamoyltransferase), his-3 (NCU03139.2, encoding a histidine biosynthesis trifunctional protein), trp-3 (NCU08409.2, encoding tryptophan synthase), and leu-6 (NCU09463.2, encoding leucine-tRNA synthetase) as well as cpc-1 itself (NCU04050.2). Another known target of CPC1, arg-2 (NCU07732.2, encoding the arginine-specific carbamoylphosphate synthetase small subunit), was slightly induced, consistent with previously published data (54), but did not pass the statistical significance test. The relative level of transcripts of the for gene (NCU02274.2, encoding cytosolic serine hydroxymethyltransferase) was not elevated after 2 h of exposure to 3-AT, although previously published Northern analysis data indicated that for gene expression increases after a 1-h exposure to 3-AT (41). We assessed for gene expression levels from both the 30-min and 2-h microarray data sets (see Materials and Methods); for expression levels were transient (detectable in the 30-min but not the 2-h data set).
|
View this table: [in a new window] |
TABLE 1. Summary of transcriptional profiling results
|
Transcriptional profile of the N. crassa cpc-1 mutant. The cpc-1 mutant grows like the WT in minimal medium (3), and basal levels of transcripts of the amino acid biosynthesis genes, such as his-3, trp-3, cpc-1, and arg-2, are similar between the WT and the cpc-1 mutant (FGSC 4264) (53). However, enzyme assay data showed that mutations in cpc-1 affect basal levels of activity of some amino acid biosynthetic enzymes (3). When the cpc-1 mutant was grown in minimal medium, 290 genes showed higher levels of expression than those in the WT under identical conditions, while 317 genes showed lower levels of expression (Table 1). Functional category analysis of the up-regulated-gene set revealed a statistically significant overrepresentation of genes annotated to be involved in protein synthesis (Table S1 in the supplemental material). Fifty genes (P = 1.24e37) were predicted to be involved in ribosome biosynthesis; 37 of these 50 genes encode ribosomal proteins (P = 1.53e30), and 7 are predicted rRNA genes (P = 7.00e-06). This gene set was also enriched with energy-related genes, including 45 genes with mitochondrial functions (P = 1.41e19), 8 genes predicted to be involved in aerobic respiration (P = 1e06), and 12 electron transport genes (P = 6e8). The set was not enriched with genes predicted to be involved in amino acid biosynthesis, although five genes involved in amino acid transport overrepresented this functional category (P = 0.0004) (Table S1 in the supplemental material).
In contrast to the set of genes showing increased expression in the cpc-1 mutant, the set of down-regulated genes revealed no significant enrichment with genes of any specific functional category upon a similar analysis of gene functions (Table S1 in the supplemental material). Only 12 genes identified in the down-regulated-gene set were related to amino acid metabolism, and among these, only 3 were predicted to be involved in amino acid biosynthesis: a 2-methylcitrate dehydratase gene (NCU00680.2) and a gene for a homoaconitase precursor (NCU08898.2), both involved in lysine biosynthesis, and a methionine synthase gene (NCU06512.2).
Comparison of transcriptional profiles of the WT and the cpc-1 mutant exposed to 3-AT. When the cpc-1 mutant was exposed to 3-AT for 2 h, it showed significant differences in the levels of expression of a large number of genes in comparison to the cpc-1 mutant grown in minimal medium; 669 genes showed statistically significantly increased expression levels, while 510 genes showed decreased expression levels (Table 1). Changes in the levels of expression of a large number of genes in the cpc-1 mutant may reflect increased stress associated with amino acid limitation. A similar transcriptional response occurs in the S. cerevisiae gcn4 mutant when it is exposed to 3-AT (45). To identify genes whose responses to amino acid starvation are dependent upon CPC1, we compared expression profiles of the WT versus the cpc-1 mutant in response to 3-AT; 255 genes showed statistically significant increases in expression levels, while 346 genes showed decreases in expression levels (Table 1). A Venn diagram showing the overlap of data sets of up-regulated genes from the WT and the cpc-1 mutant grown in minimal medium and minimal medium plus 3-AT is presented in Fig. 1A. A total of 443 genes were identified as potential CPC1 targets. This data set included 121 genes in set B, whose expression levels in the WT but not in the cpc-1 mutant were increased by treatment with 3-AT; 25 genes in set A, whose expression levels in both the WT and the cpc-1 mutant, but to a greater extent in the WT, were increased by treatment with 3-AT; 109 genes in set F, which required functional cpc-1 for appropriate expression levels in the WT; and 101 genes in set D, whose expression levels in the WT only increased upon exposure to 3-AT (Fig. S2A in the supplemental material). In addition, the 87 genes within set C were included in the CPC1 target gene set because the expression levels of 37 of these genes increased to a greater degree in the WT than in the cpc-1 mutant upon exposure to 3-AT. Genes within set E (557) were induced only in the cpc-1 mutant upon exposure to 3-AT and were therefore considered not to be CPC1 targets.
![]() View larger version (24K): [in a new window] |
FIG. 1. Venn diagram of genes with differences in expression levels between the WT and the cpc-1 mutant. (A) Venn diagram showing overlap among genes that exhibit statistically significantly increased or altered expression levels in the WT (FGSC 2489) versus the cpc-1 mutant (FGSC 4264) in minimal medium after exposure to 3-AT. The 443 genes comprising set B, set D, set A, set F, and set C are defined as the CPC1 target gene set (regulon) (see the text for details). A functional category analysis of these 443 genes is given in Table S1 in the supplemental material. (B) Venn diagram showing overlap among genes whose expression levels in the WT (FGSC 2489) and the cpc-1 (FGSC 4264) mutant were reduced in minimal medium upon exposure to 3-AT. A functional category analysis of genes that showed reduced expression levels is given in Table S1 in the supplemental material.
|
![]() View larger version (16K): [in a new window] |
FIG. 2. Computational identification of cis-element enrichment in predicted promoter regions of genes within the CPC1 regulon. The 500-bp upstream regions of the predicted translational start sites of 67 amino acid metabolism genes from the CPC1 target gene set were retrieved and evaluated by MDscan (38), BioProspector (42), and MEME (2) for motif prediction. By using the initial CPC1 matrix, the promoter regions of the 67 amino acid genes were evaluated by the PATSER program (23). To identify additional CPC1 target genes in the N. crassa target gene set and the genome, predicted promoter regions were scanned using the CPC1 matrix. The logo using the CPC1 matrix was built using WebLogo (http://weblogo.berkeley.edu/logo.cgi).
|
|
View this table: [in a new window] |
TABLE 2. Analysis of cis-element enrichment within CPC1 targets
|
Description of genes within functional categories overrepresented in the 443-gene set of putative CPC1 targets. Our profiling data indicated that, similar to Gcn4p, CPC1 is a major transcriptional regulator in N. crassa. The 443 CPC1 target genes were evaluated to identify functional categories by using the MIPS FunCat system (52) (Fig. 3 and Table S1 in the supplemental material). Sixty-seven of the predicted 195 amino acid metabolism genes in FunCat were present in the 443-gene CPC1 target set, and these 67 genes represented the largest FunCat category (P = 3.71e41); 50 genes encoding amino acid biosynthetic enzymes were identified (Table S2 in the supplemental material). Increased or WT expression levels of genes corresponding to 19 of the 20 predicted amino acid biosynthetic pathways were dependent upon functional cpc-1 in response to amino acid starvation; only genes in the alanine biosynthetic pathway were absent (Table S2 in the supplemental material). The CPC1 target gene set was also enriched with other functional categories, in particular, cytosolic aa-tRNA synthetase genes (P = 7e13); nitrogen and sulfur metabolism genes (P = 7e8); nucleotide metabolism genes (P = 7e6); genes involved in the metabolism of vitamins, cofactors, and prosthetic groups (P = 4e5); alcohol fermentation-related genes (P = 2e5); protein degradation genes (P = 8e6); and oxygen response and free-radical detoxification genes (P = 5.80e06) (Fig. 3 and Table S2 in the supplemental material). Slightly more than 50% (65 of 122) of the genes listed in Table S2 in the supplemental material have N. crassa gene designations; the remaining genes have not been characterized.
![]() View larger version (43K): [in a new window] |
FIG. 3. Functional category analysis of Gcn4p, CaGcn4p, and CPC1 targets. Target gene sets derived from S. cerevisiae Gcn4p transcriptional profiling and ChIP data (22, 34, 36, 45) were combined to generate a set of 747 Gcn4p target genes. A statistical analysis of C. albicans Gcn4p transcriptional profiling data (62) retrieved a set of 483 CaGcn4p target genes. The S. cerevisiae Gcn4p and C. albicans Gcn4p data sets were subjected to functional category analysis using FunCat (52). The functional category analyses of the S. cerevisiae Gcn4p and C. albicans Gcn4p data sets were then compared to the functional category analysis of the set of 443 CPC1 target genes.
|
subunit (NCU01468.2) and the eIF2B
subunit (NCU02414.2). The promoters of the two latter genes contain a perfect match to the CPC1 cis-element consensus. In S. cerevisiae, GCN2 (YDR283C) and the eIF2B
and
subunit genes (YGR083c and YDR211w) are not induced by treatment with 3-AT (45), nor do they contain a Gcn4p consensus cis element in their promoter regions.
In cross pathway control, deacylated tRNA is the signal for amino acid starvation (26, 28, 53). Analogous to the S. cerevisiae regulatory system, N. crassa CPC3, a protein kinase (S. cerevisiae GCN2 ortholog), is believed to be activated directly by deacylated tRNA to phosphorylate the
subunit of eIF2, reducing translation at uORFs in the cpc-1 mRNA and thus increasing the translation of CPC1 (55). Sixteen of 21 predicted aa-tRNA synthetase genes (corresponding to 20 aa-tRNA synthetases, as phenylalanyl-tRNA synthetase is composed of two subunits) were identified in the CPC1 target gene set (Fig. S2B in the supplemental material). Among the 21 predicted aa-tRNA synthetase genes, 19 have a perfect cis-element match to the CPC1 consensus, including the phenylalanyl-tRNA synthetase ß subunit gene (Table 3). The one aa-tRNA synthetase gene in this set that lacks a perfect cis-element, NCU07755.2 (encoding tyrosyl-tRNA synthetase), has a related element TGACTCT; this variant element can bind weakly to CPC1 in vitro (14). In contrast to the cytosolic tRNA synthetase genes, none of the nine genes annotated as mitochondrial tRNA synthetases were identified as CPC1 targets. In addition, five genes annotated as tRNA synthetase related, NCU00466.2 (related to NCU08894.2, encoding glutamyl-tRNA synthetase), NCU00920.2 (related to NCU03575.2, encoding isoleucyl-tRNA synthetase), NCU00931.2 (related to NCU04020.2, encoding lysine-tRNA ligase), NCU07082.2 (related to NCU00915.2, encoding aspartyl-tRNA synthetase), and NCU09892.2 (hypothetical; related to NCU04449.2, encoding prolyl-tRNA synthetase), were not identified as CPC1 targets, and all lack a CPC1 consensus cis-element.
|
View this table: [in a new window] |
TABLE 3. cis-element analysis of predicted promoter regions of aa-tRNA synthetase genes in N. crassa, C. albicans, and S. cerevisiae
|
70 in the genome), was identified in the CPC1 data set.
Within the 443-gene CPC1 data set, 169 genes (
40%) encode hypothetical or conserved hypothetical proteins (Table S1 in the supplemental material). Of these 169 genes, 27 (16%) have a perfect CPC1 cis-element match (TGACTCA) in their 5' regions, a value similar to that of genes in the 443-gene target set whose functions have been annotated (20%). Of these 169 genes of unknown function, 27 have orthologs in S. cerevisiae. However, only 4 among these 27 have been identified as part of the Gcn4 regulon (45) (YIL164C, a predicted nitrilase gene, orthologous to NCU05387.2; YCR023C, orthologous to NCU03107.2; YIR035C, orthologous to NCU02018.2; and YBR147W, orthologous to NCU09195.2).
Functional categorization of genes dependent upon functional CPC1 for reduced expression levels. Consistent with our findings that CPC1 is a major regulator in N. crassa, a large number of genes in the WT showed reduced expression levels upon exposure to 3-AT (Table 1), many (119) of which required functional CPC1 (Fig. 1B; Table S1 in the supplemental material). However, only 2 of the 119 genes contain a CPC1 consensus cis element in their predicted promoter regions. Among these 119 genes, 53 specify proteins involved in ribosomal biosynthesis, with 37 of these specifying ribosomal proteins (Fig. S2C in the supplemental material). Analysis of the 5' regions of the 37 ribosomal protein genes led to the identification of a conserved a cis-element, AGCCCTAA, which is identical to that previously identified as a potential regulatory site (21, 31).
Characterization of CPC1-independent genes. The expression levels of a large number of genes (557) increased only in the cpc-1 mutant when it was exposed to 3-AT; changes in the levels of expression of these genes were therefore CPC1 independent (Fig. 1A, set E). In contrast to the CPC1 target gene set, the set of CPC1-independent genes was not significantly enriched with any functional category. The slight enrichment included a group of genes encoding nucleic acid binding proteins (P = 5e6), including predicted transcription factors and chromosomal remodeling proteins, plus eight predicted protein kinases (Table S1 in the supplemental material).
Comparative analysis of N. crassa CPC1, S. cerevisiae Gcn4, and C. albicans Gcn4 pathways. In S. cerevisiae, Gcn4p influences the transcriptional response of many genes (635) to amino acid starvation (34, 45). Based on ChIP data, 207 genes are directly bound by Gcn4p (binding P value, <0.001) (22, 36). The Gcn4p transcriptional profiling and ChIP data sets have an overlap of 95 genes. We combined the S. cerevisiae transcriptional profiling and ChIP data to define a set of 747 S. cerevisiae genes as the Gcn4 regulon. Functional categorization of these 747 genes (Fig. 3 and Table S3 in the supplemental material) showed enrichment with the same functional categories as the CPC1 regulon, including genes involved in amino acid biosynthesis, nitrogen and sulfur metabolism, and the metabolism of vitamins. However, the Gcn4 and CPC1 regulons also showed differences, most notably in aa-tRNA synthetase genes, of which only the CPC1 data set was enriched, and genes involved in complex cofactor (NAD/NADP) binding, of which only the Gcn4 data set was enriched. Since CPC1 and Gcn4p show similarity in their DNA binding domains and the cis-element sequences bound by CPC1 and Gcn4p are identical (TGACTCA), these data suggest that cis-elements in the promoter regions of some genes within the CPC1 and Gcn4 regulons have diverged.
To further evaluate the evolution of the Gcn4 and CPC1 regulons, we analyzed an available expression profile data set for C. albicans WT strains and gcn4 mutants exposed or not to 40 mM 3-AT (62). We identified genes in the C. albicans data set whose expression levels increased or decreased to statistically significant degrees to enable comparison to our analyses of the N. crassa and S. cerevisiae data sets (see Materials and Methods). A total of 483 genes were defined as the CaGcn4 regulon, including 399 genes whose increased expression levels and 84 genes whose reduced expression levels upon exposure to 3-AT were dependent upon functional CaGCN4. Functional category analyses showed that the N. crassa, S. cerevisiae, and C. albicans data sets showed enrichment with genes within identical functional categories, including amino acid, sulfur, and vitamin metabolism (Fig. 3 and Table 3), although in some cases the data set of one or two species was specifically enriched with a category. For example, the S. cerevisiae and C. albicans data sets, but not the N. crassa data set, were significantly enriched with the complex cofactor (NAD/NADP) functional category. The C. albicans and N. crassa data sets, but not the S. cerevisiae data set, were enriched with genes within the protein degradation and oxygen detoxification functional categories, and the N. crassa and S. cerevisiae data sets, but not the C. albicans data set, were enriched with genes within the fermentation functional category.
In N. crassa, many cytosolic aa-tRNA synthetase genes were members of the CPC1 regulon and contain a perfect CPC1 consensus site. In S. cerevisiae, only three aa-tRNA synthetase genes have a perfect match to the Gcn4p consensus TGAC/GTCA (Table 3). Gcn4p ChIP data support the cis-element function in these three aa-tRNA synthetase genes (YHR020W, encoding a product related to prolyl-tRNA synthetase; YHR019C, encoding asparaginyl-tRNA synthetase; and YDR341C, encoding arginyl-tRNA synthetase) (22, 36). Of the predicted 20 cytosolic aa-tRNA synthetase genes in C. albicans, 10 have at least a 70% match to the Gcn4p cis-element in the 500-bp region upstream of the coding sequence, including 8 with a full match. Although the CaGcn4 data set was not significantly enriched with aa-tRNA synthetase genes, six of eight of the genes with a full cis-element match were induced from 1.2- to 2.4-fold upon exposure to 3-AT (62); their increase in expression levels was dependent upon functional CaGCN4 (Table 3). These data suggest a loss of regulation of aa-tRNA synthetase genes by Gcn4 orthologs in the evolution of the hemiascomycete clade.
Computational comparison of levels of ortholog and regulog identity among the CPC1, Gcn4, and CaGcn4 data sets. Regulogs consist of orthologous genes that have maintained their regulatory network through evolution (73). Some regulogs reflect direct interaction via an orthologous regulator (direct targets). We wished to compare the identification of Gcn4p/CaGcn4p/CPC1 targets using purely computational methods to the identification of predicted Gcn4p/CaGcn4p/CPC1 targets by transcriptional profiling methods. For the computational approach, we used the 351 genes in the N. crassa genome recovered using the CPC1 cis-element matrix. Of these 351 genes, 103 corresponded to orthologs identified in S. cerevisiae, although only 21 of the orthologs contain a Gcn4p cis-element (Fig. 4A). Similarly, for the 351 N. crassa genes, 116 orthologous genes in C. albicans were identified, but only 30 have a CaGcn4p cis-element. To compare S. cerevisiae and C. albicans, we used the CPC1 cis-element matrix to scan the promoters of S. cerevisiae genes to identify 278 that contain a perfect match to the Gcn4p cis-element; 123 of these were detected as Gcn4p targets in both ChIP and profiling data (22, 36). Out of these 278 genes, 155 had an ortholog in C. albicans, but only 34 of the C. albicans orthologs have a TGACTCA cis-element in the promoter region (Fig. 4A). When all three of these data sets were compared (S. cerevisiae to N. crassa, S. cerevisiae to C. albicans, and N. crassa to C. albicans), the promoters of only 17 orthologous genes were found to contain the Gcn4p/CaGcn4p/CPC1 cis-element (Fig. 4A; Table 4). Twelve of these genes are involved in amino acid biosynthesis, three encode aa-tRNA synthetases, one gene is required for adenine biosynthesis, and the last gene, SNZ, is involved in vitamin B6 biosynthesis.
![]() View larger version (21K): [in a new window] |
FIG. 4. Identification of common regulogs in the S. cerevisiae (sc), C. albicans (ca), and N. crassa (nc) genomes by computational analysis using cis-element scans compared to identification of common regulogs by transcriptional profiling. (A) In N. crassa, predicted promoter regions of 351 genes that contain the CPC1 cis-element were identified. Of these 351 genes, 103 have orthologs in S. cerevisiae, and of these 103 orthologs, 21 have a Gcn4p cis-element consensus in the promoter region (regulogs). Orthologs of 116 of the 351 N. crassa genes were identified in C. albicans. Of these 116 C. albicans genes, 30 have a CaGcn4p consensus cis-element in the predicted promoter region. In S. cerevisiae, 278 genes that contain a Gcn4p consensus cis-element in their promoter regions were identified. Of these 278 genes, 155 had orthologs identified in C. albicans, and of the 155 orthologs, 34 have a predicted CaGcn4p cis-element in predicted promoter regions. The overlap of all three data sets identified 17 orthologous gene pairs that maintained the conserved Gcn4p/CaGcn4p/CPC1 cis-element. (B) By transcriptional profiling, 443 genes were identified as CPC1 targets. The overlap between the 747-gene S. cerevisiae and N. crassa CPC1 data sets revealed 73 orthologous gene pairs (regulogs). The overlap between the N. crassa and C. albicans data sets identified 64 orthologous gene pairs, and the overlap between the S. cerevisiae and C. albicans data sets revealed 94 orthologous gene pairs. The overlap of all three data sets identified a conserved regulon of 32 orthologous genes present in all three species as determined by transcriptional profiling and ChIP.
|
|
View this table: [in a new window] |
TABLE 4. Direct regulogs among N. crassa, C. albicans, and S. cerevisiae identified by cis element computational prediction
|
Of the 443 genes in the N. crassa CPC1 regulon, 201 had orthologs in S. cerevisiae, a higher percentage than that obtained in a total-genome comparison (26%). However, only 73 of these 201 genes were included in the Gcn4 regulon (Fig. 4B); 34 of these 73 genes contain the Gcn4p consensus cis-element. The binding of Gcn4p to the promoter regions of 25 of these 34 genes has been observed previously (22, 36). However, only 18 of the 73 orthologous gene pairs in S. cerevisiae and N. crassa have a full match to the Gcn4p/CPC1 cis-element consensus sequence. In C. albicans, although 211 genes were orthologs of genes in the 443-gene CPC1 target set, only 64 of these were within the CaGcn4 regulon. Only 14 of the 64 orthologous gene pairs in C. albicans and N. crassa have a predicted CaGcn4p/CPC1 cis-element.
We also compared the S. cerevisiae Gcn4 data set to the C. albicans Gcn4 data set. For the 747 genes in the Gcn4 regulon, 403 orthologs in C. albicans were identified. However, only 94 of the 403 genes in C. albicans were identified as part of the CaGcn4 regulon, and of these 94 genes, only 20 contain the CaGcn4p cis-element. A comparison among all three data sets showed that 32 regulogs are conserved among all three fungi (Fig. 4B), which includes both direct and indirect targets. Of these 32 orthologous genes, 22 encode amino acid biosynthetic enzymes, one encodes a tRNA synthetase, two encode enzymes involved in adenine biosynthesis, two encode enzymes involved in oxygen/radical detoxification, one is predicted to be involved in vitamin B6 biosynthesis, three are involved in carbon metabolism, and one produces an enzyme of unknown function which localizes to purified mitochondria (Table 5). Of these 32 genes, 10 contain a Gcn4p/CPC1/CaGcn4p consensus sequence and 18 have been confirmed by ChIP experiments to be bound by Gen4p in S. cerevisiae (22, 36).
|
View this table: [in a new window] |
TABLE 5. Common regulogs among N. crassa, C. albicans, and S. cerevisiae identified by transcriptional profiling data
|
|
|
|---|
40% encode hypothetical or conserved hypothetical proteins. Genes that encode proteins involved in a common cellular pathway are often coregulated, and the patterns of expression of clusters of genes that perform related cellular functions are often correlated (15, 29). Our profiling data provide a guide for future phenotypic screening of mutants with knockouts of these genes (13). Conservation and divergence of the Gcn4/CaGcn4/CPC1 regulons: identification of a core regulon. Comparative microarray studies have usually analyzed evolutionarily distant species for which data were obtained under diverse experimental conditions (5). Our examination of expression patterns in these three diverse fungal species under a common set of experimental conditions (exposure to 3-AT in liquid medium) and in strains with a mutation in an orthologous transcriptional regulator (GCN4/CaGCN4/cpc-1) revealed a conserved regulon of direct and indirect targets. Thirty-two orthologous gene pairs maintained their regulation patterns in these three fungi under these conditions, and 10 of these genes are predicted to be directly regulated by Gcn4p/CaGcn4p/CPC1 based on computational analyses. These 32 orthologous genes form the core of the amino acid starvation response; 22 encode amino acid biosynthetic enzymes, and 1 encodes an aa-tRNA synthetase.
Although more than 15% of the Gcn4/CaGcn4/CPC1 targets were within the amino acid synthesis functional category, the direct regulation of most of these genes by Gcn4p/CaGcn4p/CPC1, as evaluated by the presence of cis-elements, was not conserved. These data indicate that, in some cases, orthologous genes in S. cerevisiae, C. albicans, and N. crassa maintain their regulation by either Gcn4, CaGcn4 or CPC1 but that regulation may change from direct to indirect or vice versa. For example, LYS2 was identified as a Gcn4p target in S. cerevisiae both by transcriptional profiling and by ChIP (22, 36, 45). However, its ortholog in N. crassa, NCU03010.2, was identified as a target by transcriptional profiling but lacks the CPC1 cis-element, and the score for matching to the consensus was less than 50%. Similarly, NCU07982.2 has a full match to the CPC1 consensus and was identified as a target by transcriptional profiling, but its ortholog in S. cerevisiae, ILV2, is an indirect target of Gcn4p (detected by transcriptional profiling but not by ChIP).
Regulatory network evolution by cis-element gain or loss. The evolution of gene expression patterns can affect phenotypic plasticity and is thought to play an important role in the adaptation of species to a particular environmental niche (5, 59). In C. albicans, responses to amino acid availability are linked with pseudohyphal development, biofilm formation, and phagocytosis by human neutrophils (20, 51, 66). Similarly, the GCN4 ortholog in Aspergillus fumigatus, cpcA, is required for virulence in human pulmonary infections (7, 35). These observations suggest that the CaGcn4/CpcA regulons in these two species have evolved additional functions. It is not clear how multiple promoters of coordinately expressed genes coordinately evolve their cis-elements to become members of a regulon (39, 70). For example, the promoters of ribosomal protein genes in N. crassa contain a cis-element that is an exact match to a cis-element identified in C. albicans (21); a similar cis-element is not found in ribosomal protein genes in S. cerevisiae. However, as shown in this study, the coordinate repression of ribosomal protein gene expression upon amino acid starvation is a conserved process in N. crassa, S. cerevisiae, and C. albicans and is dependent upon functional GCN4/CaGCN4/cpc-1.
In the hemiascomycete clade, which includes S. cerevisiae and C. albicans, the loss of the coordinate regulation of cytoplasmic ribosomal protein genes or rRNA processing genes with mitochondrial ribosomal protein genes is correlated with a whole-genome duplication event and the massive loss of a conserved cis-regulatory element (30). Promoter evolution can also be more gradual, as observed for the tRNA synthetase genes described in this study. In N. crassa, 19 of the 20 cytosolic tRNA synthetase genes have an exact CPC1 cis-element located in their promoter regions; 16 of these genes were identified in the CPC1 regulon. In the filamentous ascomycete species Magnaporthe grisea and Aspergillus nidulans, the predicted aa-tRNA synthetase genes also contain the predicted CPC1 cis-element (21). In Candida tropicalis, similar to C. albicans, 8 of 16 aa-tRNA synthetase orthologs have the TGACTCA consensus. Profiling data and ChIP data show that most of the aa-tRNA synthetase genes in S. cerevisiae are not regulated by Gcn4p, nor are they part of the Gcn4 regulon. This loss of regulation of aa-tRNA synthetase genes in the hemiascomycete clade is not associated with any known genome rearrangements but may be associated with adaptation and speciation.
Regulatory network evolution. Species genomes evolve over time, especially via gene or genome duplication and gene or segment loss, thus making it difficult to maintain the regulation of individual components of an entire regulon. However, networks show a buffering capacity whereby the identities of networks can be conserved through evolution, although mechanistic aspects of individual gene regulation may diverge. For example, although the patterns of temporal regulation of eve stripe expression are almost identical among species of Drosophila, this similarity in overall regulation patterns is not reflected by patterns of sequence conservation in regulatory regions (39, 40). Similarly, a comparison of expression data from six species (S. cerevisiae, Escherichia coli, Caenorhabditis elegans, Arabidopsis thaliana, humans, and Drosophila melanogaster) showed that while functionally related genes are often coexpressed and coregulated, the modular components of each transcriptional program vary significantly among organisms (5). Although our analysis showed that the number of conserved Gcn4/CaGcn4/CPC1 regulogs was only a fraction of the total number of target genes identified by transcriptional profiling, the functional categorization analyses of Gcn4/CaGcn4/CPC1 targets were remarkably similar, with the conservation of overall pathways of response to amino acid starvation.
This work was funded by a National Institutes of Health multi-institutional program project grant (GM068087) to N.L.G. (Core III: Transcriptional Profiling) and M.S.S. (Core II: Functional Annotation).
Published ahead of print on 20 April 2007. ![]()
Supplemental material for this article may be found at http://ec.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»