| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Previous Article | Next Article ![]()
Eukaryotic Cell, September 2006, p. 1517-1531, Vol. 5, No. 9
1535-9778/06/$08.00+0 doi:10.1128/EC.00106-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Joel B. Dacks,2
Samantha A. Campbell,3,
Jeffrey L. Blanchard,4
Peter G. Foster,1
Rima McLeod,5 and
Craig W. Roberts3*
Department of Zoology, The Natural History Museum, Cromwell Road, London SW7 5BD, United Kingdom,1 Department of Biological Sciences, University of Calgary, 2500 University Drive NW, Calgary T2N 1N4, Canada,2 Department of Immunology, Strathclyde Institute for Biomedical Sciences, University of Strathclyde, 27 Taylor Street, Scotland G4 ONR, United Kingdom,3 Department of Microbiology, University of Massachusetts, Amherst, Massachusetts 01003,4 Departments of Ophthalmology and Visual Sciences, Pediatrics (Infectious Diseases), and Pathology, Committees on Genetics, Molecular Medicine, and Immunology, University of Chicago, Chicago, Illinois 606375
Received 13 April 2006/ Accepted 27 June 2006
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
The pathway consists of seven steps, with several isoenzymes and multiple naming conventions. For simplicity, we adopted the Escherichia coli (Aro-) naming system where appropriate but have used other names where equivalent orthologous genes are absent from the E. coli genome. Step one is performed by 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase, of which there are two nonhomologous types or isoenzymes named DAHP synthase class I (DAHP I) and class II (DAHP II). Step 2 is catalyzed by 3-dehydroquinate synthase, or AroB. Step 3 is also catalyzed by two isoenzymes, 3-dehydroquinate dehydratase types I and II (named here AroD and DHQase II, respectively). Steps 4 to 7 are catalyzed by shikimate 5-dehydrogenase, shikimate kinase, 5-enolpyruvylshikimate-3-phosphate synthase, and chorismate synthase, or AroE, AroL/K, AroA, and AroC, respectively.
The molecular organization and structure of the shikimate pathway enzymes varies considerably between taxonomic groups (8), with a differential distribution of shikimate fusion genes throughout both prokaryotes and eukaryotes. In most prokaryotes examined to date, the seven shikimate pathway enzymes are encoded as separate polypeptides. The plant enzymes are also encoded in separate polypeptides, with the exception of AroD and AroE, which are fused and encode a bifunctional polypeptide (31). Plant shikimate pathway enzymes are nucleus encoded, but many possess N-terminal plastid targeting peptides. Thus, they are assumed to function within the plastid, although this has only been experimentally demonstrated for a selection of shikimate pathway enzymes (32, 42). In contrast, all ascomycetes sampled and T. gondii possess monofunctional DAHP II and AroC genes and a pentafunctional supergene, the arom pathway (6, 9) which includes conserved domains homologous to the genes (in fusion order) AroB, AroA, AroL/K, AroD, and AroE (details of the pathway and naming convention used are also summarized in Fig. 8B).
|
Herein we report the evolutionary relationships of the shikimate pathway genes to address their origin and evolution in eukaryotes. These analyses include sampling from recently complete prokaryotic and eukaryotic genomes, notably expanding the sampling to include seven additional divergent eukaryotic taxa: the oomycete Phytophthora ramorum, the green alga Chlamydomonas reinhardtii, the red alga Cyanidoschyzon merolae, the basidiomycete fungi Ustilago maydis and Cryptococcus neoformans, the zygomycete Rhizopus oryzae, the diatom Thalassiosira pseudonana, and the ciliate Tetrahymena thermophila. We confirm the presence of a fused arom supergene in T. thermophila by cloning and sequencing from cDNA. The majority of the phylogenetic analysis reveals both a unified fungal-chromalveolate clade and a pattern of multiple prokaryote-to-eukaryote gene transfers in both the red and green Plantae lineages by both HGT and endosymbiotic gene transfer (EGT).
| MATERIALS AND METHODS |
|---|
|
|
|---|
Sequencing of shikimate pathway genes from T. thermophila. To determine if the five putative T. thermophila components of the arom pathway were transcribed as a true pentafusion gene rather than a gene cluster, we amplified the appropriate regions from cDNA. T. thermophila (a kind gift from Aaron Turkewitz, University of Chicago) were grown at 30°C with gentle rotation in SPP media (1% Proteose peptone, 0.2% dextrose, 0.1% yeast extract, 0.009% ferric EDTA). RNA was extracted from T. thermophila cells using Trizol reagent (Invitrogen) and used to generate cDNA using Superscript III (Invitrogen) according to the manufacturer's instructions. A series of overlapping clones were amplified by PCR, and the oligonucleotide primers are listed in Table S1 in the supplemental material. Amplicons were cloned into the pDRIVE vector by using the QIAGEN PCR cloning kit (QIAGEN) according to manufacturer's instructions. Clones were sequenced by MWG Biotech, Milton Keynes, United Kingdom, and assembled using Sequencher (Gene Codes, Ann Arbor, MI). This revealed a single open reading frame of 4,734 bp encoding a polypeptide of 1,578 amino acids.
Alignment and phylogenetic analyses of shikimate pathway genes. Nine alignments were made for all shikimate pathway enzyme domains using CLUSTAL-X (52) and then manually corrected using SE-AL (http://evolve.zoo.ox.ac.uk/software.html?id=seal). These alignments include homologues from a wide diversity of prokaryote taxa with consistent taxon sampling between alignments where possible. Nonhomologous insertions and sequence characters that could not be aligned with confidence were removed from the alignments. The AroD enzyme domain was, in our opinion, too divergent to permit confident alignment and phylogenetic analyses across a broad sampling of taxa. We also excluded the hypothetical Plasmodium 5-enolpyruvylshikimate-3-phosphate synthase-shikimate kinase gene (aroA-K/L) (27) because we could not align this sequence for phylogenetic analyses with confidence. Since so many genes were included in the proceeding analyses, a table with GenBank accession numbers is not provided. However, all masked alignments, which contain GenBank accession numbers, are available from the corresponding author upon request.
Prior to tree reconstruction, each alignment was analyzed using MODELGENERATOR (http://bioinf.nuim.ie/software/modelgenerator) to find the best model for each alignment (see Table S2 in the supplemental material). MRBAYES 3.1.2 (41) was run with two separate MCMCMC analyses for 1,000,000 generations at a sampling frequency of 100 generations. Each MCMCMC run had four MCMC chains (three heated and one cold; heat parameter, 0.2). Comparisons of likelihood score and model parameter values and topologies within and between the two independent runs confirmed that the tree log-likelihood scores and parameters had reached a plateau and converged by 110,000 generations at the latest. Consequently, a maximum of 1,100 samples were excluded as a burnin.
Maximum likelihood (ML) distance bootstrap values (from 1,000 replicates) were obtained using Tree-Puzzle 5.1 (43) for parameter estimation (substitution model, eight multivariant plus invariant sites or only eight multivariant dependent on MODELGENERATOR analyses; see Table S2 in the supplemental material) and in coordination with Puzzleboot (21) to obtain distance matrices. Programs from the PHYLIP package (13) were used to create pseudoreplicate data sets (SEQBOOT), calculate distance trees (NEIGHBOR), and assemble a bootstrap consensus tree (CONSENSE). In addition, 100 fast ML (PHYML) (17) bootstrap replicates were run for each alignment with the model as before. In the case of the AroDE fusion gene, the analyses demonstrated contrasting results with moderate support. Consequently, we completed 1,000 parsimony bootstrap analyses using PROTPARS (13) with 3x jumbling to offer an additional investigation of topology support.
The phylogeny results of the AroDE fusion gene analyses and the phylogeny of AroE on its own produced ambiguous results, with the branching order between the plants, red algae, and Rhodopirellula baltica. Consequently, we took the MRBAYES tree and flipped the red algae-R. baltica branching order and, enforcing Plantae monophyly, recalculated the branch lengths using ML in P4 (14), using the model specified using the MODELGENERATOR results (see Table S2 in the supplemental material). This process was repeated for both the large taxa AroE data sets (see Fig. 4A) and the AroDE fusion gene data set (see Fig. 4B). The alternative topologies were evaluated using Shimodaira-Hasegawa (SH) and approximately unbiased (AU) (44) using the program CONSEL (45) invoked through P4 (14).
|
Ribosomal operon structural comparisons. Comparisons of ribosomal operon structure had previously indicated a close evolutionary relationship between Chlamydia, cyanobacteria, and the plastid progenitor, consistent with past analyses of the AroDE fusion gene (5). Given that our analysis does not support the previous findings for this particular data set (5), we extended the ribosomal operon structure analysis previously used to support the Chlamydia-plastid sisterhood to include other prokaryotes, such as R. baltica. This was done to test if a different prokaryote may also be closely related to the eubacterial plastid progenitor under the same criteria used by Brinkman et al. (5). To compare ribosomal operon structures, initial tables were made using the genome annotation reported to GenBank. If a gene was missing from the operon and not present in the remaining genome annotation, we used BLASTp to determine if the protein was correctly predicted but has not been given a standard annotation. If we could not identify the protein with a BLASTp search, we then used a tBLASTn search against the genome nucleotide sequence to see if the protein was incorrectly predicted or classified as a pseudogene. If we could not find the gene with either of these methods, we reported the gene as missing.
Nucleotide sequence accession number. The sequences of arom and DAHP II have been deposited in GenBank under the accession numbers AY601281 [GenBank] and AY787836 [GenBank] , respectively.
| RESULTS |
|---|
|
|
|---|
|
Genes of the alternative shikimate pathway enzymes DAHP I and DHQase II were detected in a number of different eukaryote lineages. DAHP I genes were found in the honeybee Apis melifera, all three fungus groups surveyed, and P. ramorum. The honeybee DAHP I sequence was very similar to the ß-proteobacteria sequences (Fig. 1a), but no other shikimate pathway genes could be detected from metazoan genome projects; consequently, we suspect that this gene maybe a genome project contamination artifact. DHQase II was detected in the ascomycetes and the basidiomycetes only.
|
|
|
|
Eukaryote-prokaryote HGT. The complicated shikimate evolutionary history is due in part to HGT between eukaryotic and prokaryotic groups. The DAHP II phylogeny (Fig. 1B) revealed two evolutionary complications: (i) the acidobacterium Solibacter usitatus grouped within the fungus-chromalveolate clade with at least one robust node (0.92/90/63), suggesting horizontal gene transfer from a eukaryote into the S. usitatus lineage; (ii) C. merolae possesses two DAHP II homologues, one that groups with the fungus-Chromalveolate clade with 0.96/61/52 support, while the second groups within the proteobacteria clade with 0.99/95/75 and 1/53/84 support (Fig. 1B). These data are consistent with the red alga lineage possessing two paralogues of DAHP II, one derived from a HGT event arising from a proteobacterial genome into the C. merolae lineage, while the second appears to be an anciently derived character of many eukaryote groups. Additionally, the DHQase II gene is found only in the ascomycetes, basidiomycetes, and a diverse selection of eubacteria. Its apparent absence in all other eukaryotic lineages suggests an HGT from a eubacterial lineage into the fungi prior to the division of the basidiomycete and ascomycete lineages. The phylogeny of DHQase II produced no resolution of the fungal group with any given prokaryote group, so it is not possible to pinpoint the source of the putative HGT (Fig. 3). However, this analysis demonstrates that the evolutionary history of the shikimate pathway in the fungi as well as the Plantae appears to have been influenced by prokaryote-to-eukaryote gene transfer events.
|
/ß group, and (iii) a proteobacterium that groups within or near the
-proteobacteria. In these five examples, the genes from the diatom T. pseudonana group with the Plantae sequences, suggestive of a secondary EGT from the engulfed Plantae cell which became the diatom's plastid endosymbiont (Fig. 1B, 2, 5, 6, and 7).
First, the Plantae AroK and AroC genes grouped with a clade of cyanobacteria with 1/87/50 and 0.96/83/74 support, respectively (Fig. 5 and 7), consistent with a gene origin from the progenitor of the plastid. Second, the AroB and AroA phylogenies (Fig. 2 and 6) demonstrate that the Plantae taxa group with the ß/
-proteobacteria with 1/69/81 and 1/100/100 support, respectively. In both of these cases, the
-proteobacteria sampled branch elsewhere on the phylogeny, suggesting that the Plantae AroB and AroA genes were acquired from separate gene transfers from a proteobacterial lineage and not from the progenitor genome of the mitochondria. The AroA phylogeny (Fig. 6) specifically demonstrates strong support for Plantae monophyly in the form of a shared derived HGT.
The DAHP II (Fig. 1B) phylogeny again shows the Viridaeplantae grouping separately from the other eukaryotes but within a clade of proteobacteria supported by 0.99/95/75 bootstrap support. The red alga C. merolae branches elsewhere in the proteobacterial clade with 1/53/84 support (Fig. 1B). These phylogenetic relationships suggest that the Plantae DAHP II genes were acquired from two separate horizontal gene transfers from proteobacteria. The absence of any other eukaryote orthologues suggests that these transfers were recent HGT events rather than anciently acquired as an endosymbiotic gene transfer from the genome of the mitochondrial progenitor.
AroDE in plants is an HGT and is not indicative of a planctomycete-plastid progenitor sisterhood. The final possible example of HGT in the Plantae shikimate pathway comes from the phylogeny of AroE domain which showed the Plantae forming a weakly supported clade with R. baltica, with the land plants grouping with R. baltica with 1/74/76 support (Fig. 4A). All members of this clade possess an AroDE fusion gene consistent with the monophyly of the Plantae and R. baltica and an HGT between these unrelated taxa. Relationships between the Plantae and the R. baltica branches were weakly resolved within this AroDE clade (Fig. 4A). Interestingly, the AroDE fusion characteristic and phylogeny, along with many other data sets, was initially used to demonstrate a close sisterhood of the Chlamydiales and the cyanobacterial progenitor of the chloroplast (5). Our analyses did not show a sisterhood relationship for the Plantae and the Chlamydiales for the AroDE fusion gene (Fig. 4A); consequently, we investigated its evolutionary history more closely. To pinpoint the branching relationship between the Plantae and R. baltica, we created an alignment of all AroDE fusion genes and included prokaryote AroDE fusion genes that branched elsewhere in the tree but were not separated from the Plantae-R. baltica clade with bootstrap support above 50%. While analyses of these fusion genes continued to resolve a clade of R. baltica plus Plantae sequences with strong support (1/97/91/77), it did not resolve the internal relationships of this clade (Fig. 4B). The Bayesian search and PHYML searches demonstrated R. baltica grouping within the Plantae (0.72 and 43% topology support). However, two other bootstrap analyses (ML distance and parsimony) found the Plantae to be monophyletic with 77% and 62% support, respectively (Fig. 4B). This inconsistency is important, as it makes polarizing the direction of the proposed eukaryote-prokaryote HGT difficult and suggests four possible evolutionary scenarios: (i) a planctomycete-to-Plantae HGT, (ii) a plastid-progenitor-to-Plantae EGT followed by an HGT to the planctomycetes, (iii) the planctomycetes may be closely related to the primary plastid progenitor and represent a surrogate branch for the cyanobacteria, as suggested for Chlamydia (5), or (iv) an HGT from the eubacterial plastid progenitor genome to the planctomycetes consistent with frequent HGT events between prokaryotes (4).
Analyses of the diversity of the shikimate pathway genes in cyanobacteria showed patterns of gene loss and putative HGT events in the cyanobacteria (as illustrated by the resolved separation of cyanobacterial representatives in Fig. 1A, 6, and 7) consistent with instability in the vertical evolution of this pathway in the eubacteria most closely related to the plastid progenitor (25). Consequently, it is possible that other prokaryote genomes represent appropriate surrogates for the plastid progenitor genome, as Brinkman et al. (5) hypothesized for the Chlamydia. To test the third hypothesis, that "the planctomycetes may be closely related to the primary plastid progenitor and represent a surrogate branch for the cyanobacteria," we analyzed the planctomycete R. baltica genome (15) for the genetic characteristics previously used to infer sisterhood of the Chlamydia, cyanobacteria, and the plant plastid progenitor. While the Chlamydiales, chloroplasts, and mitochondria possess an unspliced intron in the 23S rRNA gene (11), R. baltica does not. Analysis of the ribosomal operon structures had showed three further characters unique to Chlamydiales, cyanobacteria, and chloroplasts: (i) loss of S10 protein from the ribosomal superoperon, (ii) S10 found as part of a suboperon with S7/S12 ribosomal proteins, and (iii) loss of S14 protein (5). None of these three synapomorphies are present in the R. baltica genome (15). Although Chlamydiales, cyanobacteria, and R. baltica all possess L30 and S4 deletion/rearrangement, these characteristics are not unique to these three eubacterial groups.
Consequently, our data, along with published phylogenetic analyses (for examples, see references 15 and 22), suggests that the Chlamydiales and cyanobacteria do not form a closely related group with the planctomycetes. The close relatedness of the Plantae and R. baltica AroDE gene is therefore likely to be a product of planctomycete-Plantae or planctomycete-plastid progenitor HGT rather than vertical ancestry through the plastid endosymbiosis coupled with loss in sampled cyanobacteria. It is currently not possible to polarize the direction of transfer with complete confidence. Given that comparative topology tests strongly support the separate branching of the Plantae clade from the other eukaryotes and the placement of the Plantae clade among the eubacteria, we favor a HGT from R. baltica, or a close relative into the ancestral Plantae lineage.
| DISCUSSION |
|---|
|
|
|---|
The shikimate pathway is ancient in eukaryotes. The description of the T. gondii arom structure, only previously demonstrated in ascomycetes, has been interpreted as evidence that arom was present in the last common eukaryotic ancestor (6). However, an alternative explanation is possible involving the acquisition of arom in either the fungal or chromalveolate lineage followed by intraeukaryotic gene transfer. In all four phylogenies of the arom domains, the relationship between the fungi and chromalveolates, within the arom clade, was weakly supported. However, in the three phylogenies with a monophyletic arom clade (Fig. 4A, 5, and 6), the chromalveolates or the fungi were rendered paraphyletic. This, in our opinion, does not constitute support for chromalveolate-fungus HGT because all of these branching relationships were supported by less than 50% bootstrap values. It is, of course, possible that the arom gene was conveyed by a single transfer predating either the diversification of the chromalveolates (7, 12, 18, 56) or the fungi sampled. Particularly, the chromalveolate branch is considered an ancient eukaryotic event that occurred shortly after the divergence of extant eukaryotic supergroups (55), making arom an ancient innovation. However, we prefer the alternative hypothesis that the arom, AroC, and DAHP II genes represent an ancestral form of the eukaryote pathway, present in the last common eukaryote ancestor and inherited vertically in both the fungi and the chromalveolates (47). We suggest that this particular form of the shikimate pathway is ancestral to all eukaryotes, since the last common ancestor of chromalveolates and fungi is thought to represent a close approximation of the last common eukaryotic ancestor (3, 24, 40, 48). We prefer the vertical origin hypothesis because it is more parsimonious than the horizontal hypothesis requiring one less transfer event. However, ultimately, increased genome sampling, specifically among the Excavata, Cercozoa/Rhizaria, and the Amoebozoa supergroups (47), will enable a thorough test of these two alternative hypotheses.
The presence of DAHP I homologue in fungi and the chromalveolates and their monophyly on the trees reported here (Fig. 1A) also suggests that the DAHP I in addition to DAHP II, arom, and AroC could have been present in the last common ancestor of the eukaryotes (Fig. 8A). This hypothesis infers that all of the Plantae shikimate pathway genes that do not group with the fungus-chromalveolate genes represent evolutionary replacements. However, the replacement hypothesis relies on the theory that the Plantae branch groups above the fungus-chromalveolate bifurcation in the eukaryote tree (37, 48, 49). The only possible alternative is if the Plantae branched away from the eukaryotic trunk prior to the last common fungus-chromalveolate ancestor, rendering the bikonts paraphyletic (37, 48, 49). Even so, two of the seven shikimate pathway genes are likely to represent gene replacements or later additions, as they have been derived from the plastid progenitor genome (Fig. 5 and 7), which has to have occurred after the Plantae branched from the ancestral trunk of the eukaryote tree (37, 40, 48) (Fig. 8A).
It follows that the shikimate pathway, along with arom, has been lost in many eukaryotic lineages examined to date and that the seven gene domains of the Plantae shikimate pathway have been serially acquired by EGT and HGT, with the notable exception of DAHP II in the red algae, which possesses two paralogues, one of which is placed with weak bootstrap support as a putatively ancestral eukaryotic paralogue (0.96/61/52 topology support) (Fig. 1B). Since the excavate (46) Euglena is known to possess both a cytosolic and plastid-associated shikimate pathway (36), investigation into the shikimate gene arrangement in this taxon would be valuable in determining the evolution of the pathway.
Selection for gene fusions. Comparative analyses demonstrated six different protein domain fusion arrangements for the five functional domains contained within arom. This includes the arom arrangement (i.e., AroB-AroA-AroK/L-AroD-AroE) and five other fusion arrangements: (i) the AroK/L-B fusion found in a number of prokaryotes (Fig. 2 and 5), (ii) the AroDE fusion found in the Plantae, some eubacteria, and the diatom (Fig. 4A and B), (iii) the AroDEA fusion found in Parachlamydia (Fig. 4A and B and 6), (iv) the AroBD fusion found in Rhizopus (Fig. 2), and (v) the AroA-K/L putative fusion gene found in Plasmodium (6, 27), not analyzed here. The data reported here present some evidence for the separate fusion events of the shikimate pathway genes. The independent and convergent occurrence of these fusion genes can be confirmed in two ways: (i) they represent different fusion patterns with distinct domain compositions and orders or (ii) one example of each fusion order is polyphyletic in at least one tree supported by moderate to strong bootstrap support. Examples of the first type are shown in the cases of the AroK/L-B fusion (Fig. 2), the AroDEA fusion (Fig. 4A), and fusion of non-shikimate pathway genes to AroE and AroK/L (Fig. 4A and 5). In several cases, there is also moderate to strong bootstrap support for the separate branching of fusion genes. This includes 1/90/95 support for the separation of the Parachlamydia trifusion and arom (Fig. 6) and 1/87/96 support for the grouping of the Gluconobacter AroK/L-B fusion gene with other nonfused prokaryotic versions separate from other AroK/L-B fusion genes (Fig. 5).
There is also evidence of arom reduction in two lineages. The diatom possesses an AroDE fusion gene, exactly the same order as other photosynthetic eukaryotes. However, in this case, the diatom AroDE fusion is orthologous to arom (1/74/86) (Fig. 4A). Therefore, the diatom is likely to have acquired a bifunctional arrangement as a reduction of the ancestral arom, while the Plantae acquired the same bifunctional arrangement separately (polyphyly of eukaryotes confirmed by AU and SH tests of constrained alternative tree topologies) (Fig. 4A). In addition, Rhizopus has an AroBD fusion gene orthologous to the arom clade (1/93/99) and grouping specifically with the full Rhizopus arom, suggesting an arom duplication followed by an additional incidence of arom reduction (Fig. 2) in the zygomycetes.
Interestingly, it is only the five genes found in the arom that are found in different gene fusion arrangements. This suggests that the functional/genetic linking of these five shikimate pathway enzymes may be selectively favorable (54) and has occurred on numerous occasions by patterns of convergent evolution. A number of prokaryotic organisms have clustering of shikimate pathway genes, which in some cases function as operons (33, 34). We hypothesize that clustering of genes in operons would make the independent occurrence of gene fusions more likely as the genes are placed in close genomic proximity over evolutionary time. In other data sets, gene fusions have been used as characters for pinpointing ancient evolutionary relationships (48, 49). The analysis of the shikimate pathway suggests that patterns of gene fusion may be affected by convergent evolution and fission, and so it is prudent to apply phylogenetic analysis to ensure monophyly of the fusion forms when using gene fusions as evolutionary informative characters.
Serial gene transfer between prokaryotes and the Plantae beyond cyanobacterial endosymbiosis. Our data suggest that the shikimate pathway genes in the Plantae and some other photosynthetic eukaryotes were acquired from a combination of EGT and HGT events. The presence of a non-arom pathway in photosynthetic eukaryotes, which appears to be plastid located in the land plants (32) would suggest some advantage to a non-arom and putatively plastid-targeted pathway in photosynthetic lineages. This may include increased availability of precursors and cofactors, including erythrose-4-phosphate, phosphoenolpyruvate, NADH, and ATP, which are produced in chloroplasts from photosynthesis. As a likely consequence of this, the shikimate pathway has been reported to be light regulated in some plants (19, 53). Furthermore, Euglena gracilis has been reported to have a cytosolic pathway used when grown in darkness but a plastidic pathway used when grown in light (36). Interestingly, some photosynthetic chromalveolates (Thallasiosira) appear to have acquired five of the seven component genes of this pathway by EGT, presumably concordantly with the acquisition of the photosynthetic plastid organelle by secondary endosymbiosis, while the nonphotosynthetic chromalveolates have retained their arom cytosolic pathway. For example, T. gondii, which is nonphotosynthetic, has retained the arom and putatively cytosolic pathways. This assumes (i) that the chromalveolates are a monophyletic clade (7, 12, 18, 35, 56) and (ii) that both the arom and non-arom pathways were retained during the diversification of the chromalveolates. The second assumption is supported by the observation that the diatom Thallasiosira possesses a chimera of the arom and non-arom pathways, possessing five Plantae non-arom-like genes and an AroDE fusion gene orthologous to a two-domain section of the arom pathway found in the fungi and the alveolates (Fig. 4A).
The number of primary endosymbiotic acquisitions of plastids has been a topic of debate (30, 50, 29). Although nuclear gene phylogenies and conservation of plastid machinery (29, 30) support monophyly of the Plantae group, discrete evolutionary events such as HGT events (1, 20) and gene replacements (for examples, see references 18 and 35) also provide crucial evidence in two ways: shared derived HGT between lineages from nonplastid sources strongly supports the holophyly of the Plantae clade, while shared plastid transfers confirm the monophyly of the primary plastid. Our analysis of shikimate pathway genes has revealed gene transfer events of both types. Three Plantae gene phylogenies (AroB, AroDE, and AroA) support shared derived prokaryote-to-Plantae HGT events (Fig. 2, 4A and B, and 6), while the AroC gene phylogeny supports a shared derived cyanobacterium-to-Plantae EGT (Fig. 7).
There are at least two scenarios that could explain Plantae shikimate gene relationships: (i) a combination of endosymbiotic and HGT gene acquisition and (ii) endosymbiotic gene acquisition followed by replacement in cyanobacteria. In the first scenario, the primary plastid endosymbiosis equipped the ancestral Plantae genome with cyanobacterial shikimate genes. Then genes from at least two other eubacteria have replaced cyanobacterium-derived genes prior to the divergence of the Plantae into the red algae and the Viridaeplantae. The alternative hypothesis is that the HGT events occurred in the common ancestor of extant and sampled cyanobacteria, essentially leaving the Plantae genes to resemble the ancestral cyanobacterial genes while the cyanobacterial genes thus far sequenced resemble other eubacterial genes.
Irrespective of the above alternative explanations, shared prokaryote-to-Plantae HGTs which also demonstrate Plantae monophyly (AroA represents the best example) (Fig. 6), strongly implies holophyly of the red algae and the Viridaeplantae. Although, as a word of caution, we found possible cases of separate origins of shikimate pathway genes in the red algae and the Viridaeplantae (DAHP II [Fig. 1B], AroB [Fig. 2], and AroK/L [Fig. 5]). However, only in the AroK/L phylogeny was the bootstrap support for Plantae polyphyly even moderate (1/58/78) (Fig. 5). These are unlikely to count as robust evidence against Plantae monophyly and, in the case of AroK/L, may simply be a further case of HGT and/or hidden parology and therefore represent additional evolutionary noise.
It is possible that intraeukaryotic HGT may have occurred between the red alga and Viridaeplantae lineages, invalidating support for monophyly of the Plantae lineages. However, Archibald et al. suggest that such intereukaryotic HGT between photosynthetic eukaryotes is rare (2). In addition, one of the shikimate pathway genes demonstrates shared derived origin from the plastid progenitor genome (Fig. 7). Each shared transfer into the common ancestor of the red and green photosynthetic eukaryotes implies monophyly of the Plantae nuclear genome and, in the case of plastid transfers, represents a unique evolutionary event from a single shared plastid ancestor.
The present study demonstrates that the shikimate pathway is an ancient eukaryotic pathway that has been subject to diverse evolutionary pressures. This has resulted in its entire loss from a number of lineages, including the Metazoa (38), that derive their aromatic compounds from diet. Furthermore, the entire pathway has been transferred to the Plantae from a range of different eubacterial sources. In fungi and the red algae, additional discrete HGT of shikimate pathway genes have occurred.
We present a candidate ancestral pathway (Fig. 8A) given the bikont/unikont eukaryote root (37, 48, 49) and demonstrate that the shikimate pathway has been retained in many eukaryote lineages, now known to comprise the zygomycete, basidiomycete, and ascomycete fungi, apicomplexa, ciliates, and the oomycetes. Interestingly, the photosynthetic chromalveolate Thalassiosira and the red alga Cyanidioschyzon possess a chimeric pathway in part composed of the Plantae non-arom pathway and in other parts composed of component of the arom type pathway. Our model of the shikimate pathway evolution in the eukaryotes (Fig. 8A) demonstrates, at a conservative estimate, over 50 independent loss events in the evolution of the eukaryotic shikimate pathway. With its implications as both a practical antimicrobial therapeutic target and as critical biochemical pathway in diverse eukaryotic organisms, the shikimate pathway represents a rich data set for studying the evolution and regulation of biochemical pathways, endosymbiosis, and processes of gene loss, gene fusion, replacement, and gene transfer.
| ACKNOWLEDGMENTS |
|---|
Preliminary sequence data were obtained from The Institute for Genomic Research website at http://www.tigr.org and Department of Energy Joint Genome Institute http://www.jgi.doe.gov. We thank TIGR and DOE JGI for making their data publicly available. We also thank Aaron Turkewitz for the Tetrahymena cultures.
| FOOTNOTES |
|---|
Supplemental material for this article may be found at http://ec.asm.org/. ![]()
Present address: School of Biosciences, University of Exeter, Exeter, EX4 4QD, United Kingdom. ![]()
Present address: School of Life Sciences, Napier University, 10 Colinton Road, Edinburgh, EH10 5DT, United Kingdom. ![]()
| REFERENCES |
|---|
|
|
|---|