Previous Article | Next Article ![]()
Eukaryotic Cell, October 2003, p. 978-994, Vol. 2, No. 5
1535-9778/03/$08.00+0 DOI: 10.1128/EC.2.5.978-994.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Lehrstuhl für Pflanzenphysiologie, Friedrich-Schiller-Universität Jena, 07743 Jena,1 Molekulare Zellphysiologie, Universität Bielefeld, 33501 Bielefeld,2 Institut für Pflanzenphysiologie, Martin-Luther-Universität Halle-Wittenberg, 06120 Halle, Germany3
Received 28 February 2003/ Accepted 21 June 2003
|
|
|---|
|
|
|---|
Although C. reinhardtii has been used as a model system for elucidating the assembly and regulatory processes of the photosynthetic machinery, no thorough analysis of LHC protein composition has been performed for this alga. What is known about the LHC protein composition of C. reinhardtii? Through searching of Chlamydomonas databases, 10 genes that potentially encode Lhcb polypeptides that are associated with the major trimeric PSII antenna (Lhcbm) have been described (11). In addition, the products of two other genes (lhcb4 and lhcb5) that correspond to minor LHC proteins (CP29 and CP26) have been identified (42). At least seven distinct Lhca subunits have been proposed (3, 42). For six Lhca proteins, N-terminal amino acid sequences have been obtained by Edman amino acid sequencing (3). Separation of isolated PSI complexes from C. reinhardtii by two-dimensional gel electrophoresis has revealed the presence of about 18 LHCI protein (LHC protein of PSI) spots, thereby suggesting an even more extensive variability of Lhca proteins (20). To characterize the compositions of the lhca and lhcb gene products at the protein level and to monitor the posttranslational modifications of LHC proteins of C. reinhardtii, we performed a detailed proteomic study.
The genome project for the photosynthetic eukaryotic model organism, the green alga C. reinhardtii (more than 180,000 expressed sequence tag [EST] sequences have been obtained, and ninefold whole-genome shotgun coverage was released by the Department of Energy Joint Genome Institute in February 2003) provides excellent opportunities for applying systematic or functional proteomic approaches. Proteomic analysis has already become a powerful tool for the analysis of profiles of protein expression in vascular plants as well as Chlamydomonas (21, 44). Plant cell organelles, such as chloroplasts, offer excellent model systems for proteomic analysis, since an organelle or its subcompartments can be isolated easily in large amounts and at high purities. The standard technique for quantitative proteomic analysis combines protein separation by high-resolution (isoelectric focusing combined with sodium dodecyl sulfate [SDS]-polyacrylamide gel electrophoresis)two-dimensional gel electrophoresis (2-DE) with mass spectrometry (MS) or tandem MS (MS-MS) identification of selected protein spots.
The separation of hydrophobic intrinsic membrane proteins by 2-DE has long been a difficult task (40). However, with the recent development of procedures for the analysis of transmembrane thylakoid proteins by 2-DE, proteomic analysis of membrane proteins has become feasible (20). This procedure allowed the separation and identification of hydrophobic transmembrane proteins, such as the light-harvesting proteins. The technique was used to create 2-DE protein maps of thylakoid membrane proteins from wild-type and mutant strains of C. reinhardtii (20) as well as to monitor changes in the protein compositions of the thylakoid membranes during the physiological adaptation to iron deficiency (33).
In this study, we performed detailed 2-DE protein mapping of light-harvesting proteins associated with PSII (Lhcb) and PSI (Lhca) of Chlamydomonas. These polypeptides are membrane proteins that span the lipid bilayer with three
-helical transmembrane domains (28). This study represents the first detailed analysis of LHC proteins by 2-DE and MS-MS. In addition to protein separation by 2-DE, intact LHC proteins were extracted with organic solvents and separated by reversed-phase liquid chromatography (LC). Identification of vascular plant chloroplast transmembrane LHC proteins has been achieved by coupling LC with electrospray ionization MS and MS-MS (9, 16, 47).
The proteomic analysis performed in this study predicts the expression of at least nine different Lhca proteins and eight distinct Lhcb proteins at the protein level and supports the previous indications of the extensive variability of LHC proteins of C. reinhardtii. The finding that the N-terminal protein processing of Lhcbm3 leads to the removal of a phosphorylation site indicates a novel regulatory mechanism for state transitions.
|
|
|---|
Construction of recombinant strains and genetic analysis. The recombinant strain Lep2 was generated by inserting a hemagglutinin (HA) epitope-tagged form of lhcbm6 (lhcII-1.1) (23) randomly into C. reinhardtii strain CC2454. Plasmid pBQ2.4, carrying a 2.4-kb EcoRI/HindIII fragment with exons 2 to 4 and 3'-flanking sequences of the lhcbm6 gene (25), was linearized with SacI, which cuts the plasmid at a single site within exon 2. This site was used to introduce the double-stranded oligonucleotide ocabhem1 with SacI-compatible 3' overhangs, obtained by annealing single-stranded oligonucleotides ocabhem1a (5'-ACCCCTACGACGTCCCCGACTACGCTAGCT-3') and ocabhem1b (5'-AGCGTAGTCGGGGACGTCGTAGGGGTAGCT-3'). The resulting plasmid, pAL1, carrying the insert, was sequenced to verify its correct orientation and in-frame insertion of the HA epitope. With this procedure, the peptide sequence YPYDVPDYAS was introduced between Ser31 and Ser32 of the deduced precursor protein sequence encoded by the lhcbm6 gene. To reconstruct the complete HA epitope-tagged lhcbm6 gene, the 3.0-kb HindIII fragment with exon 1 and the lhcbm6 promoter region was inserted into pAL1 that had been linearized with HindIII. The resulting plasmid, pAL2, was analyzed for correct insertion and orientation of the 3.0-kb fragment by DNA sequencing. Standard protocols were used for all DNA manipulations (39).
Cotransformation of strain CC2454 with plasmids pMN24 (13) and pAL2 was carried out by using particle gun transformation (26). After transformation, cells were placed on ammonium-free Sager-Granick 2 medium to select transformants containing intact nitrate reductase encoded by pMN24. These colonies were then screened for the expression of the tagged protein by Western blot analysis with the HA-specific mouse monoclonal antibody 12CA5 (BabCO, Richmond, Va.).
Generation of gametes, matings, and zygote analysis were performed as described previously (17).
Chlorophyll determination. Precipitated protein and cell debris were removed by centrifugation, and the chlorophyll concentration was estimated by the method of Porra et al. (36).
Isolation of PSI complexes. PSI particles were isolated as described previously (18). Determination of the protein concentration in solution was done by the bicinchoninic acid method according to the manufacturer's instructions (Sigma, Taufkirchen, Germany).
In vitro phosphorylation of thylakoid membranes and green gel electrophoresis. In vitro phosphorylation of thylakoid membranes was performed as described previously (14), and green gel electrophoresis was conducted as described by Peter and Thornber (34).
2-DE, immunodetection, and trypsin digestion. Preparation of samples for 2-DE, immunodetection, and trypsin digestion of excised protein spots were performed as described previously (20).
Tryptic hydrolysis of PSI complexes. An aliquot of the sucrose gradient enriched in PSI particles, which corresponded to 10 µg of chlorophyll with a 0.05% (wt/vol) concentration of ß-dodecyl maltoside, was heated for 10 min at 95°C. The liquid was evaporated by using a vacuum centrifuge, and the pellet was resuspended in 50 µl of a 50 mM hydrogen bicarbonate solution and incubated at 30°C for 15 min and then at 37°C for 16 h with 6 µg of trypsin (sequencing grade modified; Promega, Madison, Wis.). After hydrolysis with trypsin, the reaction mixture was lyophilized, and the pellet was resuspended in 40 µl of a 5:90:5 (vol/vol) methanol-water-formic acid solution (Lichrosolv [grade]; E. Merck AG, Darmstadt, Germany) and vortexed for 20 min. To prepare a hydrophobic column, approximately 10 µl of dry POROS-R2 (Applied Biosystems, Foster City, Calif.) material was resuspended in a 60% (vol/vol) methanol solution and loaded into the top of a C18 Zip-Tip column (Millipore, Eschborn, Germany) with a 5-µl bed volume. The column was packed by brief centrifugation in a tabletop microcentrifuge with an insert meant for holding nanospray needles. Alternatively, a 0.6-ml microcentrifuge tube with a small hole in the bottom and placed inside a 1.5-ml microcentrifuge tube could be used to hold the POROS-R2-containing Zip-Tip column. The liquid phase was then replaced by an aqueous phase by washing of the column packing twice with 40 µl of methanol-water-formic acid (5:90:5, vol/vol/vol). The sample was centrifuged in a tabletop centrifuge for 5 min, and the supernatant was removed and applied to the column packing by brief centrifugation. The column was washed twice with 40 µl of methanol-water-formic acid (5:90:5, vol/vol/vol), and the sample was eluted into a microcentrifuge tube with two 40-µl aliquots of a solution containing methanol-water-formic acid (60:35:5 [vol/vol]) by centrifugation in a tabletop centrifuge at 10,000 rpm (Hermle Z 160 M). The sample was lyophilized and resuspended in 10 µl of LC buffer A (see below). Typically, a 1:10 or 1:100 dilution was analyzed by LC-MS.
LC-MS. Analyte sampling, chromatography, and production and acquisition of MS and MS-MS data were performed online by using fully automated instrument functions. A Famos 48-well plate autosampler (LC-Packings, Amsterdam, The Netherlands) was used to sample the tryptic digests in batch mode with a user-defined program to withdraw 1 µl from each sample. Solvents were purchased from Merck and were of LC grade. The aqueous phase (A) (0.1% [vol/vol] formic acid in 5:95 acetonitrile-water) and the organic phase (B) (0.1% [vol/vol] formic acid in 80:20 acetonitrile-water) were delivered by an Ultimate (LC-Packings) micropump in most instances as follows: 5% B in the first 8 min, 5 to 50% B from 8 to 38 min, 50 to 95% B from 38 to 39 min, 95% B from 39 to 49 min, 95 to 5% B from 49 to 50 min, and 5% B from 50 to 75 min. Solvents delivered by the micropump were split before reaching the column by a splitter at a flow ratio of 1:1,250, with the postsplitter flow rate set at 250 nl/min. The peptide mixture was fractioned on an LC-Packings PepMap C18 column (75-µm [inner diameter] by 150 mm) with a 3-µm particle size and a 100-Å-pore diameter (NAN-75-15-03-PM). The column eluent was monitored at 214 and 280 nm with a rapid-scanning spectrophotometer equipped with a 3-nl flow cell (LC-Packings UZ-N10 160015). The interface between the liquid phase and the gas phase was provided by a fused silica needle that had a 75-µm inner diameter, a length of approximately 12 cm, and a 30-µm aperture (FS 360-75-30; New Objective, Woburn, Mass.) and that was mounted on a nano-electrospray ionization source (New Objective). Atmospheric pressure electrospray ionization was accomplished by applying a voltage difference of +1.3 kV across the fused silica needle. The aperture of the fused silica needle was positioned 1 to 2 mm from the opening of the ion transfer capillary and slightly off axis to minimize the entrance of nonvaporized solvent into the LCQ Deca XP ion trap mass spectrometer (Thermofinnigan, San Jose, Calif.). No sheath gas was used.
Most mass spectra were acquired by using the repetitive "triple-play" sequence as recommended by the manufacturer, consisting of a full scan event for ions with mass-to-charge ratios of 400 to 1,200, a zoom scan event acquired within a mass-to-charge ratio window of 10 units centered around the chosen ion, and an MS-MS scan event. Ions were selected for the zoom scan and for the MS-MS scan automatically by using instrument software in the data-dependent manner, whereby ions of sequentially decreasing abundance were chosen and two scan events were allowed for any given ion in a 3-min time window. For complex samples, the zoom scan event was omitted and replaced with two MS-MS scan events. The tolerance for the selection of the MS-MS precursor ranged from 1.5 to 3.0 m/z (low to high m/z).
Sequest. The measured MS-MS spectra were matched with tryptic peptide amino acid sequences from a small C. reinhardtii database consisting primarily of PSI or PSII polypeptide sequences (about 90 entries), from the translated Kazusa EST database (1, 2), from the EST database of the Chlamydomonas genome project (A. Grossman, J. Davies, N. Federspiel, E. Harris, P. Lefebvre, C. Silflow, D. Stern, and R. Surzycki, unpublished data), or from a genomic database (version 1.0 assembly). In addition, the data sets were partially compared with the genomic sequences for the presence of new LHC polypeptides. All databases were in the FASTA format. Cys modification by carbamidomethylation (+57 Da) was taken into account. Raw MS-MS data files that had a minimum total ion current of 105 and contained 15 or more fragment ions were selected. Known contaminants were filtered out. Computational analysis was done on a Dell personal computer. The tolerance window set for the grouping of raw MS-MS data files into input files for the Finnigan Sequest/Turbo Sequest software (revision 2.0; ThermoQuest, San Jose, Calif.) was set to 1.4 atomic mass units.
The Sequest algorithm was used to quickly identify and retrieve database sequences having at least one tryptic end, having a theoretical mass within ±1.25 atomic mass units of that measured for the precursor ion, and having a theoretical y- and b-ion fragmentation profile with a high degree of similarity to the experimentally measured MS-MS spectrum. The similarity between a measured MS-MS spectrum and a theoretical MS-MS spectrum, reported as the cross-correlation factor (Xcorr), and the difference between the unit-normalized Xcorr values of the first- and second-ranked sequences (
Corr) provided the preliminary criteria for assigning amino acid sequences to experimental MS-MS spectra. Sequences are reported here if the Xcorr values calculated for the measured and theoretical MS-MS spectra were equal to or above 1.5, 2.25, or 3.5 for singly, doubly, or triply charged precursor ions, respectively, and if the
Corr value exceeded 0.1 within the background of the Kazusa EST database (1, 2) or the genomic database.
Since the N termini of the mature proteins are not yet clear, a database subset containing sequences of the lhca or lhcb gene products from Chlamydomonas was created. This database contains amino acid sequences where the first N-terminal amino acids of respective gene products were excised in a stepwise manner. This step was repeated 45 times to create different N-terminal sequences. This database, containing different N termini, allowed us to search for N-terminal peptides. It was also checked for carbamylation.
In order to identify C. reinhardtii contigs, the translated ESTs were subjected to a WU-BLAST 2 search of different EST assemblies at http://www.biology.duke.edu/chlamy_genome/blast/blast_form.html.
Phylogenetic analyses. A CLUSTAL W alignment of the Lhca protein sequences was performed by using the European Molecular Biology Laboratory-European Bioinformatics Institute computer server (http://www.ebi.ac.uk /clustalw/) with the default settings. Phylogenetic trees were inferred by using PAUP algorithms (PAUP version 4.0b8; Sinauer Associates, Sunderland, Mass.). The confidence of branching was assessed by using 1,000 bootstrap resamplings for distance analysis with the neighbor-joining method.
|
|
|---|
![]() View larger version (38K): [in a new window] |
FIG. 1. 2-DE protein maps of major C. reinhardtii LHC proteins after separation of thylakoid membranes (200 µg of protein). The upper panel shows a 2-DE map of thylakoid membranes stained with silver. The inset in the upper panel indicates the region presented in the lower panels. Each of the numbered spots on the Coomassie blue-stained 2-DE map (lower panels) was excised and digested in a gel. The subsequent MS-MS analysis resulted in the identification of LHC proteins. The labeled protein spots indicate proteins that were unambiguously identified by the MS data. Lhcbm3 + 6, Lhcbm3 and Lhcbm6; Lhcbm2/8, Lhcbm2 or Lhcbm8.
|
![]() View larger version (111K): [in a new window] |
FIG. 4. PSI 2-DE protein map. Coomassie blue-stained 2-DE-separated proteins (corresponding to 22 µg of chlorophyll) that copurified with PSI particles during sucrose gradient ultracentrifugation are shown. Each of the numbered spots was excised and digested in a gel. The resulting tryptic peptides were analyzed by using LC-MS and database searching. The Coomassie blue-stained protein spots are labeled according to the lhc gene products detected from the tryptic digests; the relative SDS-PAGE mobility identifiers are indicated in parentheses. Most of the peptides identified corresponded to lhca gene products. Spots 1, 5, 11, 25, 29, 29+, 30, and 31 were found to contain peptides corresponding to lhca gene products only. Previously unidentified lhca gene products are labeled as follows: Lhca9 (p22.2), Lhca2 (p19), and Lhca7 (p15).
|
|
View this table: [in a new window] |
TABLE 1. Lhcb proteins identified by MS-MS data from protein spots excised from thylakoid 2-DE mapsa
|
|
View this table: [in a new window] |
TABLE 2. Lhca proteins identified by MS-MS data from protein spots excised from PSI 2-DE mapsa
|
To test the significance of our data gathering and processing procedures, in an independent experiment we analyzed protein spots 12 to 24 derived from 2-DE-separated thylakoid membrane proteins isolated from iron-deficient cells (grown for 5 days in iron-deficient medium). It was shown recently that in the process of adaptation to iron deficiency, Lhcb proteins remain stable whereas, the Lhca3 protein and other Lhca proteins disappear (33). While the Lhcbm proteins identified from protein spots 12 to 24 were almost identical to those found under iron-sufficient conditions, no lhca-related gene products could be identified from protein spots 12 and 13 derived from iron-deficient thylakoid membrane proteins (Table 1). In contrast to the findings under iron-sufficient conditions, one Lhcbm1 peptide was detected from spot 24 under iron-deficient conditions. Although this result might suggest differential protein processing for this polypeptide, we cannot exclude the possibility that it was found because of horizontal streaking of the protein due to electrophoresis in the second dimension. Overall, these data indicate that the peptides identified by the MS-MS and subsequent Sequest analyses are significant and specific for the single spots processed. Interestingly, we identified from spots 23 and 24 under iron-sufficient as well as iron-deficient conditions several peptides from PsbO, the oxgen-evolving enhancer protein 1 (30). Although PSII polypeptides are also degraded in the process of adaptation to iron deficiency, it appears that Lhca3 (p14.1) and Lhca7 (p15) are more severely affected and thus represent primary targets of iron deficiency, as already demonstrated (33).
Beside PsbO, several other peptides deduced from the EST data could be identified from the MS-MS data. From spot 23, two peptides corresponding to a Lon protease homologue (AV397248) could be recognized. However, no other significant hits (more than one peptide for the same putative protein) were obtained.
Interestingly, lhcbm6-related gene products with different molecular masses and different isoelectric points were found. The MS-MS data identified peptide WAMLGALGCQTPELLAK, which is specific for Lhcbm6, in spots 22 and 23 as well as in spots 16 and 17 with a high level of significance (Fig. 2A). A previous MS analysis of 2-DE-separated Lhcbm6 had already indicated that this LHCII protein might contain two alternative N-terminal protein cleavage sites (20). An HA tag was introduced into the lhcbm6 gene in such a way that it was localized behind the second suggested N-terminal processing site [28APKS(HA tag)SGVE35] in the expressed gene product (23). Anti-HA monoclonal antibodies recognized two protein bands with different molecular masses (mass difference, 1 to 2 kDa) after SDS-PAGE fractionation of thylakoid membranes from the Lhcbm6-HA-tagged algal strain. Immunoblot experiments with 2-DE-separated Lhcbm6-HA-tagged thylakoid membranes, anti-Lhcbm6 peptide antibodies (directed against a sequence upstream of the second processing site), and anti-HA antibodies revealed that the lower-molecular-mass spots recognized by the anti-HA antibodies are not recognized by the anti-Lhcbm6 antibodies; these data demonstrate that processing occurs at the N terminus (Fig. 2B). Thus, the Lhcbm 2-DE mapping results and the tagging experiment independently confirmed that two differentially N-terminally processed Lhcbm6 forms exist in vivo. It should be noted that the lower-molecular-mass protein spots detected by the anti-HA antibodies are considerably shifted in their isoelectric points toward more acidic values compared to the higher-molecular-mass HA-tagged protein spots. This finding is also indicative of N-terminal protein processing, since the N-terminal part of the Lhcbm6 protein is enriched with positively charged amino acids. Fractionation of whole HA-tagged cells by one-dimensional SDS-PAGE and subsequent immunoblot analyses with anti-HA antibodies revealed two protein bands with different molecular masses, indicating that N-terminal processing also occurs in whole cells and is not an artifact due to the membrane preparation.
![]() View larger version (34K): [in a new window] |
FIG. 2. Two differentially N-terminally processed forms of Lhcbm6. (A) MS-MS spectra of peptide WAMLGALGCQTPELLAK from spot 22 (without Fe) and spot 17 (with Fe). The CID mass spectra of the doubly protonated precursor ion peaks measured at m/z 930.5 and 930.4 for spots 22 and 17, respectively, are shown. (B) To identify differentially processed Lhcbm6 forms, an immunoblot of 2-DE-separated thylakoid membranes of Lhcbm6-HA-tagged strain Lep2 was probed with an anti-Lhcbm6 peptide antibody (recognizes the N terminus) (3) and an antibody recognizing the HA epitope. Boxed areas indicate the positions of the processed forms of HA-tagged Lhcbm6.
|
A closer analysis of the fragmentation spectrum (Fig. 3) revealed a series of y- and b-type ions, including those that exhibit a shift in molecular mass of 80 Da; these data indicate that the fragmentation spectrum represents the CID pattern of the respective phosphorylated peptide. It is known that phoshorylated peptides do not fragment well in MS-MS analyses; therefore, we interpret the Sequest results as being significant, although the correlation factor is slightly under the cutoff value used throughout this study. In this analysis, we identified several peptides already found in the 2-DE analysis as well as three Lhcbm5-specific peptides and two Lhcbm4- or Lhcbm9-specific peptides (two triply charged ions with correlation factors of 5.2 and 3.8, respectively: AKWLGPYSENSTPAYLTGEFPGDYGWDTAGLSADPETFK and WLGPYSENSTPAYLTGEFPGDYGWDTAGLSADPETFKR). The latter finding indicates that Lhcbm4 or Lhcbm9 is expressed at the protein level; thus, another Lhcbm protein is added to the total number of Lhcb proteins found in this study. These large peptides were not found in the 2-DE analysis, probably because of their size. Thus, we identified eight distinct Lhcb proteins. We also detected a singly charged ion with a molecular mass of 1,394.7 Da (deviation from theoretical mass, 0.2 Da). Sequest analysis of the MS-MS data revealed a significant match, at a correlation factor of 2.00, with an Lhcbm3-specific peptide (33PASSGIEFYGPNR45). This peptide represents another putative N terminus for Lhcmb3 (see above). It should be noted that the phosphorylation site that we determined would be removed by this protein processing. Interestingly, such N-terminal protein processing can be also described for Lhcbm6 (see above).
![]() View larger version (14K): [in a new window] |
FIG. 3. MS-MS spectrum of phosphorylated Lhcbm3 peptide GTGKTAAKQAPASSGIEFYGPNR. The data indicate that either the first or the second Thr (in bold type) becomes phosphorylated. The CID mass spectrum of the triply protonated precursor ion peak measured at m/z 797.65 was identified from a tryptic digest of a green LHC band obtained after fractionation of in vitro-phosphorylated thylakoid membranes by native gel electrophoresis. The b-type ions b5, b6, b7, b8, and b10 represent phophorylated ions, indicating phosphorylation of either of the two Thr residues.
|
Peptides related to lhca gene products could be identified in 18 protein spots. In eight protein spots (1, 5, 11, 25, 29, 29+, 30, and 31), peptides that could be correlated only with one lhca gene product were found. In the other spots, the MS-MS data revealed peptides deduced from more than one lhca gene product. At least nine different lhca-related gene products were predicted by searching the Chlamydomonas EST database with MS data. For nine Lhca proteins, the respective genes were found as assembled EST contigs (Table 2), and seven are represented completely in the first release of the genomic database assembly (Table 3). Of these nine, three are identified for the first time at the protein level; Fig. 5 shows selected MS-MS spectra. In accordance with the nomenclature of Bassi et al. (3), the numbers in parentheses in Fig. 4 represent the relative mobility identifiers of the respective Lhca proteins in SDS-PAGE (see also Table 2). The N-terminal amino acid sequences of six Lhca proteins (p14, p14.1, p15.1, p18, p18.1, and p22.1) were reported by Bassi et al. (3). The respective contigs of the newly identified lhca gene products are 20020630.7235.1, 20020630.8317.1, and 200206304096.2. For the last contig sequence, a corresponding gene has been already annotated (Merchan and Fernandez, 2000).
|
View this table: [in a new window] |
TABLE 3. C. reinhardtii lhca and lhcb genes and gene products as determined by EST, cDNA, or genomic sequences in comparison with tryptic peptide sequences obtained by MS
|
![]() View larger version (26K): [in a new window] |
FIG. 5. MS-MS spectra of newly identified lhca gene products. The CID mass spectra of the doubly protonated precursor ion peaks measured at m/z 966.3, 500.9, and 980.6 for Lhca7 (p15), Lhca2 (p19), and Lhca9 (p22.2), respectively, are shown. The amino acid sequence retrieved by the Sequest program for the MS-MS data is shown in the upper right-hand corner of each spectrum. The ion peaks that corresponded to theoretical b-type (N-terminal) or y-type (C-terminal) fragments of the precursors are labeled. Peaks identified as doubly charged are labeled only for high intensities. The mass-to-charge ratios of the measured fragment ion peaks are shown below the x axis, and the relative intensities of the total ion current measured are shown along the y axis.
|
From spots excised from the PSI 2-DE map, we also identified peptides from proteins other than Lhc. However, in none of these instances could more than one peptide corresponding to a respective polypeptide sequence be recognized (data not shown); these data thus were not regarded as significant.
To verify the different lhca gene products by MS-MS, in an independent experiment we digested the entire isolated PSI complex with trypsin, separated the resulting peptides by reversed-phase LC, and analyzed them by MS-MS. Analysis of the MS-MS data with Sequest confirmed the presence of nine distinct lhca gene products, which were also identified by the 2-DE approach (data not shown). We could identify several peptides for each of the nine Lhca proteins. In contrast, no peptide for putative Lhca10 (p18.2) could be identified. Interestingly, we identified by database searches an assembled EST contig (20020630.8279.1) that is highly homologous to the p18 protein sequence deduced from contig 20021010.4209.1. This sequence differs at only one position, which would alter a putative tryptic peptide, KPGSQDEPGSFLGFEASLK, deduced from the sequence of contig 20021010.4209.1 (Table 2). This peptide has an Asp instead of a Gly at position 6. This exchange is not supported by the genomic database.
Phylogenetic analysis of Lhca proteins. Phylogenetic analysis of C. reinhardtii Lhca precursor proteins Lhca1 (p22.1), Lhca3 (p14.1), Lhca4 (p14), Lhca5 (p15.1), Lhca6 (p18.1), and Lhca9 (p22.2) has already indicated that p14.1 and p22.1 are related to Lhca3 and Lhca1 proteins from vascular plants, respectively (42) (Table 3), whereas the other proteins could not be assigned to a distinct vascular plant homologue. Proteins p14, p15.1, p18, and p18.1 were therefore assigned to vascular plant Lhca2 or Lhca4 protein (42) (Table 3). Reanalysis of these proteins by BLAST similarity searches showed that p18 was most similar to Lhca5 from Arabidopsis (24) (Table 3). When we investigated the novel precursor proteins p15, p19, and p22.2 by BLAST similarity searches, it appeared that p15 was highly similar to Lhca5, while p19 and p22.2 were most similar to Lhca2 from Arabidopsis (24) (Table 3).
To cluster the different lhca gene products, we performed phylogenetic analysis of Lhca precursor protein sequences (aligned in Fig. 6A by the CLUSTAL W program) (43) by using the neighbor-joining method (38). This analysis resulted in three major clusters, which represent the Lhca1 homologue p22.1; the Lhca3, Lhca2 or Lhca4, and Lhca2-like homologues p14.1, p15.1, p18.1, p14, p19, p22.2 and p18.2 (p18.2 is not included in Fig. 6); and the Lhca5-like homologues p15 and p18. The second cluster can be further differentiated. The Lhca3 protein seems to be distinct from the Lhca2 or Lhca4 and Lhca2-like proteins. It appears that the Lhca2 or Lhca4 homologues p15.1 and p18.1 are more closely related to each other than to p14. It is likely that polypeptides p19 and p18.2 as well as polypeptides p15.1 and p18.1 were generated by more recent gene duplications. In summary, phylogenetic analysis indicates that Lhca proteins from C. reinhardtii can be divided into five distinct classes, where the Lhca1 and Lhca3 homologues are represented by only one gene product and the Lhca2 or Lhca4, Lhca2-like, and Lhca5-like homologues are represented by more than one gene product (Fig. 6B).
![]() ![]() View larger version (109K): [in a new window] |
FIG. 6. Phylogenetic analyses. (A) CLUSTAL W alignment of Lhca protein sequences. The CLUSTAL W alignment of full-length protein sequences was performed by using the European Molecular Biology Laboratory-European Bioinformatics Institute computer server with default values. Sequences for the tryptic peptides identified by LC-MS and database searching are shown in bold type; overlapping sequences are indicated in grey. The identities of the protein sequences are as follows: Lhca1, p22.1; Lhca3, p14.1; Lhca7, p15; Lhca8, p18; Lhca5, p15.1; Lhca6, p18.1; Lhca4, p14; Lhca2, p19; and Lhca9, p22.2. The putative Lhca10 (p18.2) sequence is not shown, since a complete contig assembly is not available in the database. (B) Neighbor-joining tree of the CLUSTAL W alignment of Lhca protein sequences. The C. reinhardtii Lhca protein sequences group into three clusters. One of the clusters is split into three groups. The tree supports the assignment of Lhca8 (p18) as an Lhca5-like protein, in contrast to the previously suggested annotation as a vascular plant Lhca2- or Lhca4-like protein.
|
Modifications of Lhcb and Lhca proteins. For Lhcb proteins as well as for Lhca proteins, it is apparent that bands of spots that are represented by individual gene products appear across the 2-DE maps. A possible explanation for this finding could be modification of the peptide amino terminus or the side chains of lysine or arginine by isocyanic acid. Such protein carbamylation would result in so-called carbamylation trains. For Lhcbm6, such trains are evident on the 2-DE maps (Fig. 1 and 2). Therefore, we specifically searched the MS-MS data of all Lhcbm proteins for protein carbamylation, but the results were negative. These data indicate that the Lhcbm polypeptides produce natural trains which could be caused by posttranslational modifications, such as phosphorylation (see above), or charge heterogeneity. Most of the Lhca proteins (p22.2, p22.1, p19, p18.1, p15.1, p15, and p14.1) can be found at different positions on the 2-DE maps. The exact reason for these data is unknown. We could not identify differentially N-terminally processed Lhca proteins by MS-MS, although we specifically searched the MS-MS data for such processing events (see Materials and Methods). However, it is tempting to speculate that besides posttranslational modification, protein processing may account for the fact that particular Lhca proteins are found at different positions on the 2-DE maps.
|
|
|---|
The MS and succeeding bioinformative analyses revealed a high level of complexity of the light-harvesting proteins in C. reinhardtii. In addition to the large number of different LHC proteins, we also identified N-terminal protein processing of the lhcbm3 and lhcbm6 gene products. This finding, which was evident from the MS data as well as from the Lhcbm6-HA tagging experiment (Fig. 2), demonstrates that two differentially N-terminally processed forms of Lhcbm3 and Lhcbm6 exist in thylakoid membranes. What could be the cause for the differential processing? Differential processing of pre-LHC proteins has already been reported in in vitro studies for wheat, pea, tobacco, tomato, and corn (5, 6, 8, 27, 35). A secondary processing site in the wheat sequence is believed to be within the mature protein (7) and is, interestingly, beyond Thr3, which has been found to be phosphorylated in thylakoid membranes from spinach and Arabidopsis (31, 45). It has been shown that the recombinant stromal processing peptidase can process pre-Lhcb1, which would be homologous to Chlamydomonas Lhcbm3 or Lhcbm6, at the secondary site (37). Thus, different stromal processing peptidases that process the transit sequence at different positions could be the cause for the different N-terminal protein products observed for Lhcbm3 or Lhcbm6. What could be the function of such processing? Such a processing event would remove the phosphorylation site at Thr3 in Lhcb1 of vascular plants and would thereby inhibit the ability of the respective Lhcb1 protein to migrate to the stromal membranes and participate in the process of state transitions. Our data indicate that Lhcbm3 is phosphorylated in Chlamydomonas. In the 2-DE map, Lhcbm3 is found in the upper row of Lhcbm protein spots, corresponding to the position of protein P11, which is one of the major phosphorylated protein bands seen in SDS-PAGE analysis under state II conditions in Chlamydomonas (46). Interestingly, phosphorylation of the second Thr residue in the respective Lhcbm3 peptide [(K)GT*GKT*AAKQAPASSGIEFYGPNR] corresponds exactly to the phosphorylation of Thr3 in vascular plants (RKT*AAKAKQ...) (asterisks indicate phosphorylated residues) (31, 45). The MS data identify two N termini of Lhcbm3 in which, due to N-terminal protein processing, both phosphorylation sites are removed. Therefore, we suggest that such N-terminal protein processing could be a regulatory event in the process of state transitions in Chlamydomonas as well as vascular plants.
Our proteomic study reveals that among the 10 distinct lhca gene products (including the putative lhca10 gene product), 5 different phylogenetic types of Lhca proteins are expressed in Chlamydomonas. Chlamydomonas is the first plant where the expression of Lhca5-like proteins can be demonstrated. The lhca gene family in Arabidopsis is composed of six nuclear genes (lhca1 to lhca6); of these, genes lhca5 and lhca6 are expressed only at very low RNA levels (24). The Lhca proteins identified in 2-DE of enriched PSI particles are copurified with the PSI complex and therefore belong to the antenna that is functionally connected to PSI. The PSI (LHCI) protein complex in Chlamydomonas may contain as many as 14 light-harvesting proteins per reaction center, as revealed by electron microscopy (15). Such structural data indicate eight Lhca (Lhca1 to Lhca4) proteins per PSI complex for PSI (LHCI) proteins in vascular plants (4). These data suggest that the LHCI antenna in Chlamydomomas is larger than that in vascular plants. Our proteomic data show, as already suggested (3, 20), that the variability of Lhca proteins also is greater in Chlamydomonas than in vascular plants. These data suggest that the variability in light harvesting due to the high level of diversity of Lhca proteins could be an additional important regulatory parameter in response to changing physiological conditions. Such a dynamic adaptation of the Lhca antenna has been shown for Chlamydomonas in response to iron deficiency (33), which induces the remodeling of the Lhca antenna, including subunit degradation and replacement.
The fact that several Lhca proteins are found at different positions in the 2-DE map makes an exact determination of the stoichiometry of individual Lhca polypeptides difficult. However, an intriguing question is how and in which stoichiometry these subunits are organized within the complex. It appears that Lhca protein expression is realized on a rather dynamic scale in Chlamydomonas. In future experiments, it will be important to determine the exact stoichiometry of individual Lhca proteins. This goal could be achieved by immunotitrations with specific antibodies directed against distinct Lhca proteins or by MS with synthetic peptides for the quantification of respective tryptic Lhca peptides. Additionally, it will be interesting to monitor how the relative abundances of individual Lhca and Lhcb proteins change under different physiological conditions and in different genetic backgrounds.
We are grateful to M. Stäger for excellent technical assistance. We thank A. Romualdi for bioinformative assistance.
This work was supported by grant Hi739/1-3 from the Deutsche Forschungsgemeinschaft and Freistaat Thüringen (Nachwuchsgruppe: die Plastidäre Proteinausstattung bei Differenzierungs- und Regulationsvorgängen) to M.H.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»