Occurrence and Biological Significance of Antigenic Variation
G. lamblia trophozoites undergo antigenic variation of a family of immunodominant cysteine-rich surface antigens in vitro and in vivo (
6,
12,
243). The initial studies of the surface antigens of
G. lamblia showed differences among strains by crossed immunoelectrophoresis and enzyme-linked immunosorbent assay (
306) and marked differences in the molecular masses of “excretory-secretory products” from different surface-iodinated
G. lamblia isolates (
247,
250). These surface antigens varied in number and in size from approximately 50 to 200 kDa in a study of 19 isolates (
250). A monoclonal antibody (MAb 6E7) for a 170-kDa surface antigen (initially called CRP170 but now called VSPA6) from the WB isolate was cytotoxic for WB trophozoites but not for isolates expressing other surface antigens (
242). WB trophozoites were doubly cloned by limiting dilution and incubated with MAb 6E7; some organisms survived and were totally resistant to the cytotoxic effect of MAb 6E7 (
243). One of these cloned lines of organisms (WB1269) had a 68-kDa surface-labeled antigen (initially called CRP68 but now called VSP1269), while another (WB1267) had a 64-kDa surface antigen (VSP1267). A monoclonal antibody (MAb 5C1) reactive with the surface antigen of WB1267 was cytotoxic for WB1267 trophozoites. These trophozoites were incubated with MAb 5C1, and organisms surviving the cytotoxic antibody reacted with neither of the initial MAbs and expressed surface antigens in the 60- to 100-kDa size range. Subsequent data have confirmed that individual organisms express only one VSP at a time (
245); the detection of multiple surface-labeled bands in some of the studies resulted from subpopulations of trophozoites expressing several different VSP types. In axenic culture, variation occurs approximately once every 6 to 12 generations for a frequency of 10
−3 to 10
−4(
244).
Antigenic variation has subsequently been confirmed in animal models (
12,
118,
119) and in infected human volunteers (
248). Cloned WB trophozoites expressing the VSPA6 were inoculated into gerbils, and trophozoites collected from their intestines 7 days later demonstrated populations of trophozoites expressing multiple VSPs ranging in size from 50 to 170 kDa (
12). Trophozoites collected 28 days after infection were similar to those collected at 7 days. The lack of apparent change from 7 to 28 days argued against a role for acquired immunity in selecting antigenic variants. However, the results were somewhat different in a mouse model of
G. lamblia infection.
G. lambliatrophozoites in athymic nude mice and in heterozygous
nu/
+ mice changed VSP type coincident with the development of a humoral anti-
vsp antibody response (
119). In contrast, scid mice did not develop an antibody response and trophozoites did not undergo antigenic variation. Thus, in the mouse model, the antibody response to the VSP may have been the selective force for antigenic variants.
When human volunteers were inoculated with cloned GS trophozoites by duodenal intubation, four of four became infected with trophozoites expressing VSPH7 (72 kDa). Only 1 of 13 became infected with organisms expressing VSPB6 (200 kDa), suggesting that the ability to survive in the human intestine is greater with one VSP than with another. Whether this is due to selection against one VSP or positive selection for the other has not been determined. The subsequent pattern of change during 22 days after infection showed gradual disappearance of VSPH7 followed by the appearance of other VSPs with different immunoreactivity. The loss of VSPH7 was accompanied by the development of serum antibodies to VSPH7, suggesting the possibility that the immune response led to the antigenic variation.
An alternative or additional possible biological reason for antigenic variation might be adaptation to different intestinal environments. Some evidence for this possibility is provided by the difference in protease susceptibility for different VSP antigen types (
252). Trypsin and chymotrypsin were toxic to WB trophozoites expressing VSPA6 (reactive with MAb 6E7). Some organisms survived after exposure to these proteases, and when they were subsequently grown in the presence of trypsin and chymotrypsin, they were no longer susceptible to the cytotoxic effects and expressed alternative VSPs that were not reactive with MAb 6E7.
Antigenic variation has also been documented during the cycle of encystation and excystation (
217,
320). In vitro encystation followed by excystation of trophozoites expressing the VSP TSA417 resulted in the loss of TSA417 from the plasma membrane followed by its appearance in the lysosome-like peripheral vacuoles (
211,
320). During excystation, the TSA417 transcript was replaced by a variety of other
vsp transcripts. This occurred too rapidly to be the result of selection of antigenic variants surviving excystation and may have represented an induced antigenic shift. This antigenic variation during excystation could promote survival by evasion of an intestinal immune response or by adaptation to other important intestinal factors.
Thus, the two major hypotheses regarding the purpose of antigenic variation are (i) evasion of the host immune defense and (ii) enabling of the organisms to survive in different intestinal environments. It should be emphasized that these hypotheses are not mutually exclusive. A better understanding of the biological reasons for antigenic variation would be facilitated if the role of the VSPs was known. It would also be informative to know if the free-living diplomonads such as Hexamita have proteins homologous to the VSPs and if they undergo antigenic variation.
Structure and Biochemistry of the VSPs
The initial biochemical studies of the VSPs (then called ES products) indicated that they were protease susceptible. They did not bind to a series of lectins, suggesting that they were not glycosylated (
247). A portion of the gene for one of these surface antigens was subsequently cloned from an expression library using MAb 6E7. Sequence analysis revealed a cysteine-rich (12%) protein with frequent CXXC motifs (
6). Subsequent sequences of a number of
vsp genes have shown a number of characteristics common to all
vsp genes. They are all cysteine rich (12%), and most cysteines are present in CXXC motifs. When trophozoites are metabolically labeled with radiolabeled cysteine, most of the label is incorporated into the VSPs (
6,
11). It has been suggested that these cysteines are present in the form of disulfide bonds, since free thiol groups have not been detected in purified VSPs (
14,
266). However, the presence of thiol groups on the trophozoite surfaces suggests that the VSP cysteines may not all be disulfide bonded (
116). Whether the level of disulfide bonding changes in vivo with different oxygenation levels of the surrounding environment has not been determined.
After synthesis, the VSPs are transported through the ER (
211) to the membrane, where they diffusely coat the membrane (
112,
275). Smaller amounts of VSP can also be detected in the lysosome-like peripheral vacuoles, suggesting the possibility that these vacuoles are involved in recycling of the VSPs (
211). The VSPs have an approximately 14- to 17-amino-acid signal peptide that presumably represents a signal peptide for its transport through the ER. The signal peptide is cleaved, 14 amino acids from VSPH7 of the GS isolate (
201) and 17 amino acids from TSA417 of the WB isolate (
14).
The 38 C-terminal amino acids of the VSPs are >90% conserved at the amino acid and nucleotide levels. Each VSP reported to date ends with the amino acid motif CRGKA in both genotype A and B organisms. The conserved C terminus may represent a membrane-anchoring domain. Evidence for this proposal has been reported in work showing a difference in the C termini of membrane-bound and secreted VSP (
267). A 3,500-kDa fragment is cleaved from the C terminus of the membrane-associated VSP prior to secretion. The suggested cleavage site is between the K and S of the highly conserved NKSGLS motif (
267), which comprises the amino acids 33 to 38 from the C terminus (
235). This NKSGLS motif is generally conserved in the genotype A organisms (
88) but is not found in the
vspH7 gene from GS, a genotype B organism (
253).
The
vsp 3′ UTRs have the putative polyadenylation signal common to all
G. lamblia strains, AGTPuAAPy (
3,
273), immediately preceded by the sequence ACTPyAGPuT, which sometimes begins within the termination codon (
88,
320) and (Y. M. Yang and R. Adam, unpublished observations).
On the basis of sequences in
vsp genes similar to those shown in zinc binding by nuclear proteins, it has been proposed that the VSP genes may encode zinc binding proteins. The ability to bind zinc was shown for VSPH7 and other VSPs but was also found in regions of the VSP that did not contain the “zinc binding” motifs (
254). In addition to zinc, VSPH7 was able to bind other metals including iron. Another laboratory, using VSP4A1 (CRISP90) from a genotype A isolate, demonstrated the ability of reduced but not oxidized VSP to bind zinc. However, the degree of binding was substantially lower than an equimolar zinc:protein amount and was thought to be biologically insignificant (
266).
All of the VPSs reported have potential N-linked glycosylation sites (
201); biochemical analysis of VSP4A1 demonstrated O-linked GlcNAc rather than the more commonly found N-linked glycosylation (
268). VSP4A1 was also palmitoylated in the membrane-anchored portion of the C-terminal region (
267,
268). Subsequent work has extended the finding of glycosylation and palmitoylation to other VSPs, including VSPA6 and VSPH7 (
131).
Genomic Organization of the vsp Genes and Potential Mechanisms of Antigenic Variation
In addition to their marked 3′ similarity, the
vsp genes demonstrate different degrees of identity in other regions. In some cases, there is a high degree of similarity throughout the entire gene. For example, there are two identical convergent copies of the
vsp1267 gene approximately 3 kb apart (
235) and there is greater than 90% identity in the nonrepeat regions of
vspA6 and
vspA6-S1(
357) as well as CRP136 and CRP65 (
52). These examples suggest that the
vsp gene repertoire has been expanded by gene duplication followed by divergence. The sequences for one of these
vsp genes (
vspA6-S1 or
vspG3M) from two geographically separated isolates (WB from Afghanistan and G3M from Peru) were identical over the regions of comparison (
237,
357), suggesting that this divergence is not an extremely rapid process. A number of
vsp genes related to TSA417 (
vsp417) have been identified from genotype A-1 and A-2 isolates. Comparisons have shown that the differences between genotypes are smaller than the differences between members of this
vsp gene family, suggesting that the divergence among the various members of the
vsp417 gene family occurred before the divergence between genotypes A-1 and A-2. The
vsp genes from the genotype B isolate GS also appear to exist in families (
246,
253) but demonstrate similarity to the
vsp genes of genotype A only in the conserved 38-amino-acid C terminus and in the CXXC motif.
There are also examples of near identity followed by abrupt divergence, suggesting that in some cases, the
vsp repertoire has been expanded by recombination among the
vsp genes. For example,
vspC5 demonstrates near identity to
vspC5-S2 from approximately −150 to +94 and to
vspC5-S1 from −150 to +130 (
355). A more complete understanding of the
vsp repertoire and gene organization will be available when the sequences obtained as part of the
Giardia genome project have been assembled.
The
vsp genes are present on most if not all the chromosomes and are reasonably dispersed throughout the genome. There does appear to be some clustering of genes as demonstrated by the linkage of some
vsp genes (e.g.,
vsp1267) (Table
5) and by the proximity of
vspgenes on some cosmid clones (Yang and Adam, unpublished).
Antigenic variation in African trypanosomes is frequently associated with genome rearrangements, such as duplicative transposition to a telomeric location (
32,
290). Therefore, it is of interest to know whether the
vsp genes are telomeric and whether there are genomic rearrangements associated with antigenic variation. Current data indicate that most of the
vsp genes are not telomere associated (
355-357). The
vspA6 gene is not found in the 150- and 240-kb
NotI telomeric fragments of chromosome 4 (
356), and the
vsp conserved region does not hybridize to these regions (Adam, unpublished). Genome rearrangements are not directly associated with antigenic variation of
vspA6 (
356) or
vspC5(
358).
Giardia trophozoites have a ploidy of at least 4, so there are at least four alleles of each gene. Since the usual definitions of allele cannot be used in a presumably asexual organism, genes have been considered to be allelic when they map to the same genomic location. For most genes, the alleles are identical or nearly identical to each other, sometimes containing heterozygosities at one or two sites (
17). However, in the case of the repeat-containing
vsp genes, such as
vspA6 and
vspC5, the different alleles can be distinguished by different repeat copy numbers (
356,
358) (Fig.
13). Three distinct alleles of the
vspA6 gene have been documented which differ in the copy number (i.e., 8, 9, or 23) of the 195-bp repeat (
356). In addition to the differences in repeat copy number, the allele with the greatest repeat copy number contains 8 nucleotide substitutions within the coding region. Some of these substitutions change the amino acid composition, but none change the reading frame. The flanking regions of the three alleles are identical.
Despite the open reading frames and identical flanking regions, a steady-state transcript is expressed only from the allele with the largest repeat copy number as documented by the correct size of band on a Northern blot in addition to direct RNA sequencing, which identified the nucleotide substitutions specific to the allele with the largest repeat copy number. Thus, a transcript is detectable from only one of the alleles despite the absence of apparent sequence alterations that might explain the difference in expression.
The
vspC5 gene is another repeat-containing gene, containing 17 to 26 copies of a 105-bp repeat. In this case, the WBA6 genome contains four distinct alleles of the
vspC5 gene (17, 20, 21, or 26 repeats) while the WBC5 genome (derived from WBA6) contains only three distinct alleles (
26,
21, or
20 repeats). There are no sequence differences among the different alleles, either in the coding or in the flanking regions. However, Northern analysis has identified only a single band that is most consistent with a transcript from the allele containing 26 repeats. (Because of the short 5′ and 3′ UTRs, the transcript size closely matches that of the coding region.)
Initial studies suggested significant recombination involving the
vspA6 (CRP170) gene (
6) and deletion of an expression-linked copy of
vspA6 in an isolate no longer expressing
vspA6 (
9). However, subsequent data have shown that these recombination events can be explained by changes in repeat copy number for the repeat-containing
vsp genes (
355-358) Other than changes in repeat copy number, chromosome-mapping studies of the
vspA6 and
vspC5genes have revealed no changes in genomic location and organization in isolates where a particular
vsp gene is expressed or not expressed (
356,
358). This indicates that antigenic variation does not occur by moving the
vsp genes into expression sites.
The allele-specific expression of the
vsp genes and the absence of sequence alteration or DNA rearrangements associated with antigenic variation suggest the possibility of an epigenetic form of control for
vsp genes expression. There is no methylated or “J” DNA in
Giardia (
335), arguing against altered DNA as a means of controlling
vsp gene expression. Whether changes in chromosome structure or histone acetylation status are involved in the control of
vsp gene expression has not been studied.