CAZypedia needs your help!
We have many unassigned pages in need of Authors and Responsible Curators. See a page that's out-of-date and just needs a touch-up? - You are also welcome to become a CAZypedian. Here's how.
Scientists at all career stages, including students, are welcome to contribute.
Learn more about CAZypedia's misson here and in this article.
Totally new to the CAZy classification? Read this first.

Difference between revisions of "Carbohydrate Binding Module Family 92"

From CAZypedia
Jump to navigation Jump to search
m (→‎Family Firsts: Edited for brevity)
 
(5 intermediate revisions by 2 users not shown)
Line 19: Line 19:
  
 
== Ligand specificities ==
 
== Ligand specificities ==
The first published member in the CBM92 family is the Cgk16A-CBM92 from the marine bacterium ''Wenyingzhuangia aestuarii'' OF219 <cite>Mei2022</cite>. The CBM92 bound specifically to the red algal polysaccharide carrageenan, a substrate of the appended [[GH16]] enzyme domain. It was incapable of binding to other polysaccharide components in red algae including agarose, porphyran, and funoran <cite>Mei2022</cite>. Meanwhile, the CBM92 displayed no affinity to several anionic polysaccharides, namely pectin, chondroitin sulfates, dermatan sulfate, and sulfated fucans <cite>Mei2022</cite>. The Cgk16A-CBM92 showed no significant difference in the affinity to κ- and ι-carrageenan.
+
The first published CBM92 domain, and founding member of the CBM92 family, is the Cgk16A-CBM92 from the marine bacterium ''Wenyingzhuangia aestuarii'' OF219 <cite>Mei2022</cite>. The CBM92 bound specifically to the red algal polysaccharide carrageenan, a substrate of the appended [[GH16]] enzyme domain. It was incapable of binding to other polysaccharide components in red algae including agarose, porphyran, and funoran <cite>Mei2022</cite>. Meanwhile, the CBM92 displayed no affinity to several anionic polysaccharides, namely pectin, chondroitin sulfates, dermatan sulfate, and sulfated fucans <cite>Mei2022</cite>. The Cgk16A-CBM92 showed no significant difference in the affinity to κ- and ι-carrageenan.
  
 
An earlier paper had demonstrated carbohydrate binding by a so-called Bacterial Fascin-like Domain (BFLD) within the [[GH16]] β−1,3-glucanase LamC from a myxobacterial ''Corallococcus'' species. Affinity gel electrophoresis showed that the domain, now recognized as a [http://www.cazy.org/CBM92.html CBM92 domain] could bind to β−1,3-glucans <cite>Zhou2017</cite>. Later, Lu et al showed binding to β-1,6-glucan by a CBM92 domain found within ''Cp''Glu30A, a [[GH30]] β-1,6-glucanase <cite>Lu2023</cite>. Binding to the Glc- β-1,6-Glc structure found in pustulan, laminarin, scleroglucan, and yeast β-glucan has since been demonstrated for 12 additional members of family CBM92, none of which were able to bind carrageenan <cite>Hao2024</cite>.
 
An earlier paper had demonstrated carbohydrate binding by a so-called Bacterial Fascin-like Domain (BFLD) within the [[GH16]] β−1,3-glucanase LamC from a myxobacterial ''Corallococcus'' species. Affinity gel electrophoresis showed that the domain, now recognized as a [http://www.cazy.org/CBM92.html CBM92 domain] could bind to β−1,3-glucans <cite>Zhou2017</cite>. Later, Lu et al showed binding to β-1,6-glucan by a CBM92 domain found within ''Cp''Glu30A, a [[GH30]] β-1,6-glucanase <cite>Lu2023</cite>. Binding to the Glc- β-1,6-Glc structure found in pustulan, laminarin, scleroglucan, and yeast β-glucan has since been demonstrated for 12 additional members of family CBM92, none of which were able to bind carrageenan <cite>Hao2024</cite>.
Line 28: Line 28:
 
Several conserved residues (e.g., Phe-70, Arg-72, and Phe-75) were discovered through the multiple sequence alignments of Cgk16A-CBM92 and its close homologs <cite>Mei2022</cite>, which were suggested to be critical for the ligand binding of that CBM. Phylogenetic analysis by Hao et al later showed that the homologs of Cgk16A-CBM92 represent a small sub-group within the CBM92 family (Fig. 1) <cite>Hao2024</cite>. In most other members of CBM92, there are three apparent binding sites identified by a conserved WExF sequence motif. The Trp of this motif is the principal sugar-binding residue, as evidenced by a site-directed mutagenesis study that could abolish binding by altering these residues to Ala <cite>Hao2024</cite>. The same paper also showed that wild-type proteins with fewer Trp-containing binding sites generally showed weaker binding to polysaccharide ligand <cite>Hao2024</cite>. A conserved CNR motif is found just to the N-terminal of the WExF sequence (Fig. 1). Structural analysis revealed that the Arg of this trio contributes to ligand binding, although it is absent in some binding sites in some proteins <cite>Hao2024</cite>.  
 
Several conserved residues (e.g., Phe-70, Arg-72, and Phe-75) were discovered through the multiple sequence alignments of Cgk16A-CBM92 and its close homologs <cite>Mei2022</cite>, which were suggested to be critical for the ligand binding of that CBM. Phylogenetic analysis by Hao et al later showed that the homologs of Cgk16A-CBM92 represent a small sub-group within the CBM92 family (Fig. 1) <cite>Hao2024</cite>. In most other members of CBM92, there are three apparent binding sites identified by a conserved WExF sequence motif. The Trp of this motif is the principal sugar-binding residue, as evidenced by a site-directed mutagenesis study that could abolish binding by altering these residues to Ala <cite>Hao2024</cite>. The same paper also showed that wild-type proteins with fewer Trp-containing binding sites generally showed weaker binding to polysaccharide ligand <cite>Hao2024</cite>. A conserved CNR motif is found just to the N-terminal of the WExF sequence (Fig. 1). Structural analysis revealed that the Arg of this trio contributes to ligand binding, although it is absent in some binding sites in some proteins <cite>Hao2024</cite>.  
  
[[File:CBM92structure.png|thumb|400px|right|'''Figure 2'''. '''a''' Overall structures of (left) ''Cp''CBM92A and (right) ''Cp''CBM92B with their subdomains distinctly coloured and their ligand binding Trp and Glu residues shown as sticks. '''b''' The β-subdomain of ''Cp''CBM92B in complex with glucose. '''c''' Overlay of the ''Cp''CBM92A and -B subdomains showing sequence conservation within all putative binding sites. Single letter residue codes are coloured based on the subdomains shown in panel '''a''' , and are labelled for subdomains ⍺/β/γ, in that order, with the ''Cp''CBM92A codes shown above those for ''Cp''CBM92B. Figure generated by [[User:Scott Mazurkewich| Scott Mazurkewich]] ]]
+
[[File:CBM92structure.png|thumb|400px|right|'''Figure 2'''. '''a''' Overall structure of ''Cp''CBM92B with the subdomains distinctly coloured and its ligand binding Trp and Glu residues shown as sticks. '''b''' The β-subdomain of ''Cp''CBM92B in complex with glucose. '''c''' Overlay of the ''Cp''CBM92A and -B subdomains showing sequence conservation within all putative binding sites. Single letter residue codes are colored based on the subdomains shown in panel '''a''' , and are labelled for subdomains ⍺/β/γ, in that order, with the ''Cp''CBM92A codes shown above those for ''Cp''CBM92B. Figure generated by [[User:Scott Mazurkewich| Scott Mazurkewich]] ]]
  
 
The crystal structures of ''Cp''CBM92A and ''Cp''CBM92B reveal a β-trefoil structure comprised of 12 β-strands arranged into 3 subdomains (⍺, β, and γ), like the β-trefoil domains found in Fascin and [[CBM13]] proteins (Fig.2). Soaking experiments of ''Cp''CBM92B crystals with glucose, gentiobiose, and sophorose revealed a binding cleft within each subdomain comprising a Trp-Glu binding pair, contributed by the WExF motifs <cite>Hao2024</cite>.  Ligand-bound structures suggest the potential for end-on binding to glucose and glucan oligo- or polysaccharides of potentially any linkage, but also reveal the possibility for extensions from both the O1 and O6 positions, enabling the observed mid-chain binding to a β−1,6-glucan such as pustulan, or to β-1,6-linked glucosyl substitutions in scleroglucan or laminarin.  
 
The crystal structures of ''Cp''CBM92A and ''Cp''CBM92B reveal a β-trefoil structure comprised of 12 β-strands arranged into 3 subdomains (⍺, β, and γ), like the β-trefoil domains found in Fascin and [[CBM13]] proteins (Fig.2). Soaking experiments of ''Cp''CBM92B crystals with glucose, gentiobiose, and sophorose revealed a binding cleft within each subdomain comprising a Trp-Glu binding pair, contributed by the WExF motifs <cite>Hao2024</cite>.  Ligand-bound structures suggest the potential for end-on binding to glucose and glucan oligo- or polysaccharides of potentially any linkage, but also reveal the possibility for extensions from both the O1 and O6 positions, enabling the observed mid-chain binding to a β−1,6-glucan such as pustulan, or to β-1,6-linked glucosyl substitutions in scleroglucan or laminarin.  
  
 
== Functionalities ==  
 
== Functionalities ==  
[[File:Figure 1.png|thumb|300px|right|'''Figure 3. Domain architecture of the κ-carrageenase Cgk16A. '''The enzyme consists of a signal peptide (1-20 amino acids), a GH16 domain (21-347 amino acids), a CBM92 domain (viz., Cgk16A-CBM92; 378-490 amino acids) and a C-terminal Sorting domain (516-581 amino acids).''' ]]
+
[[File:Figure 1.png|thumb|400px|right|'''Figure 3. Domain architecture of the κ-carrageenase Cgk16A. '''The enzyme consists of a signal peptide (1-20 amino acids), a GH16 domain (21-347 amino acids), a CBM92 domain (viz., Cgk16A-CBM92; 378-490 amino acids) and a C-terminal Sorting domain (516-581 amino acids).''' ]]
  
In the natural context, Cgk16A-CBM92 is a component of the κ-carrageenase Cgk16A <cite>Shen2018</cite> (Fig. 3). It thus might maintain the enzyme near its substrate to improve the enzymatic activity via the proximity effect. The same observation was made for a β-1,6-glucan-binding CBM92 found appended to a [[GH30]] β-1,6-glucanase enzyme from the environmental bacterium “Chitinophaga pinensis" <cite>Lu2023</cite>.  
+
In the natural context, Cgk16A-CBM92 is a component of the κ-carrageenase Cgk16A <cite>Shen2018</cite> (Fig. 3). It thus might maintain the enzyme near its substrate to improve the enzymatic activity via the proximity effect. The same observation was made for a β-1,6-glucan-binding CBM92 found appended to a [[GH30]] β-1,6-glucanase enzyme from the environmental bacterium ''Chitinophaga pinensis'' <cite>Lu2023</cite>.  
  
Members of the CBM92 family are present in different glycoside hydrolase (GH) family sequences, e.g., [[GH16]]_17, [[GH5]]_46, [[GH5]]_54, [[GH18]], [[GH19]], [[GH30]], and [[GH95]]. According to the [http://www.cazy.org/CBM92.html CAZy database], these GH families comprise enzymes with various substrate specificities, including κ-carrageenase ([[GH16]]_17), chitinase ([[GH19]] and [GH18]]), fucosidase ([[GH95]]), β-1,6-glucanase ([[GH30]]), β-1,3-glucanase ([[GH16]]), and galactosidase ([[GH95]]).   
+
Members of the CBM92 family are present in different glycoside hydrolase (GH) family sequences, e.g., [[GH16]]_17, [[GH5]]_46, [[GH5]]_54, [[GH18]], [[GH19]], [[GH30]], and [[GH95]]. According to the [http://www.cazy.org/CBM92.html CAZy database], these GH families comprise enzymes with various substrate specificities, including κ-carrageenase ([[GH16]]_17), chitinase ([[GH19]] and [[GH18]]), fucosidase ([[GH95]]), β-1,6-glucanase ([[GH30]]), β-1,3-glucanase ([[GH16]]), and galactosidase ([[GH95]]).   
  
 
To evaluate the feasibility of Cgk16A-CBM92 as a tool in the ''in situ'' investigation of carrageenan, a fluorescent probe was constructed by fusing Cgk16A-CBM92 with a green fluorescent protein. The ''in situ'' visualization of carrageenan in red alga ''Kappaphycus alvarezii'' was realized by utilizing the fluorescent probe <cite>Mei2022</cite>.
 
To evaluate the feasibility of Cgk16A-CBM92 as a tool in the ''in situ'' investigation of carrageenan, a fluorescent probe was constructed by fusing Cgk16A-CBM92 with a green fluorescent protein. The ''in situ'' visualization of carrageenan in red alga ''Kappaphycus alvarezii'' was realized by utilizing the fluorescent probe <cite>Mei2022</cite>.
Line 43: Line 43:
 
== Family Firsts ==
 
== Family Firsts ==
 
;First Identified: The first published CBM92 member <cite>Mei2022</cite> is a component of the κ-carrageenase Cgk16A <cite>Shen2018</cite>, which was discovered from a marine bacterium ''Wenyingzhuangia aestuarii'' OF219.  
 
;First Identified: The first published CBM92 member <cite>Mei2022</cite> is a component of the κ-carrageenase Cgk16A <cite>Shen2018</cite>, which was discovered from a marine bacterium ''Wenyingzhuangia aestuarii'' OF219.  
;First Structural Characterization: The structures of ''Cp''CBM92A and ''Cp''CBM92B were solved by [[User:Scott Mazurkewich| Scott Mazurkewich]] using X-ray crystallography <cite>Hao2024</cite>. These CBM92 domains flank a [[GH18]] chitinase <cite>Li2023</cite> within a large multi-modular enzyme that also contains a weakly-acting [[GH5]]_46 β-1,6-glucanase <cite>Lu2023</cite>. The modular enzyme is encoded by the genome of “Chitinophaga pinensis".
+
;First Structural Characterization: The structures of ''Cp''CBM92A and ''Cp''CBM92B were solved by [[User:Scott Mazurkewich| Scott Mazurkewich]] using X-ray crystallography <cite>Hao2024</cite>. These CBM92 domains flank a [[GH18]] chitinase <cite>Li2023</cite> within a large multi-modular enzyme from ''Chitinophaga pinensis'' that also contains a weakly-acting [[GH5]]_46 β-1,6-glucanase <cite>Lu2023</cite>.
  
 
== References ==
 
== References ==

Latest revision as of 08:07, 22 May 2024

Approve icon-50px.png

This page has been approved by the Responsible Curator as essentially complete. CAZypedia is a living document, so further improvement of this page is still possible. If you would like to suggest an addition or correction, please contact the page's Responsible Curator directly by e-mail.


CAZy DB link
https://www.cazy.org/CBM92.html

Ligand specificities

The first published CBM92 domain, and founding member of the CBM92 family, is the Cgk16A-CBM92 from the marine bacterium Wenyingzhuangia aestuarii OF219 [1]. The CBM92 bound specifically to the red algal polysaccharide carrageenan, a substrate of the appended GH16 enzyme domain. It was incapable of binding to other polysaccharide components in red algae including agarose, porphyran, and funoran [1]. Meanwhile, the CBM92 displayed no affinity to several anionic polysaccharides, namely pectin, chondroitin sulfates, dermatan sulfate, and sulfated fucans [1]. The Cgk16A-CBM92 showed no significant difference in the affinity to κ- and ι-carrageenan.

An earlier paper had demonstrated carbohydrate binding by a so-called Bacterial Fascin-like Domain (BFLD) within the GH16 β−1,3-glucanase LamC from a myxobacterial Corallococcus species. Affinity gel electrophoresis showed that the domain, now recognized as a CBM92 domain could bind to β−1,3-glucans [2]. Later, Lu et al showed binding to β-1,6-glucan by a CBM92 domain found within CpGlu30A, a GH30 β-1,6-glucanase [3]. Binding to the Glc- β-1,6-Glc structure found in pustulan, laminarin, scleroglucan, and yeast β-glucan has since been demonstrated for 12 additional members of family CBM92, none of which were able to bind carrageenan [4].

Structural Features

Figure 1. A consensus sequence logo generated from an alignment of 818 CBM92 domain sequences, including all carrageenan- and β-glucan binders characterised as of May 2024.

Several conserved residues (e.g., Phe-70, Arg-72, and Phe-75) were discovered through the multiple sequence alignments of Cgk16A-CBM92 and its close homologs [1], which were suggested to be critical for the ligand binding of that CBM. Phylogenetic analysis by Hao et al later showed that the homologs of Cgk16A-CBM92 represent a small sub-group within the CBM92 family (Fig. 1) [4]. In most other members of CBM92, there are three apparent binding sites identified by a conserved WExF sequence motif. The Trp of this motif is the principal sugar-binding residue, as evidenced by a site-directed mutagenesis study that could abolish binding by altering these residues to Ala [4]. The same paper also showed that wild-type proteins with fewer Trp-containing binding sites generally showed weaker binding to polysaccharide ligand [4]. A conserved CNR motif is found just to the N-terminal of the WExF sequence (Fig. 1). Structural analysis revealed that the Arg of this trio contributes to ligand binding, although it is absent in some binding sites in some proteins [4].

Figure 2. a Overall structure of CpCBM92B with the subdomains distinctly coloured and its ligand binding Trp and Glu residues shown as sticks. b The β-subdomain of CpCBM92B in complex with glucose. c Overlay of the CpCBM92A and -B subdomains showing sequence conservation within all putative binding sites. Single letter residue codes are colored based on the subdomains shown in panel a , and are labelled for subdomains ⍺/β/γ, in that order, with the CpCBM92A codes shown above those for CpCBM92B. Figure generated by Scott Mazurkewich

The crystal structures of CpCBM92A and CpCBM92B reveal a β-trefoil structure comprised of 12 β-strands arranged into 3 subdomains (⍺, β, and γ), like the β-trefoil domains found in Fascin and CBM13 proteins (Fig.2). Soaking experiments of CpCBM92B crystals with glucose, gentiobiose, and sophorose revealed a binding cleft within each subdomain comprising a Trp-Glu binding pair, contributed by the WExF motifs [4]. Ligand-bound structures suggest the potential for end-on binding to glucose and glucan oligo- or polysaccharides of potentially any linkage, but also reveal the possibility for extensions from both the O1 and O6 positions, enabling the observed mid-chain binding to a β−1,6-glucan such as pustulan, or to β-1,6-linked glucosyl substitutions in scleroglucan or laminarin.

Functionalities

Figure 3. Domain architecture of the κ-carrageenase Cgk16A. The enzyme consists of a signal peptide (1-20 amino acids), a GH16 domain (21-347 amino acids), a CBM92 domain (viz., Cgk16A-CBM92; 378-490 amino acids) and a C-terminal Sorting domain (516-581 amino acids).

In the natural context, Cgk16A-CBM92 is a component of the κ-carrageenase Cgk16A [5] (Fig. 3). It thus might maintain the enzyme near its substrate to improve the enzymatic activity via the proximity effect. The same observation was made for a β-1,6-glucan-binding CBM92 found appended to a GH30 β-1,6-glucanase enzyme from the environmental bacterium Chitinophaga pinensis [3].

Members of the CBM92 family are present in different glycoside hydrolase (GH) family sequences, e.g., GH16_17, GH5_46, GH5_54, GH18, GH19, GH30, and GH95. According to the CAZy database, these GH families comprise enzymes with various substrate specificities, including κ-carrageenase (GH16_17), chitinase (GH19 and GH18), fucosidase (GH95), β-1,6-glucanase (GH30), β-1,3-glucanase (GH16), and galactosidase (GH95).

To evaluate the feasibility of Cgk16A-CBM92 as a tool in the in situ investigation of carrageenan, a fluorescent probe was constructed by fusing Cgk16A-CBM92 with a green fluorescent protein. The in situ visualization of carrageenan in red alga Kappaphycus alvarezii was realized by utilizing the fluorescent probe [1].

Family Firsts

First Identified
The first published CBM92 member [1] is a component of the κ-carrageenase Cgk16A [5], which was discovered from a marine bacterium Wenyingzhuangia aestuarii OF219.
First Structural Characterization
The structures of CpCBM92A and CpCBM92B were solved by Scott Mazurkewich using X-ray crystallography [4]. These CBM92 domains flank a GH18 chitinase [6] within a large multi-modular enzyme from Chitinophaga pinensis that also contains a weakly-acting GH5_46 β-1,6-glucanase [3].

References

  1. Mei X, Chang Y, Shen J, Zhang Y, Han J, and Xue C. (2022). Characterization of a Novel Carrageenan-Specific Carbohydrate-Binding Module: a Promising Tool for the In Situ Investigation of Carrageenan. J Agric Food Chem. 2022;70(29):9066-9072. DOI:10.1021/acs.jafc.2c03139 | PubMed ID:35830544 [Mei2022]
  2. Zhou J, Li Z, Wu J, Li L, Li D, Ye X, Luo X, Huang Y, Cui Z, and Cao H. (2017). Functional Analysis of a Novel β-(1,3)-Glucanase from Corallococcus sp. Strain EGB Containing a Fascin-Like Module. Appl Environ Microbiol. 2017;83(16). DOI:10.1128/AEM.01016-17 | PubMed ID:28625980 [Zhou2017]
  3. Lu Z, Rämgård C, Ergenlioğlu İ, Sandin L, Hammar H, Andersson H, King K, Inman AR, Hao M, Bulone V, and McKee LS. (2023). Multiple enzymatic approaches to hydrolysis of fungal β-glucans by the soil bacterium Chitinophaga pinensis. FEBS J. 2023;290(11):2909-2922. DOI:10.1111/febs.16720 | PubMed ID:36610032 [Lu2023]
  4. Hao MS, Mazurkewich S, Li H, Kvammen A, Saha S, Koskela S, Inman AR, Nakajima M, Tanaka N, Nakai H, Brändén G, Bulone V, Larsbrink J, and McKee LS. (2024). Structural and biochemical analysis of family 92 carbohydrate-binding modules uncovers multivalent binding to β-glucans. Nat Commun. 2024;15(1):3429. DOI:10.1038/s41467-024-47584-y | PubMed ID:38653764 [Hao2024]
  5. Shen J, Chang Y, Chen F, and Dong S. (2018). Expression and characterization of a κ-carrageenase from marine bacterium Wenyingzhuangia aestuarii OF219: A biotechnological tool for the depolymerization of κ-carrageenan. Int J Biol Macromol. 2018;112:93-100. DOI:10.1016/j.ijbiomac.2018.01.075 | PubMed ID:29355636 [Shen2018]
  6. Li H, Lu Z, Hao MS, Kvammen A, Inman AR, Srivastava V, Bulone V, and McKee LS. (2023). Family 92 carbohydrate-binding modules specific for β-1,6-glucans increase the thermostability of a bacterial chitinase. Biochimie. 2023;212:153-160. DOI:10.1016/j.biochi.2023.04.019 | PubMed ID:37121306 [Li2023]

All Medline abstracts: PubMed