CAZypedia needs your help!
We have many unassigned pages in need of Authors and Responsible Curators. See a page that's out-of-date and just needs a touch-up? - You are also welcome to become a CAZypedian. Here's how.
Scientists at all career stages, including students, are welcome to contribute.
Learn more about CAZypedia's misson here and in this article.
Totally new to the CAZy classification? Read this first.

Glycoside Hydrolase Family 9

From CAZypedia
Revision as of 22:05, 16 August 2010 by Harry Brumer (talk | contribs)
Jump to navigation Jump to search
Under construction icon-blue-48px.png

This page is currently under construction. This means that the Responsible Curator has deemed that the page's content is not quite up to CAZypedia's standards for full public consumption. All information should be considered to be under revision and may be subject to major changes.


Glycoside Hydrolase Family GH9
Clan GH-G
Mechanism inverting
Active site residues known/known
CAZy DB link
http://www.cazy.org/fam/GH9.html


Substrate specificities

GH Family 9 is an inverting glycohydrolase family that mainly contains cellulases and is the second largest cellulase family. It contains mainly endoglucanases with a few processive endoglucanases. All of the processive endoglucanases contain a family 3c CBM rigidly attached to the C-terminus of the family 9 catalytic domain (cd) [1]. This domain is part of the active site and is essential for processivity [1]. CBM3c domains bind weakly to cellulose as they lack several of the conserved aromatic residues that are important for cellulose binding in family 3a and family 3b members [2]. All known plant cellulases belong to family 9, and most of the other members are eubacterial although there are two archael members and some fungal, earthworm, arthropod, chordate, echinoderma and molusk members. There are two subgroups in family 9, E1 which contains only cellulases from bacteria, including ones from both aerobes and anaeobes, and E2 which includes some bacterial and all nonbacterial cellulases [3]. An evolutionary study shows that the eucaryote members contain two monophyletic groups that are amcient; one including all animal members and the other including all plant members [4]. All known processive endoglucanase genes are in subgroup E1.

Plant GH9 Enzymes

Early reports described the existence of plant "cellulases" or EGases [5]. Subsequently, cellulases have been shown to be associated with plant cell wall restructuring during cell expansion, the wall disassembly that accompanies processes such as fruit ripening and abscission (reviewed in [6, 7, 8]) and cellulose biosynthesis [9, 10, 11]. The amino acid sequences of the first plant "cellulases"/endo-ß-1,4-glucanases revealed that these enzymes belong to the CAZy family GH9 glycoside hydrolases [12].

Most plant "cellulases" studied to date are endoglucanases (EC 3.2.1.4) with low or no activity on crystalline cellulose, but with discernible activity on soluble cellulose derivatives, including carboxymethyl cellulose (CMC), phosphoric acid swollen non-crystalline cellulose, and numerous plant polysaccharides including xylan, 1,3-1,4-ß-glucan, xyloglucan, and glucomannan [13, 14, 15, 16, 17]. The inability of plant “cellulases” to hydrolyze crystaline cellulose is distinct from microbial cellulases, whose modular structure and synergistic action with other enzymes facilitates effective degradation of crystalline cellulose. In muro, the substrates of plant cellulases likely include xyloglucan, xylans, and non-crystalline cellulose, especially amorphous regions of cellulose where the microfibrils may be interwoven with xyloglucan.

Plant GH9 subfamilies

In the model plant Arabidopsis thaliana, 25 different GH9 coding regions have been identified. Phylogenic analysis of the deduced amino acid sequences group the proteins into nine classes or three subfamilies [8, 17, 18, 19]. Three distinct types of GH9 proteins are present in plants. Class A proteins are membrane-anchored, Class B proteins are secreted, and Class C proteins are also secreted but contain a family 49 carbohydrate binding module (CBM49) [17]. Class A plant EGases have been reported to lack tryptophans corresponding to substrate binding at subsites -4, -3, and -2 in T. fusca Cel9A [13]. Class C EGases are the only plant EGases to date that contain a tryptophan residue corresponding to the one in subsite -2 in TfCel9A [13, 17]. This tryptophan has been shown to be important for hydrolysis in TfCel9A, and the enzyme retains less than 10% of its normal activity on polymeric cellulose substrates, and less than 1% of wild type activity on cellohexaose when the Trp is replaced by another amino acid [13, 20].

Class A

The Class A EGases are integral type II membrane proteins with a GH9 catalytic core that lack a canonical secretion signal sequence. These enzymes are predicted to have a high degree of N-glycosylation and a long amino-terminal extension with a membrane-spanning domain that anchors the protein to the plasma membrane and/or to intracellular organelles [8, 21]. Membrane anchored EGases were first described in studies of the KORRIGAN (KOR) genes in Arabidopsis thaliana, which showed that they encode EGases that are required for normal cellulose synthesis or assembly. Plants with mutant alleles of the KOR1 gene are dwarfed, with decreased cellulose content and crystallinity [8, 22, 23]. The role of the Class A EGases in plants is not known. However, the KOR proteins have been proposed to cleave sitosterol-b-glucoside primers from the growing cellulose polymer, or may have a role in editing incorrectly formed growing microfibrils [24]. More recently, it has been shown that during cell expansion, KOR1 is cycled from the plasma membrane through intracellular compartments, comprising both the Golgi apparatus and early endosomes; however the role of KOR1 in cellulose biosynthesis remains to be determined [25]. The catalytic domain of PttCel9A, a Class A GH9 enzyme that is upregulated during secondary cell wall synthesis in Populus tremula x tremuloides, has been biochemically characterized and shown to hydrolyse a narrow range of substrates in vitro including CMC, phosphoric acid swollen cellulose and cellulose oligosaccharides (DP≥5) [13, 26].

Class B

Class B proteins are the most common form of plant Egases and are associated with virtually all stages of plant growth and development. These enzymes have a GH9 catalytic domain and a signal sequence for ER targeting and secretion. Different isoforms are expressed during fruit ripening, in abscission zones, in reproductive organ development, and in expanding cells [27, 28, 29, 30]. Numerous studies, especially in tomato, have also shown that many class B EGases are under hormonal control [21, 31, 32].

Class C

Plant Class C GH9 enzymes are the least studied. These proteins are predicted to have a signal sequence followed by a GH9 catalytic domain and a long carboxyl-terminal extension, which contains a CBM49 that has been shown to bind to crystalline cellulose in vitro [17, 19]. CBMs are necessary for activity on crystalline substrates and may promote hydrolysis by increasing the local enzyme concentration at the substrate surface as well as modifying cellulose microfibril structure (for review see [33]). The catalytic domain (CD) SlGH9C1 from tomato is promiscuous and can effectively hydrolyze artificial cellulosic polymers, cellulose oligosaccharides, and several plant cell wall polysaccharides [17]. Nevertheless, the activity of the full length, modular enzyme has still not been characterized. A Class C EGase from rice, OsCel9A, has been shown to be post-translationaly modified at the linker region to yield a 51 kDa GH9 CD and a CBM49, and it was suggested that the cleavage is necessary for function [34]. The OsCel9A CD also displays a broad substrate range and was able to hydrolyze CMC, phosphoric acid-swollen cellulose, mixed linkage 1,3-1,4-ß-glucan, xylan, glucomannan, cellooligosaccharides (DP≥3) and 1,4-ß-xylohexaose [14]. For Information regarding nomenclature of plant GH9 enzymes please see Urbanowicz et al 2007 [19].

Kinetics and Mechanism

The processive endoglucanase, Cel9A from Thermobifda fusca, has high activity on bacterial cellulose and is the only cellulase tested that can degrade crystalline regions in bacterial cellulose by itself although it prefers amorphous regions [35]. A related cellulase in Clostridium phytofermentans, which is the only family 9 cellulase encoded in its genome, has been shown to be essential for cellulose degradation by this organism. This is the only case where a single cellulase has been shown to be essential for growth on cellulose [36].

Catalytic Residues

There is a conserved Glu residue that functions as the catalytic acid and two conserved Asp residues that bind the catalytic water, with one functioning as the catalytic base and mutation of the other also greatly reduces activity on all substrates [37].

Three-dimensional structures

All known family 9 cd structures have an ( a / a ) 6 barrel fold that contains an open active site cleft that contains at least six sugar binding subsites -4 to +2 [1, 38]. In processive endoglucanases the catalytic domain is joined to a family 3c CBM that is aligned with the active site cleft [1].

Family Firsts

First sterochemistry determination
The steriospecificity of three family 9 cellulases were all determined to be inverting by NMR [39].
First catalytic nucleophile identification
Asp 58 in T. fusca Cel9A was shown to be the catalytic nucleophile by site directed mutagenesis and azide rescue [20].
First general acid/base residue identification
Glu555 was shown to be the catalytic acid in C. thermocellum CelD by site directed mutagenesis [40].
First 3-D structure
The structure of endocellulase CelD from Clostridium thermocellum was determined by X-ray crystallography (PDB ID 1clc) [41].


References

Error fetching PMID 9334746:
Error fetching PMID 8918451:
Error fetching PMID 8540419:
Error fetching PMID 15703240:
Error fetching PMID 11884144:
Error fetching PMID 15274620:
Error fetching PMID 17369336:
Error fetching PMID 19775243:
Error fetching PMID 14097721:
Error fetching PMID 10417876:
Error fetching PMID 10322557:
Error fetching PMID 12514237:
Error fetching PMID 9755157:
Error fetching PMID 11351091:
Error fetching PMID 11266576:
Error fetching PMID 15287736:
Error fetching PMID 17056618:
Error fetching PMID 11069690:
Error fetching PMID 11762160:
Error fetching PMID 17322304:
Error fetching PMID 15170254:
Error fetching PMID 17687051:
Error fetching PMID 11778054:
Error fetching PMID 14871312:
Error fetching PMID 16284310:
Error fetching PMID 19398462:
Error fetching PMID 18402467:
Error fetching PMID 9037162:
Error fetching PMID 10480385:
Error fetching PMID 10555309:
Error fetching PMID 9290636:
Error fetching PMID 9301092:
Error fetching PMID 1281437:
Error fetching PMID 17056619:
  1. Error fetching PMID 9334746: [Sakon1997]
  2. Error fetching PMID 8918451: [Tormo1996]
  3. Error fetching PMID 8540419: [Tomme1995]
  4. Error fetching PMID 15703240: [Davison2005]
  5. Error fetching PMID 14097721: [Hall1963]
  6. Error fetching PMID 10417876: [Campillo1999]
  7. Error fetching PMID 10322557: [Rose1999]
  8. Error fetching PMID 12514237: [Molhoj2002]
  9. Error fetching PMID 9755157: [Nicol1998]
  10. Error fetching PMID 11351091: [Lane2001]
  11. Error fetching PMID 11266576: [Sato2001]
  12. Henrissat B (1991). A classification of glycosyl hydrolases based on amino acid sequence similarities. Biochem J. 1991;280 ( Pt 2)(Pt 2):309-16. DOI:10.1042/bj2800309 | PubMed ID:1747104 [Henrissat1991]
  13. Error fetching PMID 15287736: [Master2004]
  14. Error fetching PMID 17056618: [YoshidaKomae2006]
  15. Error fetching PMID 11069690: [Ohmiya2000]
  16. Error fetching PMID 11762160: [Woolley2001]
  17. Error fetching PMID 17322304: [Urbanowicz2007]
  18. Error fetching PMID 15170254: [Libertini2004]
  19. Error fetching PMID 17687051: [UrbanowiczBennett2007]
  20. Error fetching PMID 17369336: [Li2007]
  21. Error fetching PMID 9037162: [Brummell1997]
  22. Error fetching PMID 19398462: [Takahashi2009]
  23. Error fetching PMID 11778054: [Peng2002]
  24. Error fetching PMID 16284310: [Robert2005]
  25. Error fetching PMID 18402467: [Rudsander2008]
  26. Error fetching PMID 10480385: [Brummel1999]
  27. Error fetching PMID 10555309: [Kalaitzis1999]
  28. Error fetching PMID 9290636: [Shani1997]
  29. Error fetching PMID 9301092: [Catala1997]
  30. Error fetching PMID 1281437: [Bonghi1998]
  31. Boraston AB, Bolam DN, Gilbert HJ, and Davies GJ. (2004). Carbohydrate-binding modules: fine-tuning polysaccharide recognition. Biochem J. 2004;382(Pt 3):769-81. DOI:10.1042/BJ20040892 | PubMed ID:15214846 [Boraston2004]
  32. Error fetching PMID 17056619: [YoshidaImaizumi2006]
  33. Chen, Arthur J. Stipanovic, William T. Winter, David B. Wilson and Young-Jun Kim. Effect of digestion by pure cellulases on crystallinity and average chain length for bacterial and microcrystalline celluloses. Cellulose 2007: 14: 283-293.

    [Chen2007]
  34. Error fetching PMID 19775243: [Tolonen2009]
  35. Error fetching PMID 15274620: [Zhou2004]
  36. Error fetching PMID 11884144: [Geurin2002]
  37. Gebler J, Gilkes NR, Claeyssens M, Wilson DB, Béguin P, Wakarchuk WW, Kilburn DG, Miller RC Jr, Warren RA, and Withers SG. (1992). Stereoselective hydrolysis catalyzed by related beta-1,4-glucanases and beta-1,4-xylanases. J Biol Chem. 1992;267(18):12559-61. | Google Books | Open Library PubMed ID:1618761 [Gebler1992]
  38. Lascombe, M.B., Souchon, H., Juy, M., Alzari, P.M. Three-Dimensional Structure of Endoglucanase D at 1.9 Angstroms Resolution. Deposited 1995, unpublished.

    [Lascombe1995]
  39. Error fetching PMID 14871312: [Szyjanowicz]

All Medline abstracts: PubMed