Cell Wall Degrading Enzymes in Trichoderma asperellum Grown on Wheat Bran

? Users may download and print one copy of any publication from the public portal for the purpose of private study or research. ? You may not further distribute the material or use it for any profit-making activity or commercial gain ? You may freely distribute the URL identifying the publication in the public portal ? Take down policy If you believe that this document breaches copyright please contact us at vbn@aub.aau.dk providing details, and we will remove access to the work immediately and investigate your claim.


Introduction
In recent years a significant effort has been put into efficient utilization of plant biomass, such as of agricultural crops, crop residues and industrial byproducts, for conversion into a range of valueadded bio-products to maximize the value derived from the biomass feedstock (Bozell & Petersen, 2010;Ragauskas et al., 2006). Plants have evolved complex structural and chemical mechanisms for resisting degradation of their structural sugars by for example fungi (Popper, 2008). To overcome the problem of recalcitrance of the plant cell wall, industry uses a range of pretreatment approaches such as use of cell wall degrading enzymes to convert the plant cell wall polymers into smaller molecules (Chundawat, Beckham, Himmel, & Dale, 2011;Himmel et al., 2007). In nature, various microorganisms produce enzymes that act independently or in synergy to break down the plant cell wall. Although it is not fully known how many enzymes are involved in plant cell-wall degradation, three general categories of enzymes are considered necessary to hydrolyze native cell-wall materials: Cellulases, hemicellulases, and accessory enzymes. The recalcitrance of plant cell walls to enzymatic degradation and the high cost of the necessary hydrolytic enzymes are significant barriers to the global and large-scale production of biofuels and value-added bio-based products (Merino & Cherry, 2007). There is a need to develop more efficient and cost effective enzyme mixtures for the conversion of biomass to fermentable sugars to increase production of cellulose-derived value-added products, such as food, feed, chemicals and biofuels. To address this challenge, it is essential to gain a better understanding of the interactions between plant cell wall polysaccharides and cell wall degrading enzymes (CWDE). The complexity of the plant cell wall is mirrored by the diverse arsenal of CWDE produced by lignocellulose-degrading microbes. Enzymes acting on sugar structures are collectively called Carbohydrate-Active enZymes (CAZymes) (Cantarel et al., 2009). The diversity of CAZymes reflects the structural diversity of plant cell walls. Each type of CAZyme function is represented in multiple families determined by sequence and structural similarities (Henrissat & Bairoch, 1993;Henrissat & Bairoch, 1996;Henrissat, Teeri, & Warren, 1998). There are currently 133 glycoside hydrolase (GH) families in the CAZy database, http:// www.cazy.org/. As a consequence of more genomes being sequenced a large number of enzymes are either labelled as "hypothetical" or even annotated incorrectly (Schnoes et al. 2009;Gerlt et al. 2011). High throughput annotation of CAZymes is done largely based on GH family using Blast or HMMer programs, leaving many proteins without a functional annotation (Lombard, Golaconda Ramulu, Drula, Coutinho, & Henrissat, 2014). A more specific annotation is achieved by a following manual analysis as for example described for the reannotation of Trichoderma reesei CAZymes (Häkkinen et al., 2012).
Many Trichoderma species are strong opportunistic invaders. They are fast growing and abundantly producers of spores, and contain highly active CAZymes and secondary metabolites (Monte, 2010). T. asperellum is a mycoparasitic species that are well known and widely used for their ability to inhibit the growth of plant pathogens. Their multi-enzymatic systems position them as some of the best antimicrobial agents ( T. asperellum showed promising results as an enzyme producer with a higher diversity of hemi-cellulases and β-glucosidases compared to T. reesei. The purpose of the present work was to analyze the GH enzymes in the transcriptome of T. asperellum when grown on wheat bran. This was done partly to understand the plant cell wall degrading system of T. asperellum and to identify enzymes important for the degradation of complex sugar structures, as well as to discover new enzymes. Several Trichoderma genomes are now available, which permitted a comparative study of GHs with the bioinformatic tool Peptide Pattern Recognition (PPR). With PPR it is possible to predict not only the GH family but also the enzyme function, so that the CAZymes may be compared on a functional level (Busk & Lange, 2013).

Biomass
T. asperellum was cultivated on 1.5% (w/v) agar plates (Sigma Aldrich, UK) with 2.5% (w/v) different cellulosic material for 3, 5 and 7 days each at 20 0 C and 30 0 C, where additionally each combination of incubation duration and temperature was conducted at three levels of pH (pH 4, pH 6.5 and pH 8). Cellulosic material included: wheat bran

Activity
Agar plate cultures of T. asperellum on each of the above cellulosic biomass substrates were washed with 10ml H 2 O containing 1% Tween 80 to extract secreted enzymes. Enzyme activity was measured on AZCL-plates prepared using 1% agarose in buffer, pH 5.8, containing 0.1% azurine cross-linked (AZCL) substrates (Megazymes, Bray, Ireland) as described in the protocol. Inoculation was done by placing 15 μL of enzyme blend from the growth plates in holes punched in AZCL-agarose plates. Enzyme activity was indicated by the area (cm 2 ) of blue color zones resulting from hydrolysis of the substrate. The extent of the blue zones, indicating the enzyme activity, was measured after 24h at 30 0 C.

RNA isolation
Mycelia were scraped from the surface of two agar plates on which T. asperellum had grown on wheat bran for seven days at 30 o C, frozen in liquid nitrogen and ground to a powder. RNA was extracted from the mycelia with fenozol and the RNA Total Maxi kit protocol (A&A Biotechnology, Gdynia, Poland).

Transcriptome sequencing
RNA was sequenced by Beijing Genomics Institute (BGI) using the Illumina HiSeq 2000 platform. Reads were assembled using Trinity (Grabherr et al., 2011) by first combining clean reads with a certain length of overlap to form longer fragments without N into contigs. The reads were then mapped back to contigs with paired-end reads in order to detect contigs from the same transcript as well as to determine the distances between those contigs. Trinity then connected the contigs, using N to represent unknown sequences between every two contigs. From this, scaffolds were made. Paired-end reads were used again for gap-filling of scaffolds to obtain sequences with the fewest Ns which could not be extended on either end. Such sequences were defined as unigenes. TGICL (Pertea et al., 2003) at default parameters was used to cluster these unigenes to acquire non-redundant unigenes of the greatest length possible. In the final step, a Blast X alignment was performed between the unigenes and the protein databases of NR, Swiss-Prot, KEGG, and COG. The best aligning results were used to determine the sequence directions of those unigenes. If the results of different databases conflicted with each other, we followed a priority order of NR-Swiss-Prot-KEGG-COG. Orientation and CDS of sequences that have no hits in blast search were predicted using ESTScan (Iseli, Jongeneel, & Bucher, 1999). Original transcript sequences (5'->3') were chosen if their orientations were unable to be determined by above mentioned approaches. Unigene expression levels were calculated using the RPKM method (Reads Per kb per Million reads) (Mortazavi, Williams, McCue, Schaeffer, & Wold, 2008). The formula is RPKM=(1000000*C)/ (N*L*1000). This formula assigns RPKM(A) to be the expression of gene A, C to be number of reads that uniquely aligned to gene A, N to be total number of reads that uniquely aligned to all genes, and L to be number of bases on gene A. The RPKM method is able to eliminate the influence of different gene length and sequencing discrepancy on the calculation of gene expression. Therefore the calculated gene expression can be directly used for comparing the difference of gene expression among samples.

Data processing
Further isolation of GHs was done by searching the obtained unigene annotation for the keywords: "glycoside, hydrolase, GH, EC: 3.2.1". These genes were afterwards checked for duplications by blasting gene fragments in NCBI and reverse blasting these results against a local database of obtained unigenes. Blast results lead to the construction of full unigenes without duplications. CLC Main workbench was used for local blast database and for collecting partial genes into full genes.

Peptide Pattern Recognition
Peptide Pattern Recognition (PPR) was used for the identification of GHs on a functional level in the Trichoderma genomes (Busk & Lange, 2013).

Phylogeny
Sequences were aligned using ClustalX 2.1 (http://clustalx.software. informer.com/2.1/) with default alignment parameters. Phylogenetic analysis was performed using the neighbor-joining algorithm of ClustalX 2.1 with default parameters (gapped regions were included). Bootstrap analysis (1000 trials) provided a measure of confidence for the detected relationships as described above. The resulting trees were visualized by the program Figtree (http://tree.bio.ed.ac.uk/software/ figtree/).

Results
The choice of substrate is important for the successful production of CAZymes, because different biomasses induce distinct fungal enzyme responses depending on the biomass structure and composition. An initial screening of ten carbohydrate-containing growth substrates was performed to induce the widest and largest production of CAZymes (Supplementary information). T. asperellum produced a wide range of enzymes after only three days incubation. The highest activity was recorded for endo-xylanases and β-glucanases on arabinoxylan, xylan and β-glucan. After three days wheat bran induced the widest range of enzyme activity as well as the largest response in terms of halo size, closely followed by duckweed, Spirodella polyrhiza. At five days the overall AZCL response increased for all growth substrates except PDA. The widest activity range with the largest halos was observed with wheat bran substrate at seven days (Table 1). Enzymes induced by growth on wheat bran resulted in activity on 8 out of the 13 tested AZCL-substrates, with the highest activity being recorded for xylanases with halo sizes of 4.9cm 2 and 4.5cm 2 on AZCL-arabinoxylan and AZCL-xylan, respectively. Second, the duckweed Spirodella polyrhiza induced enzymes that were detected on 7 out of 13 tested AZCLsubstrates, but to a lesser degree than wheat bran. This was indicated by significantly less by cellulase and 1,4-β-d-mannanase activity. Enzyme activities when grown on wheat bran were further investigated under different conditions. These included growth durations for five and seven days, temperatures from 20 0 C to 30 0 C and pH 4, pH 6.5 and pH 8 respectively. Neither the temperature nor the pH of the growth medium had any significant effect on the range of activities measured.

CAZymes in the transcriptome of T. asperellum grown on wheat bran:
Duplicates of mycelium from T. asperellum grown on wheat bran at 30 0 C for seven days were harvested and their RNA content was isolated and sequenced according to the description in materials and methods. The sequencing resulted in 12,901,784 reads that were assembled into 32,709 unigenes with an average length of 654nt. The transcriptome sequences were deposited in Transcriptome Shotgun Assembly (TSA) database in Genbank at the NCBI with the accession number SRR1575447. A BLASTp and BLASTx search against the National Center for Biotechnology Information (NCBI)'s protein nonredundant (NR) database showed that 66% of unigenes had significant hits (1e-05). The other 34% of the unigenes had no significant similarity to known proteins or domains in the NCBI database. The latter may contain sequences that mostly covered long untranslated regions of the mRNAs. Among the 32,709 unigenes, growth on wheat bran induced 175 GHs from 48 GH families in T. asperellum ( Figure 1 and detailed list in the supplementary information Table S4). The majority of the GH families were represented by 1-2 genes, but some families were represented with significantly more genes. The most transcribed family was GH18 with 20 genes, all of which were predicted to be chitinases. 14 genes were transcribed from the diverse GH16 family, of which most were predicted to be endo-1,3-β-glucanases. There were 11 genes each from the GH5 family and GH3 family of which the majority predicted to be endo-β-1,4-glucanases and β-glucosidases, respectively. To determine which genes were transcribed the most, the genes were listed in terms of sequenced raw reads. The most transcribed CWDE was shown to be an amylase represented with 18148 raw reads, which was three times more than the next enzymes. A 1,3-β-glucanase and a chitinase were the next most transcribed CAZyme with 7743 and 6831 raw reads, respectively.

Transcriptome derived secretome
To analyze which secreted GHs were transcribed in T. asperellum, the sequenced GHs were analyzed for the presence of a signal peptide. Genes without sequenced 5´ end were evaluated based on whether their homologue from Trichoderma atroviride contains a signal peptide. The transcriptome derived secretome contained 111 enzymes from 37 GH families, which corresponded to 62% of the transcribed GH enzymes covering 75% of the transcribed GH families (supplementary information). The majority of the signal peptide-containing sequences belonged to GH family GH18, GH16 and to some degree GH3 ( Figure  1). By contrast, GH76 was represented to a higher degree in the secretome compared with the transcriptome. Listing genes with signal peptide according to number of raw reads showed that the 34 most transcribed genes corresponded to 90% of the raw reads (Table 2).
Different substrates induced different CAZyme responses in T. asperellum, based on the initial screening of substrates. To analyze the correlation between growth substrate composition and the secreted CAZymes, the enzymes found in the transcriptome derived secretome were divided into groups of starch degrading, cellulose degrading, hemicellulases, chitinases and unknown based on their predicted   function. The transcriptional level based on raw reads of these enzyme groups was compared with the macromolecular composition of wheat bran ( Figure 2). A major part of the GHs predicted to be in the secretome were starch degrading enzymes, especially the highly transcribed amylase. The second most produced enzymes were hemicellulases, corresponding to the relatively high degree of arabinoxylan in wheat bran. The least secreted enzymes were cellulases, where the majority of the transcripts were two endo-1,3-β-glucanases from GH16, a family, where similar enzymes has shown also to contain 1,4-β-glucanases activity. Though, endo-1,3-β-glucanases are found to be more involved in anti-mycrobial actions than cellulose degradation (Vazquez-Garciduenas, Leal-Morales, & Herrera-Estrella, 1998). To a lesser transcription degree, two β-glucosidases (GH3) and a cellobiosidase (GH7) were present in the transcriptome derived secretome. Moreover, a relatively large part of the transcriptome derived secretome was chitinases, probably induced to rearrange the cell wall of fungus itself or as a constant anti-microbial response.

Composition of CAZymes across six Trichoderma species
Using a multigene phylogenetic approach, the Trichoderma genus can be divided into four big clades (Kullnig-Gradinger, Szakacs, & Kubicek, 2002). Clade A "section Trichoderma" consists of among others, T. atroviride, T. hamatum and T. asperellum. Clade B "section Pachybasium" consists of among others T. virens. Clade C "section Longibrachiatum" contains T. longibrachiatum and T. reesei. A small clade D consisting of H. aureoviridis. Peptide Pattern Recognition was used to compare the GHs from the genome of five Trichoderma species and the transcriptome from T. asperellum at a functional level (Table 3). Based on PPR analysis significantly more GHs were identified in the represented species from section Trichoderma and Pachybasium compared to the Longibrachiatum species. T. virens contained a GH profile more like T. asperellum, T. atroviride and T. hamatum rather than T. reesei and T. longibrachiatum. The difference was mostly due to a higher content of hemicellulases and chitinases in the species from Trichoderma and Pachybasium section, but they also contained a higher number of genes within all major enzyme classes (starch degrading, cellulose degrading, hemicellulases, chitinases). PPR analysis identified relatively few starch degrading enzymes, both in terms of functions but also in gene copies, though these were the most transcribed enzyme group based on raw reads. Twice as many hemicellulose degrading as cellulose functions were found across all species, though more were present in T. virens, T. atroviride and T. hamatum compared to T. reesei and T. longibrachiatum. In terms of gene copies, T. reesei and T. longibrachiatum had roughly the same number cellulose acting enzymes as hemicellulose, while T. virens, T. atroviride and T. hamatum expressed around 20% more hemicellulose acting enzymes compared to cellulose acting enzymes. The PPR analysis of GHs at a functional level makes it possible to pinpoint differences in the enzymatic potential of the fungi in terms of specific functions (Table 3). This has revealed for example fewer genes for the Longibrachiatum section within functions with high copy number like chitinase (EC. 3.2.1.14) and β-glucosidase (EC. 3.2.1.21). Further, T. reesei and T. longibrachiatum have lost 9-10 functions, for which the species in section Trichoderma have a low copy number; these include a β-fructofuranosidase, galactan 1,3-β-galactosidase, and α-l-rhamnosidase. The PPR analysis also showed a fructan β-fructosidase to be present only in T. hamatum and an exo-αsialidase present only in T. asperellum. Bacterial exo-α-sialidase has been shown to cleave sialic acids from sialyloligosaccharides, gangliosides and glycoproteins. These glycoconjugates are mostly found on surface exposed locations and thought to be among other things involved in cellto-cell interactions and adhesion (Kiyohara et al., 2011).

Phylogenetic relationship on a single gene level
Five CWDE-encoding genes were chosen to investigate the evolution on a single-gene level as a further examination of the genetic relationship between the Trichoderma species featured in the study. These genes were chosen based on their high transcription level and presence in the transcriptome derived secretome or because of their significance as the only gene with a specific function. The closest blast hits in NCBI were genes from T. atroviride or T. hamatum and the closest 20-30 hits were sequences from other ascomycete genes. The chosen protein sequences were aligned with all other enzymes from the corresponding GH family or function across all the Trichoderma species using ClustalX. When protein sequences were illustrated in a phylogenetic tree, clades form with related sequences from each Trichoderma and often according to their greater phylogenetic relationship on a species level (supplementary information). This generally form clades with one related sequence from each Trichoderma species, where sequences from the section Trichoderma (T. asperellum, T. hamatum and T. atroviride) exhibit closer relationship relative to sequences from section Longibrachiatum (T. longibrachiatum and T. reesei) and to sequences from the outlier T. virens from section Unigene14373_Lyx4 421 α-ϭ,Ϯ-ŵaŶŶosidase * GH92 Table 2: Transcriptome derived secretome represented by the 34th most transcribed genes containing signal peptide, corresponding to 90% of total secretome in terms of raw reads. Enzymes are listed by number of raw reads. The columns contain BGI-annotation as described in materials and methods. PPR annotation is a result of the predicted function by EC number.    Pachybasium. Moreover the genes formed clades with one enzyme from each Trichoderma, although on some clades a gene was missing from one or more Trichoderma. The same trend was seen for the other selected enzymes. To illustrate this point, the phylogenetic tree of endo-1,3-β-d-glucosidasee (EC. 3.2.1.39) is shown ( Figure  3).  1996). These studies indicate that several experimental parameters influence enzyme yields, including incubation time, extraction methods, and substrate loading. Other factors that improved cellulase production by T. reesei during solid state fermentation included relative humidity and temperature, continuous light exposure, and aeration (Mekala, Singhania, Sukumaran, & Pandey, 2008). In our study a range of enzyme activities were assayed by AZCL to evaluate the effect of specific growth substrate on the production of CAZymes by T. asperellum. The analysis revealed that substrates induced different enzyme responses from T. asperellum and that more complex sugars such as wheat bran induced a higher and wider activity range than the simple sugars found in e.g. PDA medium, which induced no activity. Beside the sugar complexity, enzyme production also seemed to be dependent on the degree of pretreatment and hence the accessibility to the sugar structure. This may for example be the case with the enzyme response for wheat straw compared to wheat bran; a relatively mild pretreatment of the wheat straw may not have opened up its structure and allowed access to more complex sugars, and this therefore resulted in the production of a relatively simple enzyme cocktail by T. asperellum. Processed wheat bran on the other hand seemed to have the right composition for allowing a broad range of CAZymes to be produced at the same pretreatment level as wheat straw.

CAZymes in the transcriptome of T. asperellum grown on wheat bran
To elucidate the enzymatic response on a transcript level, the transcriptome of T. asperellum was sequenced and glycoside hydrolases were identified the results. When T. asperellum was grown on wheat bran, 175 GHs from 48 GH families were induced. The genomes of phylogenetically similar fungi like T. atroviride encode 213 GHs (Xie et al., 2014), meaning that around 80% of the GHs present in the genome can be induced by growing T. asperellum on wheat bran. The most represented family was GH18, which was predicted to be chitinases. This response corresponds to other experiments using Trichoderma species that exhibit a high production of chitinases (Marx et al., 2013;Tseng, Liu, Yang, Lo, & Peng, 2008). Moreover, several GH16encoding genes were transcribed. This family contains enzymes with different functions, but all of the expressed GH16s were predicted to be β-glucanases which, like chitinases, are reported to be involved in mycoparasitic activities (Tseng et al., 2008). Compared with a similar experiment with Trichoderma harzianum (Uversky et al., 2014), GH16 genes were upregulated when growing on sugar bagasse compared with when growing on cellulose and lactose, while a GH16 glucan endo-1,3(4)-β-glucosidase from T. reesei has been reported to be highly induced on cellulose (dos Santos Castro et al., 2014). A large transcription of GH5 genes was also observed. The genes from GH5 were predicted to contain several functions, including endo-β-1,6galactanase, endo-1,4-glucanase, β-1,6-glucanase, β-mannanase and xylanase. By contrast, the T. harzianum paper (Uversky et al., 2014) showed that each growth substrate induced its specific GH5 enzyme. This was possibly due to the simpler structure of substrates in that study compared to wheat bran, and hence fewer enzyme types were needed to bring about degradation. A more in-depth analysis of the enzymes is needed to identify which function is related to which growth substrate. Interestingly, the Trichoderma species contain only a few α-glucosidases, making them the smallest group, but α-glucosidases was seen to be the most transcribed enzyme group by T. asperellum, when grown on wheat bran. This may be due to the relatively simple structure of starch, hence needing fewer different enzymes compared to hemicellulose, which due to its more complex structure needs a more complex enzyme cocktail for degradation.

Transcriptome derived secretome
To analyze which secreted GHs were transcribed in T. asperellum, the sequenced GHs were analyzed for the presence of a signal peptide. The GH families present in the transcriptome derived secretome were mostly represented by one enzyme per family, but four families exhibited a higher representation: there were five enzymes from GH16, primarily β-glucanases; GH18, primarily chitinases; GH76, α-glucosidases; and GH3, β-glucosidases and xylosidases. Besides being represented by more enzymes than the other families, these were also among the most transcribed genes. Typically, enzyme blends from Trichoderma are low in β-glucosidase activity and are supplemented with β-glucosidases by co-cultivation with other high β-glucosidase-producing fungi to efficiently hydrolyze sugar structure (Singhania, Patel, Sukumaran, Larroche, & Pandey, 2013). The enhanced β-glucosidase activity of T. asperellum corresponds to other research where this fungus has been grown on sugar bagasse (Marx et al., 2013). But a relatively large part of the transcriptome derived secretome by number of genes was chitinases and β-glucanases. The Trichoderma genus has been reported to produce several β-glucanases and chitinases, which may either be involved in rearranging of the fungus' own cell wall or as a constant anti-microbial response (Ike et al., 2006;Limón & Codón, 2004;Tseng et al., 2008;Uversky et al., 2014). This indicates that wheat bran induces a significant number of antimicrobial enzymes and not just a CAZyme response corresponding with its biomass composition. This might provide the fungi with an advantage in nature, but could represent an energy waste for industrial biomass degradation.

Composition of CAZymes across six Trichoderma species
A drawback of the current classification of GHs in GH families is that each family consists of members with different functions. When a number of enzymes in a GH family have been characterized, the PPR program is able to predict the function of other family members identified only by their sequence. PPR is a non-alignment based method for identifying conserved sequence motifs in biological sequences (for example peptides in proteins) (Busk & Lange, 2013). This provides an opportunity to analyze the fungi in more depth based on GHs and corresponding functions present in the genomes. In the case of family GH76, only one enzyme is characterized, thus PPR is unable to make a reliable prediction. Due to incomplete information about enzymatic properties of the GHs, PPR was only able to predict the function of about 80% of the GHs. The non-identified GHs were subsequently used in Blast search and their conserved domains analyzed to predict their function. Checking the unidentified GHs limits the possibility of them containing functions described in table 3. Based on the PPR analysis T. atroviride, T. hamatum and T. virens contained significantly more GHs than T. reesei and T. longibrachiatum. In the case of T. hamatum PPR identified almost 50% more GHs than in T. reesei. This difference was mostly due to a higher number of hemicellulases and chitinases, but T. reesei and T. longibrachiatum also contained fewer enzymes across all major enzyme groups such as starch degrading cellulases and hemicellulases compared to its mycoparasitic relatives. This was not only the case for enzymes with many copies; based on PPR T. reesei and T. longibrachiatum have lost 9-10 activities. These enzymes belong to a wide range of GH families and are involved in starch, cellulose and hemicellulose degradation. The consequence of this function loss is unclear because T. reesei is regarded as a relatively good lignocellulosic degrader. The overall picture of the PPR analysis corresponds to the current theory of mycoparasitic species, which suggests that the mycotroph-related genes arose in the common ancestor of Trichoderma that had the ancestral mycotrophic life style, and that some of these genes were subsequently lost in saprotrophic T. reesei (Kubicek et al., 2011). This supposedly happened as T. reesei became an efficient saprotroph on dead wood by following wood-degrading fungi into their habitat (Rossman, Samuels, Rogerson, & Lowen, 1999; Xie et al., 2014). These findings are also reflected in the phylogenetic comparison that shows a close relation between T. reesei and T. longibrachiatum and a more distant relation to the other species.

Phylogenetic relationship on a single gene level
Phylogenetic trees of the enzymes illustrate clades with similar genes across the species, placing the individual enzymes according to their greater species phylogenetic relationship (Kullnig-Gradinger et al., 2002). This supports the theory that CAZymes in the Trichoderma species have evolved from a common ancestor, followed by an evolutionary process, which resulted in especially the saprotrophic species losing genes (Kubicek et al., 2011). This created smaller clades, where only homologues from some of the Trichoderma species were present. All Trichoderma species contain 6-7 enzymes predicted to be from GH76, and are thus relatively unchanged by the loss of GHs in saprotrophic species. This indicates that these enzymes may have a key role in the degradation of complex biomass, which is also reflected in the high representation of GH76 family in the secretome of T. asperellum when grown on wheat bran. To predict the function of the GH76s with greater accuracy, more members of the family needs to be characterized. One characterized gene from GH76 is a α-1,6-mannanase (Maruyama & Nakajima, 2000), which shows close similarity to one of the predicted GH76 from T. asperellum. With respect to enzyme discovery, the transcriptome sequencing has elucidated the most transcribed GHs on a specific substrate, which is an indication of which enzymes are important and potentially effective for degradation. PPR on the other hand elucidated the differences between GHs represented

Conclusion
When T. asperellum was grown on wheat bran, 175 GHs from 48 GH families were transcribed, corresponding to more than 80% of the GHs compared with the closely related T. atroviride. Based on number of raw reads, 90% of enzymes bound for secretion, which are used by the fungi use to degrade biomass, consisted of 35 different enzymes divided on 20 different families, indicating that 40% of the glycoside hydrolase families are represented in the secretome. Beside the main carbohydrate degrading enzymes, T. asperellum also transcribed several chitinases and β-glucanases for secretion. This result is suggested to be either for rearrangement of chitin of the fungus itself or a component in anti-microbial action. More detailed insight was gained into the evolutionary differences among Trichoderma spp. by comparing the glycoside hydrolases present in different species. It was shown that the genetically similar T. atroviride, T. virens, T. hamatum and T. asperellum contain a larger and broader repertoire of glycoside hydrolases compared to the genetically similar T. reesei and T. longibrachiatum. This is additionally supported by the phylogenetic relationship of individual enzymes that form clades of homologues with distances corresponding to the overall phylogenetic relationship between the species. Analysis of enzymes on a functional level instead of on a broader family level allowed the identification of specific functions lost in T. reesei and T. longibrachiatum compared to T. atroviride, T. virens, T. hamatum and T. asperellum, as well as unique functions present in certain species. The latter can be of importance for the discovery of novel glycoside hydrolases involved in plant cell wall degradation.