The teachings provided herein are generally directed to a method of converting lignin-derived compounds to valuable aromatic chemicals using an enzymatic, bioconversion process. The teachings provide a selection of (i) host cells that are tolerant to the toxic compounds present in lignin fractions; (ii) polypeptides that can be used as enzymes in the bioconversion of the lignin fractions to the aromatic chemical products; (iii) polynucleotides that can be used to transform the host cells to express the selection of polypeptides as enzymes in the bioconversion of the lignin fractions; and (iv) the transformants that express the enzymes.
1. A system for bioprocessing lignin-derived compounds, comprising:
a host cell transformed by a vector comprising a recombinant polynucleotide having a nucleotide sequence that encodes a polypeptide comprising an amino acid sequence having at least 95% identity to SEQ ID NO:101, the amino acid sequence conserving residues 19-22, 24, 25, 27-30, 33-36, 39-45, 47, 48, 50-54; 100, 101, 104, 111, 112, 115, 116, 166, 107, 184, 187, 188, 191, 192, and 195 of SEQ ID:101; a lignin-derived compound having a beta-aryl ether bond and a molecular weight ranging from about 180 Daltons to about 3000 Daltons; and, a solvent in which the lignin-derived compound is soluble; wherein, the system functions to cleave the beta-aryl ether bond by contacting the polypeptide with the lignin-derived compound in the solvent. 2. The system of 3. The system of 4. A system for bioprocessing lignin-derived compounds, comprising:
a transformant including a host cell transformed with a vector comprising a recombinant polynucleotide having a nucleotide sequence that encodes a polypeptide comprising an amino acid sequence having at least 95% identity to SEQ ID NO:101, the amino acid sequence conserving residues 19-22, 24, 25, 27-30, 33-36, 39-45, 47, 48, 50-54; 100, 101, 104, 111, 112, 115, 116, 166, 107, 184, 187, 188, 191, 192, and 195 of SEQ ID:101, the transformant expressing the polypeptide; a lignin-derived compound having a beta-aryl ether bond and a molecular weight ranging from about 180 Daltons to about 3000 Daltons; and, a solvent in which the lignin-derived compound is soluble; wherein, the system functions to cleave the beta-aryl ether bond by contacting the polypeptide with the lignin-derived compound in the solvent. 5. The system of 6. The system of 7. The system of 8. The system of 9. The system of 10. The system of 11. The system of 12. A system for bioprocessing lignin-derived compounds, comprising:
a transformant including a host cell transformed with a vector comprising a recombinant polynucleotide having a nucleotide sequence that encodes a polypeptide comprising SEQ ID NO:101; or conservative substitutions thereof outside of conserved residues 19-22, 24, 25, 27-30, 33-36, 39-45, 47, 48, 50-54; 100, 101, 104, 111, 112, 115, 116, 166, 107, 184, 187, 188, 191, 192, and 195 of SEQ ID:101, the transformant expressing the polypeptide; a lignin-derived compound having a beta-aryl ether bond and a molecular weight ranging from about 180 Daltons to about 3000 Daltons; and, a solvent in which the lignin-derived compound is soluble; wherein, the system functions to cleave the beta-aryl ether bond by contacting the polypeptide with the lignin-derived compound in the solvent. 13. The system of 14. The system of 15. The system of 16. The system of 17. The system of 18. The system of 19. A system for bioprocessing lignin-derived compounds, comprising:
a transformant including an a lignin-derived compound having a beta-aryl ether bond and a molecular weight ranging from about 180 Daltons to about 3000 Daltons; and, a solvent in which the lignin-derived compound is soluble; wherein, the system functions to cleave the beta-aryl ether bond by contacting the polypeptide with the lignin-derived compound in the solvent. 20. The system of 21. The system of 22. The system of 23. The system of
This application claims the benefit of U.S. Provisional Application Nos. 61/403,440, filed Sep. 15, 2010; and 61/455,709, filed Oct. 25, 2010; each application of which is hereby incorporated herein by reference in it's entirety, The instant application is filed with an ASCII compliant text file of a Sequence Listing. The name of the attached file is ALIGP004US01_SEQLIST_AS-FILED.txt, and the file was created Aug. 29, 2011, is 813 KB in size, and is hereby incorporated herein by reference in its entirety. Because the ASCII compliant text file serves as both the paper copy required by §1.821(c) and the CRF required by §1.821(e), the statement indicating that the paper copy and CRF copy of the sequence listing are identical is no longer necessary under 37 C.F.R. §1.821(f), as per Federal Register/Vol. 74, No. 206/Tuesday, Oct. 27, 2009, Section I. 1. Field of the Invention The teachings provided herein are generally directed to a method of converting lignin-derived compounds to valuable aromatic chemicals using an enzymatic, bioconversion process. 2. Description of the Related Art Currently, there is a worldwide, global dependence on petroleum as a deplete-able feedstock for the manufacture of fuels and chemicals. The problems of using petroleum are so well-known and documented that they've become nearly a cliché to the world population. In short, petroleum-based processes are dirty and hazardous. Environmental effects associated with the use of petroleum are known to include, for example, air pollution, global warming, damage from extraction, oil spills, tarballs, and health hazards to humans, domestic animals, and wildlife. Oil refineries, for example, are petroleum-based processes that primarily produce gasoline. However, they are also used extensively to produce valuable and less well-known chemical products used in the manufacture of pharmaceuticals, agrochemicals, food ingredients, and plastics. A clean, green alternative to this market area would be appreciated worldwide. Bioprocesses can present a clean, green alternative to the petroleum-based processes, a bioprocess being one that uses organisms, cells, organelles, or enzymes to carry out a commercial process. Biorefineries, for example, can produce, for example, chemicals, heat and power, as well as food, feed, fuel and industrial chemical products. Examples of biorefineries can include wet and dry corn mills, pulp and paper mills, and the biofuels industry. In leather tanning, hides are softened and hair is removed using proteases. In brewing, amylases are used in germinating barley. In cheese-making, rennin is used to coagulated the proteins in mil. The biofuels industry, for example, has been a point of focus recently, naturally focusing on fuel products to replace petroleum-based fuels and, as a result, has not developed other valuable chemical products that also rely on petroleum-based processes. As such, biorefineries use enzymes to convert natural products to useful chemicals. A natural product, such as the wood that is used in a pulp and paper mill, contains cellulose, hemicelluloses, and lignin. A typical range of compositions for a hardwood may be about 40-44% cellulose, about 15-35% hemicelluloses, and about 18-25% lignin. Likewise, a typical range of compositions for a softwood may be about 40-44% cellulose, about 20-32% hemicelluloses, and about 25-35% lignin. Since all biofuels come from cellulosic biorefineries, where the key raw material is glucose, derived from cellulose, lignin remains underutilized. Lignin is the single most abundant source of aromatic compounds in nature, and the use of lignin is currently limited to low value applications, such as combustion to generate process heat and energy for the biorefinery facilities. In the alternative, lignin is sold as a natural component of animal feeds or fertilizers. Interestingly, however, lignin is the only plant biomass component based on aromatic core structures, and such core structures are valuable in the production of industrial chemicals. One of skill will appreciate that, unfortunately, a major problem to such a use of lignin remains: the aromatic compounds present in the lignin fraction of a biorefinery include toxic compounds that inhibit the growth and survival of industrial microbes. For at least these reasons, processes for converting lignin fractions to industrial products using industrial microbes have not been successful. In view of the above, one of skill will appreciate (i) a clean, green replacement for petroleum-based processes in the production of valuable chemical products that include major markets such as, for example, pharmaceuticals, agrochemicals, food ingredients, and plastics; (ii) a profitable use of the abundant and renewable natural resource available in lignin, which is currently an industrial waste stream that is underutilized as an industrial feedstock; (iii) a selection of host cells that are tolerant to the toxic compounds present in lignin fractions in the feedstock; (iv) a selection of polypeptides that can be used as enzymes in the bioconversion of the lignin fractions to the valuable chemical products; (v) a selection of polynucleotides that can be used to transform host cells to express the selection of polypeptides in the bioconversion of the lignin fractions to the valuable chemical products; (vi) systems that include transformants that express the enzymes, where the transformants can be used to (a) express the enzymes while in direct contact with the lignin fractions or (b) express the enzymes for extraction from the cells, after which the extracted enzymes are used directly in contact with the lignin fractions; and (vii) a clean-and-green method of producing valuable chemical products at higher profits than petroleum-based processes. This invention is generally directed to a recombinant method of producing enzymes for use in the bioconversion of lignin-derived compounds to valuable aromatic chemicals. In some embodiments, the teachings are directed to an isolated recombinant polypeptide, comprising an amino acid sequence having at least 95% identity to SEQ ID NO:101. The sequence can conserve residues T19, I20, S21, P22, V24, W25, T27, K28, Y29, A30, H33, K34, G35, F36, D39, I40, V41, P42, G43, G44, F45, G47, I48, E50, R51, T52, G53, G54, K100, A101, N104, V111, G112, M115, F116, P166, W107, Y184, Y187, R188, G191, G192, and F195. In some embodiments, the teachings are directed to an isolated recombinant polypeptide, comprising SEQ ID NO:101; or conservative substitutions thereof outside of the conserved residues. The conserved residues can include T19, I20, S21, P22, V24, W25, T27, K28, Y29, A30, H33, K34, G35, F36, D39, I40, V41, P42, G43, G44, F45, G47, I48, E50, R51, T52, G53, G54; K100, A101, N104, V111, G112, M115, F116, P166, W107, Y184, Y187, R188, G191, G192, and F195. In some embodiments, the teachings are directed to an isolated recombinant glutathione S-transferase enzyme, comprising an amino acid sequence having at least 95% identity to SEQ ID NO:101. The amino acid sequence can conserve residues T19, I20, S21, P22, V24, W25, T27, K28, Y29, A30, H33, K34, G35, F36, D39, I40, V41, P42, G43, G44, F45, G47, I48, E50, R51, T52, G53, G54; K100, A101, N104, V111, G112, M115, F116, P166, W107, Y184, Y187, R188, G191, G192, and F195; wherein, the amino acid sequence functions to cleave a beta-aryl ether. In some embodiments, the teachings are directed to an isolated recombinant glutathione S-transferase enzyme, comprising an amino acid sequence having at least 95% identity to SEQ ID NO:101; wherein, the amino acid sequence functions to cleave a beta-aryl ether. In some embodiments, the teachings are directed to an isolated recombinant polypeptide, comprising (i) a length ranging from about 279 to about 281 amino acids; (ii) a first amino acid region consisting of residues 19-54 from SEQ ID NO:101, or conservative substitutions thereof outside of conserved residues T19, I20, S21, P22, V24, W25, T27, K28, Y29, A30, H33, K34, G35, F36, D39, I40, V41, P42, G43, G44, F45, G47, I48, E50, R51, T52, G53, and G54; wherein, the first amino acid region can be located in the recombinant polypeptide from about residue 14 to about residue 59; and, (iii) a second amino acid region consisting of residues 98-221 from SEQ ID NO:101, or conservative substitutions thereof outside of conserved residues K100, A101, N104, V111, G112, M115, F116, P166, W107, Y184, Y187, R188, G191, G192, and F195; wherein, the second amino acid region is located in the recombinant polypeptide from about residue 93 to about residue 226. In some embodiments, the teachings are directed to an isolated recombinant glutathione S-transferase enzyme, comprising (i) a length ranging from about 279 to about 281 amino acids; (ii) a first amino acid region having at least 95% identity to residues 19-54 from SEQ ID NO:101 while conserving residues T19, I20, S21, P22, V24, W25, T27, K28, Y29, A30, H33, K34, G35, F36, D39, I40, V41, P42, G43, G44, F45, G47, I48, E50, R51, T52, G53, and G54; wherein, the first amino acid region is located in the recombinant polypeptide from about residue 14 to about residue 59; and, (iii) a second amino acid region having at least 95% identity to residues 98-221 from SEQ ID NO:101 while conserving residues K100, A101, N104, V111, G112, M115, F116, P166, W107, Y184, Y187, R188, G191, G192, and F195; wherein, the second amino acid region can be located in the recombinant polypeptide from about residue 93 to about residue 226; and, the recombinant glutathione S-transferase enzyme can function to cleave a beta-aryl ether. In some embodiments, the teachings are directed to an isolated recombinant glutathione S-transferase enzyme, comprising an amino acid sequence having at least 95% identity to SEQ ID NO:541; wherein, the amino acid sequence functions to cleave a beta-aryl ether. In some embodiments, the teachings are directed to an isolated recombinant polypeptide, comprising (i) a length ranging from about 256 to about 260 amino acids; (ii) a first amino acid region consisting of residues 47-57 from SEQ ID NO:541, or conservative substitutions thereof outside of conserved residues A47, I48, N49, P50, G52, V54, P55, V56, L57; wherein, the first amino acid region is located in the recombinant polypeptide from about residue 45 to about residue 57; (iii) a second amino acid region consisting of 63-76 from SEQ ID NO:541; and, (iv) a third amino acid region consisting of residues 99-230 from SEQ ID NO:541, or conservative substitutions thereof outside of conserved residues R100, Y101, K104, D107, M111, N112, S115, M116, K176, L194, I197, N198, S201, H202, and M206; wherein, the second amino acid region is located in the recombinant polypeptide from about residue 94 to about residue 235. In some embodiments, the teachings are directed to an isolated recombinant glutathione S-transferase enzyme, comprising (i) a length ranging from about 279 to about 281 amino acids; (ii) a first amino acid region having at least 95% identity to 47-57 from SEQ ID NO:541, or conservative substitutions thereof outside of conserved residues A47, I48, N49, P50, G52, V54, P55, V56, L57; wherein, the first amino acid region can be located in the recombinant polypeptide from about residue 45 to about residue 57; (iii) a second amino acid region consisting of 63-76 from SEQ ID NO:541; and, (iv) a third amino acid region having at least 95% identity to residues 99-230 from SEQ ID NO:541, or conservative substitutions thereof outside of conserved residues R100, Y101, K104, D107, M111, N112, S115, M116, K176, L194, I197, N198, S201, H202, and M206; wherein, the second amino acid region can be located in the recombinant polypeptide from about residue 94 to about residue 235; wherein, the recombinant glutathione S-transferase enzyme functions to cleave a beta-aryl ether. In some embodiments, an amino acid substitution outside of the conserved residues can be a conservative substitution. And, in many embodiments, the amino acid sequence can function to cleave a beta-aryl ether. The teachings are also directed to a method of cleaving a beta-aryl ether bond, the comprising contacting a polypeptide taught herein with a lignin-derived compound having (i) a beta-aryl ether bond and (ii) a molecular weight ranging from about 180 Daltons to about 3000 Daltons; wherein, the contacting occurs in a solvent environment in which the lignin-derived compound is soluble. In some embodiments, the lignin-derived compound has a molecular weight of about 180 Daltons to about 1000 Daltons. In some embodiments, the solvent environment comprises water. And, in some embodiments, the solvent environment comprises a polar organic solvent. The teachings are also directed to a system for bioprocessing lignin-derived compounds, the system comprising a polypeptide taught herein, a lignin-derived compound having a beta-aryl ether bond and a molecular weight ranging from about 180 Daltons to about 3000 Daltons; and, a solvent in which the lignin-derived compound is soluble; wherein, the system functions to cleave the beta-aryl ether bond by contacting the polypeptide with the lignin-derived compound in the solvent. The teachings are also directed to a recombinant polynucleotide comprising a nucleotide sequence that encodes a polypeptide taught herein. Likewise, the teachings are also directed to a vector or plasmid comprising the polynucleotide, as well as a host cell transformed by the vector or plasmid to express the polypeptide. The teachings are also directed to a method of cleaving a beta-aryl ether bond, the method comprising (i) culturing a host cell taught herein under conditions suitable to produce a polypeptide taught herein; (ii) recovering the polypeptide from the host cell culture; and, (iii) contacting the polypeptide of claim 1 with a lignin-derived compound having a beta-aryl ether bond and a molecular weight ranging from about 180 Daltons to about 3000 Daltons; wherein, the contacting occurs in a solvent environment in which the lignin-derived compound is soluble. In some embodiments, the host cell can be The teachings are also directed to a system for bioprocessing lignin-derived compounds, the system comprising (i) a transformed host cell taught herein; (ii) a lignin-derived compound having a beta-aryl ether bond and a molecular weight ranging from about 180 Daltons to about 3000 Daltons; and, (iii) a solvent in which the lignin-derived compound is soluble; wherein, the system functions to cleave the beta-aryl ether bond by contacting a polypeptide taught herein with the lignin-derived compound in the solvent. This invention is generally directed to a recombinant method of producing enzymes for use in the bioconversion of lignin-derived compounds to valuable aromatic chemicals. Currently, the art is limited in it's ability to control the degradation of lignin to produce useful products, as it's limited in it's knowledge of enzymes that are capable of selectively converting lignin into desired aromatic compounds. Generally, the art knows two basic things: (1) lignin is complex; and (2) bacterial lignin degradation systems are therefore at least as complex as lignin itself. Accordingly, and for at least these reasons, the teachings provided herein offer a valuable, unexpected, and surprising set of systems, methods, and compositions of matter that will be useful in the production of industrially useful aromatic chemicals. In some embodiments, the strains can be selected for (i) having well-characterized aromatic and xenobiotic metabolisms; (ii) annotated genome sequences; and (iii) prior use in fermentation processes at pilot or larger scales. Examples of strains can include, but are not limited to, The model substrate synthesis 150 for use in the biochemical screening for selective activity can be outsourced through a contract research organization (CRO). The enzyme discovery effort can initially be focused on identifying potential beta-etherase candidate genes identified through bioinformatic methods. The identification of candidates having beta-etherase activity is the 1st step towards generating lignin monomers from lignin oligomers present in soluble lignin streams. The fluorescent substrate α-O-(β-methylumbelliferyl)acetovanillone (MUAV), for example, can be used in in vitro assays to identify beta-etherase function (Acme Biosciences, Mt. View, Calif.). The formation of 4 methylumbelliferone (4MU) upon hydrolysis of the aryl ether bond can be monitored by fluorescence, for example, at λex=365 nm and λem=450 nm (or 460 nm). The gene synthesis, cloning, and transformation step 145 can include combining bioinformatic methods with known information about enzymes showing a desired, selective enzyme activity. For example, bioinformatics can produce a putative beta-etherase sequence that shares a significant homology to the The gene synthesis, cloning, and transformation step 145 can be performed using any method known to one of skill. For example, all genes can be synthesized directly as open reading frames (ORFs) from oligonucleotides by using standard PCR-based assembly methods, and using the The enzyme screening 155 is done to identify novel etherases 160. The fluorescent substrate MUAV can be used to screen for and identify beta-etherase activity from the recombinant The lignin stream acquisition 130 includes a waste lignin stream from a biorefinery for testing. A preliminary characterization of one source of such lignin has shown an aromatic monomer concentration of less than 1 g/L and an oligomer concentration of ˜10 g/L. Oligomers appear to be associated with carbohydrates in 10:1 ratio for sugar:phenolics. Some information exists on compounds in the liquid stream, including benzoic acid, vanillin, syringic acid and ferulics, which are routinely quantified in soluble samples. An average molecular weight of ˜280 has been established for the monomers; and the oligomeric components remain to be characterized. The strain tolerance testing 135 Strain tolerance will be determined by cell growth upon exposure to biorefinery lignin. Tolerance to the phenolic compounds in biorefinery lignin waste stream will be critically important to the bioprocess efficiency and high level production of aromatic chemicals by microbial systems. Cell growth will be quantified as a function of respiration by the reduction of soluble tetrazolium salts. XTT (2,3-Bis(2-methoxy-4-nitro-5-sulfophenyl)-2H-tetrazolium-5-carboxanilide inner salt, Sigma) is reduced to a soluble purple formazan compound by respiring cells. The formazan product will be detected and quantified by absorbance at 450 nm. Strain tolerance testing 135 on soluble lignin can be done in liquid format in 48 well plates, for example. Each strain can be tested in replicates of 8, for example, and The strain demonstrated to have the best tolerance characteristics can be transformed with the beta-etherase gene identified as showing the highest biochemical activity. Restriction digestion can be used to excise the ORF fragment from the cloning vector, and the sequence can be sub-cloned into the shuttle vector pMMB206. Constructs cloned in the shuttle vector can be transformed into Feedstock from Biorefinery Processes An example of a starting material might be pretreated lignocellulosic biomass. In some embodiments, the lignocellulose biomass material might include grasses, corn stover, rice hull, agricultural residues, softwoods and hardwoods. In some embodiments, the lignin-derived compounds might be derived from hardwood species such as poplar from the Upper Peninsula region of Michigan, or hardwoods such as poplar, lolloby pine, and eucalyptus from Virginia and Georgia areas, or mixed hardwoods including maple and oak species from upstate New York. In some embodiments, the pretreatment methods might encompass a range of physical, chemical and biological based processes. Examples of pretreatment methods used to generate the feedstock for Aligna processes might include physical pretreatment, solvent fractionation, chemical pretreatment, biological pretreatment, ionic liquids pretreatment, supercritical fluids pretreatment, or a combination thereof, for example, which can be applied in stages. Physical pretreatment methods used to reduce the lignocellulose biomass particle size reduction might utilize mechanical stress methods of dry, wet vibratory and compression-based ball milling procedures. Solvent fractionation methods include organosolve processes, phosphoric acid fractionation processes, and methods using ionic liquids to pretreat the lignocellulose biomass to differentially solubilize and partition various components of the biomass. In some embodiments, organosolve methods might be performed using alcohol, including ethanol, with an acid catalyst at temperature ranges from about 90 to about 20° C., and from about 155 to about 220° C. with residence time of about 25 minutes to about 100 minutes. Catalyst concentrations can vary from about 0.83% to about 1.67% and alcohol concentrations can vary from about 25% to about 74% (v/v). In some embodiments, phosphoric acid fractionations of lignocellulose biomass might be performed using a series of different extractions using phosphoric acid, acetone, and water at temperature of around 50° C. In some embodiments, ionic liquid pretreatment of lignocellulose biomass might include use of ionic liquids containing anions like chloride, formate, acetate, or alkylphosphonate, with biomass:ionic liquids ratios of approximately 1:10 (w/w). The pretreatment might be performed at temperatures ranging from about 100° C. to about 150° C. Other ionic liquid compounds that might be used include 1-butyl-3-methyl-imidazolium chloride and 1-ethyl-3-methylimidazolium chloride. Chemical pretreatments of lignocellulose biomass material might be performed using technologies that include acidic, alkaline and oxidative treatments. In some embodiments, acidic pretreatment methods of lignocellulose biomass such as those described below might be applied. Dilute acid pretreatments using sulfuric acid at concentrations in the approximate range of about 0.05% to about 5%, and temperatures in the range of about 160° C. to about 220° C. Steam explosion, with or without the use of catalysts such as sulfuric acid, nitric acid, carbonic acid, succinic acid, fumaric acid, maleic acid, citric acid, sulfur dioxide, sodium hydroxide, ammonia, before steam explosion, at temperatures between about 160° C. to about 290° C. Liquid hot water treatment at pressure >5 MPa at temperatures ranging from about 160° C. to about 230° C., and pH range between about 4 and about 7. And, in some embodiments, alkaline pretreatment methods using catalysts such as calcium oxide, ammonia, and sodium hydroxide might be used. The ammonia fiber expansion (AFEX) method might be applied in which concentrated ammonia at about 0.3 kg to about 2 kg of ammonia per kg of dry weight biomass is used at about 60° C. to about 140° C. in a high pressure reactor, and cooked for 5-45 minutes before rapid pressure release. The ammonia recycle percolation (ARP) method might be used in flow through mode by percolating ammoniacal solutions at 5-15% concentrations at high temperatures and pressures. Oxidative pretreatment methods such as alkaline wet oxidation might be used with sodium carbonate at a temperature ranging from about 170° C. to about 220° C. in a high pressure reactor using pressurized air/oxygen mixtures or hydrogen peroxide as the oxidants. Biological pretreatment methods using white rot basidomycetes and certain actinomycetes might be applied. One type of product stream from such pretreatment methods might be soluble lignin, and might contain lignin-derived monomers and oligomers in the range of about 1 g/L to about 10 g/L, and xylans. The lignin-derived monomers might include compounds such as gallic acid, hydroxybenzoate, ferulic acid, hydroxymethyl furfural, hydroxymethyl furfural alcohol, vanillin, homovanillin, syringic acid, syringaldehyde, and furfural alcohol. Supercritical fluid pretreatment methods might be used to process the biomass. Examples of supercritical fluids for use in processing biomass include ethanol, acetone, water, and carbon dioxide at a temperature and pressures above the critical points for ethanol and carbon dioxide but at a temperature and/or pressure below that of the critical point for water. Combinations of steam pretreatment and biological pretreatment methods might be applied. For example, a biomass steam can be pretreated at 195° C. for 10 min at controlled pH, followed by enzymatic treatment using commercial cellulases and xylanases at dosings of 100 mg protein/g total solid, and with incubation at 50° C. at pH 5.0 with agitation of 500 rpm. In some embodiments, combinations of hydrothermal, organosolve, and biological pretreatment methods might be used. One example of such a combination is a 3 stage process: Stage 1. Use heat in an aqueous medium at a predetermined pH, temperature and pressure for the hydrothermal process;
Soluble lignin fractions derived using organosolve methods might produce soluble lignins in the molecular weight range of 188-1000, soluble in various polar solvents. Without intending to be bound by any theory or mechanism of action, organosolve processes are generally believed to maintain the lignin beta-aryl ether linkage. Lignin streams from steam exploded lignocellulosic biomass might be used. Steam explosion might be performed, for example, using high pressure steam in the range of about 200 psi to about 500 psi, and at temperatures ranging from about 180° C. to about 230° C. for about 1 minute to about 20 minutes in batch or continuous reactors. The lignin might be extracted from the steam-exploded material with alkali washing or extraction using organic solvents. Steam exploded lignins can exhibit properties similar to those described form organosolve lignins, retaining native bond structures and containing about 3 to about 12 aromatic units per oligomer unit. Supercritical fluid pretreatment can produce soluble lignin fractions that can be used with the teachings provided herein. Such processes typically yield monomers and lignin oligomers having a molecular weight of about <1000 Daltons. Biological pretreatment can produce soluble lignin fractions that can be used with the teachings provided herein. Such lignin streams might contain lignin monomers and oligomers in the range of about 1 g/L to about 10 g/L and have a molecular weight of about <1000 Daltons, and xylans. The lignin-derived monomers might include compounds such as gallic acid, hydroxybenzoate, ferulic acid, hydroxymethyl furfural, hydroxymethyl furfural alcohol, vanillin, homovanillin, syringic acid, syringaldehyde, and furfural alcohol. Feedstock from Wood Pulping Processes Wood pulping processes produce a variety of lignin types, the type of lignin dependent on the type of process used. Chemical pulping processes include, for example, Kraft and sulfite pulping. In some embodiments, the lignin-derived compound can be derived from a spent pulping liquor or “black liquor” from Kraft pulping processes. Kraft lignin might be derived from batch or continuous processes using, for example, reaction temperatures in the range of about 150° C. to about 200° C. and reaction times of approximately 2 hours. Any range of molecular weights of lignin may be obtained, and the useful fraction may range, in some embodiments, from about 200 Daltons to about 4000 Daltons. A Kraft lignin having a molecular weight ranging from about 1000 Daltons to about 3000 Daltons might be used in a bioconversion. In some embodiments, lignin from a sulfite pulping process might be used. A sulfite pulping process can include, for example, a chemical sulfonation using aqueous sulfur dioxide, bisulfite and monosulfite at a pH ranging from about 2 to about 12. The sulfonated lignin might be recovered by precipitation with excess lime as lignosulfonates. Alternatively, formaldehyde-based methylation of the lignin aromatics followed by sulfonation might be performed. Any range of molecular weights of lignin may be obtained, and the useful fraction may range, in some embodiments, from about 200 Daltons to about 4000 Daltons. A sulfite lignin having a molecular weight ranging from about 1000 Daltons to about 3000 Daltons might be used in a bioconversion. Optimization of a system for a particular feedstock should include an understanding of the composition of the particular feedstock. For example, one of skill will appreciate that the composition of a native lignin can be significantly different than the composition of the lignin-derived compounds in a given lignin faction that is used for a feedstock. Accordingly, and understanding of the composition of the feedstock will assist in optimizing the conversion of the lignin-derived compounds to the valuable aromatic compounds. Any method known to one of skill can be used to characterize the compositions of the feedstock. For example, one of skill may use wet chemistry techniques, such as thioacidolysis and nitrobenzene oxidation, coupled with gas chromatography, which have been used traditionally, or spectroscopic techniques such as NMR and FTIR. Thioacidolysis, for example, cleaves the β-O-4 linkages in lignin, giving rise to monomers and dimers which are then used to calculate the S and G content. Similar information can be obtained using nitrobenzene oxidation, but the ratios are thought to be less accurate. In some embodiments, the content of S, G, and H, as well as their relative ratios can be used to characterize feedstock compositions for purposes of determining a bioconversion system design. It is widely accepted that the biosynthesis of lignin stems from the polymerization of three types of phenylpropane units, also referred to as monolignols. These units are coniferyl, sinapyl, and p-coumaryl alcohol. The three structures are as follows: Tables 1A and 1B summarize distributions of p-coumaryl alcohol or p-hydroxyl phenol (H), coniferyl alcohol or guaiacyl (G), and sinapyl alcohol or syringyl (S) lignin in several sources of biomass. Table 1A compares percent lignin in the biomass to the G:S:H. In general, the relative amounts of G, S, and H in lignin can be a good indicator of its overall composition and response to a treatment, such as the bioconversions taught herein. In poplar species, for example, differences can be seen based on the measurement technique as well as species, but in general the S/G ratio ranges from 1.3 to 2.2. This is similar to the hardwood eucalyptus, but higher than herbaceous biomass switchgrass and Miscanthus. This is to be expected given the higher H contents in grass lignin. An optimized nitrobenzene oxidation method has shown S/G ratios of 13 poplar samples from two different sites and obtained values ranging from 1.01 to 1.68. Further, a linear correlation (R2=0.85) has been found in poplar between decreasing lignin content and increasing S/G ratios. The correlation was stronger (R2=0.93) in samples from a single site suggesting a dependency on geographic location. Higher throughput methods can be used for rapid screening of feedstocks. Examples of such methods can include, but are not limited to, near-infrared (NIR), reflectance spectroscopy, pyrolysis molecular beam mass spectrometry (pyMBMS), Fourier transform infrared spectroscopy, a modified thioacidolysis technique, and whole cell NMR after dissolution in ionic liquids. Information on some structural characteristics of lignin, such as S/G ratios, can be rapidly obtained using these methods. The average S:G:H ratio of 104 poplar lignin samples, for example, was determined using the modified thioacidolysis technique, and was found to be 68:32:0.02. In some embodiments, the S, G, and H components in the ratio can be expressed as mass percent. In some embodiments, the S, G, and H components in the ratio can be expressed as any relative unit, or unitless. Any comparison can be used, if the amount of each component directly correlates with the other respective components in the composition. The ratios can be expressed in relative whole numbers or fractions as S:G:H, or any other order or combination of components, S/G, G/S, and the like. In some embodiments, the S/G ratio is used. In some embodiments, the S/G ratio can range from about 0.20 to about 20.0, from about 0.3 to about 18.0, from about 0.4 to about 15.0, from about 0.5 to about 15.0, from about 0.6 to about 12.0, from about 0.7 to about 10.0, from about 0.8 to about 8.0, from about 0.9 to about 9.0, from about 1.0 to about 7.0, or any range therein. In some embodiments, the S/G ratio can be about 0.2, about 0.4, about 0.6, about 0.8, about 1.0, about 1.2, about 1.4, about 1.6, about 1.8, about 2.0, about 2.2, about 2.4, about 2.6, about 2.8, about 3.0, about 3.2, about 3.4, about 3.6, about 3.8, about 4.0, about 4.2, about 4.4, about 4.6, about 4.8, about 5.0, about 5.2, about 5.4, about 5.6, about 5.8, about 6.0, about 6.2, about 6.4, about 6.6, about 6.8, about 7.0, about 7.2, about 7.4, about 7.6, about 7.8, about 8.0, about 8.2, about 8.4, about 8.6, about 8.8, about 9.0, about 9.2, about 9.4, about 9.6, about 9.8, about 10.0, and any ratio in-between on 0.1 increments, and any range of ratios therein. Soluble lignin streams derived from biorefinery or Kraft processes might be used directly in microbial conversions without additional purification or, they might be further purified by one or more of the separation or fractionation techniques prior to microbial conversions. In some embodiments, membrane filtration might be applied to achieve a starting concentration of lignin monomers and oligomers in the 1-60% (w/v) concentration range, and molecular weights ranging from about 180 Daltons to about 2000 Daltons, from about 200 Daltons to about 4000 Daltons, from about 250 Daltons to about 2500 Daltons, from about 180 Daltons to about 3500 Daltons, from about 300 Daltons to about 3000 Daltons, or any range therein. In some embodiments, soluble lignin streams might be partially purified by chromatography using, for example, HP-20 resin. The lignin monomers and oligomers can bind to the resin while highly polar impurities or inorganics that might be toxic to microorganisms can remain un-bound. Subsequent elution, for example, with a methanol-water solvent system, can provide fractions of higher purity that are enriched in lignin monomers and oligomers. A purpose of the present teaching includes the discovery of novel biochemical conversions that create valuable commercial products from various lignin core structures. Such commercial products include monomeric aromatic chemicals that can serve as building block chemicals. One of skill will appreciate that a vast number of aromatic chemicals can be produced using the principles provided by the teachings set-forth herein, and that a comprehensive teaching of every possible chemical that can be produced would be beyond the scope and purpose of this teaching. Commercial products that can be obtained from a bioconversion of lignin-derived compounds, as taught herein, include mono-aromatic chemicals. Examples of such chemicals include, but are not limited to, caprolactam, cumene, styrene, mononitro- and dinitrotoluenes and their derivatives, 2,4-diaminotoluene, 2,4-dinitrotoluene, terephthalic acid, catechol, vanillin, salicylic acid, aminosalicylic acid, cresol and isomers, alkylphenols, chlorinated phenols, nitrophenols, polyhydric phenols, nitrobenzene, aniline and secondary and tertiary aniline bases, benzothiazole and derivatives, alkylbenzene and alkylbenzene sulfonates, 4,4-diphenylmethane diisocyanate (MDI), chlorobenzenes and dichlorobenzenes, nitrochlorobenzenes, sulfonic acid derivatives of toluene, pseudocumene, mesitylene, nitrocumene, cumenesulfonic acid. The teachings herein are also directed to the discovery of novel enzymes. In some embodiments, the enzymes are beta-etherase enzymes. Lignin is the only plant biomass constituent based on aromatic core structures, and is comprised of branched phenylpropenyl (C9) units. The guaiacol and syringol building blocks of lignin are linked through carbon-carbon (C—C) and carbon-oxygen (C—O, ether) bonds. The native structure of lignin suggests its key application as a chemical feedstock for aromatic chemicals. The production of such chemical structures necessitates depolymerization and rupture of C—C and C—O bonds. An abundant chemical linkage in lignin is the beta-aryl ether linkage, which comprises 50% to 70% of the bond type in lignin. The efficient scission of the beta-aryl ether bond would generate the monomeric building blocks of lignin, and provide the chemical feedstock for subsequent conversion to a range of industrial products. The beta-etherase enzyme system has multiple advantages for conversions of lignin oligomers to monomers over the laccase enzyme systems. The beta-etherase enzyme system would achieve highly selective reductive bond scission catalysis for efficient and high yield conversions of lignin oligomers to monomers without the formation of side products, degradation of the aromatic core structures of lignin, or the use of electron transfer mediators required with use of the oxidative and radical chemistry-based laccase enzyme systems. One of skill will recognize the chemical nomenclature used herein as standard to the art. For example, the amino acids used herein can be identified by at least the following conventional three-letter abbreviations in Table 2: The single letter identifier is provided for ease of reference, but any format can be used. The three-letter abbreviations are generally accepted in the peptide art, recommended by the IUPAC-IUB commission in biochemical nomenclature, and are provided to comply with WIPO Standard ST.25. Furthermore, the peptide sequences are taught according to the generally accepted convention of placing the N-terminus on the left and the C-terminus on the right of the sequence listing to again comply with WIPO Standard ST.25. The Recombinant Polypeptides The teachings herein are based on discovery of novel and non-obvious proteins, DNAs, and host cell systems that can function in the conversion of lignin-derived compounds into valuable aromatic compounds. The systems can include natural, wild-type components or recombinant components, the recombinant components being isolatable from what occurs in nature. The term “isolated” means altered “by the hand of man” from its natural state; i.e., if it occurs in nature, it has been changed or removed from its original environment, or both. For example, a naturally occurring polynucleotide or a polypeptide naturally present in a living animal in its natural state is not “isolated,” but the same polynucleotide or polypeptide separated from the coexisting materials of its natural state is “isolated”, as the term is used herein. For example, with respect to polynucleotides, the term isolated means that it is separated from the chromosome and cell in which it naturally occurs. However, a nucleic acid molecule contained in a clone that is a member of a mixed clone library (e.g., a genomic or cDNA library) and that has not been isolated from other clones of the library (e.g., in the form of a homogeneous solution containing the clone without other members of the library) or a chromosome isolated or removed from a cell or a cell lysate (e.g., a “chromosome spread”, as in a karyotype), is not “isolated” for the purposes of the teachings herein. Moreover, a lone nucleic acid molecule contained in a preparation of mechanically or enzymatically cleaved genomic DNA, where the isolation of the nucleic molecule was not the goal, is also not “isolated” for the purposes of the teachings herein. As part of, or following, an intentional isolation, polynucleotides can be joined to other polynucleotides, for mutagenesis, to form fusion proteins, and for propagation or expression in a host, for instance. Isolated polynucleotides, alone or joined to other polynucleotides such as vectors, can be introduced into host cells, in culture or in whole organisms, after which such DNAs still would be isolated, as the term is used herein, because they would not be in their naturally occurring form or environment. Similarly, the isolated polynucleotides and polypeptides may occur in a composition, such as a media formulation, solutions for introduction of polynucleotides or polypeptides, for example, into cells, compositions or solutions for chemical or enzymatic reactions, for instance, which are not naturally occurring compositions, and, therein remain “isolated” polynucleotides or polypeptides within the meaning of that term as it is used herein. A “vector,” such as an expression vector, is used to transfer or transmit the DNA of interest into a prokaryotic or eukaryotic host cell, such as a bacteria, yeast, or a higher eukaryotic cell. Vectors can be recombinantly designed to contain a polynucleotide encoding a desired polypeptide. These vectors can include a tag, a cleavage site, or a combination of these elements to facilitate, for example, the process of producing, isolating, and purifying a polypeptide. The DNA of interest can be inserted as the expression component of a vector. Examples of vectors include plasmids, cosmids, viruses, and bacteriophages. If the vector is a virus or bacteriophage, the term vector can include the viral/bacteriophage coat. The term “expression vector” is usually used to describe a DNA construct containing gene encoding an expression product of interest, usually a protein, that is expressed by the machinery of the host cell. This type of vector is frequently a plasmid, but the other forms of expression vectors, such as bacteriophage vectors and viral vectors (e.g., adenoviruses, replication defective retroviruses, and adeno-associated viruses), can be used. In some embodiments, the polypeptides taught herein can be natural or wildtype, isolated and/or recombinant. In some embodiments, the polynucleotides can be natural or wildtype, isolated and/or recombinant. In some embodiments, the teachings are directed to a vector than can include such a polynucleotide or a host cell transformed by such a vector. In some embodiments, the polypeptide can be an isolated recombinant polypeptide, comprising an amino acid sequence having at least 95% identity to SEQ ID NO:101. The sequence can conserve residues T19, I20, S21, P22, V24, W25, T27, K28, Y29, A30, H33, K34, G35, F36, D39, I40, V41, P42, G43, G44, F45, G47, I48, E50, R51, T52, G53, G54, K100, A101, N104, V111, G112, M115, F116, P166, W107, Y184, Y187, R188, G191, G192, and F195. In some embodiments, the polypeptide can be an isolated recombinant polypeptide, comprising SEQ ID NO:101; or conservative substitutions thereof outside of the conserved residues. The conserved residues can include T19, I20, S21, P22, V24, W25, T27, K28, Y29, A30, H33, K34, G35, F36, D39, I40, V41, P42, G43, G44, F45, G47, I48, E50, R51, T52, G53, G54; K100, A101, N104, V111, G112, M115, F116, P166, W107, Y184, Y187, R188, G191, G192, and F195. In some embodiments, the polypeptide can be an isolated recombinant glutathione S-transferase enzyme, comprising an amino acid sequence having at least 95% identity to SEQ ID NO:101. The amino acid sequence can conserve residues T19, I20, S21, P22, V24, W25, T27, K28, Y29, A30, H33, K34, G35, F36, D39, I40, V41, P42, G43, G44, F45, G47, I48, E50, R51, T52, G53, G54; K100, A101, N104, V111, G112, M115, F116, P166, W107, Y184, Y187, R188, G191, G192, and F195; wherein, the amino acid sequence functions to cleave a beta-aryl ether. In some embodiments, the polypeptide can be an isolated recombinant glutathione S-transferase enzyme, comprising an amino acid sequence having at least 95% identity to SEQ ID NO:101; wherein, the amino acid sequence functions to cleave a beta-aryl ether. In some embodiments, the polypeptide can be an isolated recombinant polypeptide, comprising (i) a length ranging from about 279 to about 281 amino acids; (ii) a first amino acid region consisting of residues 19-54 from SEQ ID NO:101, or conservative substitutions thereof outside of conserved residues T19, I20, S21, P22, V24, W25, T27, K28, Y29, A30, H33, K34, G35, F36, D39, I40, V41, P42, G43, G44, F45, G47, I48, E50, R51, T52, G53, and G54; wherein, the first amino acid region can be located in the recombinant polypeptide from about residue 14 to about residue 59; and, (iii) a second amino acid region consisting of residues 98-221 from SEQ ID NO:101, or conservative substitutions thereof outside of conserved residues K100, A101, N104, V111, G112, M115, F116, P166, W107, Y184, Y187, R188, G191, G192, and F195; wherein, the second amino acid region is located in the recombinant polypeptide from about residue 93 to about residue 226. In some embodiments, the polypeptide can be an isolated recombinant glutathione S-transferase enzyme, comprising (i) a length ranging from about 279 to about 281 amino acids; (ii) a first amino acid region having at least 95% identity to residues 19-54 from SEQ ID NO:101 while conserving residues T19, I20, S21, P22, V24, W25, T27, K28, Y29, A30, H33, K34, G35, F36, D39, I40, V41, P42, G43, G44, F45, G47, I48, E50, R51, T52, G53, and G54; wherein, the first amino acid region is located in the recombinant polypeptide from about residue 14 to about residue 59; and, (iii) a second amino acid region having at least 95% identity to residues 98-221 from SEQ ID NO:101 while conserving residues K100, A101, N104, V111, G112, M115, F116, P166, W107, Y184, Y187, R188, G191, G192, and F195; wherein, the second amino acid region can be located in the recombinant polypeptide from about residue 93 to about residue 226; and, the recombinant glutathione S-transferase enzyme can function to cleave a beta-aryl ether. In some embodiments, the polypeptide can be an isolated recombinant glutathione S-transferase enzyme, comprising an amino acid sequence having at least 95% identity to SEQ ID NO:541; wherein, the amino acid sequence functions to cleave a beta-aryl ether. In some embodiments, the polypeptide can be an isolated recombinant polypeptide, comprising (i) a length ranging from about 256 to about 260 amino acids; (ii) a first amino acid region consisting of residues 47-57 from SEQ ID NO:541, or conservative substitutions thereof outside of conserved residues A47, I48, N49, P50, G52, V54, P55, V56, L57; wherein, the first amino acid region is located in the recombinant polypeptide from about residue 45 to about residue 57; (iii) a second amino acid region consisting of 63-76 from SEQ ID NO:541; and, (iv) a third amino acid region consisting of residues 99-230 from SEQ ID NO:541, or conservative substitutions thereof outside of conserved residues R100, Y101, K104, D107, M111, N112, S115, M116, K176, L194, I197, N198, S201, H202, and M206; wherein, the second amino acid region is located in the recombinant polypeptide from about residue 94 to about residue 235. In some embodiments, the polypeptide can be an isolated recombinant glutathione S-transferase enzyme, comprising (i) a length ranging from about 279 to about 281 amino acids; (ii) a first amino acid region having at least 95% identity to 47-57 from SEQ ID NO:541, or conservative substitutions thereof outside of conserved residues A47, I48, N49, P50, G52, V54, P55, V56, L57; wherein, the first amino acid region can be located in the recombinant polypeptide from about residue 45 to about residue 57; (iii) a second amino acid region consisting of 63-76 from SEQ ID NO:541; and, (iv) a third amino acid region having at least 95% identity to residues 99-230 from SEQ ID NO:541, or conservative substitutions thereof outside of conserved residues R100, Y101, K104, D107, M111, N112, S115, M116, K176, L194, I197, N198, S201, H202, and M206; wherein, the second amino acid region can be located in the recombinant polypeptide from about residue 94 to about residue 235; wherein, the recombinant glutathione S-transferase enzyme functions to cleave a beta-aryl ether. In some embodiments, an amino acid substitution outside of the conserved residues can be a conservative substitution. And, in many embodiments, the amino acid sequence can function to cleave a beta-aryl ether. The teachings include a method of preparing the polypeptides described herein, comprising culturing a host cell under conditions suitable to produce the desired polypeptide; and recovering the polypeptide from the host cell culture; wherein, the host cell comprises an exogenously-derived polynucleotide encoding the desired polypeptide. In some embodiments, the host cell is Initially, a double-stranded DNA fragment encoding the primary amino acid sequence of recombinant polypeptide can be designed. This DNA fragment can be manipulated to facilitate synthesis, cloning, expression or biochemical manipulation of the expression products. The synthetic gene can be ligated to a suitable cloning vector and then the nucleotide sequence of the cloned gene can be determined and confirmed. The gene can be then amplified using designed primers having specific restriction enzyme sequences introduced at both sides of insert gene, and the gene can be subcloned into a suitable subclone/expression vector. The expression vector bearing the synthetic gene for the mutant can be inserted into a suitable expression host. Thereafter the expression host can be maintained under conditions suitable for production of the gene product and, in some embodiments, the protein can be (i) isolated and purified from the cells expressing the gene or (ii) used directly in a reaction environment that includes the host cell. The nucleic acid (e.g., cDNA or genomic DNA) may be inserted into a replicable vector for cloning (amplification of the DNA) for expression. Various vectors are publicly available. In general, DNA can be inserted into an appropriate restriction endonuclease site(s) using techniques known in the art, for example. Vector components generally include, but are not limited to, one or more of a signal sequence, an origin of replication, one or more marker genes, an enhancer element, a promoter, and a transcription termination sequence. The signal sequence may be a prokaryotic signal sequence selected, for example, from the group of the alkaline phosphatase, penicillinase, Ipp, or heat-stable enterotoxin II leaders. For yeast secretion the signal sequence may be, e.g., the yeast invertase leader, alpha factor leader (including Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate in one or more selected host cells. Such sequences are well known for a variety of bacteria, yeast, and viruses. The origin of replication from a plasmid, e.g. pBR322, for example, is suitable for most Gram-negative bacteria, and the 2μ plasmid origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, VSV or BPV) are useful for cloning vectors in mammalian cells. Expression and cloning vectors will typically contain a selection gene, also termed a selectable marker. Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g., ampicillin, neomycin, methotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) supply critical nutrients not available from complex media, e.g., the gene encoding D-alanine racemase for Bacilli. An example of suitable selectable markers for mammalian cells are those that enable the identification of cells competent to take the encoding nucleic acid, such as DHFR or thymidine kinase. An appropriate host cell when wild-type DHFR is employed is the CHO cell line deficient in DHFR activity, prepared and propagated as described by Urlaub et al., Proc. Natl. Acad. Sci. USA, 77:4216 (1980). A suitable selection gene for use in yeast is the trp1 gene present in the yeast plasmid YRp7 (Stinchcomb et al., Nature, 282:39 (1979); Kingsman et al., Gene, 7:141 (1979); Tschemper et al., Gene, 10:157 (1980)). The trp1 gene provides a selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan, for example, ATCC No. 44076 or PEP4-1 (Jones, Genetics, 85:12 (1977)). Expression and cloning vectors usually contain a promoter operably linked to the encoding nucleic acid sequence to direct mRNA synthesis. Promoters recognized by a variety of potential host cells are well known. Promoters suitable for use with prokaryotic hosts include the .beta.-lactamase and lactose promoter systems (Chang et al., Nature, 275:615 (1978); Goeddel et al., Nature, 281:544 (1979)), alkaline phosphatase, a tryptophan (trp) promoter system (Goeddel, Nucleic Acids Res., 8:4057 (1980); EP 36,776), and hybrid promoters such as the tac promoter (deBoer et al., Proc. Natl. Acad. Sci. USA, 80:21 25 (1983)). Promoters for use in bacterial systems also will contain a Shine-Dalgarno sequence operably linked to the encoding DNA. Other yeast promoters, which are inducible promoters having the additional advantage of transcription controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein, glyceraldehyde-3-phosphate dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable vectors and promoters for use in yeast expression are known in the art, e.g. see EP 73,657 for a further discussion. PRO87299 transcription from vectors in mammalian host cells is controlled, for example, by promoters obtained from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,211,504), adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a retrovirus, hepatitis-B virus and Simian Virus 40 (SV40), from heterologous mammalian promoters, e.g., the actin promoter or an immunoglobulin promoter, and from heat-shock promoters, provided such promoters are compatible with the host cell systems. Transcription of the encoding DNA by higher eukaryotes may be increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 10 to 300 bp, that act on a promoter to increase its transcription. Many enhancer sequences are now known from mammalian genes (globin, elastase, albumin, a-fetoprotein, and insulin). Typically, however, one will use an enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer on the late side of the replication origin, the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers. The enhancer may be spliced into the vector at a position 5′ or 3′ to the coding sequence but is preferably located at a site 5′ from the promoter. Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or nucleated cells from other multicellular organisms) will also contain sequences necessary for the termination of transcription and for stabilizing the mRNA. Such sequences are commonly available from the 5′ and, occasionally 3′, untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions contain nucleotide segments transcribed as polyadenylated fragments in the untranslated portion of the mRNA encoding the mutants. In some embodiments, the expression control sequence can be selected from a group consisting of a lac system, T7 expression system, major operator and promoter regions of pBR322 origin, and other prokaryotic control regions. Still other methods, vectors, and host cells suitable for adaptation to the synthesis of the mutants in recombinant vertebrate cell culture are described in Gething et al., Nature, 293:620 625 (1981); Mantei et al., Nature, 281:40 46 (1979); EP 117,060; and EP 117,058. Mutants can be expressed as a fusion protein. In some embodiments, the methods involve adding a number of amino acids to the protein, and in some embodiments, to the amino terminus of the protein. Extra amino acids can serve as affinity tags or cleavage sites, for example. Fusion proteins can be designed to: (1) assist in purification by acting as a temporary ligand for affinity purification, (2) produce a precise recombinant by removing extra amino acids using a cleavage site between the target gene and affinity tag, (3) increase the solubility of the product, and/or (4) increase expression of the product. A proteolytic cleavage site can be included at the junction of the fusion region and the protein of interest to enable further purification of the product—separation of the recombinant protein from the fusion protein following affinity purification of the fusion protein. Such enzymes, and their cognate recognition sequences, can include Factor Xa, thrombin and enterokinase, cyanogen bromide, trypsin, or chymotrypsin, for example. Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, D. B. and Johnson, K. S. Gene 67:31-40 (1988)), pMAL (New England Biolabs, Beverly, Mass.), pRIT5 (Pharmacia, Piscataway, N.J.), and pET (Strategen), which can fuse glutathione S-transferase (GST), maltose E binding protein, protein A, or a six-histidine sequence, respectively, to a target recombinant protein. Synthetic DNAs containing the sequences of nucleotides, tags and cleavage sites can be designed and provided as a modified coding for recombinant polypeptide mutants. In some embodiments, a polypeptide can be a fusion polypeptide having an affinity tag, and the recovering step includes (1) capturing and purifying the fusion polypeptide, and (2) removing the affinity tag for high yield production of the desired polypeptide or an amino acid sequence that is at least 95% homologous to a desired polypeptide. DNA encoding the mutants may be obtained from a cDNA library prepared from tissue possessing the mRNA for the mutants. As such, the DNA can be conveniently obtained from a cDNA library. The encoding gene for the mutants may also be obtained from a genomic library or by known synthetic procedures (e.g., automated nucleic acid synthesis). Libraries can be screened with probes designed to identify the gene of interest or the protein encoded by it. Screening the cDNA or genomic library with the selected probe may be conducted using standard hybridization procedures, such as described in Sambrook et al., Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989), which is herein incorporated by reference. An alternative means to isolate the gene encoding recombinant polypeptide mutants is to use PCR methodology [Sambrook et al., supra; Dieffenbach et al., PCR Primer: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 1995)]. Nucleic acids having a desired protein coding sequence may be obtained by screening selected cDNA or genomic libraries using a deduced amino acid sequence and, if necessary, a conventional primer extension procedure as described in Sambrook et al., supra, to detect precursors and processing intermediates of mRNA that may not have been reverse-transcribed into cDNA. The selection of expression vectors, control sequences, transformation methods, and the like, are dependent on the type of host cell used to express the gene. Following entry into a cell, all or part of the vector DNA, including the insert DNA, may be incorporated into the host cell chromosome, or the vector may be maintained extrachromosomally. Those vectors that are maintained extrachromosomally are frequently capable of autonomous replication in the host cell. Other vectors are integrated into the genome of a host cell upon and are replicated along with the host genome. Host cells are transfected or transformed with the expression or cloning vectors described herein to produce the mutants. The cells are cultured in conventional nutrient media modified as appropriate for inducing promoters, selecting transformants, or amplifying the genes encoding the desired sequences. The culture conditions, such as media, temperature, pH and the like, can be selected by the skilled artisan without undue experimentation. In general, principles, protocols, and practical techniques for maximizing the productivity of cell cultures can be found in Mammalian Cell Biotechnology: a Practical Approach, M. Butler, ed. (IRL Press, 1991) and Sambrook et al., supra, each of which are incorporated by reference. The host cells can be prokaryotic or eukaryotic and, suitable host cells for cloning or expressing the DNA in the vectors herein can include prokaryote, yeast, or higher eukaryote cells. Methods of eukaryotic cell transfection and prokaryotic cell transformation are known to the ordinarily skilled artisan, for example, CaCl2, CaPO4, liposome-mediated and electroporation. Depending on the host cell used, transformation is performed using standard techniques appropriate to such cells. The calcium treatment employing calcium chloride, as described in Sambrook et al., supra, or electroporation is generally used for prokaryotes. Infection with Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote, yeast, or higher eukaryote cells. Suitable prokaryotes include, but are not limited to, eubacteria, such as Gram-negative or Gram-positive organisms, for example, Enterobacteriaceae such as In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable cloning or expression hosts for the mutants. Suitable host cells for the expression of glycosylated mutants can be derived from multicellular organisms. Invertebrate cells include insect cells such as In some embodiments, a nucleotide sequence will be hybridizable, under moderately stringent conditions, to a nucleic acid having a nucleotide sequence comprising or complementary to the desired nucleotide sequences. In some embodiments, an isolated nucleotide sequence will be hybridizable, under stringent conditions, to a nucleic acid having a nucleotide sequence comprising or complementary to the desired nucleotide sequences. A nucleic acid molecule can be “hybridizable” to another nucleic acid molecule when a single-stranded form of the nucleic acid molecule can anneal to the other nucleic acid molecule under the appropriate conditions of temperature and ionic strength (see Sambrook et al., supra). The conditions of temperature and ionic strength determine the “stringency” of the hybridization. “Hybridization” requires that two nucleic acids contain complementary sequences. However, depending on the stringency of the hybridization, mismatches between bases may occur. The appropriate stringency for hybridizing nucleic acids depends on the length of the nucleic acids and the degree of complementation. Such variables are well known in the art. More specifically, the greater the degree of similarity or homology between two nucleotide sequences, the greater the value of Tm for hybrids of nucleic acids having those sequences. For hybrids of greater than 100 nucleotides in length, equations for calculating Tm have been derived (see Sambrook et al., supra). For hybridization with shorter nucleic acids, the position of mismatches becomes more important, and the length of the oligonucleotide determines its specificity (see Sambrook et al., supra). In some embodiments, the polynucleotides and polypeptides have at least 55, 60, 65, 70, 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent homology to a desired polynucleotide or polypeptide. In some embodiments, the polynucleotides and polypeptides have at least 55, 60, 65, 70, 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent identity to a desired polynucleotide or polypeptide. And, in some embodiments, the polynucleotides and polypeptides have at least 55, 60, 65, 70, 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent similarity to a desired polynucleotide or polypeptide. As described above, degenerate forms of the desired polynucleotide are also acceptable. In some embodiments, a polypeptide can be 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 homologous, identical, or similar to a desired polypeptide as long as it shares the same function as the desired polypeptide, and the extent of the function can be less or more than that of the desired polypeptide. In some embodiments, for example, a polypeptide can have a function that is 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, or any 0.1% increment in-between, that of the desired polypeptide. And, in some embodiments, for example, a polypeptide can have a function that is 110%, 120%, 130%, 140%, 150%, 160%, 170%, 180%, 190%, 200%, 300%, 400%, 500%, or more, or any 1% increment in-between, that of the desired polypeptide. In some embodiments the “function” is an enzymatic activity, measurable by any method known to one of skill such as, for example, a method used in the teachings herein. The “desired polypeptide” or “desired polynucleotide” can be referred to as a “reference polypeptide” or “reference polynucleotide”, or the like, in some embodiments as a control for comparison of a polypeptide of interest, which may be considered a “test polypeptide” or “test polynucleotide” or the like. In any event, the comparison is that of one set of bases or amino acids against another set for purposes of measuring homology, identity, or similarity. The ability to hybridize is, of course, another way of comparing nucleotide sequences. The terms “homology” and “homologous” can be used interchangeably in some embodiments. The terms can refer to nucleic acid sequence matching and the degree to which changes in the nucleotide bases between polynucleotide sequences affects the gene expression. These terms also refer to modifications, such as deletion or insertion of one or more nucleotides, and the effects of those modifications on the functional properties of the resulting polynucleotide relative to the unmodified polynucleotide. Likewise the terms refer to polypeptide sequence matching and the degree to which changes in the polypeptide sequences, such as those seen when comparing the modified polypeptides to the unmodified polypeptide, affect the function of the polypeptide. It should appreciated to one of skill that the polypeptides, such as the mutants taught herein, can be produced from two non-homologous polynucleotide sequences within the limits of degeneracy. The terms “similarity” and “identity” are known in the art. The term “identity” can be used to refer to a sequence comparison based on identical matches between correspondingly identical positions in the sequences being compared. The term “similarity” can be used to refer to a comparison between amino acid sequences, and takes into account not only identical amino acids in corresponding positions, but also functionally similar amino acids in corresponding positions. Thus similarity between polypeptide sequences indicates functional similarity, in addition to sequence similarity. Levels of identity between gene sequences and levels of identity or similarity between amino acid sequences can be calculated using known methods. For example, publicly available computer based methods for determining identity and similarity include the BLASTP, BLASTN and FASTA (Atschul et al., J. Molec. Biol., 1990; 215:403-410), the BLASTX program available from NCBI, and the Gap program from Genetics Computer Group, Madison Wis. In some embodiments, the Gap program, with a Gap penalty of 12 and a Gap length penalty of 4 can be used for determining the amino acid sequence comparisons, and a Gap penalty of 50 and a Gap length penalty of 3 for the polynucleotide sequence comparisons. In some embodiments, the sequences can be aligned so that the highest order match is obtained. The match can be calculated using published techniques that include, for example, Computational Molecular Biology, Lesk, A. M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D. W., ed., Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part I, Griffin, A. M., and Griffin, H. G., eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991, each of which is incorporated by reference herein. As such, the term “similarity” is similar to “identity”, but in contrast to identity, similarity can be used to refer to both identical matches and conservative substitution matches. For example, if two polypeptide sequences have 10/20 identical amino acids, and the remainder are all non-conservative substitutions, then the percent identity and similarity would both be 50%. On the other hand, if there are 5 five more positions where there are conservative substitutions, then the percent identity is 50%, whereas the percent similarity is 75%. In some embodiments, the term “substantial sequence identity” can refer to an optimal alignment, such as by the programs GAP or BESTFIT using default gap penalties, having at least 65, 70, 75, 80, 85, 90, 95, 96, 97, 98, or 99 percent sequence identity. The difference in what is “substantial” regarding identity can often vary according to a corresponding percent similarity, since the factor of primary importance is often the function of the sequence in a system. The term “substantial percent identity” can be used to refer to a DNA sequence that is sufficiently similar to a reference sequence at the nucleotide level to code for the same protein, or a protein having substantially the same function, in which the comparison can allow for allelic differences in the coding region. Likewise, the term can be used to refer to a comparison of sequences of two polypeptides optimally aligned. In some embodiments, sequence comparisons can be made to a reference sequence over a “comparison window” of amino acids or bases that includes any number of amino acids or bases that is useful in the particular comparison. For example, the reference sequence may be a subset of a larger sequence. In some embodiments, the comparison window can include at least 10 residue or base positions, and sometimes at least 15-20 amino acids or bases. The reference or test sequence may represent, for example, a polypeptide or polynucleotide having one or more deletions, substitutions or additions. The term “variant” refers to modifications to a peptide that allows the peptide to retain its binding properties, and such modifications include, but are not limited to, conservative substitutions in which one or more amino acids are substituted for other amino acids; deletion or addition of amino acids that have minimal influence on the binding properties or secondary structure; conjugation of a linker; post-translational modifications such as, for example, the addition of functional groups. Examples of such post-translational modifications can include, but are not limited to, the addition of modifying groups described below through processes such as, for example, glycosylation, acetylation, phosphorylation, modifications with fatty acids, formation of disulfide bonds between peptides, biotinylation, PEGylation, and combinations thereof. In fact, in most embodiments, the polypeptides can be modified with any of the various modifying groups known to one of skill. The terms “conservatively modified variant,” “conservatively modified substitution,” and “conservative substitution” can be used interchangeably in some embodiments. These terms can be used to refer to a conservative amino acid substitution, which is an amino acid substituted by an amino acid of similar charge density, hydrophilicity/hydrophobicity, size, and/or configuration such as, for example, substituting valine for isoleucine. In comparison, a “non-conservatively modified variant” refers to a non-conservative amino acid substitution, which is an amino acid substituted by an amino acid of differing charge density, hydrophilicity/hydrophobicity, size, and/or configuration such as, for example, substituting valine for phenyalanine. One of skill will appreciate that there are a plurality of ways to define conservative substitutions, and any of these methods may be used with the teachings provided herein. In some embodiments, for example, a substitution can be considered conservative if an amino acid falling into one of the following groups is substituted by an amino acid falling in the same group: hydrophilic (Ala, Pro, Gly, Glu, Asp, Gln, Asn, Ser, Thr), aliphatic (Val, Ile, Leu, Met), basic (Lys, Arg, His), aromatic (Phe, Tyr, Trp), and sulphydryl (Cys). See Dayhoff, M O. Et al. National Biomedical Research Foundation, Georgetown University, Washington D.C.:89-99 (1972), which is incorporated herein. In some embodiments, the substitution of amino acids can be considered conservative where the side chain of the substitution has similar biochemical properties to the side chain of the substituted amino acid. The antimicrobial activity of lignin-derived compounds is a major problem addressed by the systems taught herein. For example, typical industrial fermentation processes might utilize the microbes The phenolic streams or soluble lignin streams derived from pretreated lignocellulosic biomass, for example, might contain aromatic and nonaromatic compounds, such as gallic acid, hydroxymethylfurfural alcohol, hydroxymethylfurfural, furfural alcohol, 3,5-dihydroxybenzoate, furoic acid, 3,4-dihydroxybenzaldehyde, hydroxybenzoate, homovanillin, syringic acid, vanillin, and syringaldehyde. There are several lignin-derived compounds that are antimicrobials. For example, furfural, 4-hydroxybenzaldehyde, syringaldehyde, 5-hydroxymethylfurfural, and vanillin are each known to have antimicrobial activity against One or more of the following benzaldehyde derivatives might be present in the phenolic streams from pretreated lignocellulosic biomass: 2,4,6-trihydroxybenzaldehyde, 2,5-dihydroxybenzaldehyde, 2,3,4-trihydroxybenzaldehyde, 2-hydroxy-5-methoxybenzaldehyde, 2,3-dihydroxybenzaldehyde, 2-hydroxy-3-methoxybenzaldehyde, 4-hydroxy-2,6-dimethoxybenzaldehyde, 2,5-dihydroxybenzaldehyde, 2,4-dihydroxybenzaldehyde, and 2-hydroxybenzaldehyde. Likewise, 2,4,6-trihydroxybenzaldehyde, 2,5-dihydroxybenzaldehyde, 2,3,4-trihydroxybenzaldehyde, 2-hydroxy-5-methoxybenzaldehyde, 2,3-dihydroxybenzaldehyde, 2-hydroxy-3-methoxybenzaldehyde, 4-hydroxy-2,6-dimethoxybenzaldehyde, 2,5-dihydroxybenzaldehyde, 2,4-dihydroxybenzaldehyde, and 2-hydroxybenzaldehyde have each demonstrated antibacterial activity against The antimicrobial activity of lignin-derived compounds creates a need for a strain of microbe that is tolerant to such activity in the reaction environment. The teachings include the identification of recombinant or non-recombinant microbial species that are naturally capable of metabolizing aromatic compounds for the biotransformations of lignin-derived compounds to commercial products. Some examples of microbial species particularly suited for biotransformations of phenolic streams from pretreated lignocellulosic biomass include, but are not limited to, The microbes can be grown in a fermentor, for example, using methods known to one of skill. The enzymes used in the bioprocessing are obtained from the microbes, and they can be intracellular, extracellular, or a combination thereof. As such, the enzymes can be recovered from the host cells using methods known to one of skill in the art that include, for example, filtering or centrifuging, evaporation, and purification. In some embodiments, the method can include breaking open the host cells using ultrasound or a mechanical device, remove debris and extract the protein, after which the protein can be purified using, for example, electrophoresis. In some embodiments, however, the teachings include the use of a microbe, recombinant or non-recombinant, that has tolerance to lignin-derived compounds. A microbe that is tolerant to lignin-derived compounds can be used industrially, for example, to express any enzyme, recombinant or non-recombinant, having a desired enzyme activity while directly in association with the lignin-derived compounds. Such activities include, for example, beta etherase activity, C-alpha-dehydrogenase activity, glutathione lyase activity, or any other enzyme activity that would be useful in the biotransformation of lignin-derived compounds. The activities can be wild-type or produce through methods known to one of skill, such as transfection or transformation, for example. The teachings herein are also directed to the discovery and use of recombinant Research directed to the discovery of a suitable microbe has shown that The teachings are also directed to a method of cleaving a beta-aryl ether bond, the comprising contacting a polypeptide taught herein with a lignin-derived compound having (i) a beta-aryl ether bond and (ii) a molecular weight ranging from about 180 Daltons to about 3000 Daltons; wherein, the contacting occurs in a solvent environment in which the lignin-derived compound is soluble. The term “contacting” refers to placing an agent, such as a compound taught herein, with a target compound, and this placing can occur in situ or in vitro, for example. The teachings are also directed to a method of cleaving a beta-aryl ether bond, the comprising contacting a polypeptide taught herein with a lignin-derived compound having (i) a beta-aryl ether bond and (ii) a molecular weight ranging from about 180 Daltons to about 3000 Daltons; wherein, the contacting occurs in a solvent environment in which the lignin-derived compound is soluble. In some embodiments, the lignin-derived compound has a molecular weight of about 180 Daltons to about 1000 Daltons. In some embodiments, the solvent environment comprises water. And, in some embodiments, the solvent environment comprises a polar organic solvent. The teachings are also directed to a system for bioprocessing lignin-derived compounds, the system comprising a polypeptide taught herein, a lignin-derived compound having a beta-aryl ether bond and a molecular weight ranging from about 180 Daltons to about 3000 Daltons; and, a solvent in which the lignin-derived compound is soluble; wherein, the system functions to cleave the beta-aryl ether bond by contacting the polypeptide with the lignin-derived compound in the solvent. The teachings are also directed to a recombinant polynucleotide comprising a nucleotide sequence that encodes a polypeptide taught herein. Likewise, the teachings are also directed to a vector or plasmid comprising the polynucleotide, as well as a host cell transformed by the vector or plasmid to express the polypeptide. The teachings are also directed to a method of cleaving a beta-aryl ether bond, the method comprising (i) culturing a host cell taught herein under conditions suitable to produce a polypeptide taught herein; (ii) recovering the polypeptide from the host cell culture; and, (iii) contacting the polypeptide of claim 1 with a lignin-derived compound having a beta-aryl ether bond and a molecular weight ranging from about 180 Daltons to about 3000 Daltons; wherein, the contacting occurs in a solvent environment in which the lignin-derived compound is soluble. In some embodiments, the host cell can be The teachings are also directed to a system for bioprocessing lignin-derived compounds, the system comprising (i) a transformed host cell taught herein; (ii) a lignin-derived compound having a beta-aryl ether bond and a molecular weight ranging from about 180 Daltons to about 3000 Daltons; and, (iii) a solvent in which the lignin-derived compound is soluble; wherein, the system functions to cleave the beta-aryl ether bond by contacting a polypeptide taught herein with the lignin-derived compound in the solvent. The following examples illustrate, but do not limit, the present invention. Microbial growth and metabolism studies on soluble lignin samples are performed to test the tolerance of microbes on lignin-derived compounds. A set of aromatic and nonaromatic compounds known to inhibit growth of The set of lignin compounds to be tested might be expanded to any of the teachings provided herein. And, the microbial growth and metabolism studies on soluble lignin samples can also be performed actual industrial samples such as, for example, kraft lignins and biorefinery lignins. This example illustrates how prospective enzymes were identified for use with the teachings provided herein. Although never successfully expressed heterologously as an industrial microbe in a commercial scale process, Sphingomonas paucimobilis has been shown to produce enzymes that have some activity in cleaving the beta aryl ether bond in lignin. See Masai, E., et al. Accordingly, the enzyme discovery effort started with running BLAST searches against the two enzymes identified by Masai as having beta etherase activity, “ligE” and “ligF”. See Id. at Abstract. Table 4 lists genes identified in the BLAST searches for initial screening. The nucleotide and amino acid sequences in Table 4 are incorporated herein by reference in their entirety through the GenBank Accession Numbers. This example describes a method for preparing recombinant host cells for the heterologous expression of known and putative beta-etherase encoding gene sequences in The gene sequences with accession numbers in Table 3 were synthesized directly as open reading frames (ORFs) from oligonucleotides by using standard PCR-based assembly methods, and using the LigE, from Accession No BAA2032.1, is listed herein as SEQ ID NO:1 for the protein and SEQ ID NO:2 for the gene. An “optimized” nucleic acid sequence was created to facilitate the transformation in LigE-1, from Accession No ABD26841.1, is listed herein as SEQ ID NO:101 for the protein and SEQ ID NO:102 for the gene. An “optimized” nucleic acid sequence was created to facilitate the transformation in LigF, from Accession No BAA2031.1 (P30347.1), is listed herein as SEQ ID NO:513 for the protein and SEQ ID NO:514 for the gene. An “optimized” nucleic acid sequence was created to facilitate the transformation in LigF-1, from Accession No ABD26530.1, is listed herein as SEQ ID NO:539 for the protein and SEQ ID NO:540 for the gene. An “optimized” nucleic acid sequence was created to facilitate the transformation in LigF-2, from Accession No ABD27301.1, is listed herein as SEQ ID NO:541 for the protein and SEQ ID NO:542 for the gene. An “optimized” nucleic acid sequence was created to facilitate the transformation in LigF-3, from Accession No ABD27309.1, is listed herein as SEQ ID NO:545 for the protein and SEQ ID NO:546 for the gene. An “optimized” nucleic acid sequence was created to facilitate the transformation in This example describes a method for gene expression in Clarified cell extracts were tested in the in vitro biochemical assay for beta-etherase activity on a fluorescent substrate, a model lignin dimer compound α-O-(β-methylumbelliferyl)acetovanillone (MUAV). In vitro reactions were performed in a total volume of 200 ul and contained: 25 mM TrisHCl pH 7.5; 0.5 mM dithiothreitol; 1 mM glutathione; 0.05 mM or 0.1 mM MUAV; 10 ul of clarified cell extract used to initiate the reactions. Following incubation for 2.5 hours at room temperature, a 50 ul sample of the reactions was terminated using 150 uL of 300 mM glycine/NaOH buffer pH 9. The formation of 4 methylumbelliferone (4MU) upon hydrolysis of the aryl ether bond was monitored by the increase in fluorescence at λex=360 nm and λem=450 nm using a Spectramax UV/visible/fluorescent spectrophotometer. The total protein concentrations of the cell lysates were determined using the BCA reagent system for protein quantification (Pierce). Induction might be also performed using IPTG concentrations in the range of 0.01-1 mM. Cell disruption might be also performed using toluene permeabilization, French pressure techniques, or using multiple freeze/thaw cycles in conjunction with lysozyme. Assay conditions might be varied to include TrisHCl at 10-150 mM concentrations and in the pH range of 6.5-8.5; 0-2 mM dithiothreitol; 0.05-2 mM glutathione; 0.01-5 mM MUAV substrate; 22-42° C. reaction temperatures. The biochemical assay might be performed as a fixed time point assay with reaction times ranging from 5 minutes-12 hours, or performed continuously without quenching with glycine/NaOH buffer to extract enzyme kinetic parameters. This example describes the tested biochemical activities of the newly-discovered beta-etherase enzymes. In reactions containing 0.1 mM MUAV substrate, It was also surprising to find that 3 novel As such, the enzyme discovery program unexpectedly and surprisingly generated four (4) novel polypeptides from Four (4) novel gene sequences encoding polypeptides with beta-etherase activity were discovered from One of skill will appreciate that the bioinformatic screen that was used to help identify putative enzymes is not a definitive predictor in itself of biochemical activities, particularly in view of (i) having only one known active enzyme for LigE in a different species, (ii) one known active enzyme for LigF, and (iii) the unexpected extent of such activities discovered. The tests for function therefore had to be performed empirically on the One of skill will also appreciate that the discovery of beta-etherase activities for all 4 One of skill will also appreciate that the discovery of 2 novel beta-etherases from the This example describes the extended use of bioinformatics to identify a pool of putative enzymes in the discovery program. As noted above, the bioinformatic screen that was used to help identify putative enzymes initially was not a definitive predictor in itself of biochemical activities, particularly in view of (i) having only one known active enzyme for LigE in a different species, (ii) one known active enzyme for LigF, and (iii) the unexpected extent of such activities discovered. Having the additional known active enzymes provided more information that could be used to enhance the effectiveness of the bioinformatics in identifying the pool of putative enzymes for both LigE-type and LigF-type enzymes. Sequence to function correlations for the newly discovered beta-etherases were analyzed and identified. A bioinformatic survey of functional domains, essential catalytic residues, and sequence alignments was performed for the Identifying Functional Domains As shown in The Table 5 describes conserved domains and essential amino acid residues in the Table 5 also lists fifteen (15) amino acid residues as conserved and essential for catalytic activity (column 3 of Table 5), according to some embodiments. These include: K100; A101; N104; P166; W107; Y184; Y187; R188; G191; G192; F195; V111; G112; M115; F116. While not intending to be bound by any theory or mechanism of action, these residues appear responsible for the high beta-etherase catalytic activity discovered for the In such embodiments, the essential amino acid residues of the The Table 6 describes conserved domains and essential amino acid residues in the Table 6 also lists sixteen (16) amino acid residues as conserved and essential for catalytic activity (column 3 of Table 6) of the In such embodiments, the essential amino acid residues of the Identifying Additional Functional Domains Bioinformatic methods were used to further understand the protein structure that may result in the desired activities. First, the LigE1 and LigF2 were analyzed together. Amino acid sequence alignments were performed using the Next, regions in LigE1 and LigF2 were analyzed independently in GENBANK. For LigE1, an alignment was performed against the database in GENBANK using the following query sequence: “tispfvwatkyalkhkgfdldwpggftgilertgg” (residues 19-54 of SEQ ID NO:101), from Without intending to be bound by any theory or mechanism of action, they are thought to function as protein disulfide oxidoreductases (PDOs), altering the redox state of target proteins via the reversible oxidation of their active site dithiol. The PDO members of this superfamily include the families of TRX, protein disulfide isomerase (PDI), tlpA, glutaredoxin, NrdH redoxin, and bacterial Dsb proteins (DsbA, DsbC, DsbG, DsbE, DsbDgamma). Members of the superfamily that do not function as PDOs but contain a TRX-fold domain include phosducins, peroxiredoxins, glutathione (GSH) peroxidases, SCO proteins, GSH transferases (GST, N-terminal domain), arsenic reductases, TRX-like ferredoxins and calsequestrin, among others. Table 7 lists 3 subject sequences having high identities (>80%) to residues 19-54 of LigE-1 (SEQ ID NO:101). In some embodiments, these sequences are likely to be essential to catalytic functions similar to those discovered for the The nucleotide and amino acid sequences in Table 7 are incorporated herein by reference in their entirety through the GenBank Accession Numbers. Likewise, for LigF2, separate alignments were performed against the database in GENBANK using the following 2 query sequences: “ainpegqvpvl” (residues 47-57 of SEQ ID NO:541); and “iithttvineyled” (residues 63-76 of SEQ ID NO:541), from i. N terminal residues thought to make contact with the C terminal interface in forming the tertiary protein structure for the GST-N family of proteins; ii. N terminal residues thought to be involved in dimerization of the polypeptides; and, iii. Residues thought to be involved in the binding of glutathione substrate. Table 8 provides the percent identities and similarities to The nucleotide and amino acid sequences in Table 8 are incorporated herein by reference in their entirety through the GenBank Accession Numbers. Table 9 provides the percent identities and similarities to The nucleotide and amino acid sequences in Table 9 are incorporated herein by reference in their entirety through the GenBank Accession Numbers. The bioinformatics provides valuable information about protein structure that can assist in identifying test candidates. For example, the LigE1 has the 98-221 region, which is annotated in the databases as potentially responsible as component of binding and activity, dimerization, and for binding and catalysis in general. While not intending to be bound by any theory or mechanism of action, the variability in active site structures is reflected by the variability in substrate structures. Likewise, upon further research using bioinformatics, it was further discovered that the 19-54 region, which is annotated in the databases as a second region that is potentially responsible as component of the reductase function, and thus potentially responsible for catalysis in addition to the 98-221 region, while having more conservation between members. Obtaining additional structural information that will assist in finding high performing proteins within each family of strains is within the scope of the teachings to the extent that the methodology is known to one of skill. A variety of research techniques are known to one of skill. Bioinformatic methods, such as motif finding, are an example of one way to obtain the additional structural information. Motif finding, also known as profile analysis, constructs global multiple sequence alignments that attempt to align short conserved sequence motifs among the sequences in the query set. This can be done, for example, by first constructing a general global multiple sequence alignment, after which highly conserved regions are isolated, in a manner similar to what is taught herein, and used to construct a set of profile matrices. The profile matrix for each conserved region is arranged like a scoring matrix but its frequency counts for each amino acid or nucleotide at each position are derived from the conserved region's character distribution rather than from a more general empirical distribution. The profile matrices are then used to search other sequences for occurrences of the motif they characterize. LigE-1 and LigF-2 were further examined by comparing their structures to other polypeptides of the LigE-type and LigF-type, respectively. Table 10A shows conserved residues between the polypeptide sequences of LigE and LigE-1, and Table 10B shows shows conserved residues between the polypeptide sequences of LigF and LigF-2. As can be seen, there is a high degree of between-species similarity between LigE and LigE-1 in the LigE-type family. The LigE residues are from As can be seen, there is less between-species similarity between LigF and LigF-2 in the LigF-type family. The LigF residues are from This example provides additional sequences for a second round of assays, the sequences containing the 3 conserved functional domains described herein for the GST_C family of proteins, and belong to the beta-etherase LigE subfamily. Table 11 lists nine (9) additional sequences having identities of 51%-73% at the amino acid level that were identified in the SwissProt database using the The nucleotide and amino acid sequences in Table 11 are incorporated herein by reference in their entirety through the GenBank Accession Numbers. This example describes how native lignin core structures can be hydrolyzed by the action of C alpha-dehydrogenases, beta-etherases, and glutathione-eliminating enzymes. While the LigE and LigF polypeptides, or similar ones described herein, might be sufficient to hydrolyze native lignin structures, it would be useful to discover novel C alpha dehydrogenases ( Table 12 lists potential C alpha-dehydrogenase polypeptide sequences, the LigD-type, for use in conjunction with beta etherases including, but not limited to, LigE/F. The sequences were identified using bioinformatic methods, such as those taught herein. These C alpha-dehydrogenases are classified in the CDD as short-chain dehydrogenase/reductases (SDRs) and are a functionally diverse family of oxidoreductases that have a single domain with a structurally conserved Rossmann fold (alpha/beta folding pattern with a central beta-sheet), an NAD(P)(H)-binding region, and a structurally diverse C-terminal region. Classical SDRs are typically about 250 residues long, while extended SDRs are approximately 350 residues. Sequence identity between different SDR enzymes are typically in the 15-30% range, but the enzymes share the Rossmann fold NAD-binding motif and characteristic NAD-binding and catalytic sequence patterns. Without intending to be bound by any theory or mechanism of action, these enzymes are thought to catalyze a wide range of activities including the metabolism of steroids, cofactors, carbohydrates, lipids, aromatic compounds, and amino acids, and act in redox sensing. Classical SDRs have an TGXXX[AG]XG cofactor binding motif and a YXXXK active site motif, with the Tyr residue of the active site motif serving as a critical catalytic residue (Tyr-151, human prostaglandin dehydrogenase (PGDH) numbering). In addition to the Tyr and Lys, there is often an upstream Ser (Ser-138, PGDH numbering) and/or an Asn (Asn-107, PGDH numbering) contributing to the active site; while substrate binding is in the C-terminal region, which determines specificity. Without intending to be bound by any theory or mechanism of action, the standard reaction mechanism is thought to be a 4-pro-S hydride transfer and proton relay involving the conserved Tyr and Lys, a water molecule stabilized by Asn, and nicotinamide. Extended SDRs have additional elements in the C-terminal region, and typically have a TGXXGXXG cofactor binding motif. Complex (multidomain) SDRs such as ketoreductase domains of fatty acid synthase can have a GGXGXXG NAD(P)-binding motif and an altered active site motif (YXXXN). Fungal type ketoacyl reductases can have a TGXXXGX(1-2)G NAD(P)-binding motif. Some atypical SDRs are thought to have lost catalytic activity and/or have an unusual NAD(P)-binding motif and missing or unusual active site residues. Reactions catalyzed within the SDR family can include isomerization, decarboxylation, epimerization, C═N bond reduction, dehydratase activity, dehalogenation, Enoyl-CoA reduction, and carbonyl-alcohol oxidoreduction. The nucleotide and amino acid sequences in Table 12 are incorporated herein by reference in their entirety through the GenBank Accession Numbers. Table 13 lists potential LigG (glutathione-eliminating)-like enzyme sequences for use in conjunction with beta etherases including, but not limited to, LigE/F. The sequences were identified using bioinformatic methods, such as those taught herein. These might be utilized in conjunction with C-alpha dehydrogenases, and/or with LigE/F-like beta-etherases. The LigG-like proteins are annotated in the CDD as glutathione S-transferase (GST)-like proteins with similarity to the GST_C family, the GST-N family, and the thioredoxin (TRX)-like superfamily of proteins containing a TRX fold. The nucleotide and amino acid sequences in Table 13 are incorporated herein by reference in their entirety through the GenBank Accession Numbers. This example describes the creation of a novel recombinant microbial system for the conversion of lignin oligomers to monomers. The heterologous production of beta etherases, Cα dehydrogenases, and other enzymes for the production of lignin monomers and aromatic products in The biochemical activity of a newly-discovered beta-etherase enzyme functionally expressed in This example describes the design and use of recombinant One example of a microbial process to a commercial aromatic compound might be the production of catechol from lignin-derived phenolic compounds. Catechol might be produced from guaiacol using an i) Fractionation of soluble lignin—Concentration or partial purification of soluble biorefinery lignin fractions or phenolic streams using methods known to one of skill. ii) Biotransformation—The biotransformation of the phenolic substrate stream might be carried out in a fed-batch bioprocess using iii) Product separation—The product, such as catechol, might be purified from the aqueous culture broths using standard chemical separation methods such as liquid-liquid extractions (LLE) with solvents of varying polarities applied in a sequential manner. Additional examples of designed biochemical routes to aromatic products are described below: i) lignin-derived syringic acid might be converted to gallic acid via a 2-step biochemical conversion using aryl aldehyde oxidases and demethylases. ii) Lignin-derived vanillin might be converted to protocatechuic acid via a 2-step biochemical conversion using aryl aldehyde oxidases and demethylases. iii) Lignin-derived vanillin might be converted to catechol via a 3-step biochemical conversion using aryl aldehyde oxidases, aromatic decarboxylases, and demethylases. iv) Lignin-derived 2-methoxytoluene might be converted to the urethane precursor 2,4-diaminotoluene via a 4-step biochemical conversion using demethylases, ferulate-5-hydroxylases, 2,4-nitrophenol oxidoreductases, and 2,4-nitrobenzene reductases. In each case, the specific enzymes might be engineered into This example describes potential LigE-, LigF-, LigG-, and LigD-type polypeptides, and the genes encoding them. The potential polypeptides were identified using bioinformatic methods, such as those taught herein. As described above, the query sequences in the initial pass for the LigE-type and LigF-type were Sphingomonas paucimobilis sequences, such as those discussed in Masai, E., et al. Likewise, the query sequences for the LigG-type and LigD-type were also Sphingomonas paucimobilis sequences, such as those discussed in Masai. The following sequences were used in the initial pass for all queries: LigE, from Accession No BAA2032.1, is listed herein as SEQ ID NO:1 for the protein and SEQ ID NO:2 for the gene. LigF, from Accession No BAA2031.1 (P30347.1), is listed herein as SEQ ID NO:513 for the protein and SEQ ID NO:514 for the gene. LigG, from Accession No Q9Z339.2, is listed herein as SEQ ID NO:733 for the protein and SEQ ID NO:734 for the gene. LigD, from Accession No Q01198.1, is listed herein as SEQ ID NO:777 for the protein and SEQ ID NO:778 for the gene. The following sequences were used in a modified query to further refine the LigE-type and LigF-type, and the query sequences were the LigE-1 and LigF-2 that showed the surprising and unexpected results shown in LigE-1, from Accession No ABD26841.1, is listed herein as SEQ ID NO:101 for the protein and SEQ ID NO:102 for the gene. LigF-2, from Accession No ABD27301.1, is listed herein as SEQ ID NO:541 for the protein and SEQ ID NO:542 for the gene. Table 16 lists SEQ ID NOs:1-246, which are potential protein sequences of the LigE-type, as well as a respective gene sequence encoding the protein. Table 17 lists SEQ ID NOs:247-576, which are potential protein sequences of the LigF-type, as well as a respective gene sequence encoding the protein. Table 18 lists SEQ ID NOs:577-776, which are potential protein sequences of the LigG-type, as well as a respective gene sequence encoding the protein. Table 19 lists SEQ ID NOs: 777-976, which are potential protein sequences of the LigD-type, as well as a respective gene sequence encoding the protein. Bioinformatic methods, such as those described herein, can be used to suggest an efficient order of experimentation to identify additional potential enzymes for use with the teachings provided herein. Moreover, mutations and amino acid substitutions can be used to test affects on enzyme activity to further understand the structure of the most active proteins with respect to the enzyme functions sought by teachings provided herein. Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, that there are many equivalents to the specific embodiments described herein that have been described and enabled to the extent that one of skill in the art can practice the invention well-beyond the scope of the specific embodiments taught herein. Such equivalents are intended to be encompassed by the following claims. In addition, there are numerous lists and Markush groups taught and claimed herein. One of skill will appreciate that each such list and group contains various species and can be modified by the removal, or addition, of one or more of species, since every list and group taught and claimed herein may not be applicable to every embodiment feasible in the practice of the invention. As such, components in such lists can be removed and are expected to be removed to reflect some embodiments taught herein. All publications, patents, patent applications, other references, accession numbers, ATCC numbers, etc., mentioned in this application are herein incorporated by reference into the specification to the same extent as if each was specifically indicated to be herein incorporated by reference in its entirety.CROSS-REFERENCE TO RELATED APPLICATIONS
SEQUENCE LISTING
BACKGROUND OF THE INVENTION
SUMMARY
BRIEF DESCRIPTION OF THE DRAWINGS
DETAILED DESCRIPTION OF THE INVENTION
Stage 2. Use at least one organic solvent from those described in 6-6c in water for the organosolve step;
Stage 3. Use yeast, white rot basidomycetes, actinomycetes, and cellulases and xylanases in native or recombinant forms for the biological pretreatment step.
Characterization of Lignin-Derived Compounds for Use in Bioconversion
Wheat Straw 16-21 45 46 9 Rice Straw 6 45 40 15 Rye Straw 18 43 53 1 Hemp 8-13 51 40 9 Tall Fescue: Stems 7-10 55 42 3 Internodes 11 48 50 2 Flax 21-34 67 29 4 Jute 15-26 36 62 2 Sisal 7-14 22 76 2 Curaua Leaf fiber 7 29 41 30 Banana Plant Leaf 43 50 7 Piassava Fiber 45 40 9 51 (Plam Tree) Abaca 7-9 19 55 26 Loblolly Pine 29 86 2 12 29 87 0 13 Compression 60 40 Spruce ( 28 94 1 5 MWL 98 2 0 22 14 84 2 27 27 69 4 Birch 22 29 69 2 Beech 26 56 40 4 28 48 49 3
Table 1A compares location of a sample in the biomass, species, and environmental stress to the G:S:H.
White Birch G: S Fiber, S2 layer 12 88 Vessel, S2 Layer 88 12 Ray parenchyma, S-layer 49 51 Middle lamella (fiber/fiber) 91 9 Middle lamella (fiber/vessel) 80 20 Middle lamella (fiber/ray) 100 0 Middle lamella (ray/ray) 88 12 G: S: H Lignin Samples 19 80 1 35 59 6 23 57 20 25 72 3 22 73 6 Lobolly Pine Juvenile Normal 95 5 Wind Opposite 96 4 Wind Compression 89 11 Bent Opposite 96 4 Bent Compression 88 12 indicates data missing or illegible when filed
Fractionation of Lignin-Derived Compounds for Use in Bioconversion
Chemical Products
Enzyme Discovery
Alanine A Ala Leucine L Leu Arginine R Arg Lysine K Lys Asparagine N Asn Methionine M Met Aspartic acid D Asp Phenylalanine F Phe Cysteine C Cys Proline P Pro Glutamic acid E Glu Serine S Ser Glutamine Q Gln Threonine T Thr Glycine G Gly Tryptophan W Trp Histidine H His Tyrosine Y Tyr Isoleucine I Ile Valine V Val Ornithine O Orn Other Xaa Methods of Preparing the Recombinant SDF-1 Polynucleotide and Polypeptides
Microbial Systems—Antimicrobial Lignin-Derived Compounds
Microbial Systems—Suitable Microbes
Microbial Systems—
EXAMPLES
Example 1
1 Syringic acid 2 Syringaldehyde 3 Gallic acid 4 Furfural 5 5-Hydroxymethylfurfural 6 4-hydroxybenzaldehyde 7 Hydroxybenzoate 8 Vanillin 9 Vanillic acid 10 Cinnamic acid 11 o-, m-and p-Coumaric acids 12 2-hydroxy-3-methoxybenzaldehyde 13 2,4,6-trihydroxybenzaldehyde 14 4-hydroxy-2,6-dimethoxybenzaldehyde Example 2
1 ligE Beta- BAA02032.1 etherase 2 ligE-1 Putative ABD26841.1 (62%) (75%) Beta- etherase 3 ligF Beta- BAA02031.1 etherase 4 ligF-1 Putative ABD26530.1 (60%) (77%) Beta- etherase 5 ligF-2 Putative ABD27301.1 (47%) (59%) Beta- etherase 6 ligF-3 Putative ABD27309.1 (37%) (57%) Beta- etherase Example 3
Example 3
Example 4
Example 5
Dimer (residues 98-221 K100; A101; K100->R interface of SEQ ID N104; P166 A101->L; I; V; G; S NO: 101) N104->Q; H; S; A N terminal (residues 98-221 K100; W107; K100->R domain of SEQ ID Y184; Y187; W107->Y; F; A; S interface NO: 101) R188; G191; Y184->W; F; A; S F195 Y187-> W; F; A; S R188->K G191-> L; I; V; A; S F195->W; Y; A; S Lignin/ (residues 98-221 W107; V111; W107->Y; F; A; S substrate of SEQ ID G112; M115; V111-> L; I; G; A; S binding NO: 101) F116; G192; G112-> L; I; V; A; S pocket or F195 M115->S; A; G H site G192-> L; I; V; A; S F195-> W; Y; A; S Dimer (residues 99-230 R100; Y101; K104; R100->K interface of SEQ ID K176 Y101-> W; F; A; S NO: 541) K104->R K176->R N terminal (residues 99-230 R100; D107; L194; R100->K domain of SEQ ID I197; N198; S201; D107->E interface NO: 541) M206 L194-> V; I; G; A; S I197-> L; V; G; A; S N198->Q S201->A; M; G M206->S; A; G Substrate (residues 99-230 D107; M111; N112; D107->E binding of SEQ ID S115; M116; M206; M111->S; A; G pocket NO: 541) H202 N112->Q or H site S115->A; M; G M116->S; A; G M206->S; A; G H202->N; Q; S; M (residues 19-54 of SEQ ID BAA02032.1 89/97 NO: 1) TISPYVWRTKYALKHKGFDI etherase DIVPGGFTGILERTGG (residues 19-54 of SEQ ID YP004533906.1 86/92 NO: 89) PP1Y; glutathione S TISPFVWRTKYALAHKGFD transferase like protein VDIVPGGFTGIAERTGG (residues 19-54 of SEQ ID BAJ11989.1 83/94 NO: 3) 6; TISPFVWATKYAIAHKGFEL beta-etherase DIVPGGFSGIPERTGG (residues 45-55 of ZP_03840063.1 91/91 SEQ ID NO: 983) 29906; glutathione S- AINPKGQVPVL transferase (residues 60-70 of ZP_08683997.1 82/91 SEQ ID NO: 985) 33926; glutathione S- AINPQGQVPAL transferase (residues 43-53 of YP_425114.1 82/91 SEQ ID NO: 987) glutathione S-transferase- AMNPEGEVPVL like protein (residues 46-56 of ZP_05317369.1 82/91 SEQ ID NO: 989) 29256; glutathione S- AINPQGQVPAL transferase (residues 46-56 of ZP_05978410.1 82/91 SEQ ID NO: 991) 25996; glutathione S- AINPQGQVPAL transferase (residues 19-29 of alpha proteobacterium ZP_02189431.1 82/91 SEQ ID NO: 993) BAL199; Glutathione S- AINPAGEVPVL transferase-like protein (residues 31-41 of ZP_01077889.1 91/91 SEQ ID NO: 995) glutathione S-transferase AINPLGQVPVL (residues 46-55 of ZP_03805830.1 90/90 SEQ ID NO: 997) 35198; hypothetical protein INPKGQVPVL PROPEN_04226 (residues 45-55 of AURANDRAFT_7474 EGB13094.1 82/91 SEQ ID NO: 999) AINPQGKVPVL hypothetical protein (residues 107-115 of XP_003019921.1 100/100 SEQ ID NO: 1001) HKI 0517; conserved TVINEYLED hypothetical protein (residues 103-111 of XP_003017304.1 100/100 SEQ ID NO: 1003) CBS 112371; conserved TVINEYLED hypothetical protein (residues 72-80 of XP_003232549.1 100/100 SEQ ID NO: 1005) 118892; glutathione TVINEYLED transferase (residues 62-75 of YP_004533905.1 79/79 SEQ ID NO: 1007) glutathione S-transferase- IITESTVICEYLED like protein (residues 84-92 of XP_003171868.1 89/100 SEQ ID NO: 1009) 118893; hypothetical protein TVINEFLED MGYG_06412 (residues 61-69 of EGE04518.1 89/100 SEQ ID NO: 1011) 127.97; hypothetical protein TVINEFLED TEQG_03389 M 1 A 2 N 4 N 5 T 6 I 7 T 8 Y 10 D 11 L 12 L 14 G 17 T 19 I 20 S 21 P 22 V 24 W 25 T 27 K 28 Y 29 A 30 L 31 K 32 H 33 K 34 G 35 F 36 D 37 D 39 V 41 P 42 G 43 G 44 F 45 T 46 G 47 I 48 L 49 E 50 R 51 T 52 G 53 G 54 E 57 R 58 P 60 I 62 V 63 D 64 D 65 G 66 E 67 V 69 L 70 D 71 S 72 W 73 I 75 E 77 Y 78 L 79 D 80 K 82 Y 83 P 84 D 85 R 86 P 87 L 89 K 100 L 102 D 103 N 104 W 105 W 107 A 110 V 111 G 112 P 113 W 114 C 117 D 121 Y 122 D 124 L 125 S 126 L 127 P 128 Q 129 D 130 Y 133 V 134 S 137 R 138 E 139 L 148 E 149 V 151 Q 152 A 153 G 154 R 155 E 156 R 158 L 159 P 160 L 166 E 167 P 168 R 170 L 173 A 174 W 178 L 179 G 180 G 181 P 184 N 185 A 187 D 188 Y 189 T 198 A 199 S 200 V 201 T 204 P 205 L 207 D 210 D 211 P 212 L 213 R 214 D 215 W 216 R 219 D 222 L 223 G 226 L 227 G 228 R 229 H 230 P 231 G 232 P 235 L 236 F 237 G 238 L 239 R 242 E 243 G 244 D 245 P 246 F 249 R 251 G 254 G 257 N 264 G 266 P 267 T 270 R 275 E 278 M 1 Y 6 P 10 A 12 N 13 S 14 K 16 L 21 E 23 K 24 G 25 L 26 E 29 D 34 F 38 E 39 H 41 F 45 I 48 N 49 P 50 G 52 V 54 P 55 T 65 T 68 I 70 E 72 Y 73 L 74 E 75 D 76 L 85 P 87 D 89 R 97 W 99 K 101 L 161 K 167 E 176 L 179 L 185 Y 190 L 192 A 193 D 194 I 195 P 221 L 223 W 226 R 229 R 233 P 234 A 235 Example 6
7 P28342.1/121736 59 Glutathione S transferase 8 P57108.1/11132235 51 Glutathione S transferase 9 P04907.4/1170090 70 Glutathione S transferase 10 P57109.1/11133449 58 Maleylacetoacetate isomerase 11 P46420.2/1170092 63 Glutathione S transferase 12 Q8L7C9.1/75329755 61 Glutathione S transferase 13 P42769.1/1170093 73 Glutathione S transferase 14 O65857.2/57012737 59 Group; Probable Gluta- thione S transferase 15 O82451.3/57012739 62 Group; Probable Gluta- thione S transferase Example 7
1 YP495487.1 78/88 2 YP496072.1 39/58 3 YP496073.1 39/59 4 YP495984.1 35/56 5 YP497149.1 38/58 1 YP_498160.1 23/41 2 YP_002798340 32/50 Example 8
1 Wiscon- 12518 8 Ad116 17962 14 B-6 7489 sin O 2 3a 12837 9 NRS 16 25308 15 B-9 7492 3 AV-3 13266 10 UWD 478 16 37 9046 4 AV-4 13267 11 113 53800 17 V1 7496 5 AV-5 13268 12 B-1 7484 18 3 9047 6 OP 13705 13 B-4 7487 7 135 53799 — — — — — — [VKMB- 547] Example 9
Catechol
30 x 103 2.34 Antioxident: 4-tert- butylcatechol. Flavors: piperonal; veratrol. Insecticides: carbofuran; propoxur.
Vanillin
20 x 103 6.12 Flavor agent. Precursor for pharmaceutical methyldopa.
2,4-Diaminotoluene
3 x 106 1.65 Precursor to toluene diisocyanates for urethane polymers.
Salicylic acid
1.6 x 103(US) 3.92 Precursor to analgesic drug acetylsalicylic acid. Precursor to fragrances: amyl and methyl esters of salicylic acid.
Aminosalicylic acid
57.38 Tuberculosis drug.
ortho-Cresol
38 x 103 0.8 Precursors to herbicides: 4-chloro-2- methyl- phenoxyacetic acid; 2-(4- chloro-2- methyl- phenoxy)- propionic acid Example 10
1 2 BAA02032.1 LIGE 3 4 BAJ11989.1 beta-etherase [ LIGE 5 6 EFV85608.1 glutathione S-transferase domain-containing LIGE protein [ 7 8 EFW42705.1 predicted protein [ LIGE 9 10 EGE55257.1 Glutathione S-transferase domain-containing LIGE protein [ 11 12 EGP48556.1 glutathione S-transferase domain-containing LIGE protein [ 13 14 EGP57475.1 lignin degradation protein [ LIGE 15 16 EGU12703.1 Glutathione S-transferase [ LIGE ATCC 204091] 17 18 EGU56510.1 glutathione S-transferase domain-containing LIGE protein [ 19 20 NP_053324.1 hypothetical protein pTi-SAKURA_p086 LIGE [ tiorf84 [ 21 22 NP_108131.1 lignin beta-ether hydrolase [ LIGE MAFF303099] >dbj|BAB54276.1|lignin beta- ether hydrolase [ 23 24 NP_354140.2 lignin degradation protein [ LIGE degradation protein [ 25 26 NP_385269.1 putative BETA-etherase (BETA-aryl ether LIGE cleaving enzyme) protein [ 1021] >emb|CAC45742.1|Putative beta- etherase (beta-aryl ether cleaving enzyme) protein [ >gb|AEG03720.1|Glutathione S-transferase domain protein [ >gb|AEH79753.1|putative BETA-etherase 27 28 NP_774067.1 ligninase [ LIGE >dbj|BAC52692.1|ligE [ 29 30 NP_949676.1 putative lignin beta-ether hydrolase LIGE [ >emb|CAE29781.1|putative lignin beta-ether 31 32 P27457.3 RecName: Full = Beta-etherase; AltName: LIGE Full = Beta-aryl ether cleaving enzyme >gb|AAA25878.1|beta-etherase [ 33 34 P_003028922.
hypothetical protein SCHCODRAFT_85860 LIGE [ >gb|EFI94019.1|hypothetical protein 35 36 P_003030384.
hypothetical protein SCHCODRAFT_57691 LIGE [ >gb|EFI95481.1|hypothetical protein 37 38 P_003033715.
hypothetical protein SCHCODRAFT_81614 LIGE [ >gb|EFI98812.1|hypothetical protein 39 40 P_003041213.
hypothetical protein NECHADRAFT_55532 LIGE [ >gb|EEU35500.1|hypothetical protein NECHADRAFT_55532 [ 41 42 XP_382462.1 hypothetical protein FG02286.1 [ LIGE 43 44 P_001207860.
putative glutathione S-transferase (GST) LIGE [ >emb|CAL79645.1|putative glutathione S- 45 46 P_001236206.
glutathione S-transferase domain-containing LIGE protein [ >gb|ABQ32287.1|Glutathione S-transferase, N- terminal domain protein [ 47 48 P_001237901.
putative glutathione S-transferase LIGE [ putative glutathione S-transferase (GST) 49 50 P_001262153.
hypothetical protein Swit_1652 [ LIGE protein Swit_1652 [ 51 52 P_001326465.
glutathione S-transferase domain-containing LIGE protein [ >gb|ABR59630.1|Glutathione S-transferase domain [ 53 54 P_001413220.
glutathione S-transferase domain-containing LIGE protein [ >gb|ABS63563.1|Glutathione S-transferase domain [ 55 56 P_001526182.
glutathione S-transferase [ LIGE glutathione S-transferase [ 57 58 P_001616516.
lignin degradation protein [ LIGE ‘So ce 56’] >emb|CAN96036.1|lignin degradation protein [ 59 60 P_001772944.
glutathione S-transferase domain-containing LIGE protein [ >gb|ACA20510.1|Glutathione S-transferase 61 62 P_001833458.
glutathione S-transferase domain-containing LIGE protein [ 9039] >gb|ACB95969.1|Glutathione S- transferase domain [ 63 64 P_001977695.
beta-aryl ether cleaving enzyme, lignin LIGE degradation protein [ >gb|ACE90517.1|beta-aryl ether cleaving enzyme, lignin degradation protein [ 65 66 P_001993784.
glutathione S-transferase domain-containing LIGE protein [ >gb|ACF03309.1|Glutathione S-transferase domain [ 67 68 P_002280598.
glutathione S-transferase domain [ LIGE >gb|ACI54372.1|Glutathione S-transferase domain [ 69 70 P_002290149.
glutathione S-transferase [ LIGE beta etherase [ OM5] >gb|ACI94284.1|glutathione S- transferase [ >gb|AEI02075.1|putative beta etherase [ 71 72 P_002362903.
glutathione S-transferase domain-containing LIGE protein [ >gb|ACK51541.1|glutathione S-transferase 73 74 P_002502105.
glutathione S-transferase domain-containing LIGE protein [ >gb|ACL61802.1|Glutathione S-transferase domain protein [ 75 76 P_002549116.
lignin degradation protein [ LIGE S4] >gb|ACM36110.1|lignin degradation protein [ 77 78 P_002797805.
glutathione S-transferase-like protein LIGE [ Glutathione S-transferase-like protein 79 80 P_002825455.
putative lignin beta-ether hydrolase LIGE [ >gb|ACP24702.1|putative lignin beta-ether 81 82 P_002975056.
glutathione S-transferase domain protein LIGE [ >gb|ACS55517.1|Glutathione S-transferase domain protein [ 83 84 P_004278359.
lignin degradation protein [ LIGE H13-3] >gb|ADY64039.1|lignin degradation protein [ 85 86 P_004285673.
putative beta-etherase [ LIGE AIU301] >dbj|BAJ82791.1|putative beta- etherase [ 87 88 P_004378290.
glutathione S-transferase-like protein LIGE [ >gb|AEB56538.1|glutathione S-transferase-like 89 90 P_004533906.
glutathione S-transferase-like protein LIGE [ >emb|CCA92088.1|glutathione S-transferase- 91 92 P_004548326.
glutathione S-transferase domain-containing LIGE protein [ >gb|AEG52712.1|Glutathione S-transferase 93 94 P_004613710.
glutathione S-transferase domain-containing LIGE protein [ WSM2075] >gb|AEH89616.1|Glutathione S- transferase domain protein [ 95 96 YP_269568.1 putative lignin beta-etherase [ LIGE lignin beta-etherase [ 97 98 YP_469001.1 beta-aryl ether cleaving enzyme, lignin LIGE degradation protein [ >gb|ABC90274.1|beta-aryl ether cleaving enzyme, lignin degradation protein [ 99 100 YP_487746.1 glutathione S-transferase-like protein LIGE [ >gb|ABD08835.1|Glutathione S-transferase-like 101 102 YP_497675.1 glutathione S-transferase-like protein LIGE [ >gb|ABD26841.1|glutathione S-transferase-like protein [ 103 104 YP_533979.1 glutathione S-transferase-like protein LIGE [ >gb|ABD89660.1|glutathione S-transferase-like 105 106 YP_574731.1 glutathione S-transferase-like protein LIGE [ >gb|ABE60032.1|glutathione S-transferase-like protein [ 107 108 YP_723508.1 glutathione S-transferase-like protein LIGE [ >gb|ABG53035.1|glutathione S-transferase-like 109 110 YP_767183.1 etherase [ LIGE 3841] >emb|CAK07074.1|putative etherase [ 111 112 YP_783091.1 glutathione S-transferase [ LIGE S-transferase [ 113 114 YP_915395.1 glutathione S-transferase domain-containing LIGE protein [ >gb|ABL69699.1|Glutathione S-transferase, N- terminal domain [ 115 116 ZP_02146530. putative beta-etherase (beta-aryl ether cleaving LIGE enzyme) protein [ BS107] >gb|EDQ11875.1|putative beta- etherase (beta-aryl ether cleaving enzyme) 117 118 ZP_02149699. putative beta-etherase (beta-aryl ether cleaving LIGE enzyme) protein [ 2.10] >gb|EDQ08644.1|putative beta-etherase (beta-aryl ether cleaving enzyme) protein 119 120 ZP_02166231. putative beta-etherase (beta-aryl ether cleaving LIGE enzyme) protein [ >gb|EDQ33834.1|putative beta-etherase (beta- aryl ether cleaving enzyme) protein [ 121 122 ZP_02190934. glutathione S-transferase-like protein [alpha LIGE proteobacterium BAL199] >gb|EDP62276.1| glutathione S-transferase-like protein [alpha 123 124 ZP_03503368. Glutathione S-transferase domain [ LIGE 125 126 ZP_03507162. Glutathione S-transferase domain [ LIGE 127 128 ZP_03513891. Glutathione S-transferase domain [ LIGE 129 130 ZP_03519388. Glutathione S-transferase domain [ LIGE 131 132 ZP_03520502. putative etherase [ LIGE 133 134 ZP_05084767. glutathione S-transferase, N-terminal domain LIGE [ glutathione S-transferase, N-terminal domain 135 136 ZP_06688745. lignin degradation protein [ LIGE degradation protein [ 137 138 ZP_06898146. glutathione S-transferase family protein LIGE [ >gb|EFH10151.1|glutathione S-transferase family protein [ 139 140 ZP_07027473. Glutathione S-transferase domain protein [ LIGE sp. 1NLS2] >gb|EFI51229.1|Glutathione S- transferase domain protein [ 141 142 ZP_07373940. beta-etherase [ LIGE >gb|EFL90585.1|beta-etherase [ 143 144 ZP_08328512. Glutathione S-transferase [gamma LIGE proteobacterium IMCC1989] >gb|EGG95341.1| Glutathione S-transferase [gamma 145 146 ZP_08529965. lignin degradation protein [ LIGE ATCC 31749] >gb|EGL63395.1|lignin degradation protein [ 147 148 ZP_08627134. lignin beta-ether hydrolase [ LIGE ether hydrolase [ 149 150 ZP_08631370. Glutathione S-transferase domain-containing LIGE protein [ Glutathione S-transferase domain-containing 151 152 ZP_08634908. Glutathione S-transferase domain-containing LIGE protein [ Glutathione S-transferase domain-containing 153 154 ZP_08635074. glutathione S-transferase domain-containing LIGE protein [ glutathione S-transferase domain-containing 155 156 EGN93792.1 hypothetical protein SERLA73DRAFT_115219 LIGE [ >gb|EGO19163.1|hypothetical protein SERLADRAFT_453680 [ 157 158 EGN94392.1 hypothetical protein SERLA73DRAFT_188253 LIGE [ >gb|EGO19875.1|hypothetical protein SERLADRAFT_478300 [ 159 160 EGN96317.1 hypothetical protein SERLA73DRAFT_186005 LIGE [ >gb|EGO21854.1|hypothetical protein SERLADRAFT_474829 [ 161 162 EGN96924.1 hypothetical protein SERLA73DRAFT_185168 LIGE [ >gb|EGO22516.1|hypothetical protein SERLADRAFT_473468 [ 163 164 EGO00367.1 hypothetical protein SERLA73DRAFT_107446 LIGE [ >gb|EGO25928.1|hypothetical protein SERLADRAFT_415302 [ 165 166 P_001215222.
conserved hypothetical protein [ LIGE hypothetical protein [ 167 168 P_001823934.
hypothetical protein AOR_1_322094 [ LIGE protein product [ 169 170 P_001839188.
hypothetical protein CC1G_07903 [ LIGE hypothetical protein CC1G_07903 [ 171 172 P_001885678.
predicted protein [ LIGE >gb|EDR03530.1|predicted protein [ 173 174 P_002152364.
conserved hypothetical protein [ LIGE conserved hypothetical protein [ 175 176 P_002380998.
conserved hypothetical protein [ LIGE hypothetical protein [ 177 178 P_002392962.
hypothetical protein MPER_07394 LIGE [ >gb|EEB93892.1|hypothetical protein 179 180 P_002468854.
predicted protein [ LIGE >gb|EED86077.1|predicted protein [ 181 182 P_002472522.
predicted protein [ LIGE >gb|EED82308.1|predicted protein [ 183 184 P_002557398.
Pc12g05530 [ LIGE Wisconsin 54-1255] >emb|CAP80180.1| Pc12g05530 [ 185 186 P_003026159.
hypothetical protein SCHCODRAFT_12387 LIGE [ >gb|EFI91256.1|hypothetical protein 187 188 P_003028923.
hypothetical protein SCHCODRAFT_111982 LIGE [ >gb|EFI94020.1|hypothetical protein 189 190 P_003890246.
Glutathione S-transferase domain-containing LIGE protein [ >gb|ADN16971.1|Glutathione S-transferase 191 192 P_003896657.
glutathione S-transferase-like [ LIGE glutathione S-transferase-like [ 193 194 P_003980382.
glutathione S-transferase [ LIGE S-transferase, N-terminal domain protein 4 195 196 P_004110838.
glutathione S-transferase domain-containing LIGE protein [ >gb|ADU46105.1|Glutathione S-transferase domain [ 197 198 P_004143867.
glutathione S-transferase [ LIGE biovar Glutathione S-transferase domain [ 199 200 ZP_01102591. conserved hypothetical protein [ LIGE hypothetical protein [ 201 202 AAA87183.1 auxin-induced protein [ LIGE 203 204 AAG34797.1 glutathione S-transferase GST 7 [ LIGE 205 206 AAO69664.1 glutathione S-transferase [ LIGE 207 208 ACU24385.1 unknown [ LIGE 209 210 ADP99065.1 glutathione S-transferase [ LIGE 211 212 ADY82158.1 putative glutathione S-transferase [ LIGE 213 214 BAA77215.1 beta-etherase [ LIGE 215 216 P_001839584.
hypothetical protein CC1G_12612 [ LIGE hypothetical protein CC1G_12612 [ 217 218 P_002336443.
predicted protein [ LIGE >gb|EEE73479.1|predicted protein [ 219 220 P_003028624.
hypothetical protein SCHCODRAFT_59314 LIGE [ >gb|EFI93721.1|hypothetical protein 221 222 XP_456365.1 DEHA2A00660p [ LIGE CBS767] >emb|CAG84310.1|DEHA2A00660p [ 223 224 XP_572781.1 hypothetical protein [ LIGE var. hypothetical protein CNBH0460 [ >gb|EAL19352.1|hypothetical protein CNBH0460 [ 225 226 P_001236206.
glutathione S-transferase domain-containing LIGE protein [ >gb|ABQ32287.1|Glutathione S-transferase, N- terminal domain protein [ 227 228 P_001237901.
putative glutathione S-transferase LIGE [ putative glutathione S-transferase (GST) 229 230 P_001262153.
hypothetical protein Swit_1652 [ LIGE protein Swit_1652 [ 231 232 P_001326465.
glutathione S-transferase domain-containing LIGE protein [ >gb|ABR59630.1|Glutathione S-transferase domain [ 233 234 P_001413220.
glutathione S-transferase domain-containing LIGE protein [ >gb|ABS63563.1|Glutathione S-transferase domain [ 235 236 P_001526182.
glutathione S-transferase [ LIGE glutathione S-transferase [ 237 238 YP_171459.1 glutathione S-transferase [ LIGE glutathione S-transferase [ glutathione S-transferase [ 239 240 YP_322424.1 glutathione S-transferase-like protein [ LIGE Glutathione S-transferase-like protein 241 242 ZP_01625805. glutathione S-transferase, putative [marine LIGE gamma proteobacterium HTCC2080] >gb|EAW41324.1|glutathione S-transferase, putative [marine gamma proteobacterium 243 244 ZP_01631145. Glutathione S-transferase-like protein [ LIGE Glutathione S-transferase-like protein [ 245 246 ZP_06057261. glutathione S-transferase [ LIGE glutathione S-transferase [ indicates data missing or illegible when filed
247 248 AAB65163.1 glutathione S-transferase, class-phi LigF [ 249 250 AAG34850.1 glutathione S-transferase GST 42 [ LigF 251 252 AAK98535.1 putative glutathione S-transferase LigF OsGSTU7 [ Group] 253 254 AAL61612.1 glutathione S-transferase [ LigF 255 256 ABE86679.1 Intracellular chloride channel [ LigF 257 258 ABE86683.1 Intracellular chloride channel [ LigF 259 260 ABQ96853.1 glutathione S-transferase [ LigF 261 262 ACF15452.1 glutathione-S-transferase LigF [ 263 264 ACG44597.1 glutathione S-transferase GSTU6 [ LigF 265 266 ACJ86045.1 unknown [ LigF 267 268 ACO15091.1 Probable maleylacetoacetate isomerase LigF 2 [ 269 270 ADB11335.1 phi class glutathione transferase GSTF7 LigF [ 271 272 BAB70616.1 glutathione S-transferase [ LigF 273 274 BAF56180.1 glutathione S-transferase [ LigF 275 276 BAJ90004.1 predicted protein [ LigF subsp. predicted protein [ subsp. 277 278 CAI51314.2 glutathione S-transferase GST1 LigF [ 279 280 EAY79299.1 hypothetical protein OsI_34425 [ LigF 281 282 EAZ16758.1 hypothetical protein OsJ_32234 [ LigF 283 284 EEC67342.1 hypothetical protein OsI_34397 [ LigF 285 286 EFV87279.1 glutathione S-transferase LigF [ 287 288 EGN92742.1 hypothetical protein LigF SERLA73DRAFT_190579 [ >gb|EGO26403.1|hypothetical protein SERLADRAFT_463437 [ 289 290 EGU75635.1 hypothetical protein FOXB_13869 LigF [ 291 292 NP_001065115.1 Os10g0525600 [ LigF Group] >gb|AAM12493.1|AC074232_20 putative glutathione S-transferase [ >dbj|BAF27029.1|Os10g0525600 [ 293 294 NP_001065118.1 Os10g0527400 [ LigF Group] >gb|AAM12310.1|AC091680_11 putative glutathione S-transferase [ >gb|AAM12478.1|AC074232_5 putative glutathione S-transferase [ glutathione S-transferase GSTU6, putative, expressed [ Os10g0527400 [ Group] >gb|EEE51298.1|hypothetical protein OsJ_32225 [ 295 296 NP_001065126.1 Os10g0529300 [ LigF Group] >gb|AAK98546.1|AF402805_1 putative glutathione S-transferase OsGSTU18 [ Group] >gb|AAM12302.1|AC091680_3 putative glutathione S-transferase [ >gb|AAM94529.1|putative glutathione S- transferase [ Group] >gb|AAP54753.1|glutathione S- transferase GSTU6, putative, expressed [ >dbj|BAF27040.1|Os10g0529300 [ >gb|EAY79288.1|hypothetical protein OsI_34414 [ >dbj|BAG87628.1|unnamed protein product [ >dbj|BAG97643.1|unnamed protein product [ >dbj|BAG87189.1|unnamed protein product [ 297 298 NP_001065132.1 Os10g0529900 [ LigF Group] >gb|AAM12331.1|AC091680_32 putative glutathione S-transferase [ >gb|AAM94517.1|putative glutathione S- transferase [ Group] >gb|AAP54759.1|glutathione S- transferase GSTU6, putative [ >dbj|BAF27046.1|Os10g0529900 [ >gb|EAZ16763.1|hypothetical protein OsJ_32239 [ Group] 299 300 NP_001105627.1 LOC542632 [ LigF >gb|AAG34835.1|AF244692_1 glutathione S-transferase GST 27 [ 301 302 NP_001152229.1 glutathione S-transferase GSTU6 [ LigF transferase GSTU6 [ 303 304 NP_384409.1 putative glutathione S-transferase LigF protein [ >ref|YP_004550950.1|glutathione S- transferase domain-containing protein [ >emb|CAC41740.1|Putative glutathione S-transferase [ 1021] >gb|AEG06303.1|Glutathione S- transferase domain protein [ >gb|AEG55336.1|Glutathione S- transferase domain protein [ >gb|AEH81005.1|putative glutathione S- transferase protein [ 305 306 XP_001555922.1 hypothetical protein BC1G_05597 LigF [ >gb|EDN24875.1|hypothetical protein BC1G_05597 [ B05.10] 307 308 XP_001805855.1 hypothetical protein SNOG_15716 LigF [ >gb|EAT76811.2|hypothetical protein SNOG_15716 [ SN15] 309 310 XP_002321320.1 predicted protein [ LigF >gb|EEE99635.1|predicted protein [ 311 312 XP_002455784.1 hypothetical protein LigF SORBIDRAFT_03g025210 [ protein SORBIDRAFT_03g025210 [ 313 314 XP_002467606.1 hypothetical protein LigF SORBIDRAFT_01g030860 [ protein SORBIDRAFT_01g030860 [ 315 316 XP_002734706.1 PREDICTED: ganglioside-induced LigF differentiation-associated protein 1-like [ 317 318 XP_002734707.1 PREDICTED: ganglioside-induced LigF differentiation-associated protein 1-like [ 319 320 XP_002737947.1 PREDICTED: Glutathione S-Transferase LigF family member (gst-42)-like [ 321 322 XP_002989538.1 hypothetical protein LigF SELMODRAFT_184606 [ hypothetical protein SELMODRAFT_184606 [ 323 324 XP_003146962.1 glutathione S-transferase domain- LigF containing protein [ >gb|EFO17107.1|glutathione S- transferase domain-containing protein [ 325 326 YP_001187408.1 glutathione S-transferase domain- LigF containing protein [ Glutathione S-transferase, N-terminal domain protein [ 327 328 YP_001239734.1 glutathione S-transferase domain- LigF containing protein [ BTAi1] >gb|ABQ35828.1|putative glutathione S-transferase enzyme with thioredoxin-like domain [ sp. BTAi1] 329 330 YP_001261939.1 glutathione S-transferase domain- LigF containing protein [ Glutathione S-transferase, N-terminal domain [ 331 332 YP_001263066.1 glutathione S-transferase domain- LigF containing protein [ Glutathione S-transferase, N-terminal domain [ 333 334 YP_001414366.1 glutathione S-transferase domain- LigF containing protein [ >gb|ABS64709.1|Glutathione S- transferase domain [ 335 336 YP_001414838.1 maleylacetoacetate isomerase LigF [ >gb|ABS65181.1|maleylacetoacetate isomerase [ 337 338 YP_001684291.1 glutathione S-transferase domain- LigF containing protein [ >gb|ABZ71793.1|Glutathione S- transferase domain [ K31] 339 340 YP_001770584.1 glutathione S-transferase domain- LigF containing protein [ 4-46] >gb|ACA18150.1|Glutathione S- transferase domain [ sp. 4-46] 341 342 YP_002828116.1 predicted glutathione S-transferase LigF protein [ >gb|ACP27363.1|predicted glutathione S-transferase protein [ 343 344 YP_003593122.1 glutathione S-transferase domain- LigF containing protein [ ATCC 21756] >gb|ADG10504.1| Glutathione S-transferase domain protein [ 21756] 345 346 YP_003930867.1 glutathione S-transferase [ LigF Glutathione S-transferase [ 347 348 YP_004434596.1 Glutathione S-transferase domain LigF protein [ 7 + YE-5] >gb|AEE23328.1|Glutathione S- transferase domain protein [ sp. 4H-3-7 + YE-5] 349 350 YP_004620883.1 glutathione S-transferase [ LigF >gb|AEG94864.1|glutathione S- transferase-like protein [ 351 352 YP_067874.1 glutathione S-transferase family protein LigF [ >emb|CAG15111.1|glutathione S- transferase family protein [ 353 354 YP_168502.1 glutathione S-transferase, putative LigF [ >gb|AAV96533.1|glutathione S- transferase, putative [ DSS-3] 355 356 YP_339058.1 glutathione S-transferase LigF [ TAC125] >emb|CAI85615.1|putative glutathione S-transferase [ TAC125] 357 358 YP_612204.1 glutathione S-transferase-like [ LigF sp. TM1040] >gb|ABF62942.1| glutathione S-transferase-like protein [ 359 360 ZP_00954574.1 glutathione S-transferase family protein LigF [ >ref|ZP_00961889.1|glutathione S- transferase family protein [ sp. NAS-14.1] >gb|EAP81303.1| glutathione S-transferase family protein [ >gb|EAP85807.1|glutathione S- transferase family protein [ sp. EE-36] 361 362 ZP_01165363.1 maleylacetoacetate isomerase LigF [ >gb|EAR62715.1|maleylacetoacetate isomerase [ 363 364 ZP_01881157.1 glutathione S-transferase, putative LigF [ >gb|EDM30676.1|glutathione S- transferase, putative [ TM1035] 365 366 ZP_03523367.1 Glutathione S-transferase domain LigF [ 367 368 ZP_04614975.1 Glutathione S-transferase GST-6.0 LigF [ >gb|EEQ00521.1|Glutathione S- transferase GST-6.0 [ ATCC 29473] 369 370 ZP_05125190.1 glutathione S-transferase, N-terminal LigF domain protein [ glutathione S-transferase, N-terminal domain protein [ 371 372 ZP_05786193.1 glutathione S-transferase [ LigF >gb|EEX09309.1|glutathione S- transferase [ ITI-1157] 373 374 ZP_08264339.1 maleylacetoacetate isomerase LigF [ >gb|EGF90974.1|maleylacetoacetate isomerase [ C19] 375 376 ZP_08630058.1 glutathione S-transferase LigF [ >gb|EGP07427.1|glutathione S- transferase [ 377 378 AAG34806.1 glutathione S-transferase GST 16 LigF [ 379 380 AAQ02687.1 tau class GST protein 3 [ LigF hypothetical protein OsI_34421 [ glutathione S-transferase [ 381 382 ADV56298.1 Glutathione S-transferase domain LigF protein [ 383 384 BAB70616.1 glutathione S-transferase [ LigF 385 386 BAJ94610.1 predicted protein [ LigF subsp. 387 388 CAN68934.1 hypothetical protein VITISV_002763 LigF [ 389 390 CBW26056.1 putative glutathione S-transferase LigF [ 391 392 EFW18159.1 glutathione S-transferase [ LigF 393 394 EGF84337.1 hypothetical protein LigF BATDEDRAFT_85058 [ 395 396 NP_001065124.1 Os10g0528400 [ LigF Group] >gb|AAG32472.1|AF309379_1 putative glutathione S-transferase OsGSTU3 [ Group] >gb|AAM12325.1|AC091680_26 putative glutathione S-transferase [ >gb|AAM94544.1|putative glutathione S- transferase [ Group] >gb|AAP54745.1|glutathione S- transferase GSTU6, putative, expressed [ >dbj|BAF27038.1|Os10g0528400 [ >gb|EAZ16756.1|hypothetical protein OsJ_32232 [ Group] 397 398 NP_191835.1 Glutathione S-transferase-like protein LigF [ >emb|CAB83126.1|Glutathione transferase III-like protein [ S-transferase-like protein [ 399 400 NP_717190.1 glutathione S-transferase family protein LigF [ >gb|AAN54634.1|AE015603_8 glutathione S-transferase family protein [ 401 402 NP_769143.1 glutathione S-transferase LigF [ >dbj|BAC47768.1|glutathione S- transferase [ USDA 110] 403 404 NP_900642.1 glutathione transferase zeta 1 LigF [ 12472] >gb|AAQ58646.1|probable glutathione transferase zeta 1 [ 12472] 405 406 XP_001246353.1 glutathione S-transferase [ LigF 407 408 XP_002171087.1 PREDICTED: similar to glutathione S- LigF transferase [ 409 410 XP_002263386.1 PREDICTED: hypothetical protein [ LigF protein product [ 411 412 XP_002263424.1 PREDICTED: hypothetical protein [ LigF protein product [ 413 414 XP_002272099.1 PREDICTED: hypothetical protein LigF isoform 2 [ 415 416 XP_002527848.1 glutathione s-transferase, putative LigF [ glutathione s-transferase, putative [ 417 418 XP_002786341.1 Glutathione S-transferase A, putative LigF [ >gb|EER18137.1|Glutathione S- transferase A, putative [ 419 420 XP_003066789.1 Glutathione S-transferase, putative LigF [ SOWgp] >gb|EER24644.1|Glutathione S-transferase, putative [ 421 422 XP_970577.1 PREDICTED: similar to ganglioside- LigF induced differentiation-associated- protein 1 [ >gb|EFA00477.1|hypothetical protein TcasGA2_TC003336 [ 423 424 YP_001263559.1 glutathione S-transferase domain- LigF containing protein [ Glutathione S-transferase, N-terminal domain [ 425 426 YP_001503032.1 glutathione S-transferase domain- LigF containing protein [ ATCC 700345] >gb|ABV88497.1| Glutathione S-transferase domain [ 427 428 YP_001516981.1 glutathione S-transferase II LigF [ >gb|ABW27665.1|glutathione S- transferase II [ MBIC11017] 429 430 YP_001615392.1 glutathione S-transferase, [ LigF >emb|CAN94912.1|glutathione S- transferase, putative [ 431 432 YP_001685556.1 glutathione S-transferase domain- LigF containing protein [ >gb|ABZ73058.1|Glutathione S- transferase domain [ K31] 433 434 YP_001748054.1 glutathione S-transferase domain- LigF containing protein [ W619] >gb|ACA71685.1|Glutathione S- transferase domain [ 435 436 YP_001804371.1 glutathione S-transferase [ LigF sp. ATCC 51142] >gb|ACB52305.1| glutathione S-transferase [ sp. ATCC 51142] 437 438 YP_002007283.1 glutathione s-transferase protein; gsta LigF protein [ 19424] >emb|CAQ71222.1|putative glutathione S-transferase protein; gstA protein [ 19424] 439 440 YP_002130812.1 glutathione S-transferase LigF [ >gb|ACG78383.1|glutathione S- transferase [ HLK1] 441 442 YP_002220633.1 glutathione S-transferase domain LigF [ 53993] >ref|YP_002426974.1| glutathione S-transferase [ 23270] >gb|ACH84426.1|Glutathione S- transferase domain [ >gb|ACK78121.1|glutathione S- transferase [ 443 444 YP_002482418.1 glutathione S-transferase domain- LigF containing protein [ 7425] >gb|ACL44057.1|Glutathione S- transferase domain protein [ sp. PCC 7425] 445 446 YP_002543747.1 glutathione S-transferase protein LigF [ >gb|ACM25821.1|glutathione S- transferase protein [ 447 448 YP_002974739.1 glutathione S-transferase domain protein LigF [ WSM1325] >gb|ACS55200.1| Glutathione S-transferase domain protein [ 449 450 YP_004065207.1 glutathione transferase LigF [ >gb|ADT70298.1|glutathione transferase [ SM9913] 451 452 YP_004357179.1 glutathione S-transferase [ LigF NFM421] >gb|AEA72175.1|putative glutathione S-transferase [ NFM421] 453 454 YP_004680920.1 glutathione S-transferase [ LigF glutathione S-transferase [ 455 456 YP_468810.1 glutathione S-transferase [ LigF CFN 42] >gb|ABC90083.1|glutathione S- transferase protein [ 42] 457 458 YP_554040.1 glutathione S-transferase [ LigF Glutathione S-transferase [ 459 460 YP_612103.1 glutathione S-transferase-like [ LigF sp. TM1040] >gb|ABF62841.1| glutathione S-transferase-like protein [ 461 462 YP_735310.1 glutathione S-transferase domain- LigF containing protein [ >gb|ABI40253.1|Glutathione S- transferase, N-terminal domain protein [ 463 464 YP_747567.1 glutathione S-transferase domain- LigF containing protein [ Glutathione S-transferase, C-terminal domain [ 465 466 YP_757227.1 maleylacetoacetate isomerase LigF [ >gb|ABI66289.1|maleylacetoacetate isomerase [ 467 468 YP_868399.1 glutathione S-transferase domain- LigF containing protein [ 3] >gb|ABK46993.1|Glutathione S- transferase, N-terminal domain protein [ 469 470 YP_870498.1 glutathione S-transferase domain- LigF containing protein [ 3] >gb|ABK49092.1|Glutathione S- transferase, N-terminal domain protein [ 471 472 YP_957711.1 glutathione S-transferase domain- LigF containing protein [ Glutathione S-transferase, N-terminal domain [ 473 474 YP_957873.1 glutathione S-transferase domain- LigF containing protein [ Glutathione S-transferase, N-terminal domain [ 475 476 YP_960793.1 glutathione S-transferase domain- LigF containing protein [ Glutathione S-transferase, N-terminal domain [ 477 478 YP_963418.1 glutathione S-transferase domain- LigF containing protein [ 18-1] >gb|ABM24864.1|Glutathione S- transferase, N-terminal domain [ 479 480 ZP_01000028.1 glutathione S-transferase family protein LigF [ >gb|EAQ02499.1|glutathione S- transferase family protein [ 481 482 ZP_01459182.1 glutathione S-transferase [ LigF >ref|YP_003956548.1|glutathione s- transferase [ DW4/3-1] >gb|EAU70026.1|glutathione S-transferase [ DW4/3-1] >gb|ADO74721.1|Glutathione S-transferase [ DW4/3-1] 483 484 ZP_02886014.1 Glutathione S-transferase domain LigF [ >gb|EDT08402.1|Glutathione S- transferase domain [ 485 486 ZP_04713937.1 Glutathione S-transferase [ LigF 487 488 ZP_05075049.1 Glutathione S-transferase, N-terminal LigF domain protein [ Glutathione S-transferase, N-terminal domain protein [ 489 490 ZP_05101428.1 glutathione S-transferase protein LigF [ >gb|EEB85730.1|glutathione S- transferase protein [ GAI101] 491 492 ZP_05124402.1 glutathione S-transferase LigF [ >gb|EEE39034.1|glutathione S- transferase [ 493 494 ZP_05926645.1 glutathione S-transferase [ LigF RC341] >gb|EEX64947.1|glutathione S- transferase [ 495 496 ZP_06308936.1 Glutathione S-transferase-like protein LigF [ >gb|EFA69058.1|Glutathione S- transferase-like protein [ 497 498 ZP_06838829.1 Glutathione S-transferase domain LigF protein [ >gb|EFG73275.1|Glutathione S- transferase domain protein [ sp. Ch1-1] 499 500 ZP_08104209.1 glutathione S-transferase III [ LigF >gb|EGA68654.1|glutathione S- transferase III [ 21326] 501 502 ZP_08275708.1 Glutathione S-transferase LigF [ IMCC9480] >gb|EGF30821.1| Glutathione S-transferase [ IMCC9480] 503 504 ZP_08409706.1 glutathione S-transferase LigF [ ANT/505] >gb|EGI73123.1|glutathione S transferase [ 505 506 ZP_08565123.1 glutathione S-transferase [ LigF sp. HN-41] >gb|EGM70872.1| glutathione S-transferase [ sp. HN-41] 507 508 CAA12269.1 ORF 3 [ LigF 509 510 CAC94002.1 glutathione transferase [ LigF 511 512 NP_967294.1 maleylacetoacetate isomerase/ LigF glutathione S-transferase [ >emb|CAE77948.1|maleylacetoacetate isomerase/glutathione S-transferase [ 513 514 P30347.1 RecName: Full = Protein ligF LigF >dbj|BAA02031.1|beta-etherase [ >prf||1914145A beta etherase 515 516 XP_002964271.1 hypothetical protein LigF SELMODRAFT_142654 [ hypothetical protein SELMODRAFT_142654 [ 517 518 YP_001021314.1 glutathione S-transferase-like protein LigF [ >gb|ABM95079.1|glutathione S- transferase-like protein [ 519 520 YP_001862387.1 glutathione S-transferase domain- LigF containing protein [ Glutathione S-transferase domain [ 521 522 YP_002130750.1 glutathione S-transferase LigF [ >gb|ACG78321.1|glutathione S- transferase [ HLK1] 523 524 YP_002825255.1 glutathione S-transferase [ LigF glutathione S-transferase [ 525 526 YP_003908670.1 glutathione S-transferase domain- LigF containing protein [ CCGE1003] >gb|ADN59379.1| Glutathione S-transferase domain protein [ 527 528 YP_004154430.1 glutathione s-transferase domain- LigF containing protein [ EPS] >gb|ADU36319.1|Glutathione S- transferase domain [ 529 530 YP_004229981.1 glutathione S-transferase domain- LigF containing protein [ CCGE1001] >gb|ADX56921.1| Glutathione S-transferase domain protein [ 531 532 YP_004302768.1 glutathione S-transferase, N-terminal LigF domain protein [ SL003B-26A1] >gb|ADZ69468.1| Glutathione S-transferase, N-terminal domain protein [ SL003B-26A1] 533 534 YP_004533892.1 glutathione S-transferase-like protein LigF [ >emb|CCA92074.1|glutathione S- transferase-like [ PP1Y] 535 536 YP_004533893.1 glutathione S-transferase-like protein LigF [ >emb|CCA92075.1|glutathione S- transferase-like [ PP1Y] 537 538 YP_004533905.1 glutathione S-transferase-like protein LigF [ >emb|CCA92087.1|glutathione S- transferase-like [ PP1Y] 539 540 YP_497364.1 glutathione S-transferase-like protein LigF [ 12444] >gb|ABD26530.1|glutathione S- transferase-like protein [ 12444] 541 542 YP_498135.1 glutathione S-transferase-like protein LigF [ 12444] >gb|ABD27301.1|glutathione S- transferase-like protein [ 12444] 543 544 YP_498142.1 glutathione S-transferase-like protein LigF [ 12444] >gb|ABD27308.1|glutathione S- transferase-like protein [ 12444] 545 546 YP_498143.1 glutathione S-transferase-like protein LigF [ 12444] >gb|ABD27309.1|glutathione S- transferase-like protein [ 12444] 547 548 ZP_00952372.1 maleylacetoacetate isomerase LigF [ >gb|EAP91525.1|maleylacetoacetate isomerase [ HTCC2633] 549 550 ZP_00959702.1 glutathione S-transferase, putative LigF [ >gb|EAP78164.1|glutathione S- transferase, putative [ 551 552 ZP_01034543.1 glutathione S-transferase, putative LigF [ glutathione S-transferase, putative [ 553 554 ZP_01057917.1 glutathione S-transferase, putative LigF [ >gb|EAQ44057.1|glutathione S- transferase, putative [ MED193] 555 556 ZP_01223510.1 glutathione S-transferase [marine LigF gamma proteobacterium HTCC2207] >gb|EAS48069.1|glutathione S- transferase [marine gamma proteobacterium HTCC2207] 557 558 ZP_01753989.1 glutathione S-transferase, putative LigF [ >gb|EBA17470.1|glutathione S- transferase, putative [ SK209-2-6] 559 560 ZP_02146800.1 glutathione S-transferase-like protein LigF [ >gb|EDQ11817.1|glutathione S- transferase-like protein [ 561 562 ZP_02150992.1 glutathione S-transferase, putative LigF [ >gb|EDQ07480.1|glutathione S- transferase, putative [ 563 564 ZP_05073592.1 glutathione S-transferase 2 LigF [ >gb|EDZ41252.1|glutathione S- transferase 2 [ 565 566 ZP_05077451.1 glutathione S-transferase LigF [ >gb|EDZ45430.1|glutathione S- transferase [ Y4I] 567 568 ZP_05087035.1 Glutathione S-transferase, N-terminal LigF domain protein [ >gb|EEA92555.1|Glutathione S- transferase, N-terminal domain protein [ 569 570 ZP_05089424.1 glutathione S-transferase [ LigF R11] >gb|EEB71116.1|glutathione S- transferase [ 571 572 ZP_05126316.1 protein LigF [gamma proteobacterium LigF NOR5-3] >gb|EED32863.1|protein LigF [gamma proteobacterium NOR5-3] 573 574 ZP_05126823.1 maleylacetoacetate isomerase [gamma LigF proteobacterium NOR5-3] >gb|EED33370.1|maleylacetoacetate isomerase [gamma proteobacterium NOR5-3] 575 576 ZP_05741946.1 glutathione S-transferase [ LigF TrichCH4B] >gb|EEW58747.1| glutathione S-transferase [ TrichCH4B] indicates data missing or illegible when filed
577 578 BAA77216.1 glutathione S-transferase homolog LigG [ 579 580 YP_004533907. glutathione S-transferase family protein LigG [ >emb|CCA92089.1|glutathione S- transferase family protein 581 582 YP_314808.1 glutathione S-transferase family protein LigG [ >gb|AAZ97003.1|glutathione S- transferase family protein [ 583 584 YP_167289.1 glutathione S-transferase family protein LigG [ >gb|AAV95330.1|glutathione S- transferase family protein [ 585 586 ZP_01011943.1 glutathione S-transferase family protein LigG [ >gb|EAQ14262.1|glutathione S- transferase family protein 587 588 YP_002540613. glutathione S-transferase protein LigG [ >gb|ACM29018.1|glutathione S- 589 590 CAJ81793.1 Novel glutathione S-transferase omega LigG protein [ 591 592 NP_001005086. glutathione S-transferase omega 2 LigG [ >gb|AAH77010.1|MGC89704 protein 593 594 XP_624501.1 PREDICTED: glutathione S-transferase LigG omega-1 [ 595 596 XP_002029736. GM24932 [ LigG >gb|EDW40722.1|GM24932 [ 597 598 NP_001002621. hypothetical protein LOC436894 [ LigG 599 600 XP_002431486. predicted protein [ LigG protein [ 601 602 ADD18952.1 glutathione S-transferase [ LigG 603 604 XP_002093444. GE21298 [ LigG >gb|EDW93156.1|GE21298 [ 605 606 XP_002068563. GK20540 [ LigG >gb|EDW79549.1|GK20540 [ 607 608 NP_001165912. glutathione S-transferase O1 [ LigG 609 610 CAM34501.1 putative glutathione S-transferase LigG [ 611 612 XP_421747.1 PREDICTED: similar to glutathione-S- LigG transferase homolog isoform 2 [ 613 614 XP_002135069. GA23449 [ LigG GA23449 [ 615 616 NP_034492.1 glutathione S-transferase omega-1 [ LigG >sp|O09131.2|GSTO1_MOUSE RecName: Full = Glutathione S-transferase omega-1; Short = GSTO-1; AltName: Full = p28 >gb|AAB70110.1|glutathione-S- transferase homolog [ >dbj|BAC25667.1|unnamed protein product [ >gb|AAH85165.1|Glutathione S- transferase omega 1 [ >dbj|BAE27469.1|unnamed protein product [ 617 618 ZP_03524422.1 glutathione S-transferase domain- LigG containing protein [ 619 620 NP_729388.1 CG6673, isoform A [ LigG isoform A [ >gb|ACZ02426.1|glutathione S- 621 622 ZP_08179398.1 glutathione S-transferase [ LigG >gb|EGD08414.1|glutathione S- 623 624 XP_003218563. PREDICTED: glutathione S-transferase LigG omega-1-like isoform 1 [ 625 626 ABC86304.1 IP16242p [ LigG 627 628 XP_002026470. GL15567 [ LigG >gb|EDW33419.1|GL15567 [ 629 630 NP_001108461. glutathione S-transferase omega 4 LigG [ glutathione S-transferase 13 [ 631 632 NP_999215.1 glutathione S-transferase omega-1 [ LigG PREDICTED: glutathione S-transferase omega-1-like [ >sp|Q9N1F5.2|GSTO1_PIG RecName: Full = Glutathione S-transferase omega-1; Short = GSTO-1; AltName: Full = Glutathione-dependent dehydroascorbate reductase 633 634 NP_001007373. hypothetical protein LOC492500 [ LigG [ 635 636 YP_001566654. glutathione S-transferase domain- LigG containing protein [ SPH-1] >gb|ABX38269.1|Glutathione S- transferase domain [ 637 638 ADY80021.1 omega class glutathione S-transferase LigG [ 639 640 YP_001329158. glutathione S-transferase domain- LigG containing protein [ Glutathione S-transferase domain 641 642 NP_001084924. hypothetical protein LOC431979 LigG [ MGC82327 protein [ 643 644 XP_003396907. PREDICTED: glutathione S-transferase LigG omega-1-like [ 645 646 XP_001368758. PREDICTED: glutathione S-transferase LigG omega-1-like isoform 1 [ 647 648 XP_001983981. GH16193 [ LigG >gb|EDV96329.1|GH16193 [ 649 650 ADK66966.1 glutathione s-transferase [ LigG 651 652 XP_001232808. PREDICTED: similar to glutathione-S- LigG transferase homolog isoform 1 [ 653 654 XP_002068565. GK20354 [ LigG >gb|EDW79551.1|GK20354 [ 655 656 YP_001611239. hypothetical protein sce0602 [ LigG gst2 [ 657 658 XP_001499427. PREDICTED: glutathione S-transferase LigG omega-1-like isoform 1 [ 659 660 NP_384409.1 putative glutathione S-transferase protein LigG [ >ref|YP_004550950.1|glutathione S- transferase domain-containing protein [ >emb|CAC41740.1|Putative glutathione S-transferase [ 1021] >gb|AEG06303.1|Glutathione S- transferase domain protein [ >gb|AEG55336.1|Glutathione S- 661 662 CAG05035.1 unnamed protein product [ LigG 663 664 ZP_01365353.1 hypothetical protein PaerPA_01002475 LigG [ >ref|YP_002440902.1| maleylacetoacetate isomerase [ >ref|ZP_04928412.1|maleylacetoacetate isomerase [ C3719] >gb|EAZ52531.1| maleylacetoacetate isomerase [ >emb|CAW28043.1|maleylacetoacetate isomerase [ 665 666 YP_001348642. maleylacetoacetate isomerase LigG [ >gb|ABR84080.1|maleylacetoacetate isomerase [ 667 668 ZP_04933765.1 maleylacetoacetate isomerase LigG [ >gb|EAZ57884.1|maleylacetoacetate isomerase [ 669 670 NP_250697.1 maleylacetoacetate isomerase LigG [ >sp|P57109.1|MAAI_PSEAE RecName: Full = Maleylacetoacetate isomerase; Short = MAAI >gb|AAG05395.1|AE004627_3 671 672 EFN59352.1 hypothetical protein LigG CHLNCDRAFT_137800 [ 673 674 YP_002945584. glutathione S-transferase domain- LigG containing protein [ S110] >gb|ACS20318.1|Glutathione S- transferase domain protein [ 675 676 XP_002197460. PREDICTED: glutathione S-transferase LigG omega 1 [ 677 678 XP_001971643. GG15075 [ LigG >gb|EDV50669.1|GG15075 [ 679 680 NP_001155757. glutathione S-transferase omega-1-like LigG [ ACYPI008340 [ 681 682 XP_002026468. GL15565 [ LigG >gb|EDW33417.1|GL15565 [ 683 684 XP_001353820. GA19760 [ LigG GA19760 [ 685 686 YP_791232.1 maleylacetoacetate isomerase LigG [ >gb|ABJ11194.1|maleylacetoacetate isomerase [ 687 688 ZP_06879058.1 maleylacetoacetate isomerase LigG [ >ref|ZP_07797003.1|maleylacetoacetate isomerase [ 39016] >gb|EFQ42099.1| maleylacetoacetate isomerase [ >gb|EGM14661.1|maleylacetoacetate 689 690 EFZ22366.1 hypothetical protein SINV_14968 LigG 691 692 ZP_03527925.1 Glutathione S-transferase domain LigG [ 693 694 ABD77536.1 hypothetical protein [ LigG 695 696 XP_002756473. PREDICTED: glutathione S-transferase LigG omega-1-like [ 697 698 XP_001636996. predicted protein [ LigG >gb|EDO44933.1|predicted protein [ 699 700 YP_467831.1 glutathione S-transferase [ LigG CFN 42] >gb|ABC89104.1|glutathione S- transferase protein [ 701 702 NP_103005.1 glutathione-S-transferase [ LigG glutathione-S-transferase [ 703 704 ADY47623.1 Glutathione transferase omega-1 [ LigG 705 706 BAG36430.1 unnamed protein product [ LigG 707 708 XP_002718774. PREDICTED: glutathione-S-transferase LigG omega 1 [ 709 710 3LFL_A Chain A, Crystal Structure Of Human LigG Glutathione Transferase Omega 1, Delta 155 >pdb|3LFL|B Chain B, Crystal Structure Of Human Glutathione Transferase Omega 1, Delta 155 >pdb|3LFL|C Chain C, Crystal Structure 711 712 XP_002805857. PREDICTED: glutathione S-transferase LigG omega-1-like [ >gb|ABO21635.1|glutathione S- 713 714 NP_001007603. glutathione S-transferase omega-1 LigG [ Glutathione S-transferase omega 1 [ glutathione S-transferase omega 1, 715 716 XP_535007.1 PREDICTED: similar to glutathione-S- LigG transferase omega 1 isoform 1 [ 717 718 NP_004823.1 glutathione S-transferase omega-1 LigG isoform 1 [ >sp|P78417.2|GSTO1_HUMAN RecName: Full = Glutathione S-transferase omega-1; Short = GSTO-1 >pdb|1EEM|A Chain A, Glutathione Transferase From >gb|AAF73376.1|AF212303_1 glutathione transferase omega [ >gb|AAB70109.1|glutathione-S- transferase homolog [ >gb|AAH00127.1|Glutathione S- transferase omega 1 [ >gb|AAV68046.1|glutathione S- transferase omega 1-1 [ 719 720 XP_002758417. PREDICTED: glutathione S-transferase LigG omega-1-like [ 721 722 XP_003218564. PREDICTED: glutathione S-transferase LigG omega-1-like isoform 2 [ 723 724 EFN62827.1 Glutathione transferase omega-1 LigG [ 725 726 XP_508020.3 PREDICTED: glutathione S-transferase LigG omega-1 isoform 3 [ 727 728 CAD97673.1 hypothetical protein [ LigG 729 730 BAJ20927.1 glutathione S-transferase omega 1 LigG [synthetic construct] 731 732 ACR43779.1 glutathione S-transferase [ LigG 733 734 Q9Z339.2 RecName: Full = Glutathione S-transferase LigG omega-1; Short = GSTO-1; AltName: Full = Glutathione-dependent dehydroascorbate reductase >gb|ACI32122.1|glutathione S- 735 736 XP_001956909. GF10159 [ LigG >gb|EDV39715.1|GF10159 [ 737 738 XP_001742278. hypothetical protein [ LigG MX1] >gb|EDQ92516.1|predicted protein [ 739 740 XP_002821176. PREDICTED: glutathione S-transferase LigG omega-1-like [ 741 742 XP_003255483. PREDICTED: glutathione S-transferase LigG omega-1-like isoform 1 [ 743 744 YP_325490.1 glutathione S-transferase-like protein LigG [ >gb|ABA24595.1|Glutathione S- transferase-like protein [ 745 746 XP_003208190. PREDICTED: glutathione S-transferase LigG omega-1-like [ 747 748 XP_002068562. GK20539 [ LigG >gb|EDW79548.1|GK20539 [ 749 750 XP_001956911. GF10161 [ LigG >gb|EDV39717.1|GF10161 [ 751 752 ABV24048.1 gluthathione S-transferase omega LigG [ 753 754 ZP_05086262.1 putative glutathione S-transferase protein LigG [ >gb|EEA93528.1|putative glutathione S- transferase protein [ 755 756 AAI28951.1 LOC100037104 protein [ LigG 757 758 XP_001956910. GF10160 [ LigG >gb|EDV39716.1|GF10160 [ 759 760 NP_001099052. glutathione S-transferase omega 2 LigG [ LOC100037104 protein [ 761 762 ZP_03503214.1 Glutathione S-transferase domain LigG [ 763 764 XP_002046961. GJ12198 [ LigG >gb|EDW69303.1|GJ12198 [ 765 766 XP_001956912. GF24331 [ LigG >gb|EDV39718.1|GF24331 [ 767 768 XP_001368790. PREDICTED: glutathione S-transferase LigG omega-1-like isoform 1 [ 769 770 ZP_06308936.1 Glutathione S-transferase-like protein LigG [ >gb|EFA69058.1|Glutathione S- transferase-like protein 771 772 ABJ15788.1 glutathione S-transferase omega 1 LigG [ omega-class glutathione S-transferase 773 774 NP_001037406. glutathione S-transferase omega 2 LigG [ glutathione S-transferase 6 [ 775 776 NP_001040131. glutathione S-transferase omega 1 LigG [ glutathione S-transferase omega 1 indicates data missing or illegible when filed
777 778 Q01198.1 RecName: Full = C alpha-dehydrogenase LigD >dbj|BAA02030.1|C alpha-dehydrogenase [ >dbj|BAA01953.1|C alpha-dehydrogenase [ >gb|AAC60455.1|C alpha-dehydrogenase [ 779 780 YP_495487.1 short-chain dehydrogenase/reductase SDR LigD [ 12444] >gb|ABD24653.1|short-chain dehydrogenase/reductase SDR [ 12444] 781 782 YP_004533898.1 short-chain dehydrogenase/reductase SDR LigD [ >emb|CCA92080.1|short-chain dehydrogenase/reductase SDR [ 783 784 BAH56687.1 Calpha-dehydrogenase [ LigD SYK-6] 785 786 YP_004533921.1 short-chain dehydrogenase/reductase SDR LigD [ >emb|CCA92103.1|short-chain dehydrogenase/reductase SDR [ 787 788 YP_496072.1 short-chain dehydrogenase/reductase SDR LigD [ 12444] >gb|ABD25238.1|short-chain dehydrogenase/reductase SDR [ 12444] 789 790 3IOY_A Chain A, Structure Of Putative Short-Chain LigD Dehydrogenase (Saro_0793) From >pdb|3IOY|B Chain B, Structure Of Putative Short-Chain Dehydrogenase (Saro_0793) From 791 792 YP_496073.1 short-chain dehydrogenase/reductase SDR LigD [ 12444] >gb|ABD25239.1|short-chain dehydrogenase/reductase SDR [ 12444] 793 794 BAH56683.1 Calpha-dehydrogenase [ LigD SYK-6] 795 796 YP_004533920.1 short-chain dehydrogenase/reductase SDR LigD [ >emb|CCA92102.1|short-chain dehydrogenase/reductase SDR [ 797 798 YP_003592832.1 short-chain dehydrogenase/reductase SDR LigD [ >gb|ADG10214.1|short-chain dehydrogenase/reductase SDR [ 799 800 YP_495984.1 short-chain dehydrogenase/reductase SDR LigD [ 12444] >gb|ABD25150.1|short-chain dehydrogenase/reductase SDR [ 12444] 801 802 YP_497149.1 short-chain dehydrogenase/reductase SDR LigD [ 12444] >gb|ABD26315.1|short-chain dehydrogenase/reductase SDR [ 12444] 803 804 YP_003592830.1 short-chain dehydrogenase/reductase SDR LigD [ >gb|ADG10212.1|short-chain dehydrogenase/reductase SDR [ 805 806 YP_001260886.1 short-chain dehydrogenase/reductase SDR LigD [ >gb|ABQ66748.1|short-chain dehydrogenase/reductase SDR [ 807 808 YP_001413979.1 short-chain dehydrogenase/reductase SDR LigD [ >gb|ABS64322.1|short-chain dehydrogenase/reductase SDR [ 809 810 YP_001412300.1 short-chain dehydrogenase/reductase SDR LigD [ >gb|ABS62643.1|short-chain dehydrogenase/reductase SDR [ 811 812 YP_001412299.1 short-chain dehydrogenase/reductase SDR LigD [ >gb|ABS62642.1|short-chain dehydrogenase/reductase SDR [ 813 814 BAH56685.1 Calpha-dehydrogenase [ LigD SYK-6] 815 816 NP_959644.1 short chain dehydrogenase [ LigD >ref|YP_880159.1|short chain dehydrogenase [ >ref|ZP_05215302.1|short chain dehydrogenase [ subsp. >gb|AAS03027.1|hypothetical protein MAP_0710c [ short chain dehydrogenase [ alcohol dehydrogenase [ 817 818 ZP_08717023.1 short chain dehydrogenase [ LigD short chain dehydrogenase [ 819 820 ZP_05127447.1 oxidoreductase, short chain LigD dehydrogenase/reductase family protein [gamma proteobacterium NOR5-3] >gb|EED33994.1|oxidoreductase, short chain dehydrogenase/reductase family protein [gamma proteobacterium NOR5-3] 821 822 YP_004555419.1 Estradiol 17-beta-dehydrogenase LigD [ >gb|AEG50913.1|Estradiol 17-beta- dehydrogenase [ 823 824 YP_004230838.1 short-chain dehydrogenase/reductase SDR LigD [ >gb|ADX57778.1|short-chain dehydrogenase/reductase SDR [ 825 826 YP_004284589.1 putative oxidoreductase [ LigD putative oxidoreductase [ 827 828 YP_001235233.1 hypothetical protein Acry_2115 [ LigD dehydrogenase/reductase SDR [ 829 830 ZP_01617820.1 hypothetical protein GP2143_09415 [marine LigD gamma proteobacterium HTCC2143] >gb|EAW30413.1|hypothetical protein GP2143_09415 [marine gamma proteobacterium HTCC2143] 831 832 ZP_08629833.1 short-chain dehydrogenase/reductase LigD [Bradyrhizobiaceae bacterium SG-6C] >gb|EGP07476.1|short-chain dehydrogenase/reductase [Bradyrhizobiaceae bacterium SG-6C] 833 834 YP_001853014.1 short-chain type dehydrogenase/reductase LigD [ >gb|ACC43159.1|short-chain type dehydrogenase/reductase [ 835 836 YP_004754457.1 short-chain dehydrogenase/reductase SDR LigD [ >gb|AEK63634.1|short-chain dehydrogenase/reductase SDR [ 837 838 ZP_05129129.1 short-chain dehydrogenase/reductase SDR LigD [gamma proteobacterium NOR5-3] >gb|EED30944.1|short-chain dehydrogenase/reductase SDR [gamma proteobacterium NOR5-3] 839 840 ZP_05223648.1 short chain dehydrogenase [ LigD 841 842 YP_004555383.1 short-chain dehydrogenase/reductase SDR LigD [ >gb|AEG50877.1|short-chain dehydrogenase/reductase SDR [ 843 844 YP_976997.1 short chain dehydrogenase [ LigD >ref|YP_002643932.1|short-chain dehydrogenase [ str. Tokyo 172] >ref|ZP_06432004.1|short- chain type dehydrogenase/reductase [ >ref|ZP_06449040.1|short-chain type dehydrogenase/reductase [ chain type dehydrogenase/reductase [ >ref|ZP_06508748.1|short-chain type dehydrogenase/reductase [ chain dehydrogenase [ short-chain type dehydrogenase/reductase [ >emb|CAL70889.1|Putative short-chain type dehydrogenase/reductase [ >dbj|BAH25164.1|short-chain dehydrogenase [ str. Tokyo 172] >gb|EFD12419.1|short- chain type dehydrogenase/reductase [ >gb|EFD42482.1|short-chain type dehydrogenase/reductase [ 845 846 ZP_01101659.1 Short-chain dehydrogenase/reductase SDR LigD [ >gb|EAQ98875.1|Short-chain dehydrogenase/reductase SDR [ 847 848 ZP_01615364.1 short chain dehydrogenase [marine gamma LigD proteobacterium HTCC2143] >gb|EAW32447.1|short chain dehydrogenase [marine gamma proteobacterium HTCC2143] 849 850 ZP_06436160.1 short-chain type dehydrogenase/reductase LigD [ >gb|EFD16575.1|short-chain type dehydrogenase/reductase [ 851 852 NP_854532.1 short chain dehydrogenase [ LigD PUTATIVE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE [ 853 854 YP_004744317.1 putative short-chain type LigD dehydrogenase/reductase [ >emb|CCC43191.1|putative short-chain type dehydrogenase/reductase [ 855 856 YP_003947586.1 short-chain dehydrogenase/reductase sdr LigD [ >gb|ADO57345.1|Short-chain dehydrogenase/reductase SDR [ 857 858 YP_003951191.1 short-chain dehydrogenase/reductase LigD [ >gb|ADO69364.1|Short-chain dehydrogenase/reductase SDR [ 859 860 YP_583994.1 hypothetical protein Rmet_1846 LigD [ >gb|ABF08725.1|conserved hypothetical protein [ 861 862 NP_215366.1 short chain dehydrogenase [ LigD short chain dehydrogenase [ short chain dehydrogenase [ chain dehydrogenase [ short-chain type dehydrogenase/reductase [ >ref|ZP_04924487.1|hypothetical protein TBCG_00842 [ C] >ref|ZP_04979832.1|hypothetical short- chain type dehydrogenase/reductase [ >ref|ZP_05140274.1|short chain dehydrogenase [ ‘98-R604 INH-RIF-EM’] >ref|ZP_06444578.1|short-chain type dehydrogenase/reductase [ short chain dehydrogenase [ short chain dehydrogenase [ chain type dehydrogenase/reductase [ >ref|ZP_06802023.1|short chain dehydrogenase [ 210] >ref|ZP_06951148.1|short chain 863 864 YP_904525.1 short chain dehydrogenase [ LigD chain type dehydrogenase/reductase [ 865 866 ZP_06851131.1 short-chain dehydrogenase/reductase family LigD oxidoreductase [ >gb|EFG75472.1|short-chain dehydrogenase/reductase family oxidoreductase [ 867 868 YP_003871369.1 3-oxoacyl-[acyl-carrier-protein] reductase (3- LigD ketoacyl-acyl carrier protein reductase) [ >gb|ADM70831.1|3-oxoacyl-[acyl-carrier- protein] reductase (3-ketoacyl-acyl carrier protein reductase) [ E681] 869 870 ZP_05094873.1 oxidoreductase, short chain LigD dehydrogenase/reductase family [marine gamma proteobacterium HTCC2148] >gb|EEB78920.1|oxidoreductase, short chain dehydrogenase/reductase family [marine gamma proteobacterium HTCC2148] 871 872 ZP_01224235.1 probable oxidoreductase dehydrogenase LigD signal peptide protein [marine gamma proteobacterium HTCC2207] >gb|EAS47242.1|probable oxidoreductase dehydrogenase signal peptide protein [marine gamma proteobacterium HTCC2207] 873 874 YP_634033.1 short chain dehydrogenase [ LigD oxidoreductase, short chain dehydrogenase/reductase family [ 875 876 ABL97174.1 short-chain dehydrogenase/reductase LigD [uncultured marine bacterium EB0_49D07] 877 878 NP_335301.1 short chain dehydrogenase [ LigD >ref|ZP_07413312.2|short-chain type dehydrogenase/reductase [ >ref|ZP_07668817.1|short-chain type dehydrogenase/reductase [ >ref|ZP_07669069.1|short-chain type dehydrogenase/reductase [ oxidoreductase, short-chain dehydrogenase/reductase family [ >gb|EFO75870.1|short-chain type dehydrogenase/reductase [ short-chain type dehydrogenase/reductase [ >gb|EFP52129.1|short-chain type dehydrogenase/reductase [ 879 880 ZP_01627272.1 short-chain dehydrogenase/reductase SDR LigD [marine gamma proteobacterium HTCC2080] >gb|EAW39988.1|short-chain dehydrogenase/reductase SDR [marine gamma proteobacterium HTCC2080] 881 882 YP_002774647.1 short chain dehydrogenase [ LigD probable short chain dehydrogenase [ 883 884 YP_004533909.1 short-chain dehydrogenase/reductase SDR LigD [ >emb|CCA92091.1|short-chain dehydrogenase/reductase SDR [ 885 886 ZP_04751842.1 short chain dehydrogenase [ LigD 887 888 ZP_08271356.1 short-chain dehydrogenase/reductase SDR LigD [gamma proteobacterium IMCC3088] >gb|EGG29327.1|short-chain dehydrogenase/reductase SDR [gamma proteobacterium IMCC3088] 889 890 YP_004666338.1 short chain dehydrogenase [ LigD dehydrogenase [ 891 892 YP_001704647.1 putative short chain LigD dehydrogenase/reductase [ >emb|CAM63993.1|Putative short chain dehydrogenase/reductase [ 893 894 ZP_07283949.1 cis-2,3-dihydrobiphenyl-2,3-diol LigD dehydrogenase [ >gb|EFL12318.1|cis-2,3-dihydrobiphenyl- 2,3-diol dehydrogenase [ AA4] 895 896 YP_002005492.1 hypothetical protein RALTA_A1476 LigD [ >emb|CAQ69425.1|putative OXIDOREDUCTASE DEHYDROGENASE [ 897 898 YP_003543705.1 SDR-family protein [ LigD UT26S] >dbj|BAI95093.1|SDR-family protein [ 899 900 YP_759628.1 short chain dehydrogenase/reductase family LigD oxidoreductase [ ATCC 15444] >gb|ABI75402.1| oxidoreductase, short chain dehydrogenase/reductase family [ 901 902 ZP_03543905.1 short-chain dehydrogenase/reductase SDR LigD [ >gb|EED68191.1|short-chain dehydrogenase/reductase SDR [ 903 904 YP_003487191.1 hypothetical protein SCAB_14801 LigD [ >emb|CBG68626.1|putative PROBABLE SHORT-CHAIN TYPE DEHYDROGENASE/REDUCTASE [ 905 906 AEG69105.1 3-oxoacyl-[acyl-carrier-protein] reductase LigD [ 907 908 YP_003841993.1 short-chain dehydrogenase/reductase SDR LigD [ >ref|ZP_07630916.1|short-chain dehydrogenase/reductase SDR [ chain dehydrogenase/reductase SDR [ 909 910 YP_001899010.1 hypothetical protein Rpic_1437 [ LigD dehydrogenase/reductase SDR [ 911 912 ZP_07965490.1 short chain dehydrogenase [ LigD short chain dehydrogenase [ 913 914 NP_250228.1 short-chain dehydrogenase [ LigD hypothetical protein PaerPA_01001998 [ >ref|YP_002441374.1|putative short-chain dehydrogenase [ LESB58] >ref|ZP_04933207.1|hypothetical protein PA2G_00514 [ >gb|AAG04926.1|AE004582_4 probable short-chain dehydrogenase [ hypothetical protein PA2G_00514 [ >emb|CAW28518.1|probable short-chain dehydrogenase [ LESB58] >gb|EGM16253.1|putative short- chain dehydrogenase [ 915 916 YP_001020978.1 hypothetical protein Mpe_A1784 LigD [ >gb|ABM94743.1|putative oxidoreductase dehydrogenase signal peptide protein [ 917 918 YP_003745682.1 oxidoreductase dehydrogenase [ LigD >emb|CBJ43067.1|putative oxidoreductase dehydrogenase [ CFBP2957] 919 920 ADD82954.1 BatM [ LigD 921 922 ZP_06846575.1 short-chain dehydrogenase/reductase family LigD oxidoreductase [ >gb|EFG80090.1|short-chain dehydrogenase/reductase family oxidoreductase [ 923 924 ZP_05041687.1 oxidoreductase, short chain LigD dehydrogenase/reductase family [ oxidoreductase, short chain dehydrogenase/reductase family [ 925 926 YP_726036.1 hypothetical protein H16_A1536 [ LigD conserved hypothetical protein [ 927 928 ZP_08275744.1 Hypothetical Protein IMCC9480_775 LigD [Oxalobacteraceae bacterium IMCC9480] >gb|EGF30787.1|Hypothetical Protein IMCC9480_775 [Oxalobacteraceae bacterium IMCC9480] 929 930 YP_791716.1 putative short-chain dehydrogenase LigD [ >ref|ZP_06879570.1|putative short-chain dehydrogenase [ PAb1] >ref|ZP_07792770.1|putative short- chain dehydrogenase [ putative short-chain dehydrogenase [ >gb|EFQ37866.1|putative short-chain dehydrogenase [ 39016] >gb|EGM15719.1|putative short- chain dehydrogenase [ 931 932 CAQ35702.1 oxidoreductase dehydrogenase protein LigD [ 933 934 ZP_07966320.1 short chain dehydrogenase [ LigD short chain dehydrogenase [ 935 936 YP_002981437.1 hypothetical protein Rpic12D_1478 LigD [ short-chain dehydrogenase/reductase SDR [ 937 938 YP_004685391.1 C alpha-dehydrogenase LigD [ LigD dehydrogenase LigD [ 1] 939 940 ZP_00945631.1 Hypothetical Protein RRSL_01608 LigD [ >ref|YP_002259522.1|oxidoreductase dehydrogenase protein [ Hypothetical Protein RRSL_01608 [ >emb|CAQ61454.1|oxidoreductase dehydrogenase protein [ 941 942 NP_519890.1 hypothetical protein RSc1769 [ LigD >emb|CAD15471.1|probable oxidoreductase dehydrogenase signal peptide protein [ GMI1000] 943 944 ZP_07676733.1 oxidoreductase dehydrogenase signal LigD peptide protein [ >gb|EFP64736.1|oxidoreductase dehydrogenase signal peptide protein [ 945 946 YP_003752456.1 oxidoreductase dehydrogenase [ LigD putative oxidoreductase dehydrogenase [ 947 948 YP_004533099.1 hypothetical protein PP1Y_AT3242 LigD [ >emb|CCA91281.1|conserved hypothetical protein [ 949 950 YP_001564386.1 hypothetical protein Daci_3363 [ LigD chain dehydrogenase/reductase SDR [ 951 952 YP_004488753.1 short-chain dehydrogenase/reductase SDR LigD [ chain dehydrogenase/reductase SDR [ 953 954 YP_001188109.1 short-chain dehydrogenase/reductase SDR LigD [ >gb|ABP85377.1|short-chain dehydrogenase/reductase SDR [ 955 956 ADP99633.1 short-chain dehydrogenase/reductase SDR LigD [ 957 958 YP_693638.1 short-chain dehydrogenase/reductase family LigD protein [ >emb|CAL17366.1|short-chain dehydrogenase/reductase family [ 959 960 YP_585740.1 short-chain dehydrogenase/reductase SDR LigD [ >gb|ABF10471.1|short-chain dehydrogenase/reductase SDR [ 961 962 YP_003277769.1 short-chain dehydrogenase/reductase SDR LigD [ >gb|ACY32473.1|short-chain dehydrogenase/reductase SDR [ 963 964 ZP_08406457.1 hypothetical protein HGR_11311 LigD [ >gb|EGI76405.1|hypothetical protein HGR_11311 [ 19624] 965 966 YP_003842521.1 short-chain dehydrogenase/reductase SDR LigD [ >ref|ZP_07632312.1|short-chain dehydrogenase/reductase SDR [ chain dehydrogenase/reductase SDR [ 967 968 ZP_07043693.1 short-chain dehydrogenase/reductase SDR LigD [ >gb|EFI62855.1|short-chain dehydrogenase/reductase SDR [ 969 970 YP_295629.1 hypothetical protein Reut_A1415 [ LigD chain dehydrogenase/reductase SDR [ 971 972 CBJ37979.1 putative oxidoreductase dehydrogenase LigD [ 973 974 YP_004155471.1 short-chain dehydrogenase/reductase sdr LigD [ >gb|ADU37360.1|short-chain dehydrogenase/reductase SDR [ 975 976 YP_001353681.1 hypothetical protein mma_1991 LigD [ >gb|ABR91341.1|short-chain dehydrogenase/reductase SDR [ indicates data missing or illegible when filed