Location: Sustainable Biofuels and Co-products ResearchTitle: Draft genome sequences of the sophorolipid-producing yeast pseudohyphozyma bogoriensis ATCC 18809
|Ashby, Richard - Rick|
|CAMPOS, PHILIP - Oak Ridge Institute For Science And Education (ORISE)|
|SOLAIMAN, DANIEL - Retired ARS Employee|
Submitted to: Microbiology Resource Announcements
Publication Type: Peer Reviewed Journal
Publication Acceptance Date: 11/10/2022
Publication Date: 11/30/2022
Citation: Msanne, J.N., Shao, J.Y., Ashby, R.D., Campos, P., Liu, Y., Solaiman, D. 2022. Draft genome sequences of the sophorolipid-producing yeast pseudohyphozyma bogoriensis ATCC 18809. Microbiology Resource Announcements. https://doi.org/10.1128/mra.00566-22.
Interpretive Summary: The yeast strain Pseudohyphozyma bogoriensis is capable of producing considerable amounts of high-value surface-active molecules (glycolipid biosurfactants, more specifically known as sophorolipids) which have large-scale application potential in the detergent and cosmetic industries. Understanding the DNA genome sequence contained within this yeast strain would offer vital knowledge of the genetic regulations that govern the synthesis of these industrially relevant molecules. This will help make the synthetic process more economical through the development of genetic engineering systems and, coupled with the application of low-value agricultural by-products or other cheap substrates, will provide essential information necessary to further reduce production costs and broaden the application base for these molecules.
Technical Abstract: The aerobic yeast Pseudohyphozyma (formerly Rhodotorula) bogoriensis is gaining attention as a microbial source for the production and secretion of a unique, high-value extracellular glycolipid (e.g., sophorolipid) with numerous possibilities for industrial applications. Here, we report on the genomic sequences of P. bogoriensis ATCC 18809 that would help improve the understanding of relevant metabolic pathways involved in sophorolipid biosynthesis and allow the development of more specific genome manipulation systems. Sequencing was performed using the PacBio Sequel System and the 26 Mbp genome has a GC content of 57% and encodes for 7,818 predicted proteins.