Modeling cannabinoids from a large-scale sample of Cannabis sativa chemotypes.

The widespread legalization of Cannabis has opened the industry to using contemporary analytical techniques for chemotype analysis. Chemotypic data has been collected on a large variety of oil profiles inherent to the cultivars that are commercially available. The unknown gene regulation and pharmac...

Full description

Bibliographic Details
Main Authors: Daniela Vergara, Reggie Gaudino, Thomas Blank, Brian Keegan
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0236878
id doaj-fb6d4311817146f3bb86dd7b1293c402
record_format Article
spelling doaj-fb6d4311817146f3bb86dd7b1293c4022021-03-03T22:04:36ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-01159e023687810.1371/journal.pone.0236878Modeling cannabinoids from a large-scale sample of Cannabis sativa chemotypes.Daniela VergaraReggie GaudinoThomas BlankBrian KeeganThe widespread legalization of Cannabis has opened the industry to using contemporary analytical techniques for chemotype analysis. Chemotypic data has been collected on a large variety of oil profiles inherent to the cultivars that are commercially available. The unknown gene regulation and pharmacokinetics of dozens of cannabinoids offer opportunities of high interest in pharmacology research. Retailers in many medical and recreational jurisdictions are typically required to report chemical concentrations of at least some cannabinoids. Commercial cannabis laboratories have collected large chemotype datasets of diverse Cannabis cultivars. In this work a data set of 17,600 cultivars tested by Steep Hill Inc., is examined using machine learning techniques to interpolate missing chemotype observations and cluster cultivars into groups based on chemotype similarity. The results indicate cultivars cluster based on their chemotypes, and that some imputation methods work better than others at grouping these cultivars based on chemotypic identity. Due to the missing data and to the low signal to noise ratio for some less common cannabinoids, their behavior could not be accurately predicted. These findings have implications for characterizing complex interactions in cannabinoid biosynthesis and improving phenotypical classification of Cannabis cultivars.https://doi.org/10.1371/journal.pone.0236878
collection DOAJ
language English
format Article
sources DOAJ
author Daniela Vergara
Reggie Gaudino
Thomas Blank
Brian Keegan
spellingShingle Daniela Vergara
Reggie Gaudino
Thomas Blank
Brian Keegan
Modeling cannabinoids from a large-scale sample of Cannabis sativa chemotypes.
PLoS ONE
author_facet Daniela Vergara
Reggie Gaudino
Thomas Blank
Brian Keegan
author_sort Daniela Vergara
title Modeling cannabinoids from a large-scale sample of Cannabis sativa chemotypes.
title_short Modeling cannabinoids from a large-scale sample of Cannabis sativa chemotypes.
title_full Modeling cannabinoids from a large-scale sample of Cannabis sativa chemotypes.
title_fullStr Modeling cannabinoids from a large-scale sample of Cannabis sativa chemotypes.
title_full_unstemmed Modeling cannabinoids from a large-scale sample of Cannabis sativa chemotypes.
title_sort modeling cannabinoids from a large-scale sample of cannabis sativa chemotypes.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2020-01-01
description The widespread legalization of Cannabis has opened the industry to using contemporary analytical techniques for chemotype analysis. Chemotypic data has been collected on a large variety of oil profiles inherent to the cultivars that are commercially available. The unknown gene regulation and pharmacokinetics of dozens of cannabinoids offer opportunities of high interest in pharmacology research. Retailers in many medical and recreational jurisdictions are typically required to report chemical concentrations of at least some cannabinoids. Commercial cannabis laboratories have collected large chemotype datasets of diverse Cannabis cultivars. In this work a data set of 17,600 cultivars tested by Steep Hill Inc., is examined using machine learning techniques to interpolate missing chemotype observations and cluster cultivars into groups based on chemotype similarity. The results indicate cultivars cluster based on their chemotypes, and that some imputation methods work better than others at grouping these cultivars based on chemotypic identity. Due to the missing data and to the low signal to noise ratio for some less common cannabinoids, their behavior could not be accurately predicted. These findings have implications for characterizing complex interactions in cannabinoid biosynthesis and improving phenotypical classification of Cannabis cultivars.
url https://doi.org/10.1371/journal.pone.0236878
work_keys_str_mv AT danielavergara modelingcannabinoidsfromalargescalesampleofcannabissativachemotypes
AT reggiegaudino modelingcannabinoidsfromalargescalesampleofcannabissativachemotypes
AT thomasblank modelingcannabinoidsfromalargescalesampleofcannabissativachemotypes
AT briankeegan modelingcannabinoidsfromalargescalesampleofcannabissativachemotypes
_version_ 1714813654230106112