Ensemble learning for prediction of the bioactivity capacity of herbal medicines from chromatographic fingerprints

Chen, H, Poon, J, Poon, S, Cui, L, Fan, K and Sze, M 2015, 'Ensemble learning for prediction of the bioactivity capacity of herbal medicines from chromatographic fingerprints', BMC Bioinformatics, vol. 16, no. Supplement 12, pp. 1-8.

Document type: Journal Article
Collection: Journal Articles

Title Ensemble learning for prediction of the bioactivity capacity of herbal medicines from chromatographic fingerprints
Author(s) Chen, H
Poon, J
Poon, S
Cui, L
Fan, K
Sze, M
Year 2015
Journal name BMC Bioinformatics
Volume number 16
Issue number Supplement 12
Start page 1
End page 8
Total pages 8
Publisher BioMed Central Ltd.
Abstract Background: Recent quality control of complex mixtures, including herbal medicines, is not limited to chemical chromatographic definition of one or two selected compounds; multivariate linear regression methods with dimension reduction or regularisation have been used to predict the bioactivity capacity from the chromatographic fingerprints of the herbal extracts. The challenge of this type of analysis requires a multidimensional approach at two levels: firstly each herb comprises complex mixtures of active and non-active chemical components; and secondly there are many factors relating to the growth, production, and processing of the herbal products. All these factors result in the significantly diverse concentrations of bioactive compounds in the herbal products. Therefore, it is imminent to have a predictive model with better generalisation that can accurately predict the bioactivity capacity of samples when only the chemical fingerprints data are available. Results: In this study, the algorithm of Stacking Multivariate Linear Regression (SMLR) and a few other commonly used chemometric approaches were evaluated. They were to predict the Cluster of Differentiation 80 (CD80) expression bioactivity of a commonly used herb, Astragali Radix (AR), from the corresponding chemical chromatographic fingerprints. SMLR provides a superior prediction accuracy in comparison with the other multivariate linear regression methods of PCR, PLSR, OPLS and EN in terms of MSEtest and the goodness of prediction of test samples. Conclusions: SMLR is a better platform than some multivariate linear regression methods. The first advantage of SMLR is that it has better generalisation to predict the bioactivity capacity of herbal medicines from their chromatographic fingerprints. Future studies should aim to further improve the SMLR algorithm. The second advantage of SMLR is that single chemical compounds can be effectively identified as highly bioactive components which demands further CD80 bioactivity confirmation.
Subject Bioinformatics
Keyword(s) Chromatographic Fingerprints
Bioactivity Prediction
Ensemble Learning
DOI - identifier https://www.doi.org/10.1186/1471-2105-16-S12-S4
Copyright notice © 2015 Chen et al.; Creative Commons Attribution License 4.0
ISSN 1471-2105
Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 4 times in Thomson Reuters Web of Science Article | Citations
Scopus Citation Count Cited 0 times in Scopus Article
Altmetric details:
Access Statistics: 94 Abstract Views  -  Detailed Statistics
Created: Wed, 28 Oct 2015, 10:15:00 EST by Catalyst Administrator
© 2014 RMIT Research Repository • Powered by Fez SoftwareContact us