Cheng 2013
Cheng 2013
Methods
View Article Online
PAPER View Journal | View Issue
Published on 19 September 2013. Downloaded by Christian Albrechts Universitat zu Kiel on 27/10/2014 12:12:59.
Chaihu (Bupleuri Radix), the root of Bupleurum chinense and B. scorzonerifolium, is a traditional Chinese
herbal medicine authenticated in the Chinese Pharmacopoeia. There are also several variations available
from local herbal markets, for example, the roots of B. falcatum, B. bicaule, and B. marginatum var.
stenophyllum. In the current study, we collected 64 Chaihu samples, including 33 authenticated samples
and 31 commercial samples. Test solutions of all the examples were analysed by high-performance thin-
layer chromatography (HPTLC) to assess the principal bio-active components (saikosaponins). The HPTLC
fluorescent images acquired were analyzed by sophisticated image processing techniques for
comprehensive quantification. High dimensional features for both gray-scale and true color images
were constructed for the raw images. Classical classification algorithms, including naive Bayes, Support
Vector Machine (SVM), K-nearest neighbors, neural network and logistic, were used to construct
prediction models. To gain an insight into the principal components while evaluating the Chaihu
sample, feature selection and ensemble feature selection methods were further combined with the
Received 9th July 2013
Accepted 19th September 2013
classifiers to enhance the discrimination power. Ensemble feature selection was shown to achieve
superior performance. Experimental results demonstrated that the roots of Chaihu from different
DOI: 10.1039/c3ay41132j
species of the genus Bupleurum could be readily distinguished so that commercial samples could be
www.rsc.org/methods easily classified.
This journal is ª The Royal Society of Chemistry 2013 Anal. Methods, 2013, 5, 6325–6330 | 6325
View Article Online
further enhance the discrimination power, ensemble feature sprayed with DMAB reagent and heated at 105 C on a TLC plate
selection and various feature selection mechanisms were heater (CAMAG) until the colour of the saponins was distinct.
combined with various classiers. Extensive experiments are The uorescent images were examined at 365 nm by using a UV
reported and analyzed on the performance of the combination viewer cabinet (CAMAG). The images were captured by a Digi-
of classication tools and HPTLC. The current study demon- store 2 documentation system (CAMAG). The excitation wave-
strates that the combination of advanced machine learning length was 366 nm in the reection mode and the exposure time
Published on 19 September 2013. Downloaded by Christian Albrechts Universitat zu Kiel on 27/10/2014 12:12:59.
techniques and HPTLC can assess the quality of different was 3 seconds.
species of Chaihu in an accurate and effective way. A sample image obtained following the aforementioned
procedures is shown in Fig. 1(a).
2 Experimental
3 Pattern analysis for HPTLC
2.1 HPTLC experimental sample
Sixty-four batches of Chaihu samples were collected from To obtain an effective discrimination system with machine
different herbal markets or harvested from various habitats. learning techniques, pattern analysis was essential for our
Among them, thirty-one samples, including B. chinense, study. These procedures are depicted in Fig. 2.
B. scorzonerifolium, B. falcatum, B. longiradiatum, B. bicaule and
B. marginatum var. stenophyllum, were authenticated by bota- 3.1 HPTLC ngerprint images preprocessing
nists Prof. Z. D. Wang of Henan Science & Technology Univer- The raw HTPLC uorescence images have to be preprocessed to
sity, China and Prof. D. Q. Wang of Anhui University of standardize the data in order to prevent any side-effects arising
Traditional Chinese Medicine, China. from the experiment, such as image shiing and nonuniform
lighting. An example is shown in Fig. 1. The proposed pre-
2.2 HPTLC experiment setup processing method consists of two steps. In the rst step, the
raw image are converted into gray-scale or a true color image
The chemical reagents for the experiment were obtained from
and the noise suppression scheme aims to enhance the image
the Guangzhou Chemical Reagent Factory (Guangzhou, China).
quality. This step facilitates feature extraction and is used for
Chemical reference standards of saikosaponin a and saikosa-
quantication of the image. In the second step, the denoised
ponin d were provided by the National Institute for the Control
images are aligned manually so that the tested images are the
of Pharmaceutical and Biological Products (Beijing, China).
same size. The head and tail portion of each standardized
Chemical references of saikosaponin c, saikosaponin f and
image were removed as the images were imperfect because of
saikosaponin b2 were provided by Henan College of Traditional
nonuniform lighting.
Chinese Medicine, China.
The experimental procedure for the preparation of the
3.2 HPTLC image feature construction
HPTLC ngerprint was as follows:
(1) Preparation of sample solution: A 0.3 g portion of powdered 3.2.1 Feature calculation. Since the intensity of the pre-
herb was added to 20 mL of solution of 0.5% pyridine in processed images show band-wise variations, their averaged
methanol to prevent the degradation of saikosaponins a and d. intensity prole could be used to quantify the variation. To be
The mixture was reuxed twice in a water bath at 80 C for 30 specic, the peak and valley values along the curve at particular
minutes and ltered aerwards. The ltrate was evaporated to positions were estimated as feature values. In Fig. 3, the
dryness in a fume cupboard and reconstituted in 3 mL of water detected peaks and valleys are plotted as stars. In addition, each
before the suspension was applied to a C18 cartridge. Aer
elution with 10 mL of 30% methanol and 20 mL of 80%
methanol, successively, the 80% methanol fraction was evapo-
rated to dryness and the residue was dissolved in 2 mL of
methanol. The solution was subsequently ltered through a
0.45 mm membrane lter before analysis.
(2) Preparation of references solution: A 5 mg portion of each
saikosaponin reference was dissolved in 5 mL of methanol.
(3) HPTLC chromatographic condition: The sample solutions
were applied bandwisely via an ATS4 auto-sampler (CAMAG,
Muttenz, Switzerland) onto a commercial 20 cm 10 cm pre-
coated HPTLC Silica gel 60-plate (Merck). The sample plate was
placed into a desiccator with phosphorus pentoxide and dried
under vacuum for 2 hours before development. Fieen millili-
ters of mobile phase consisting of dichloromethane–ethyl
acetate–methanol–water (30 : 40 : 15 : 3, v/v/v/v) was added into Fig. 1 Demonstration of preprocessing of HPTLC fingerprint images. (a) A raw
a twin-trough chamber, to saturate it for 15 minutes. The plate HPTLC fingerprint image; (b) gray scale transformed image with histogram
in the chamber was developed upward over a path of 8 cm and equalization; (c) image after alignment to have uniform size.
6326 | Anal. Methods, 2013, 5, 6325–6330 This journal is ª The Royal Society of Chemistry 2013
View Article Online
Table 1 Experimental results after various classifiers on feature subset with/without PCA processing. The best results for each scheme are highlighted in italics. The
overall performance of the second feature subset was the best. Processing by PCA did not enhance the classification performance obviously
Feature subset
(#features) PCA processing NB SMO RBF-NN KNN Logistic Average
This journal is ª The Royal Society of Chemistry 2013 Anal. Methods, 2013, 5, 6325–6330 | 6327
View Article Online
Table 2 Experimental results of combinational performance of feature selection method with various classifiers. The best results for each scheme are highlighted in
italics. The overall accuracy of the second feature subsets was far better than that of the other feature subsets
Table 3 Experimental results of the performance of ensemble feature selection ensemble of the base classiers. This is achieved through three
technology combined with the Libsvm tool for the four feature sets. The overall steps: (1) elimination of low classication accuracy base clas-
accuracy of the second feature set is slightly superior to that of the other sets
siers; (2) removal of those base classiers that have identical
Classication predictions; (3) integration of the base classiers with criteria:
Feature set (#features) #Features accuracy (%) P
~
p ¼ F( ukvm) (1)
I (504) 40 87.5 3.1
II (258) 30 95.3 1.6 where ~p is the predicted result aer integration. vm is the pre-
III (176) 30 90.6 3.1 dicted result of the base classier of bm, and uk is its prediction
IV (53) 20 93.8 3.1
accuracy. The piecewise linear function F($) is an error function,
1; x . 0:5
and is dened by FðxÞ ¼ . Therefore, the inte-
0; otherwise
gration scheme predicts the base classier with high prediction
accuracy.
6328 | Anal. Methods, 2013, 5, 6325–6330 This journal is ª The Royal Society of Chemistry 2013
View Article Online
dimensional feature space by using a non-linear mapping main experimental results are summarized in Table 1. The
algorithm to make it linearly separable. In this paper, two overall performance of the second feature set was superior to
improved training algorithms for SVM were used, that is the other feature sets as it reached an accuracy of 90%. Since the
Sequential Minimal Optimization (SMO)25 classier and color information was omitted in the second feature set, the
Libsvm.24 results imply that removal of the color information can enhance
SMO helps to accelerate the solving procedure by breaking a the discrimination power. The possible reason is that the color
Published on 19 September 2013. Downloaded by Christian Albrechts Universitat zu Kiel on 27/10/2014 12:12:59.
large quadratic programming (QP) problem down into a series information may be inaccurate due to imperfect imaging pro-
of smaller QP problems. SMO improves the scaling and reduces cessing, such as non-uniform lighting. A similar observation
computation time signicantly by utilizing the smallest was shown in the second example in which the classication
possible QP problems. performance of the rst feature set was vastly inferior to the
In ensemble feature selection, Libsvm is widely used as an latter ones. Another worthy point to note is that the classica-
efficient SVM tool. There are two steps involved in the LIBSVM: tion accuracy aer PCA did not show obvious improvement over
(1) the dataset is trained to obtain a model; (2) the model is used one without PCA preprocessing, possibly because of medium
to predict the information for the testing dataset. In this paper, feature dimensions.
a polynomial kernel was used. In the second experiments, various methods for feature
3.3.3 K-Nearest neighbors (KNN) classier. The K-nearest selection were rstly applied to the four feature subsets. The
neighbors algorithm14 was implemented for pattern recognition resulting feature subset was then fed into various classication
by using the so-called weighed vote formula to predict the herbal algorithms, including naive Bayes, SVM, RBF-NN, KNN and
species based on the spatial distances between observation and logistic classiers. The averaged performance aer each clas-
target vectors. sier of different feature subsets is summarized in Table 2. The
3.3.4 Radial basis function neural network (RBF-NN) clas- performance of the second feature set was the best in most of
sier. Neural network15 based on radial function is an efficient the cases. The results were similar to the rst experiment.
feed forward neural network.27 It has the best approximation of Furthermore, the performance of the classier dramatically
performance and global optimum characteristics, which other increased aer feature selection processing. For example, the
forward networks do not have. It has a simple structure and fast classier of naive Bayes reached an accuracy of 93.75% in
training speed. comparison with 82.81% which was achieved without feature
3.3.5 Logistic classier. Logistic16 is a classier for building selection. The accuracy of KNN reaches 95.31% with feature
and using a multinomial logistic regression model with a ridge selection, while 92.19% was achieved without processing. The
estimator. good performances implies that high accuracy can be obtained
3.3.6 Principal component analysis (PCA).28 Principal by removing redundant information in the feature set.
component analysis (PCA) is widely adopted as a preprocessing In the third experiment, the ensemble feature selection
procedure which uses an orthogonal transformation to convert method with the base classier of SVM was tested on the four
a set of observations of possibly correlated variables into a set of feature sets and achieved remarkable results. In our experi-
values of linearly uncorrelated variables, called principal ment, een base classiers were rstly constructed for
components. In many analysis case, the number of principal randomly selected features. Extensive experiments were con-
components which account for most of the variance in the ducted to search for a feature subset with good performance.
observed variables is signicantly less than that of the original The performance of the classier was evaluated via 10-fold cross-
variables.29 In our study, the correlations among the features of validation. Experimental results are summarized in Table 3.
the HPTLC images are high and thus PCA is expected to achieve Performance of the second and fourth feature sets produced
good performance by reducing the redundant features and optimal and suboptimal results. Similar to the previous two
improving classication performance. experiments, classication of the second feature set was the
best and an equal accuracy of 95.3% was achieved.
4 Experimental results In order to compare the performance of the different clas-
siers via different feature selection technologies, Fig. 4 was
In this section, we demonstrate the performance of the various plotted. As shown in Fig. 4, the ensemble feature selection
classiers on the four feature subsets by combining different method combined with the Libsvm tool achieved a signicantly
feature selection with ensemble feature selection methods. The superior accuracy for classication in comparison with the
purpose of this section is to show that the fully automated other methods.
classication models can achieve high accuracy in discrimina-
tion of authentic Chaihu samples from fake ones when the raw 5 Conclusion
images were characterized by accurate quantitative
measurements. HPTLC has been shown to be promising for the development of
In the rst experiment, the four feature sets were processed chromatographic ngerprint proling methods to determine
independently by PCA to get an economic representation by complex herb extracts. The pictorial nature of an HPTLC image
discarding 5% of the least informative components. The provides extra intuitively visible measurements for assessing its
resulting feature representation was then combined with chemical characteristics. However, quantitative image analysis
various classiers to evaluate their discrimination power. The of HPTLC remains open as well as its clinical potential. Besides,
This journal is ª The Royal Society of Chemistry 2013 Anal. Methods, 2013, 5, 6325–6330 | 6329
View Article Online
various contents of saikosaponins among different samples of 10 P. Torrione, K. D. Morton and L. Collins, Chemometrics and
Chaihu species were observed, which calls for not only assess- Machine Learning for Spectral Analysis, Optical Society of
ing the clinical quality by analyzing the multiple marker America, 2012, vol. 1, pp. 3–10.
components individually but also recognizing the entire 11 Y. Q. Wang, H. X. Yan, R. Guo and F. F. Li, Int. J. Data Min.
ngerprint pattern for consistency assurance and authentica- Bioinf., 2011, 5, 369–382.
tion purposes. 12 G. John and P. Langley, Estimating Continuous Distributions
Published on 19 September 2013. Downloaded by Christian Albrechts Universitat zu Kiel on 27/10/2014 12:12:59.
In the current study, various techniques for machine in Bayesian Classiers, Morgan Kaufmann, 1995, vol. 3, pp.
learning and image analysis were combined to evaluate the 338–345.
chemical quality of Bupleuri Radix through HPTLC. Four 13 C. Cortes and V. Vapnik, Mach. Learn., 1995, 3, 273–297.
inherent feature subsets were rstly derived to quantify the 14 D. W. Aha, D. Kibler and M. K. Albert, Instance-Based
pictorial characteristics of the HPTLC image. In order to test the Learning Algorithms, Springer, Netherlands, 1991, vol. 1, pp.
discrimination potential of the derived feature, various stan- 37–66.
dard machine learning schemes were used. Various feature 15 A. Guillén, I. Rojas and González, Neural Process. Lett., 2007,
selection methods, including lter scheme and ensemble 25, 209–225.
scheme combined with advanced classiers, were carried out to 16 S. L. Cessie and J. C. V. Houwelingen, J. Appl. Stat., 1992, 2,
assess the ngerprint pattern. Experimental results have 191–201.
conrmed the high accuracy in discriminating various samples 17 Q. Huang, Y. Zhuang, X. B. Qiao and X. J. Xu, Acta Phys.-
of Chaihu species. This study has revealed a promising way for Chim. Sin., 2007, 23, 1141–1145.
classifying the intrinsic inconsistency of herbal quality when 18 A. L. Blum and P. Langley, Artif. Intell., 1997, 97, 245–271.
their distribution of principal ingredients in this herb varied 19 H. Liu and R. Setiono, Chi2: Feature Selection and
from one batch to another. Discretization of Numeric Attributes, IEEE Computer
Society, Herndon, Virginia, 1995, vol. 2, pp.
388–391.
Acknowledgements 20 J. W. Han and M. Kamber, Data Mining: Concepts and
Techniques (The Morgan Kaufmann Series in Data
This work was supported by NSFC under award number
Management Systems), Morgan Kaufmann, 1st edn, 2000,
60902076, 61372141, and the Fundamental Research Funds for
vol. 2, pp. 179–220.
the Central Universities under award number 2013ZM0079.
21 K. Kira and L. A. Rendell, A Practical Approach to Feature
Selection, Morgan Kaufmann Publishers Inc., 1992, vol. 2,
pp. 249–256.
References
22 T. K. Ho, IEEE Trans. Pattern Anal. Mach. Intell., 1998, 20,
1 C. P. Commission, Pharmacopoeia of the People's Republic of 832–844.
China, Chemical Industry Press, 2011, vol. 1, pp. 196–197. 23 A. Tsymbal, S. Puuronen and D. W. Patterson, Inf. Fusion,
2 Z. H. Su, S. Q. Li and G. A. Zou, J. Pharm. Biomed. Anal., 2011, 2003, 4, 87–100.
55, 533–539. 24 C. C. Chang and C. J. Lin, ACM Trans. Intell. Syst. Technol.,
3 J. P. Committee, The Japanese Pharmacopoeia, Ministry of 2011, 2, 1.
Health, Japan Tokyo, 2000, vol. 1, pp. 876–878. 25 Advances in Kernel Methods, ed. B. Schölkopf, C. J. C. Burges
4 C. P. Commission, Pharmacopoeia of the People's Republic of and A. J. Smola, MIT Press, Cambridge, MA, USA, 1999, vol.
China, Peoples Health Publishing House, 1963, vol. 1, pp. 2, pp. 185–208.
237–238. 26 H. Wang, A Computerized Diagnostic Model Based on Naive
5 P. Xiao, Modern Chinese Materia Medica, Chemical Industry Bayesian Classier in Traditional Chinese Medicine, IEEE
Press, 2002, vol. 1, pp. 784–785. Computer Society, 2008, vol. 1, pp. 474–477.
6 S. B. Chen, H. P. Liu and R. T. Tian, J. Chromatogr., A, 2006, 2, 27 K. Priddy and P. Keller, Articial Neural Networks: An
114–119. Introduction, Society of Photo Optical, 2005, vol. 2, pp. 205–
7 R. T. Tian, P. S. Xie and H. P. Liu, J. Chromatogr., A, 2009, 18, 234.
2150–2155. 28 H. Abdi and L. J. Williams, Wiley Interdiscip. Rev.: Comput.
8 A. Zlatkis and R. Kaiser, HPTLC: High Performance Thin-Layer Stat., 2010, 2, 433–459.
Chromatography, Elsevier, 1977, vol. 6, pp. 95–126. 29 C. Y. Wang, Z. Y. Chen, C. G. Wu and Y. C. Liang, Medicine
9 J. Tamaoka and K. Komagata, FEMS Microbiol. Lett., 1984, 25, Composition Analysis Based on PCA and SVM, Springer,
125–128. 2005, vol. 9, pp. 1226–1230.
6330 | Anal. Methods, 2013, 5, 6325–6330 This journal is ª The Royal Society of Chemistry 2013