Classification of Wine Samples by Means of ANN
Classification of Wine Samples by Means of ANN
OR I G I N A L P AP E R
Abstract The three-layer artificial neural network origins of wine by reliable chemical analysis techniques
(ANN) model with back-propagation (BP) of error was in combination with modern chemometrical methods
used to classify wine samples in six different regions instead of traditional wine experts. Concerning the
based on the measurements of trace amounts of analysis methods of constituents in wine, there are
B, V, Mn, Zn, Fe, Al, Cu, Sr, Ba, Rb, Na, P, Ca, Mg, many reports such as GC for the analysis of aroma
K using an inductively coupled plasma optical emis- compounds in wine [1], HPLC for the determination
sion spectrometer (ICP-OES). The ANN architecture of phenolic acids [2], spectrophotometry for Fe [3],
and parameters were optimized. The results obtained Mn [4] and preservatives containing benzoic acid,
with ANN were compared with those obtained by sorbic acid, dehydroacetic acid and ethyl parahyd-
cluster analysis, principal component analysis, the robenzoate [5], ion-selective electrode (ISE) methods
Bayes discrimination method and the Fisher discrim- for the simultaneous determination of sulfite and phos-
ination method. A satisfactory prediction result (100%) phate [6] and ethanol [7], capillary zone electrophor-
by an artificial neural network using the jackknife esis (CZE) for phenolic compounds [8], flame atomic
leave-one-out procedure was obtained for the classi- absorption spectroscopy analysis (FAAS) and graphite
fication of wine samples containing six categories. furnace atomic absorption spectroscopy analysis
(GFAAS) for Fe, Cu, Ag, Co, Si, Zn, Pb [9—11], GC-MS
(mass spectrometry) for trifluoroacetylated glycosides
[12], GC-FTIR (Fourier transform infrared)-MS for
Introduction spirits [13], NMR [14] and so on. There is also
a monograph which describes various methods of wine
Wine is one of the most widely consumed beverages in analysis [15]. The inductively coupled plasma optical
the world and has very obvious commercial value as emission spectrometer (ICP-OES) provides a powerful
well as social importance. Therefore, the evaluation of means for fast analysis of a number of elements of the
the quality of wine plays a very important role for both periodic table. There is a problem, however, with how
manufacture and sale. Historically, the quality or geo- to treat the large quantity of data. Fortunately,
graphical origin of wine was determined only through chemometrical methods are able to solve such multi-
tasting by wine experts. However, recently more ad- variate data analysis problems [16, 17]. In the present
vanced instruments have become available. Therefore, paper, cluster analysis, principal component analysis,
it seems reasonable to determine the geographical discrimination analysis methods and artificial neural
networks are applied to wine classification based on
ICP-OES measurement data. Principal component
analysis [18—22], cluster analysis [23—27] and discrim-
Dedicated to Professor Dr. Gerhard Werner on the occasion of his
65th birthday
ination analysis methods [28—31] have found wide ap-
plications in various fields. Recently artificial neural
Li-Xian Sun networks (ANN) [32—43] have become powerful
On leave from the Department of Chemistry, Hunan Normal chemometric tools for modeling complex analytical
University, Changsha, 410081, People’s Republic of China systems and solving those complex non-linear prob-
K. Danzer ( ) · G. Thiel lems in which the relationship between the cause and
Institute of Inorganic and Analytical Chemistry, Friedrich Schiller effect cannot be precisely defined or described by chem-
University Jena, Lessingstrasse 8, D-07743 Jena, Germany ical reasoning. In this paper, trace amounts of
144
B, V, Mn, Zn, Fe, Al, Cu, Sr, Ba, Rb, Na, P, Ca, Mg, K in
wine samples were measured by ICP-OES. Both cluster
analysis and principal component analysis were used to
investigate the structure of the data and the three-layer
ANN model was used to classify the wine samples. The
results obtained by ANN were compared with those
obtained by conventional Bayes discrimination analy-
sis and Fisher discrimination methods.
Chemometric fundamentals
Table 1 List of origins of wine samples and labels by three different methods with two different instruments
Note* each sample was measured 10 times and the number(s) in parentheses represent(s) the sequential No. of measurements with certain
methods and instruments (see (a—d)
! FS(1—3)DEP means that the wine sample FS is processed with the Decomposition method and then measured three times (i.e., from 1st—3rd)
by Plasmaquant 110
" FS(4—6)DIP means that the wine sample FS is processed with the Direct method and then measured three times (i.e., from 4th—6th) by
Plasmaquant 110
# FS(7—9)DIM means that the wine sample FS is processed with the Direct method and then measured three times (i.e., from 7th—9th) by
Maxim I
$ FS(10)SP means that the wine sample FS is processed with the Standard addition method and then measured only once (i.e. 10th) by
Plasmaquant 110
Plasmaquant 110 (Carl Zeiss Jena GmbH) ICP spectrometers were STATISTICA 5.0 [51] was used to carry out the cluster analysis
used to measure the concentrations of the fifteen trace elements of of the wine samples. The other programs were written in MATLAB
seventeen samples at 17 different wavelengths. The details of the [52] and run on IBM 486.
analytical program and the wavelengths used can be found in Table
2. To increase the reliability of the analytical results, each sample
was preprocessed using different methods (i.e., direct method, de-
composition method, and standard addition method) as shown in
Table 1 and was measured by both ICP-OES instruments. For Results and discussion
samples in a complex matrix the standard addition method is prefer-
red [50]. In the case of the direct method, standard solutions Principal component analysis
containing ethanol and different amounts of the elements to be
determined were used for calibration. In order to be able to compen-
sate for intensity differences caused by viscosity influences of the In Fig. 2 the first principal component PC1 of wine
organic components during sample nebulization, the organic com- samples is plotted against the second principal com-
pounds of the wine, in particular the alcohol, must be removed from ponent PC2, but the separations among different cate-
the samples. The wine samples were therefore decomposed using gories of wine samples from this PC1-PC2 scatter
perhydrol. This method, however, is very time-consuming. For more
details of these methods, see [50]. Finally, 170 samples were utilized point plot are not obvious. Therefore, the simple
for both ICP-OES measurements and data analysis because each PC1-PC2 scatter point plot appears to be inadequate
original wine sample was measured 10 times (see Table 1). to distinguish the wine samples.
146
Table 2 Measurement
parameters of the instruments Parameter Maxim I Plasmaquant 110
(Firma Fisons Instruments) (Firma Carl Zeiss Jena
GmbH)
Table 3 Results obtained by using the cluster analysis based on Ward’s method
1st Group 2nd Group 3rd Group 4th Group 5th Group 6th Group
N 5 8 10 15 20 30
! h"0.8, a"0.9
! h"1, a"0.9
different categories of wine samples from the CV1-CV2
scatter point plot are much more obvious than in Fig. 2
or Fig. 3. Therefore, it is possible to classify the wine shows that ANN classifies wine samples from six differ-
samples using this method. The results (Table 4) ob- ent regions with the most satisfactory results.
tained show that an average discrimination rate for
training is 95.3% and the corresponding prediction
rate is 91.8%. Conclusions
supporting this work. We are grateful to Mrs. Schubert in Winzer- 25. Yu RQ, Sun LX, Liang YZ (1995) Classification of materials. In:
vereinigung Freyburg/Unstrut eG., Prof. Dr. J. Dietrich in Kaliva JH (ed) Adaptation of Simulated Annealing to Chemical
Forschungsanstalt Geisenheim, Dr. R. Amann in Staatliches Optimization Problems. Elsevier, Amsterdam, pp 155—179
Weinbauinstitut Freiburg i. Br. and Mrs. Kreisel in Chemisches 26. Sun LX, Danzer K (1996) Journal of Chemometrics (in press)
Untersuchungsamt Mainz for the preparation of wine samples. 27. Shaffer RE, Small GW (1996) Chemometrics Intelligent Labor-
atory Systems 32:95—109
28. Martin MJ, Pablos F, Gonzalez AG (1996) Anal Chim Acta
320:191—197
References 29. Todeschini R, Marengo E (1992) Chemometrics and Intelligent
Laboratory Systems 16:25—35
1. Villen J, Senorans FJ, Reglero G, Herraiz M (1995) J Agric Food 30. Sun LX, Song XH, Xie YL, Liang YZ, Dong YF, Zhao XG,
Chem 43:717—722 Yu RQ (1995) Chin J Hunan Normal University 12:18—23
2. Buiarelli F, Cartoni G, Coccioli F, Levetsovitou Z (1995) 31. Sun LX, Song XH, Yu RQ (1994) Chinese Acta Scientiae
J Chromatogr A 695:229—235 Circumstantiae 14:501—505
3. Capitan Garcia F, Checa R, Avidad R, Capitan-Valley LF 32. Sun LX, Xu F, Tan J, Liang YZ, Yu RQ (1994) Chinese Acta
(1995) Talanta 42:711—715 Pharmaceutica Sinica 29:767—772
4. Zhou JY (1995) Lihua Jianyan, Huaxue Fence 31:42—43 33. Sun LX, Song XH, Yu RQ (1994) China Environmental Science
5. Sasaki, Hideto (1994) Tottori Daigaku Kogakuba Kenkyu 14:259—262
Hokoko 25:169—177 34. Zupan J, Gasteiger J (1991) Anal Chim Acta 248:1—30
6. Yao T, Satomura M, Nakahara T (1994) Talanta, 41:2113—2119 35. Long JR, Gregoriou VG, Gemperline PJ (1990) Anal Chem
7. Kokovkin VV, Smolyakav BS (1995) J Anal Chem 50:519—523 62:1791—1797
8. Gil MI, Garcia Viguera C, Bridle P, Tomas Barberan FA (1995) 36. Wythoff BJ (1993) Chemometrics Intelligent Laboratory Sys-
Z Lebensm Unters Forsch 200:278—281 tems 18:115—155
9. Escobal A, Iriondo C, Laborra C, Ulibarrena E (1995) At Spec- 37. Song XH, Chen Z, Yu RQ (1992) Chemometrics Intelligent
trosc 16:162—164 Laboratory Systems, 16:213—219
10. Soares ME, Bastos ML, Ferreira MA (1995) At Spectrosc 38. Borggaard C, Thodberg HH (1992) Anal Chem 64:545—551
16:256—260 39. Rumelhart DE, Hinton GE, Williams RJ (1986) In: Rumelhart
11. Jorhem L, Sundström B (1995) At Spectrosc 16:226—228 DE, McClelland JL (eds) Parallel distributed processing: explo-
12. Chassange D, Crouzet J, Baumes RL, Lepoutre JP, Bayonove rations in the microstructure of cognition, vol 1. MIT Press,
CL (1995) J Chromatogr A 694:441—451 Cambridge, MA
13. Laber H, Schulz J, Sponholz WR, Bremser W (1995) Fresenius 40. Pao YH (1989) Adaptive pattern recognition and neural net-
J Anal Chem 351:530—535 works. Addison-Wesley
14. Vogels JTWE, Tas AC, van den Berg F, van der Greef J (1993) 41. Ajay L (1994) Chemometrics and Intelligent Laboratory Sys-
Chemometrics Intelligent Laboratory Systems 21:249—258 tems 24:19—30
15. Linskens HF, Jackson JF (1993) Wine analysis. In: Modern 42. Smits JRM, Melssen WJ, Buydens LMC, Kateman G (1994)
methods of plant analysis. New Series, Volume 6 Chemometrics Intelligent Laboratory Systems 22:165—189
16. Danzer K, Venth K (1993) Fresenius J Anal Chem 346:520—524 43. Melssen WJ, Smits JRM, Buydens LMC, Kateman G (1994)
17. Danzer K, Wagner M (1994) Fresenius J Anal Chem Chemometrics Intelligent Laboratory Systems 23:267—291
350:339—343 44. Massart DL, Kaufman L (1988) The interpretation of analytical
18. Einax J, Geiß S, Danzer K (1987) Z Chem 27:332—333 chemical data by the use of cluster analysis. Wiley, New York
19. Einax J, Danzer K (1988) Staub-Reinhalt Luft 49:53—57 45. Sun SG, Pan EP (1990) Practical discrimination analysis.
20. Einax J, Oswald K, Danzer K (1990) Fresenius J Anal Chem Science, Beijing
336:394—399 46. Shatzoff M, Fienberge S, Tsak R (1968) Technometrics
21. Massart DL, Vandeginste BGM, Deming SN, Michotte Y, 10:769—775
Kaufman L (1988) Chemometrics: a textbook, 1st ed. Elsevier, 47. Furnival GM, Wilson RW (1974) Technometrics 16:499—508
Amsterdam, Chapter 21, 22 48. Vogl TP (1988) Biological Cybernetics 59:257—263
22. Alsberg BK, Kvalheim OM (1994) Chemometrics Intelligent 49. Song XH, Yu RQ (1992) Chemometrics Intelligent Laboratory
Laboratory System 24:31—42 Systems 19:101—109
23. Sun LX, Xie YL, Song XH, Wang JH, Yu RQ (1994) Computers 50. Thiel G (1995) PhD Thesis, Friedrich Schiller University Jena
Chem 18:103—108 51. STATISTICA for Windows Tulsa OK, USA
24. Sun LX, Xu F, Liang YZ, Xie YL, Yu RQ (1995) Chemometrics 52. MATLAB for Windows 4.2b, The Mathworks Inc, South
Intelligent Laboratory Systems 25:51—60 Natick 5.0, Statoft Inc, MA, USA