Reference
Reference
and Computer Graphics, Visualization, Computer Vision and Image Processing 2019
Mei-Yi Wu
Graduate Institute of Food Culture and Innovation, National Kaohsiung University of Hospitality and Tourism, Taiwan
ABSTRACT
Wine label information was found to have a positive effect on consumer choice. However, it is not easy for consumers to
read the information on wine labels since these texts are usually not print in English. Wine label information system with
visual recognition function seems to be important as an auxiliary reading tool for customers. In this paper, we proposed
an efficient wine image label recognition method using convolutional neural network(CNN) algorithms. Data
augmentation techniques for machine learning with different image manipulation such as cropping, rotation and hue
modification were applied to enlarge the dataset. Experimental results show a high accuracy on 130 different wine label
images using the proposed CNN model with mixing different image manipulation.
KEYWORDS
Wine Label Image Recognition, Convolutional Neural Network, Augmented Data, Deep Learning
1. INTRODUCTION
In order to encourage consumption in a country without a strong culture of wine drinking, the expanding
wine producers put their efforts on the design of wine labels. Some wine marketing studies make reference to
the importance of wine labels and the information they contain. The information content of wine labels can
be grouped under seven information positioning statements: namely, parentage, nonpareil, manufacture,
attributes, endorsements, end user and end use. Nested within some of these statements is other information
commonly associated with wine labels (Thomas et al., 2003,). However, it is not easy for consumers to read
the information on wine labels since these texts are usually not print in English. A simple approach to solve
this problem is to develop a mobile wine label image recognition system. Customers can take a shot of wine
label photograph with mobile phone camera and the related information about the wine label will be
displayed on the mobile phone.
Two different approaches, traditional image recognition and deep learning algorithms, are generally
applied in the implementation of image recognition systems. Traditional image recognition schemes are
based on specific handcrafted feature extraction. On the contrary, deep learning schemes optimize the
features that are extracted. The major difference between deep learning and conventional methods is that
deep learning automatically learns features from big data, instead of adopting handcrafted features, which
mainly depends on priori knowledge of designers and is highly impossible to take the advantage of big data.
Deep learning can automatically learn feature representation from big data, including millions of parameters
which largely explains why these deep learning perform better (Liang et al., 2017). Although great
achievements and perspectives, deep learning algorithms have some relevant challenges to tackle. The most
frequently mentioned problem in the field of machine learning is the lack of sufficient amount of the training
data within the datasets (Mikołajczyk et al., 2018,). In this paper, we have compared and analyzed multiple
methods of data augmentation like cropping, rotation and hue modification to enlarge the dataset. More
details about the proposed method and experimental results can be found in the next session.
479
ISBN: 978-989-8533-91-3 © 2019
2. PROPOSED METHOD
The implementation of proposed wine label image recognition system can be divided into four steps (1) Build
a small-size wine label image dataset (2) data augmentation (3) Design CNN algorithms (4) Wine label
recognition. The flowchart of proposed system is illustrated in Figure 1.
(c)
Figure 2. Build a small-size wine label image dataset. (a) a wine label image shot by mobile phone camera (b) an wine
label image example (c) 5 images captured in different views for each wine bottle
Figure 3. An example for image manipulation with cropping, rotation and hue modification
480
International Conferences Interfaces and Human Computer Interaction 2019; Game and Entertainment Technologies 2019;
and Computer Graphics, Visualization, Computer Vision and Image Processing 2019
Original: accuracy 0.4 Hue modification: accuracy 0.96 Rotation: accuracy 0.83
ACKNOWLEDGEMENT
We would like to thank the Ministry of Science and Technology of ROC for supporting this work in part
under Contract MOST 107-2637-E-328-001.
REFERENCES
Thomas, A., & Pickering, G., 2003, The importance of wine label information. International journal of wine marketing,
15(2), pp. 58-74.
Liang, H., Sun, X., Sun, Y., & Gao, Y., 2017, Text feature extraction based on deep learning: a review. EURASIP journal
on wireless communications and networking, 2017-211, pp.1-12.
Mikołajczyk, A., & Grochowski, M., 2018, Data augmentation for improving deep learning in image classification
problem. In 2018 international interdisciplinary PhD workshop (IIPhDW), pp. 117-122.
Zhang, X., Xv, C., Shen, M., He, X., & Du, W., 2018, Survey of Convolutional Neural Network. In 2018 International
Conference on Network, Communication, Computer Engineering (NCCE 2018).
481