0% found this document useful (0 votes)

33 views5 pages

2017 IEVC Yanagisawa

Uploaded by

metona6702

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views5 pages

2017 IEVC Yanagisawa

Uploaded by

metona6702

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Proceedings of the Fifth IIEEJ International Workshop

on Image Electronics and Visual Computing 2017

Da Nang, Vietnam, February 28- March 3, 2017

RECOGNITION OF PANEL STRUCTURE IN COMIC IMAGES USING FASTER R-CNN

Hideaki Yanagisawa† Hiroshi Watanabe†

†
Graduate School of Fundamental Science and Engineering, Waseda University

ABSTRACT extracting function using modified connected component

labeling (CCL) method. For character detection, Ishii et
For efficient e-comics creation, automatic extracting al. [6] proposed an approach using machine learning with
technique for comic components such as panel layout, HOG features to detect character face areas. We applied
speech balloon, and characters is necessary. In the Fast R-CNN in character face detection. [7] From its
conventional methods, comic components are extracted result, Fast R-CNN showed higher detection rate than
using geometrical characteristics such as line drawings or HOG features.
connected pixels. However, it is difficult to extract all Conventional methods extract comic components
comic components by focusing on a particular geometric according to the geometric characteristics, e.g. line
feature, since the components are drawn in various detection or extracting connected pixels. However, in
expressions. In this paper, we extract comic components some of comic images, panels and speech balloons are
using Faster R-CNN regardless of various comic illustrated in special expressions. Therefore, it is difficult
expressions, and recognize panel structure. Experimental to detect such components as drawn in specific shapes or
results show proposed method succeed to recognize overlapped other objects.
67.5% of panel structures on average.
3. FASTER R-CNN
1. INTRODUCTION
Garshick et al. [8] proposed Regions with Convolutional
Current state of publishing industry has been shifting Neural Network features (R-CNN) as a general object
from the traditional paper based version to e-books. In the detection method using convolutional neural network
e-book market in Japan, e-comic dominates 80% of sales (CNN). R-CNN detects objects in following process.
amount [1]. In order to improve convenience of e-comics, First, objects’ region proposals are extracted from input
services using metadata of e-comics have been proposed. image by selective search [9]. Second, the region
Such services are, e.g. comic search system using proposals are input to CNN and image feature values are
particular scene or dialogue in comics, or automatic calculated. Then, the output feature values are classified
digest generation system. However, most of e-comics are by support vector machine (SVM). Finally, the deviation
converted from scanned paper comics. Therefore, it is of region proposals is corrected by bounding box
necessary to manually extract comic structure regression. However, R-CNN is slow since it calculates
components such as panel layout, speech balloon, convolutional network features for each object proposal.
characters (in this paper, we use the word ‘character’ as In order to improve this problem, Fast R-CNN is
actors in comics) and so on. To reduce a cost of metadata introduced. Fast R-CNN enables end-to-end detector
extraction, a technique which extracts comic components training on shared convolutional features. Therefore, it
automatically is important. In this paper, we evaluate a shows compelling accuracy and speed [10].
system, which automatically obtains the number of Ren et al. [11] proposed Faster R-CNN as a faster
speech balloons and characters in panels using Faster R- improved object detection technique. Faster R-CNN is
CNN from comics. single network connected Fast R-CNN and Region
Proposal Network (RPN) that share full-image
2. RERATED WORK convolutional features with the detection network. RPN
is fully convolutional network that simultaneously
For detecting panel layout, Ishii et al. [2] proposed to predict object bounds and object scores at each position.
identify panels by detecting dividing line using gradient In addition, RPN is trained end-to-end to generate high-
concentration. Nonaka et al. [3] introduced panel layout quality region proposals, which are used by Fast R-CNN
recognition method by detecting lines and rectangles for detection. Therefore, Faster R-CNN can detect object
according to a characteristic that panels are often more quickly and shows higher detection accuracy than
represented as rectangles. Next, for speech balloon state-of-the-art methods.
extraction, Tanaka et al. [4] proposed a method that
identify text areas using Ada-Boost and detect white 4. PROPOSED METHOD
areas in speech balloons. Moreover, in a study for
structure recognition of comics, Arai et al. [5] proposed We propose a method of panel structure recognition from
a detection method for panel, speech balloon and text comic images by detection of panels, speech balloons and
area. That based on the image blob detection and character faces. We create annotations of comic images
Proceedings of the Fifth IIEEJ International Workshop
on Image Electronics and Visual Computing 2017
Da Nang, Vietnam, February 28- March 3, 2017

Fig.1 Flow diagram of panel structure recognition

by specifying peripheral regions of each component in

rectangles, and 3 types of detectors are generated by
training of Faster R-CNN. The flow diagram of panel
structure obtaining is shown in Fig.1. First, panels are
detected from an input image and sorted them. The
(a)
sorting order is based on the height of detected areas. In
addition, if the heights of areas are same, they are sorted © Hishika Minamisawa
from right side one. Figure 2 shows example images of
panel location and sorting orders. Then, there is a slight
shift in the position of each panel detected by Faster R-
CNN. Therefore, they are normalized per 50 pixels in y-
axis direction. Next, speech balloon and character face
are detected. They are belonged to the panel that
overlapping more than 50% over the detected area. If
there is a component which overlaps 50% or more on
multiple panels as seen in Fig.3, the component is
belonged to the panel sorted back side. Finally, the
numbers of speech balloons and character faces that
belong to each panel are obtained.

5. EXPERIMENT

In this section, we evaluate the detection accuracy of

comic components using Faster R-CNN. Also, the
recognition accuracy of panel structures is evaluated. In
this experiment, we use an algorithm published in
https://github.com/rbgirshick/py-faster-rcnn [11] for
training and evaluation of Faster R-CNN, and set
vgg_cnn_m_1024 [12] as architecture of CNN for
training. Datasets for training and evaluation are made of
comic images provided in Manga 109 database
(http://www.manga109.org/) [13]. The training dataset
consists of each 100 images in 20 titles of comics drawn (b)
by different authors. The test dataset consists of each 30
images in 5 titles of comic named as Comic A-to-E drawn Fig.2 Examples of panel sorting
by different authors from training images.
Proceedings of the Fifth IIEEJ International Workshop
on Image Electronics and Visual Computing 2017
Da Nang, Vietnam, February 28- March 3, 2017

iteration number

(a) Panel detection

1
 Panel 1 has 2 characters and 3 balloons 0.98
 Panel 2 has 1 character and 2 balloons A 0.96
P 0.94
Fig.3 Example of panel structure recognition test
0.92
train
In this experiment, we define a true positive as the 0.9
detected area overlapping the correct area more than 50%.

5.1. Iteration number iteration number

We verified relationship between iteration number in the

(b) Speech balloon detection
training process of Faster R-CNN and average precisions
(AP) for each comic component. AP means the average 1
values of precisions at each level of recalls. In this 0.95
experiment, AP is calculated for 2000 images in the 0.9
training dataset and 150 images in the test dataset. A 0.85
Experimental results are shown in Fig.4. In this figure, x- P 0.8 test
axis indicates iteration number and y-axis indicates AP.
0.75
From this result, the detection rates are increased by train
0.7
increasing of iteration number. In addition, when the
iteration number is over 70000, the AP for training
images is converged.
iteration number
5.2. Threshold of confidence
(c) Character face detection
We evaluate the detailed results of comic component
detection for 150 images in test dataset using the Fig.4 Relationship elation between average precision and
detectors trained with 70000 iterations. Faster R-CNN iteration number increasing
calculates a confidence of object in the region proposals,
and detects a region when its confidence is larger than a
Experimental results show that the precision rates of
threshold. In this experiment, the threshold of confidence
Faster R-CNN are more than 90%, and this method
is set to 0.6 at panel detection, and those are set to 0.8 at
exceeds the conventional method at panel and speech
speech balloon and character face detection. The
balloon detection. Examples of detection results are
thresholds are heuristic values. Experimental results are
shown in Fig.5. From this figure, it is shown that blob
shown in Table 1. In this table, “Total” means total
extraction is hard to separate panels when a panel
numbers of comic components in test images, “TP”
overlapping another panels. On the other hand, R-CNN
means true positive, “FN” means false negative and “FP”
can detect panels independently of those layouts.
means false positive. We also measure parameters of
recall (R) and precision (P). Table 2 shows the detection
5.3. Recognition rate of panel construction
results of panels and speech balloons by the method of
[5] for same test set.
We evaluate a recognition accuracy of panel structures
for each 30 pages in 5 comics. The recognition accuracy
Proceedings of the Fifth IIEEJ International Workshop
on Image Electronics and Visual Computing 2017
Da Nang, Vietnam, February 28- March 3, 2017

(a) Examples of panel detection by [5] (b) Examples of panel detection by Faster R-CNN

Fig.5 Examples of panel detection for flat panels and connected panels

Table 1 Results of comic component extraction for 5 layout as shown in Fig.7. In Fig.6 and Fig.7, red rectangle
comic sources by Faster R-CNN shows the detected area as comic component.
R P
Total TP FN FP
(%) (%) 6. CONCLUSION & FUTURE WORK
Panel 859 770 90 40 89.5 95.1
Balloon 1190 1161 29 42 97.6 96.5 In this paper, we evaluated panel structure recognition
Character 937 803 134 50 85.7 94.1 using Faster R-CNN. Experimental results show our
proposed method success to recognizing 67.5% of panel
structures on average.
Table 2 Results of comic component extraction for 5 comic
For future works, there are some possible
sources by [5] improvements in detection for panels and character faces
R P those are hard to detected in this method. As a specific
Total TP FN FP technique, it is considerable to combine image processing
(%) (%)
such as highlighting division lines of panels with Faster
Panel 859 481 378 183 56.0 72.4
R-CNN detection. In addition, for obtaining metadata to
Balloon 1190 790 400 650 66.4 54.9
be used for automatic generation of comic summaries, we
need to consider a technique for classifying main
characters from detected character faces.
Table 3 Results of panel structure recognition for 5 comic
sources
7. REFERENCES
B (%) C (%) B + C (%)
Comic A 83.0 74.5 68.1 [1] Internet Media Research Institute: “eComic Marketing
Comic B 91.4 89.8 84.9 Report 2012”, Impress R&D, pp.14 (2012).
Comic C 81.7 72.8 66.3
Comic D 94.6 69.0 65.2 [2] D. Ishii, K. Kawamura, H. Watanabe: “A Study on Frame
Comic E 62.3 62.9 52.8 Decomposition of Comic Images", IEICE Transactions, Vol.
J90-D, No.7, pp. 1667—1670 (2007).

is defined as follows: “B” means the panels which speech [3] S. Nonaka, T. Sawano, N. Haneda: “Development of “GT-
balloon numbers correctly extracted, “C” means the Scan”, the Technology for Automatic Detection of Frames in
panels which character face numbers correctly extracted Scanned Comic”, FUJIFILM RESEARCH &
and “B + C” means the panels which both numbers of DEVELOPMENT, No.57, pp.46—49 (2012).
speech balloon and character face correctly extracted. An
experimental result is shown in Table 3. From this result, [4] T. Tanaka, F. Toyama, J. Miyamichi, K. Shoji: “Detection
and Classification of Speech Balloons in Comic Images”,
the highest value of B + C is 84.9% in comic B and the
Journal of the Institute of Image Information and Television
lowest value is 52.8% in comic E.
Engineers, Vol.64, No.12, pp.1933—1939 (2010).
An example case of failure to panel structure
recognition is the detection failure caused by deformed
faces as shown in Fig.6. In addition, the reason of low
recognition rate in Comic E is that it contains fuzzy panel
Proceedings of the Fifth IIEEJ International Workshop
on Image Electronics and Visual Computing 2017
Da Nang, Vietnam, February 28- March 3, 2017

© Satoshi Arai © Saya Miyauchi

Fig.6 Example of failure to detect character faces

[5] Arai K, Tolle Herman: “Method for Real Time Text

Extraction from Digital Manga Comic”, International Journal
of Image Processing Vol.4, No.6, pp.669—676 (2011).

[6] D. Ishii, H. Watanabe: “A Study on Automatic Character

Detection and Recognition from Comics”, The Journal of the
Institute of Image Electronics Engineers of Japan, Vol.42, No.4
(2013)

[7] H. Yanagisawa, H. Watanabe: “A study of Multi-view Face

Detection for Characters in Comic Images”, Proceedings of the
2016 IEICE General Conference, D—12—12 (2016).

[8] R. Girshick, J. Donahue, T. Darrell, J. Malik: “Rich feature

hierarches for accurate object detection and semantic Fig.7 Example of failure to detect panels in Comic E
segmentation,” in IEEE Conference on Computer Vision and
Pattern Recognition, (2014).

[9] J. R. R. Uijlings, K. E. A. van de Sande, T. Gevers, A. W.

M. Smeulders: “Selective Search for Object Recognition”,
International Journal of Computer Vision, Vol.102, No.2
pp.154—171, (2013).

[10] R. Girshick: “Fast R-CNN”, arXiv:1504.08083, (2015).

[11] S. Ren, K. He, R. Girshick, J. Sun: “Faster R-CNN:

Towards Real-Time Object Detection with Region Proposal
Networks”, Advances in Neural Information Processing
Systems (NIPS), (2015).

[12] S. Farfade, M. Saberian: “Multi-view Face Detection

Using Deep Convolutional Neural Networks”,
arXiv:1502.02766, (2015).

[13] Y.Matsui, K.Ito, Y. Aramaki, T.Yamasaki, K. Aizawa:

“Sketch-based Manga Retrieval using Manga109 Dataset”,
arXiv:1510.04389，(2015).

La Cage Aux Folles Script
No ratings yet
La Cage Aux Folles Script
92 pages
Traffic Management and Accident Investigation
67% (3)
Traffic Management and Accident Investigation
8 pages
T54B VCF PDF
92% (12)
T54B VCF PDF
528 pages
Rear Differential Information
100% (2)
Rear Differential Information
5 pages
Cartooniation Using White-Box Technique in Machine Learning
100% (2)
Cartooniation Using White-Box Technique in Machine Learning
5 pages
CARTOON OF AN IMAGE Documentation
No ratings yet
CARTOON OF AN IMAGE Documentation
38 pages
Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images
No ratings yet
Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images
18 pages
ECRU: An Encoder-Decoder Based Convolution Neural Network (CNN) For Road-Scene Understanding
No ratings yet
ECRU: An Encoder-Decoder Based Convolution Neural Network (CNN) For Road-Scene Understanding
19 pages
Problem 2 Proposal
No ratings yet
Problem 2 Proposal
11 pages
2019 - Joeseytre - TextTubes-for-Detecting-Curved-Text-in-the-Wild
No ratings yet
2019 - Joeseytre - TextTubes-for-Detecting-Curved-Text-in-the-Wild
10 pages
Parametric Comics Creation From 3D Interaction
No ratings yet
Parametric Comics Creation From 3D Interaction
17 pages
2017 - Yingying - R2CNN-Rotational-Region-CNN-for-Orientation-Robust-Scene-Text-Detection
No ratings yet
2017 - Yingying - R2CNN-Rotational-Region-CNN-for-Orientation-Robust-Scene-Text-Detection
8 pages
Csi 2018 Mechanical Division 15
100% (1)
Csi 2018 Mechanical Division 15
303 pages
Event Info Extraction From Flyers: Yang Zhang Hao Zhang, Haoranli
No ratings yet
Event Info Extraction From Flyers: Yang Zhang Hao Zhang, Haoranli
7 pages
Scene Text Recognition Using Co-Occurrence of Histogram of Oriented Gradients
No ratings yet
Scene Text Recognition Using Co-Occurrence of Histogram of Oriented Gradients
5 pages
Face Recognition of Cartoon Characters
No ratings yet
Face Recognition of Cartoon Characters
5 pages
Text Color Images
No ratings yet
Text Color Images
6 pages
R-CNN and FR-CNN Report: Methods Used at The Core of Object Detection
No ratings yet
R-CNN and FR-CNN Report: Methods Used at The Core of Object Detection
4 pages
Text Detection and Character Recognition in Scene Images With Unsupervised Feature Learning
No ratings yet
Text Detection and Character Recognition in Scene Images With Unsupervised Feature Learning
6 pages
A Robust and Fast Text Extraction in Images and Video Frames
No ratings yet
A Robust and Fast Text Extraction in Images and Video Frames
7 pages
Shs12 Trends q1w2
100% (1)
Shs12 Trends q1w2
12 pages
Layout Analysis of Tree-Structured Scene Frames in Comic Images
No ratings yet
Layout Analysis of Tree-Structured Scene Frames in Comic Images
6 pages
Sykora05 SBM PDF
No ratings yet
Sykora05 SBM PDF
8 pages
Automatic Storytelling in Comics: A Case Study On World of Warcraft
No ratings yet
Automatic Storytelling in Comics: A Case Study On World of Warcraft
6 pages
Title: Spatial Cohesion Refers To The Fact That Text
No ratings yet
Title: Spatial Cohesion Refers To The Fact That Text
6 pages
Landry Maxey Lee Recognition of Cartoon Characters in Comic Strips
No ratings yet
Landry Maxey Lee Recognition of Cartoon Characters in Comic Strips
1 page
Image Sorting Using Object Detection and Face Recognition
No ratings yet
Image Sorting Using Object Detection and Face Recognition
6 pages
Scene Text Detection Using Machine Learning Classifiers
No ratings yet
Scene Text Detection Using Machine Learning Classifiers
5 pages
Project Report: Optical Character Recognition Using Artificial Neural Network
No ratings yet
Project Report: Optical Character Recognition Using Artificial Neural Network
9 pages
Top-Down and Bottom-Up Cues For Scene Text Recognition: Anand Mishra Karteek Alahari C. V. Jawahar
No ratings yet
Top-Down and Bottom-Up Cues For Scene Text Recognition: Anand Mishra Karteek Alahari C. V. Jawahar
8 pages
Latest Base Paper
No ratings yet
Latest Base Paper
4 pages
Scene Text Recognition by Using EE-MSER and Optical Character Recognition For Natural Images-35843
No ratings yet
Scene Text Recognition by Using EE-MSER and Optical Character Recognition For Natural Images-35843
5 pages
English Year 4 - Paper 1
No ratings yet
English Year 4 - Paper 1
26 pages
Text Detection From Images
No ratings yet
Text Detection From Images
43 pages
Automobile Engineering Lab II (ETPM Lab)
No ratings yet
Automobile Engineering Lab II (ETPM Lab)
4 pages
Char RCG TH
No ratings yet
Char RCG TH
11 pages
Image Processing On The GPU: A Canonical Example: Scales Ns Orientatio Colors
No ratings yet
Image Processing On The GPU: A Canonical Example: Scales Ns Orientatio Colors
11 pages
(IJCST-V12I2P9) :Dr.M. Praneesh, Ashwanth.V, Febina.N, Sai Krishna P K
No ratings yet
(IJCST-V12I2P9) :Dr.M. Praneesh, Ashwanth.V, Febina.N, Sai Krishna P K
8 pages
Character Proposal Network For Robust Text Extraction: Shuye Zhang, Mude Lin, Tianshui Chen, Lianwen Jin, Liang Lin
No ratings yet
Character Proposal Network For Robust Text Extraction: Shuye Zhang, Mude Lin, Tianshui Chen, Lianwen Jin, Liang Lin
5 pages
DSP Project
No ratings yet
DSP Project
16 pages
ImageCLEF2012 1
No ratings yet
ImageCLEF2012 1
12 pages
Kami Export - 1904.01941
No ratings yet
Kami Export - 1904.01941
5 pages
Line-Wise Text Identification in Comic Books A Support Vector Machine-Based Approach
No ratings yet
Line-Wise Text Identification in Comic Books A Support Vector Machine-Based Approach
6 pages
Comic Characters Detection Using Deep Learning
No ratings yet
Comic Characters Detection Using Deep Learning
6 pages
Specific Comic Character Detection Using Local Feature Matching
No ratings yet
Specific Comic Character Detection Using Local Feature Matching
5 pages
Panel and Speech Balloon Extraction From Comic Books
No ratings yet
Panel and Speech Balloon Extraction From Comic Books
5 pages
Gupta Synthetic Data For CVPR 2016 Paper
No ratings yet
Gupta Synthetic Data For CVPR 2016 Paper
10 pages
Introduction To Matlab - Simulink Control Systems: & Their Application in
No ratings yet
Introduction To Matlab - Simulink Control Systems: & Their Application in
13 pages
Akpo Field Development Focus On Riser System: BY Omeoga Obinna OTI/MSC/PPE/2016/FT/013
No ratings yet
Akpo Field Development Focus On Riser System: BY Omeoga Obinna OTI/MSC/PPE/2016/FT/013
11 pages
Welcome To The Project Presentation On Number Plate
100% (1)
Welcome To The Project Presentation On Number Plate
21 pages
Private Fire Hydrant (PFH) Inspection and Testing Form
No ratings yet
Private Fire Hydrant (PFH) Inspection and Testing Form
2 pages
Suburba Contest
100% (4)
Suburba Contest
4 pages
Object Detection Using Contour Segment Networks
No ratings yet
Object Detection Using Contour Segment Networks
14 pages
Real-Time Scene Text Detection Based On Global Level and Word Level Features
No ratings yet
Real-Time Scene Text Detection Based On Global Level and Word Level Features
12 pages
Introduction To Sikhism by DR Zakir Naik
No ratings yet
Introduction To Sikhism by DR Zakir Naik
3 pages
Power, Conflict and Resistance: SocialMovements, Networks and Hierarchies by Athina Karatzogianni
No ratings yet
Power, Conflict and Resistance: SocialMovements, Networks and Hierarchies by Athina Karatzogianni
284 pages
Cascading Training For Relaxation CNN On Handwritten Character Recognition
No ratings yet
Cascading Training For Relaxation CNN On Handwritten Character Recognition
6 pages
IJERT Segmentation and Detection of Text
No ratings yet
IJERT Segmentation and Detection of Text
6 pages
Li 2021 J. Phys.: Conf. Ser. 1827 012085
No ratings yet
Li 2021 J. Phys.: Conf. Ser. 1827 012085
11 pages
Data Sheet
No ratings yet
Data Sheet
2 pages
Ken Kim PG79 FINAL
No ratings yet
Ken Kim PG79 FINAL
1 page
Review of Scene Text Detection and Recognition: Han Lin Peng Yang Fanlong Zhang
No ratings yet
Review of Scene Text Detection and Recognition: Han Lin Peng Yang Fanlong Zhang
22 pages
Ramos Residence 01.10.24 1
100% (2)
Ramos Residence 01.10.24 1
18 pages
Homework of Basic Communication Skills Lecture 1
No ratings yet
Homework of Basic Communication Skills Lecture 1
2 pages
Tang Few Could Be Better Than All Feature Sampling and Grouping CVPR 2022 Paper
No ratings yet
Tang Few Could Be Better Than All Feature Sampling and Grouping CVPR 2022 Paper
10 pages
Climate of India - Wikipedia
No ratings yet
Climate of India - Wikipedia
146 pages
Detection of Text From Lecture Video Images
No ratings yet
Detection of Text From Lecture Video Images
5 pages
Toward Accessible Comics For Blind and Low Vision Readers: Christophe - Rigaud, Jean-Christophe - Burie @
No ratings yet
Toward Accessible Comics For Blind and Low Vision Readers: Christophe - Rigaud, Jean-Christophe - Burie @
18 pages
Something To Question A Reading of Hanif Qurashi's Something To Tell You
No ratings yet
Something To Question A Reading of Hanif Qurashi's Something To Tell You
64 pages
Newbie
No ratings yet
Newbie
6 pages
Chapter 16 Water Resources
No ratings yet
Chapter 16 Water Resources
3 pages
Thesis 1
No ratings yet
Thesis 1
9 pages
Dd-Il9-Practice Test Unit 3a
No ratings yet
Dd-Il9-Practice Test Unit 3a
6 pages
Feature Fusion Pyramid Network For End-To-End Scene Text Detection
No ratings yet
Feature Fusion Pyramid Network For End-To-End Scene Text Detection
16 pages
CEILING SUSPENDED AHU 2600 CFM 125 MMWG 2 Nos
No ratings yet
CEILING SUSPENDED AHU 2600 CFM 125 MMWG 2 Nos
1 page
Group 1 Poetry From Asia & Australia
No ratings yet
Group 1 Poetry From Asia & Australia
44 pages
Scene Text Recognition Based On Improved CRNN
No ratings yet
Scene Text Recognition Based On Improved CRNN
14 pages
Bright Ideas 4 Unit 7 Test
100% (2)
Bright Ideas 4 Unit 7 Test
3 pages
Learning Spatially Embedded Discriminative Part Detectors For Scene Character Recognition
No ratings yet
Learning Spatially Embedded Discriminative Part Detectors For Scene Character Recognition
6 pages
4.2. Anna Ferruta Freud's Three Essays Revised
No ratings yet
4.2. Anna Ferruta Freud's Three Essays Revised
6 pages
10 1109@iccsp 2019 8698095
No ratings yet
10 1109@iccsp 2019 8698095
5 pages
Team 2 Review 2 Updated
No ratings yet
Team 2 Review 2 Updated
44 pages
IEEE Conference
No ratings yet
IEEE Conference
4 pages
GR 10 GRAPHS MIXED EXERCISE Mathematics
No ratings yet
GR 10 GRAPHS MIXED EXERCISE Mathematics
5 pages
2018 Wassce - English Language 1
No ratings yet
2018 Wassce - English Language 1
9 pages
P. 5 Maths 3
No ratings yet
P. 5 Maths 3
3 pages
Face Detection With The Faster R-CNN
No ratings yet
Face Detection With The Faster R-CNN
6 pages
An Introduction To Groups and Their Matrices For Science Students Robert Kolenkow Download
No ratings yet
An Introduction To Groups and Their Matrices For Science Students Robert Kolenkow Download
76 pages

2017 IEVC Yanagisawa

Uploaded by

2017 IEVC Yanagisawa

Uploaded by

Proceedings of the Fifth IIEEJ International Workshop

on Image Electronics and Visual Computing 2017

RECOGNITION OF PANEL STRUCTURE IN COMIC IMAGES USING FASTER R-CNN

Hideaki Yanagisawa† Hiroshi Watanabe†

ABSTRACT extracting function using modified connected component

Fig.1 Flow diagram of panel structure recognition

by specifying peripheral regions of each component in

In this section, we evaluate the detection accuracy of

(a) Panel detection

5.1. Iteration number iteration number

We verified relationship between iteration number in the

© Satoshi Arai © Saya Miyauchi

Fig.6 Example of failure to detect character faces

[5] Arai K, Tolle Herman: “Method for Real Time Text

[6] D. Ishii, H. Watanabe: “A Study on Automatic Character

[7] H. Yanagisawa, H. Watanabe: “A study of Multi-view Face

[8] R. Girshick, J. Donahue, T. Darrell, J. Malik: “Rich feature

[9] J. R. R. Uijlings, K. E. A. van de Sande, T. Gevers, A. W.

[10] R. Girshick: “Fast R-CNN”, arXiv:1504.08083, (2015).

[11] S. Ren, K. He, R. Girshick, J. Sun: “Faster R-CNN:

[12] S. Farfade, M. Saberian: “Multi-view Face Detection

[13] Y.Matsui, K.Ito, Y. Aramaki, T.Yamasaki, K. Aizawa:

You might also like