0% found this document useful (0 votes)

12 views11 pages

2019 Idt

Uploaded by

Funmii

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views11 pages

2019 Idt

Uploaded by

Funmii

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/334521767

Supporting Creation of FAQ Dataset for E-Learning Chatbot

Chapter · January 2020

DOI: 10.1007/978-981-13-8311-3_1

CITATIONS READS

12 1,173

4 authors, including:

Yasunobu Sumikawa Masaaki Fujiyoshi

Takushoku University Tokyo Metropolitan University
31 PUBLICATIONS 76 CITATIONS 154 PUBLICATIONS 689 CITATIONS

SEE PROFILE SEE PROFILE

Hisashi Hatakeyama
Tokyo Institute of Technology
4 PUBLICATIONS 18 CITATIONS

SEE PROFILE

All content following this page was uploaded by Yasunobu Sumikawa on 06 February 2020.

The user has requested enhancement of the downloaded file.

Supporting Creation of FAQ Dataset
for E-learning Chatbot

Yasunobu Sumikawa, Masaaki Fujiyoshi, Hisashi Hatakeyama, and Masahiro Nagai

Tokyo Metropolitan University, Japan

{sumikawa-yasunobu, fujiyoshi-masaski, hatak, mnagai}@tmu.ac.jp

Abstract. Recently, many universities provide e-learning systems for supporting

classes. Though the system is an effective and efficient learning environment, it
usually lacks a dynamic user support systems. A chatbot is a good choice to sup-
port a dynamic Q&A system; however, it is difficult to collect the large number
of Q&A data or high-quality datasets required to train the chatbot model to ob-
tain high accuracy. In this paper, we propose a novel framework for supporting
dataset creation. This framework provides two recommendation algorithms: cre-
ating new questions and aggregating semantically similar answers. We evaluated
our framework and confirmed that the framework can improve quality of an FAQ
dataset.

Keywords: Dataset creation, e-learning, FAQ, chatbot

1 Introduction

Thanks to growing IT infrastructures, many universities provide e-learning systems for

supporting classes. For example, in the viewpoints of teachers, they can share resumes,
assignments, and notifications with students through the system when and where they
want to do so. However, unfortunately, many e-learning systems lack a dynamic Q&A
system. In other words, it is impossible for users to ask any questions they may have
after the system engineers’ working time ends. This problem may reduce usability,
especially for users who are not familiar with using computers. To provide 24-hour
support for the users, a chatbot is a good choice because it automatically answers FAQs
any time. Indeed, in industry and government, chatbots are already used to support
customers and civilians, respectively, in order to enhance their experiences 1 .
As the chatbot is usually implemented by machine learning models, we must pre-
pare the high-quality dataset for training it to obtain highly accurate answers. This re-
quirement has two challenges. First, it is expensive in terms of both the time and cost
spent to build the dataset. Second, to collect Q&A data, we must listen to and record
the difficulties users had; however, ways to find the people who encounter difficulties
in order to ask them questions, either face-to-face or through an email, are few. This
challenge indicates that it is difficult to collect a large amount of Q&A data to create
FAQ datasets and to train chatbots.
1
E.g., Facebook bot on Messenger https://developers.facebook.com/videos/f8-2016/
introducing-bots-on-messenger/
Contributions. In this paper, we propose a novel framework for supporting chat-
bot dataset creation specifically for an e-learning system. The core contribution of this
study is to provide recommendations that are applicable to small sized datasets. Com-
pared with previous studies on dataset creation, our framework uses two unsupervised
learning algorithms: supporting creation of new questions and finding semantically sim-
ilar answers. We make assumptions as follows:

– It is difficult to automatically create FAQ datasets from small Q&A datasets.

– We can manually create FAQ datasets from small Q&A datasets.
– Supporting manual creation is beneficial to decrease the costs even though we can
create the dataset without any tools.

If we have enough data, we can apply supervised learning algorithms that automati-
cally create FAQ datasets as well as [7]. After obtaining many questions from chatbots
equipped with small datasets, we can apply the supervised learning algorithms to im-
prove scalability for dataset creation.
The contributions of this study are summarized as follows:

1. To the best of our knowledge, we are the first to create a chatbot to enhance e-
learning system used in a Japanese university in practice.
2. We propose a novel framework to create FAQ dataset.
3. We evaluated a chatbot trained on a dataset that is created with the framework and
obtained over 81% in terms of macro-average F1 -score.

2 Related Works
Analyzing Q&A data has been performed by many researchers. Many of the studies
seek to improve user experiences [4, 6] or results of classification. As the objective of
this study is to support dataset creation for improving the accuracy of a chatbot, which is
essentially a multi-class classifier, we focus on comparing the studies trying to improve
results of classification with this study.
Finding similar questions to exploit FAQ data is a popular way to improve the accu-
racy of the classifier for Q&A. One of the most popular approaches is to train language
or translation models by probability-based-estimation or neural network [2, 3, 5]. This
kind of approach is powerful; however, it assumes that a large amount of data is avail-
able to be applied to their models. In contrast, we assume that we can use small Q&A
data, and therefore, it is difficult to employ methods for estimating language models. To
support creating FAQ dataset from the small size of dataset, we design our framework
as an unsupervised learning using a lexical analysis and an entropy-based method.
Supporting dataset creation is another study related to this study. Behúň et al. pro-
pose an automatic annotation tool for collecting ground truth to a purely visual dataset
by Kinect [1]. Rodofile et al. design a modular dataset generation framework for cyber-
attacks [7]. These studies make assumptions that they can use large datasets or it is easy
to create large datasets; the targets are different from our study.
3 Data Collection
We first collected raw data from logs of users of the e-learning system introduced in
Tokyo Metropolitan University and recorded the questions they asked and answers pro-
vided by system engineers who managed the e-learning system in practice. We collected
the data from April 1, 2015 to July 31, 2018. The dataset includes 200 Q&A pairs in
total.

4 Categorization

In this section, we introduce our categorization scheme for the collected raw Q&A
based on features of the e-learning system. The objective is to organize answers; this is
useful for analyzing the kinds of features users often have difficulties with and under-
standing the feature we should focus on when preparing FAQ data. From the collected
data and manual investigation thereof, we propose 11 categories as shown in Tab. 1.

Table 1. Categories for answers

Category Name Description

C1 Documents Answers related to any questions on documents. For exam-
ple, ways of showing files to students.
C2 Assignments Answers focusing on assignments, e.g., settings for an
opening duration for students and downloading the results.
C3 Test/Questionnaire Answers for both test and questionnaire such as re-use of
problems in different classes.
C4 Contents Answers in general for broad contents in e-learning that do
not fall under any specific type (e.g., how to keep all data
files, assignments, and tests in order to use next year).
C5 Uploading Answers focusing on processes of uploading any data. For
example, answering ”the maximum file size user can upload
at a time” question.
C6 Registration Answers related to processes of registering to classes and
such as how a teacher invites another teacher to a class for
collaborative team teaching.
C7 Aggregation Answers for how to combine several classes on the e-
learning system.
C8 Login Answers for any questions related to how to log into the
e-learning system (e.g., how to obtain a new password).
C9 Contact Answers regarding ways about how to communicate be-
tween teachers and students such as sending an e-mail to
students via the system, using a bulletin board, and so on.
C10 Students Answers focusing on how students use the e-learning sys-
tem. In this category, all answers are for only students.
C11 Basic Usage Answers for how to use the e-learning system. For example,
system requirements and operating hours.
Suggestion tool

New Question

Answer Combining

Raw Q&A Manual

Checking FAQ dataset
dataset

Fig. 1. Process overview

5 Dataset Creation using Supports

Fig. 1 shows an overview of processes for creating an FAQ dataset. We assume that
transforming the Q&A into FAQ is performed by manual processes. During this pro-
cess, our framework suggests words to create new questions and combinations of se-
mantically the same answers. The two recommendations play key roles to improve the
accuracy of a chatbot, as the first one increases the number of labeled data whereas
the second one decreases redundant labels. As our framework is designed for manual
creation, users can choose one of the two algorithms when they want to use it. In the
remainder of this section, we detail the algorithms of the two suggestions.

5.1 Supporting Creation of New Questions

Increasing the number of questions for each answer is one of the most important pro-
cesses to improve the accuracy of classifications. However, creating new questions is
challenging as we must come up with new suitable words that should not be used in
other answers to distinguish them. Our framework automatically finds important words
that are missed in questions when characterizing their answers. This framework first
exploits words from answers if they are not used in current question texts. It then cal-
culates the importance of the exploited words to find words characterizing an answer
from other ones. We measure the importance by TF-IDF, which is formally defined as
follows:
NewWord(a) = {w | w ∈ W(a) \ W(Qa ), TFIDF(w, a) ≥ tnw } (1)
|A|
TFIDF(w, a) = tf w,a ∗ (2)
| {a0 ∈ A | wi ∈ W(a0 )} |
where W(a) is a set of words included in answer a, Qa is a set of questions for an answer
a, TFIDF is a function calculating a score of TF-IDF for a given word w, A is a set of
answers, and tnw is a threshold used to suggest the words as keywords. In this study, we
regard an answer as a document.
Table 2. Statistics of dataset created by our approach

Total Num. of answers 79

Total Num. of questions for training in baseline 155
Total Num. of questions for training in proposed dataset 367
Total Num. of questions for test 44
Ave. len. of questions 76.9

Finally, the framework outputs a list of the top-k important words as a ranking style.
The ranking function just sorts results of Eq. 2. The top-ranked words may help us with
creating new questions by combining or paraphrasing them.

5.2 Combining Answers

If there are more than two answers that are semantically the same as each other, we
can combine them to be an answer. We perform mutual information (MI) to find similar
answers.
p(w1 , w2 )
X X !
MI(a1 , a2 ) = p(w1 , w2 ) log (3)
w ∈W(Q )w ∈W(Q )
p(w1 )p(w2 )
1 a1 2 a2

Combine(A, tca ) = {(a1 , a2 ) | a1 , a2 ∈ A, MI(a1 , a2 ) ≥ tca } (4)

The Eq. 4 shows pairs of answers whose MI scores are over a given threshold.
Showing pairs is enough because we can incrementally use our framework; in other
words, even if we can combine more than three answers, we can apply the Eq. 4 to the
dataset more than twice. From this simple way, we can combine two or more similar
answers as an answer.

6 Experimental Evaluation

6.1 Setup
Classification Algorithm. We used the IBM Watson to implement the chatbot program.
Data Collection for Evaluation. Tab. 2 shows the statistics of the dataset used for
this evaluation. We used 79 answers and 44 questions to measure the accuracy of the
chatbot. Note that the 44 questions were not used to train the chatbot. Tab. 3 details how
many answers and questions were prepared for each category.
Comparisons. In this paper, we used only the classification algorithm (Watson),
as our framework is designed for dataset creation. To evaluate the effectiveness of the
framework, we used the following two datasets.

– Proposed dataset: This dataset is created with our framework 2 .

– Baseline: This dataset has the same answers as above dataset; however, this dataset
excludes questions created by our framework.
2
The proposed dataset is available on a public repository server: https://doi.org/10.5281/zenodo.2557319.
Table 3. Numbers of answers and questions for test

C1 C2 C3 C4 C5 C6
Num. of answers 2 3 2 2 2 4
Num. of questions 4 6 3 4 4 9
C7 C8 C9 C10 C11 Total
Num. of answers 2 1 2 1 2 23
Num. of questions 4 1 4 1 4 44

Measurements. Usually, multi-class classification studies use two kinds of mea-

surements: label-based measures and example-based loss functions [8]. As for the label-
based measurement, we use macro-average precision, recall, and F1 . The macro-average
measurements treat all labels equally; in other words, they compute the metrics indepen-
dently for each label and then take the average. The formal definitions of macro-average
precision, recall, and F1 are as follows:
TPi
Pi = (5)
TPi + FPi
TPi
Ri = (6)
TP + FNi
 i 
X 2 Pi Ri 
F1 =  / | L | (7)
P +R 
i i i

where TP, FP, and FN mean true positive, false positive and false negative, respectively,
and L is a set of the label defined in Tab.1. Note here that the precision is defined
as the proportion of predicted labels that are truly relevant. The recall is defined as
the proportion of truly relevant labels that are included in predictions. The trade-off
between precision and recall is formalized by their harmonic mean, called F1 -score. In
the label-based measurements, the higher these scores are, the better the performances
of the model are.
Regarding the example-based loss functions, hamming loss (HL), ranking loss (RL)
and log loss (LL) are popular measurements. HL calculates the fraction of the wrong
labels to the total number of labels. RL means a proportion of labels’ pairs that are
not correctly ordered. Finally, LL calculates scores from probabilistic confidence. This
metric can be seen as cross-entropy between the distribution of the true labels and the
predictions. Their formal definitions are given as follows:
N L
1 XX
HL = [[yi,l , ŷi,l ]] (8)
NL i l
N
1X X 1
RL = ([[ŷi < ŷ j ]] + [[ŷi = ŷ j ]]) (9)
N i y >y 2
j k

L
X
LL = − yi log(pi ) (10)
i
Table 4. Scores for both baseline and our approaches. The abbreviated names of measurements
are for: macro-average precision (maP), macro-average recall (maR), macro-average F-score
(maF), hamming loss (HL), ranking loss (RL), log loss (LL)

maP maR maF HL RL LL

Baseline 67.4% 54.5% 57.3% 0.02 0.45 0.27
Proposed dataset 93.1% 75.0% 81.2% 0.01 0.25 0.11

Table 5. Numbers of answers and questions in training data

C1 C2 C3 C4 C5 C6
Num. of answers 9 9 15 3 3 11
Baseline
Num. of questions 13 18 23 3 12 41
Num. of answers 9 13 17 3 3 11
Proposed dataset
Num. of questions 50 68 89 15 19 66
C7 C8 C9 C10 C11 Total
Num. of answers 5 1 8 3 3 70
Baseline
Num. of questions 12 5 21 3 4 155
Num. of answers 5 2 8 3 5 79
Proposed dataset
Num. of questions 25 11 44 15 25 427

In the example-based loss functions, the smaller these scores are, the better the perfor-
mances of the model are.

6.2 Discussion of Results

Tab. 4 compares all measurements of our framework with that of the baselines. The
conclusion is that using our framework improves all measurements. Especially, macro-
average precision is improved over 25% compared with the baseline. The main reason
is that we can increase the number of questions. Looking at Tab. 5, the proposed dataset
has twice as many questions as the baseline does.
We then performed error analysis. Fig. 2 shows confusion matrices of our approach.
The former one shows what answers the chatbot outputs for test questions whereas the
later one shows the result by mapping answers to their categories. In Fig. 2 (a), we use
indexes for answers; for example, if we use a question whose answer is the second one,
we use A2 in the figure. The index numbers start in order from 1 to 79, as our dataset has
79 answers as shown in Tab. 5. From Fig. 2 (a), we can see that the chatbot sometimes
performs ”mis-answering” for several questions. On the other hand, Fig. 2 (b) shows
that the chatbot wrongly predicts only for two categories. For a better understanding
of the results, we measured inner- and inter-category similarity by using the Jaccard
index. This measurement calculates the similarity by counting the number of unique
words shared by given two sets after normalizing their sizes. The formal definition is
A2
A8
A9 5
A11
A12
A14
A20
A22 3.0
A26 4 C1
A27
A40 C2
A42 2.4
A43 C3
A45 3
A46
A47 C4
A51 1.8
A52 C5
A54 2
A56 C6
A57 1.2
A59 C7
A62
A64 C8
A65 1 0.6
A66
A67 C9
A73
A76 C10 0.0
A77 0
A79 C11
A2
A8
A9
A1
A11
A12
A24
A 20
A22
A26
A47
A40
A42
A43
A45
A46
A57
A51
A52
A54
A56
A57
A 69
A62
A64
A65
A66
A77
A73
A76
A77
C1 C2 C3 C4 C5 C6 C7 C8 C9 C10C11

9
(a) (b)

Fig. 2. (a) Answer level based confusion matrix of the proposal. The x axis represents correct
labels whereas labels predicted by classifier are on the y axis (b) Category level based confusion
matrix of the proposed dataset. The x axis represents correct labels whereas labels predicted by
classifier are on the y axis

Table 6. Inner-category similarity

C1 C2 C3 C4 C5 C6
Baseline 0.19 0.17 0.13 0.32 0.25 0.10
Proposed dataset 0.11 0.09 0.07 0.06 0.22 0.08
C7 C8 C9 C10 C11
Baseline 0.20 0.27 0.12 0.30 0.23
Proposed dataset 0.12 0.07 0.06 0.06 0.03

given as follows:
| W q1 ∩ W q2 |
Jaccard(q1 , q2 ) = (11)
| W q1 ∪ W q2 |

where Wq1 indicates a word set of a question q1 . Tab. 6 shows scores of the inner-
category similarities calculated with the Jaccard index. We can see that relatively high
scores occupy this table. In contrast, Fig. 3 (a) shows scores of inter-category similari-
ties scores calculated by the Jaccard index between all combinations of the two different
categories. Overall, the scores are lower than that of inner-category similarity. These ob-
servations indicate that we should improve the quality of question texts to distinguish
in the same category.
In addition, from Fig. 2 (b), we can observe that several questions of C5 (Uploading)
and C11 (Basic Usage) are mis-predicted to answers of C1 (Documents) and C9
(Contact), respectively. Mispredictions of C5 questions as C1 are understandable as
the two categories (C1 and C5) can share file-related words. Indeed, Fig. 3 (a) shows
0.10 0.10
C1 C1
C2 C2
C3 0.08 C3 0.08

C4 C4
C5 0.06 C5 0.06
C6 C6
C7 0.04 C7 0.04
C8 C8
C9 C9
0.02 0.02
C10 C10
C11 C11
0.00 0.00
C1 C2 C3 C4 C5 C6 C7 C8 C9 C10C11 C1 C2 C3 C4 C5 C6 C7 C8 C9 C10C11
(a) (b)

Fig. 3. (a) Inter-category similarity of proposed dataset (b) Inter-category similarity of baseline

the score of the Jaccard index between the two categories is quite high. Next, to iden-
tify why the chatbot wrongly showed an answer of C9 instead of C11, we manually
analyzed questions of two categories, C9 and C11. In our dataset, there is a question
about how to make available a function for sending e-mail between teachers and stu-
dents. This question is similar to C11 that collect questions related to how to use the
e-learning system.
Finally, we compared these results of our proposed dataset with that of the baseline.
We show inner-category, inter-category, and answer- and category-level confusion ma-
trices of the baseline in Tab. 6, and Figs. 3 (b) and 4. Looking at all similarity scores
(Tab. 6 and Fig. 3 (b)), they are all higher than that of the proposed dataset. This means
that our framework can suggest several kinds of words leading to increasing diversity
without decreasing the accuracy of the chatbot because our dataset is better than the
baseline.

7 Conclusions

In this paper, we introduce a novel framework for supporting chatbot dataset creation
specifically for an e-learning system. This framework has two methods: suggesting new
words for new questions and aggregating answers that are semantically similar to each
other.
In the future, we plan to analyze a) which questions users tend to have for each
month. In this paper, we assume that all Q&A data can occur independently of time.
However, there are some temporal questions regarding registration to classes that may
occur early in a semester and questions about tests that users may have late in a semester.
This temporal question analysis may improve the effectiveness of chatbots. The future
work also includes b) qualitative evaluation. This paper focuses on quantitative evalua-
tions; however, analyzing what users feel and think about using chatbots is also impor-
tant for practical usage.
A2
A8
A9 5
A11
A12
A14
A20
A22
A26 4 C1
A27
A40 C2
A42 2.4
A43 C3
A45 3
A46 C4
A47
A51
C5 1.6
A52
A54 2
A56 C6
A57
A59 C7 0.8
A62
A64 C8
A65 1
A66 C9
A67 0.0
A73
A76 C10
A77 0
A79 C11
A2
A8
A9
A1
A11
A12
A24
A20
A22
A 26
A47
A40
A42
A43
A 45
A46
A 57
A51
A52
A54
A56
A57
A69
A 62
A 64
A65
A66
A 77
A73
A76
A 77
C1 C2 C3 C4 C5 C6 C7 C8 C9 C10C11

9
(a) (b)

Fig. 4. (a) Answer granularity based confusion matrix of baseline. The x axis represents correct
labels whereas labels predicted by classifier are on the y axis (b) Category granularity based
confusion matrix of baseline. The x axis represents correct labels whereas labels predicted by
classifier are on the y axis

Acknowledgments. We would like to appreciate Mr. Okamura (Tokyo Metropolitan

University), and Mr. Kouda, Mr. Suzuki, and Mr. Toya (Alpha Computer, Ltd) for their
support collecting and initially organizing Q&A data. This work was supported by JSPS
KAKENHI Grant Number 18H01057.

References
1. Behúň, K., Herout, A., Páldy, A.: Kinect-supported dataset creation for human pose estima-
tion. pp. 55–62. SCCG ’14, ACM, New York, NY, USA (2014)
2. Jeon, J., Croft, W.B., Lee, J.H.: Finding similar questions in large question and answer
archives. pp. 84–90. CIKM ’05, ACM, New York, NY, USA (2005)
3. Leveling, J.: Monolingual and crosslingual sms-based faq retrieval. pp. 3:1–3:6. FIRE ’12 &
’13, ACM, New York, NY, USA (2007)
4. Morris, M.R., Teevan, J., Panovich, K.: What do people ask their social networks, and why?:
A survey study of status message q&a behavior. pp. 1739–1748. CHI ’10, ACM, New York,
NY, USA (2010)
5. Otsuka, A., Nishida, K., Bessho, K., Asano, H., Tomita, J.: Query expansion with neural
question-to-answer translation for faq-based question answering. pp. 1063–1068. WWW ’18,
Republic and Canton of Geneva, Switzerland (2018)
6. Pinto, G., Torres, W., Castor, F.: A study on the most popular questions about concurrent
programming. pp. 39–46. PLATEAU 2015, ACM, New York, NY, USA (2015)
7. Rodofile, N.R., Radke, K., Foo, E.: Framework for scada cyber-attack dataset creation. pp.
69:1–69:10. ACSW ’17, ACM, New York, NY, USA (2017)
8. Tsoumakas, G., Katakis, I., Vlahavas, I.: Mining Multi-label Data, pp. 667–685. Springer US,
Boston, MA (2010)

View publication stats

Geo Server
No ratings yet
Geo Server
7 pages
NuoDB Documentation
0% (1)
NuoDB Documentation
142 pages
Tangedco-Abt Meter Final Spec. (Both Bulk & WFHT SC)
No ratings yet
Tangedco-Abt Meter Final Spec. (Both Bulk & WFHT SC)
34 pages
GHH1
100% (1)
GHH1
8 pages
Serai Catalogue ENG 2017-18
No ratings yet
Serai Catalogue ENG 2017-18
44 pages
Handbook Recommender Systems For Learning
No ratings yet
Handbook Recommender Systems For Learning
31 pages
The Effects of Integrating Mobile Devices With Tea
No ratings yet
The Effects of Integrating Mobile Devices With Tea
25 pages
Kalo 10863-38105-1-PB
No ratings yet
Kalo 10863-38105-1-PB
13 pages
Course Management Systems in Higher Education Unde PDF
No ratings yet
Course Management Systems in Higher Education Unde PDF
20 pages
Course Management Systems in Higher Education Unde PDF
No ratings yet
Course Management Systems in Higher Education Unde PDF
20 pages
Kenyatta University: Postgraduate Dissertation Handbook
No ratings yet
Kenyatta University: Postgraduate Dissertation Handbook
29 pages
Fabrication of Silicon Nitride Nanoceramics-Powder
No ratings yet
Fabrication of Silicon Nitride Nanoceramics-Powder
11 pages
An E-Learning Theoretical Framework: Journal of Educational Technology Systems January 2016
No ratings yet
An E-Learning Theoretical Framework: Journal of Educational Technology Systems January 2016
17 pages
The Design of Microlearning Experiences: A Research Agenda: Article
No ratings yet
The Design of Microlearning Experiences: A Research Agenda: Article
10 pages
Task Assignmentof Peer Gradingin MOOCs
No ratings yet
Task Assignmentof Peer Gradingin MOOCs
13 pages
The Biotron Breeding System A Rapid and Reliable P PDF
No ratings yet
The Biotron Breeding System A Rapid and Reliable P PDF
10 pages
PASER A Curricula Synthesis System Based On Automa
No ratings yet
PASER A Curricula Synthesis System Based On Automa
14 pages
A Review of AI Teaching and Learning From 2000 To
No ratings yet
A Review of AI Teaching and Learning From 2000 To
59 pages
Learning Behavioral Patterns of Students With Varying Performance in A High School Mathematics Course Using An Ebook System
No ratings yet
Learning Behavioral Patterns of Students With Varying Performance in A High School Mathematics Course Using An Ebook System
24 pages
Aflatoxin 4
No ratings yet
Aflatoxin 4
34 pages
Development of Telepresence Teaching Robots With Social Capabilities
No ratings yet
Development of Telepresence Teaching Robots With Social Capabilities
12 pages
Exploring User Experience
No ratings yet
Exploring User Experience
21 pages
FPGA Xilin Xcell
No ratings yet
FPGA Xilin Xcell
68 pages
A Practical Guide Using LLMs ChatGPT and Beyond
No ratings yet
A Practical Guide Using LLMs ChatGPT and Beyond
24 pages
Fostering Students' Creativity Via Educational Robotics: An Investigation of Teachers' Pedagogical Practices Based On Teacher Interviews
No ratings yet
Fostering Students' Creativity Via Educational Robotics: An Investigation of Teachers' Pedagogical Practices Based On Teacher Interviews
36 pages
Optimal Timing of The First Barium Swallow Examination For Diagnosis of Pyriform Sinus Fistula
No ratings yet
Optimal Timing of The First Barium Swallow Examination For Diagnosis of Pyriform Sinus Fistula
7 pages
Myeloschisis Review
No ratings yet
Myeloschisis Review
8 pages
Studyof Manga Readingasan Effective Teaching Method Basedonthe Text Comprehension Process
No ratings yet
Studyof Manga Readingasan Effective Teaching Method Basedonthe Text Comprehension Process
14 pages
(IJETA-V921P1) :yew Kee Wong
No ratings yet
(IJETA-V921P1) :yew Kee Wong
5 pages
Aplicacion Educacion Paper91
No ratings yet
Aplicacion Educacion Paper91
9 pages
Landsat 8 Surface Reflectance Code (Lasrc) Product Guide: Department of The Interior U.S. Geological Survey
No ratings yet
Landsat 8 Surface Reflectance Code (Lasrc) Product Guide: Department of The Interior U.S. Geological Survey
39 pages
SL70 / SLS70: Copper and Stainless Steel Brazed Plate Heat Exchangers
No ratings yet
SL70 / SLS70: Copper and Stainless Steel Brazed Plate Heat Exchangers
1 page
Overview of Training LLMs
No ratings yet
Overview of Training LLMs
31 pages
Paper 01 DOI
No ratings yet
Paper 01 DOI
22 pages
Django Based E-Learning Website
No ratings yet
Django Based E-Learning Website
9 pages
Summary of Intensive Quenching Processes Theory and Applications 17p
No ratings yet
Summary of Intensive Quenching Processes Theory and Applications 17p
17 pages
DAA Quiz Answers
No ratings yet
DAA Quiz Answers
5 pages
ASPIRE Functional Specification and Architectural
No ratings yet
ASPIRE Functional Specification and Architectural
53 pages
PDF 0235
No ratings yet
PDF 0235
18 pages
2913 Published
No ratings yet
2913 Published
21 pages
Mathematics in The Modern World Reviewer
No ratings yet
Mathematics in The Modern World Reviewer
3 pages
PGDM Semester - I (2020-2022) End Term Examination: Instructions
100% (1)
PGDM Semester - I (2020-2022) End Term Examination: Instructions
2 pages
2014 JournalofInformationTechnologyEducation Research Technologyreadiness
No ratings yet
2014 JournalofInformationTechnologyEducation Research Technologyreadiness
20 pages
Design of Fuzzy Logic Controller For A Cross Flow Shell and Tube Heat-Exchanger
No ratings yet
Design of Fuzzy Logic Controller For A Cross Flow Shell and Tube Heat-Exchanger
3 pages
Further Properties of Splines and B-Splines
No ratings yet
Further Properties of Splines and B-Splines
17 pages
Robotics and Education A Systematic Review
No ratings yet
Robotics and Education A Systematic Review
10 pages
Recycling 07 00066
No ratings yet
Recycling 07 00066
17 pages
2017 IISAReinforcementtheory
No ratings yet
2017 IISAReinforcementtheory
7 pages
19-Article Text-69-1-10-20170622
No ratings yet
19-Article Text-69-1-10-20170622
7 pages
2021-Bjet Cy
No ratings yet
2021-Bjet Cy
19 pages
Escholarship UC Item 6kf0r28s
No ratings yet
Escholarship UC Item 6kf0r28s
45 pages
Hines Et Al-2007-Quality and Reliability Engineering International
No ratings yet
Hines Et Al-2007-Quality and Reliability Engineering International
13 pages
Dynamic Educational Recommender System Based On Improved Recurrent Neural Networks Using Attention Technique
No ratings yet
Dynamic Educational Recommender System Based On Improved Recurrent Neural Networks Using Attention Technique
25 pages
Ijep 2016 49 234 242
No ratings yet
Ijep 2016 49 234 242
10 pages
Cmit 220046 000 SCW 15.01 0004 0
No ratings yet
Cmit 220046 000 SCW 15.01 0004 0
2 pages
Handbook Recommender Systems For Learning
No ratings yet
Handbook Recommender Systems For Learning
31 pages
Identification of A Novel Inhibitor Specific To TH
No ratings yet
Identification of A Novel Inhibitor Specific To TH
7 pages
Roles and Research Trends of Ipped Classrooms in Nursing Education: A Review of Academic Publications From 2010 To 2017
No ratings yet
Roles and Research Trends of Ipped Classrooms in Nursing Education: A Review of Academic Publications From 2010 To 2017
24 pages
Chat GPT e Educação Superior
No ratings yet
Chat GPT e Educação Superior
17 pages
Chat GPT Effect
No ratings yet
Chat GPT Effect
14 pages
Effects of Defocus Distance On Three-Beam Laser Internal Coaxial Wire Cladding
No ratings yet
Effects of Defocus Distance On Three-Beam Laser Internal Coaxial Wire Cladding
22 pages
Chapter6 Bearing Capacity and Settlement of Shallow Foundations
No ratings yet
Chapter6 Bearing Capacity and Settlement of Shallow Foundations
57 pages
Challenges in Ado (Tion of Technology in Agricultural Education
No ratings yet
Challenges in Ado (Tion of Technology in Agricultural Education
7 pages
Roleof AIin ME
No ratings yet
Roleof AIin ME
16 pages
E-Learning Agents
No ratings yet
E-Learning Agents
14 pages
Integrating A Llama-Based Chatbot With Augmented Retrieval Generation As A Complementary Educational Tool For High School and College Students
No ratings yet
Integrating A Llama-Based Chatbot With Augmented Retrieval Generation As A Complementary Educational Tool For High School and College Students
9 pages
Microlearning As A New Method of Teaching Soft Skills To University Studentsfrontiers in Education
No ratings yet
Microlearning As A New Method of Teaching Soft Skills To University Studentsfrontiers in Education
10 pages
An Integrated Platform For Studying Learning With Intelligent Tutoring Systems: Ctat+Tutorshop
No ratings yet
An Integrated Platform For Studying Learning With Intelligent Tutoring Systems: Ctat+Tutorshop
15 pages
An Intelligent Hair and Scalp Analysis System
No ratings yet
An Intelligent Hair and Scalp Analysis System
16 pages
L2 Students' Barriers in Engaging With Form and Content-Focused AI-generated Feedback in Revising Their Compositions
No ratings yet
L2 Students' Barriers in Engaging With Form and Content-Focused AI-generated Feedback in Revising Their Compositions
23 pages
Recitation Problems Week 7
No ratings yet
Recitation Problems Week 7
2 pages
E Nhancing e Ducational Qa S Ystems I Ntegrating K Nowledge G Raphs A ND L Arge L Anguage M Odels F or C Ontext A Ware L Earning
No ratings yet
E Nhancing e Ducational Qa S Ystems I Ntegrating K Nowledge G Raphs A ND L Arge L Anguage M Odels F or C Ontext A Ware L Earning
9 pages
The Role of Certification Courses With Reference To Technical Subjects To Improve The Skill of Undergraduate Students
No ratings yet
The Role of Certification Courses With Reference To Technical Subjects To Improve The Skill of Undergraduate Students
7 pages
Paper JCAL
No ratings yet
Paper JCAL
14 pages
E Nhancing e Ducational Qa S Ystems I Ntegrating K Nowledge G Raphs A ND L Arge L Anguage M Odels F or C Ontext A Ware L Earning
No ratings yet
E Nhancing e Ducational Qa S Ystems I Ntegrating K Nowledge G Raphs A ND L Arge L Anguage M Odels F or C Ontext A Ware L Earning
9 pages
Evaluating On Line Learning Platforms A
No ratings yet
Evaluating On Line Learning Platforms A
10 pages
Are Sequences For Learning and Assessment Really HelpfulC&A - V2 - JLI
No ratings yet
Are Sequences For Learning and Assessment Really HelpfulC&A - V2 - JLI
36 pages
Grade X
No ratings yet
Grade X
1 page
Hematocritc Determination
No ratings yet
Hematocritc Determination
3 pages
Enhancing Educational Qa Systems: Integrating Knowledge Graphs and Large Language Models For Context-Aware Learning
No ratings yet
Enhancing Educational Qa Systems: Integrating Knowledge Graphs and Large Language Models For Context-Aware Learning
9 pages
An E-Learning Theoretical Framework: Journal of Educational Technology Systems January 2016
No ratings yet
An E-Learning Theoretical Framework: Journal of Educational Technology Systems January 2016
17 pages
Article Creation I - Step by Step by
No ratings yet
Article Creation I - Step by Step by
3 pages
2021 DSQ-40 Evaluation Students
No ratings yet
2021 DSQ-40 Evaluation Students
10 pages
Download
No ratings yet
Download
7 pages
Adaptive 3D and VFX Films Virtual Learning Environments1
No ratings yet
Adaptive 3D and VFX Films Virtual Learning Environments1
7 pages
Water Colling Tower
No ratings yet
Water Colling Tower
16 pages
Light GCN
No ratings yet
Light GCN
26 pages
Ai-Driven Student Assistance Chatbots Redefining
No ratings yet
Ai-Driven Student Assistance Chatbots Redefining
11 pages
2 - An E - Learning Theoretical Framework
No ratings yet
2 - An E - Learning Theoretical Framework
17 pages

2019 Idt

Uploaded by

2019 Idt

Uploaded by

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

Supporting Creation of FAQ Dataset for E-Learning Chatbot

Chapter · January 2020

Yasunobu Sumikawa Masaaki Fujiyoshi

SEE PROFILE SEE PROFILE

The user has requested enhancement of the downloaded file.

Yasunobu Sumikawa, Masaaki Fujiyoshi, Hisashi Hatakeyama, and Masahiro Nagai

Tokyo Metropolitan University, Japan

Abstract. Recently, many universities provide e-learning systems for supporting

Keywords: Dataset creation, e-learning, FAQ, chatbot

Thanks to growing IT infrastructures, many universities provide e-learning systems for

– It is difficult to automatically create FAQ datasets from small Q&A datasets.

Table 1. Categories for answers

Category Name Description

Raw Q&A Manual

Fig. 1. Process overview

5 Dataset Creation using Supports

5.1 Supporting Creation of New Questions

Total Num. of answers 79

5.2 Combining Answers

Combine(A, tca ) = {(a1 , a2 ) | a1 , a2 ∈ A, MI(a1 , a2 ) ≥ tca } (4)

– Proposed dataset: This dataset is created with our framework 2 .

Measurements. Usually, multi-class classification studies use two kinds of mea-

maP maR maF HL RL LL

Table 5. Numbers of answers and questions in training data

6.2 Discussion of Results

Table 6. Inner-category similarity

Acknowledgments. We would like to appreciate Mr. Okamura (Tokyo Metropolitan

View publication stats

You might also like