0% found this document useful (0 votes)

21 views7 pages

Detection of Fake Online Reviews by Using Machine Learning

Uploaded by

Priyanka appinakatte

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views7 pages

Detection of Fake Online Reviews by Using Machine Learning

Uploaded by

Priyanka appinakatte

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

International Conference on Innovative Data Communication Technologies and Application (ICIDCA-2023)

IEEE Xplore Part Number: CFP23CR5-ART; ISBN: 979-8-3503-9720-8

Detection of Fake Online Reviews by using

Machine Learning
C. Silpa P Prasanth S Sowmya
School of Computing Dept.of Information Technology Dept.of Information Technology
2023 International Conference on Innovative Data Communication Technologies and Application (ICIDCA) | 979-8-3503-9720-8/23/$31.00 ©2023 IEEE | DOI: 10.1109/ICIDCA56705.2023.10099776

M ohan Babu University (Erstwhile SreeVidyanikethan Engg. College SreeVidyanikethan Engg. College
SreeVidyanikethan Engg. College) Tirupati, India Tirupati, India
Tirupati, India [email protected] [email protected]
[email protected]

Y Bhumika C H Surya Pavan M Naveed

Dept.of Information Technology Dept. of Information Technology Dept.of Information Technology
SreeVidyanikethan Engg. College SreeVidyanikethan Engg. College SreeVidyanikethan Engg. College
Tirupati, India Tirupati, India Tirupati, India
[email protected] [email protected] [email protected]

Abstract— Reviews, ratings, and personal stories written by the derived from the reviews. Yelp reviews, which are categorised
customers on online sites and also other services are helpful for using a few criteria, have been deemed to be a publicly available
both buyers and sellers. By writing reviews, the user may large scale and created dataset. Yelp reviews have been taken into
increase brand loyalty and help other buyers better consideration as a publicly available large-scale and created
understand about their product. If customers offer favourable dataset. These reviews are classified using a few well-known
feedback on reviews of their items, vendors can increase the supervised classifiers, which categorise them as true or misleading
sale of their products an d build additional profiles. by taking into account different data aspects. Given the complexity
Unfortunately, suppliers may abuse these review processes. of the suggested heterogeneous graph transformer model,
One may fabricate good reviews to boost a bran d reputation or significant computer resources may be needed to implement it.
attempt to denigrate rival brand’s items by posting fraud
reviews of negative evaluations. Based on the textual II. LITERATURE S URVEY
information in the reviews, sentiment classification has been Fan Cheng, et al. [1], proposed the complicated interactions
incorporated. Re views that are fake are identified and between customers, goods, and reviews are captured by using a
classified, classification models produce the results by applying heterogeneous graph transformer model. Utilizing user preferences
the machine learning algorithms. Hence, supervised learning and reviewer content analysis, the system seeks to increase the
model is used for labelling the reviews to identify the review as efficiency and accuracy of product suggestions. It was conducted
fake-review or genuine review. by using a small dataset, which might not accurately reflect real-
world circumstances and restricts the generalizability of the results.
Keywords— Sentimental Analysis, Text mining, Product review, It also given the complexity of the suggested heterogeneous graph
Machine Learning. transformer model, significant computer resources may be needed
to implement it
I. INTRODUCTION
The advancement of internet innovation has significantly changed J. Wang, et al. [2], proposed their strategy using two real-world
how people live their lives nowadays. Different e-commerce datasets, comparing the outcomes to a number of cutting-edge
websites, such as Amazon, Flipkart with the internet users techniques for false review identification. They discover that in
effective, and largely dependable environment for online. M ore regards to accuracy and F1-score, their strategy performs better
and more business owners are choosing to create their online stores than these approaches. The authors talk about the difficulties in
on various platforms. As more customers gradually become deep learning and provide numerous methods and algorithms to
accustomed to this method of buying, they automatically share solve these problems. They also emphasize the value of learning
their opinions and experiences online through the e-commerce algorithms in resolving complicated issues and its possibility for
website's review system. These reviews often reflect the quality of development in the future and introduces a unique method for
the product or the user experience because the majority of them are detecting false reviews that incorporates a number of variables as
written by online shoppers. Before making an order to purchase well as a rolling cooperative training strategy to increase precision.
items, more and more consumers have become accustomed to The review of the literature emphasis the importance of research
reading online reviews. M oreover, many business owners are on machine learning and its uses in a variety of sectors.
aware that the more favourable internet evaluations they have, the Fei et al. [3], authors examine the patterns of review activity and
more transactions they have, and the faster they may grow and discover that spammers frequently blast forth several reviews in a
establish a solid reputation. The major goal is to analyse the key short period of time. To identify review spammers, the authors
review and review-centric features that have been suggested to create a burstiness-based method that takes into account the
identify fraud or fake reviews, particularly methods that use distributions of the time gaps between 2 sequential reviews for
supervised machine learning techniques. Opinion spam detection each reviewers and test the strategy against a variety of current
can detect fraudulent reviews, fake stories, fake blogs, fraudulent spammer detection techniques using a real-world review dataset.
social networking postings, and deceptive messaging. When By utilizing the burstiness of reviews, this work makes a special
detecting fraudulent reviews, review-focused websites like Yelp addition to the field of reviewing spammer identification. The
might be taken into account. Unsupervised methods that are based authors' findings emphasize how crucial it is to take into account
on graphical techniques but are not very reliable have been used up the time patterns of reviewer activity while looking for review
to this point to detect bogus reviews. The supervised techniques spammers. This study has potential implications in e-commerce,
take into account both the reviewer's behaviour and other attributes

979-8-3503-9720-8/23/$31.00 ©2023 IEEE 71

Authorized licensed use limited to: Zhejiang University. Downloaded on April 10,2024 at 18:09:51 UTC from IEEE Xplore. Restrictions apply.
International Conference on Innovative Data Communication Technologies and Application (ICIDCA-2023)
IEEE Xplore Part Number: CFP23CR5-ART; ISBN: 979-8-3503-9720-8

where unreliable and biassed customer reviews can significantly Yashika goyal, et al. [11], identifies fraud reviews in e-commerce
influence purchasing decisions. platforms and points out the shortcomings of conventional
approaches. SVM and Naive Bayes, were used in earlier research
Luhua Jin, et al. [4], proposed by exploiting the data in the to identify fake review and how their own approach is superior to
heterogeneous network, which is made up of customer, product, these earlier ones. The authors also describe how the suggested
and review nodes, the authors hope to improve the caliber of method was tested on a sizable datasets and how it was able to
product reviews. The heterogeneous graph transformer, which the identify fraudulent reviews with a high degree of accuracy. The
study offers, takes the heterogeneous graph for input and produces findings of this investigation show how well the suggested
as the representations for each node, is a novel model. By the help approach works at spotting bogus reviews and emphasise its
computational effectiveness and recommendation accuracy , the
potential for practical use. Overall, fake review identification at
findings demonstrate for suggested model performs better than the present and demonstrates how the suggested approach makes a
alternative and offers a fresh approach for raising the calibre of substantial addition to the area.
product reviews while also demonstrating the usefulness of the
heterogeneous graph transformer for ecommerce review systems. III. METHODOLOGY
HinaTufail et al. [5], The authors determined that fraudulent To sell products, people post unjustified positive reviews about the
reviews can significantly affect customer behavior, sales revenue, product. Sometimes fake reviews were also written against other
and market dynamics after conducting a literature scan to assess (competition) items in an effort to harm their reputation. Some of
prior studies on the subject and also covered the difficulties e- these are not reviews which don't express any views about the
commerce businesses confront in identifying and reducing the goods. It might be challenging to predict the nature of someone's
effects of fake review and offered some viable remedies. Those opinion when they make contradicting assertions. In a poor review,
reports show the need for additional research in this field and there might be a concealed positive meaning.
offers valuable information on the problem of fraudulent review in
e-commerce. Now a days people are using online E-commerce site to write
reviews on their own way in their respective accounts. People often
Z. Liu, et al. [6], The authors address the issue that because GNNs make contradictory claims, making it challenging to predict the
are sensitive to the structure and characteristics of the graph data, nature of their opinions. In a poor review, there might be a
they are unreliable in detecting fraud. The suggested method concealed positive meaning. Additionally, opinions regarding the
increases the accuracy and efficiency of GNNs in detection of product can occasionally be both favourable and unfavourable.
fraud by enhancing their consistency and stability. The study, After facing all of these difficulties, it becomes increasingly harder
which has been contributing to the expanding area of graph-based to identify reviews that are fake or that are being exploited to
fraudulent activities. swam consumer opinion. Since consumers these days heavily rely
on opinions and reviews, ecommerce sites and other service
J. Wang et al. [7], To increase the effectiveness of fake review
providers have a huge challenge with opinion spamming with the
identification, the technique combines a number of characteristics,
help of review detection.
including text-based and image-based features. The model can
adapt to new data and avoid overfitting to the rolling collaboration The Review system can offers to categorize fake reviews into fraud
training strategy. The suggested technique was tested on a data set review and no fraud review in order to identify any such spammed
by the authors, who found that it performed better than other fake reviews and address the major issue that online websites
cutting-edge techniques in terms of precision. confront due to opinion spamming. Using Naive Bayes, logistic
regression, SVM s and Decision Trees algorithms, these method
Y. Wu et al. [8], authors give a description of the definitions,
aims to more accurately classify the reviews obtained. Just a few of
causes, and effects of false reviews in addition to a rundown of the
the accessible datasets from many sources and categories. In
prevention and detection strategies that have been created to
additional to the review depth, other aspects are employed to boost
address the problem. A precise statement of fake reviews is
accuracy, such as comparing training and test accuracy , detection
required, and there may be room for new technology to aid in their
Product review type and ratio, Detection product review type with
detection and prevention, among other problems and possibilities
the overall score. This supervised learning method uses various
for future study in the topic that are also mentioned in the report.
machine learning algorithms to identify fraud reviews and no fraud
the authors suggest a study roadmap for the development of
reviews.
research on fraud online reviews, highlighting the significance of
multidisciplinary cooperation and the value of cross-disciplinary
cooperation and the requirement for data-driven methods.
J. Salminen et al. [9], authors cover strategies for identifying false
reviews as well as approaches for producing them. Additionally,
they provide their own contribution to the subject, such as a
collection of authentic and fraudulent ratings and a model for
identifying fraudulent reviews that combines language and meta-
data elements and also gives a thorough assessment of the state-of-
the-art at the moment, in this field and dis cusses the difficulties
and prospects for further research.
S.N.Tran, et al. [10], The authors evaluate a wide range of
currently used methodologies and strategies for spotting false
reviews, including network-based methods, machine learning
algorithms, and approaches based on natural language processing.
The authors also discuss the difficulties in detecting false reviews,
such as the abundance of data, the dynamic properties of fraudulent
reviews, and the dearth of annotated data and discusses the need
for more study to address the market conditions change of fake
reviews and to increase the precision and efficacy of review spam
detection algorithms.

979-8-3503-9720-8/23/$31.00 ©2023 IEEE 72

Count Vectorizer, each cell of the matrix represents the frequency

with which the matching features appear in the relevant review.
By removing redundant and unnecessary data from review dataset
along with noisy and unreliable data, the data are processed and
refined. A textual passage is tokenized when it is broken up into
separate tokens or words.
Tokenization is a practical method for obtaining relevant data from
customer evaluations for detecting fraud reviews in online product
reviews.
Splitting of the reviews into tokens or individual words. You may
either make your own unique custom tokenizer or utilise a pre-
trained tokenizer from a natural-language processing library like
NLTK .
Eliminate any stop words from the review sentences, which are
often used terms like "the," "and," and "it." This can assist decrease
background noise in the data and increase the visibility of the
relevant terms.
Tokenization is a crucial stage in the process of detecting fraud in
customer reviews since it may be used to extract useful
information from these reviews and increase the precision of fraud
detection models.
Fig 1. workflow process for review system
By applying the evaluation criteria to compare the predicted fraud
labels to the actual fraud labels, the performance evaluation is
Figure 1 depicts the implementation's high-level architecture. The accomplished.
following six procedures are used to resolve the issue. The review
To evaluate the performance of their proposed model,
system techniques and algorithms used in the classification method
are combined into a wrapper strategy. Reviews and its opinion 1. Accuracy: the proportion of correctly identified sample samples.
were used to evaluate a group of algorithms.
2. Precision: The proportion of genuine positive samples among all
In order to improve classification results, the four most effective correctly predicted positive outcomes.
algorithms were extracted and selected effective algorithm is
3. Recall: the proportion of genuine positive samples among all
determined based on measure on F1 score, accuracy, precision and
real positive samples.
confusion matrix.
4. F1 Score: M ean of precision and recall.
the outcomes of the test dataset metrics. The suggested model
Algorithm:
earned an F1 score of 97.01%, accuracy of 97.03%, precision of
97%, recall of 96%, and. These findings show how well the
Reviews ← preprocessing(data) suggested methodology works to identify fake reviews.
model= []
To Achieve high efficiency in their fraud detection model by the
func_n fraud_detect_n(review, model):
usage of following three approaches, they are:
review_Text = model. predict_n(review)
if review_Text[1] >= model: Feature extraction: most pertinent features for their machine
review_product_type=fraud_review learning model's training with the help of using feature selection
else: approaches. As a result, the input data's dimensionality was
review_product_type=nofraud_review decreased and the model's effectiveness was increased.
end if
end func_n One of the classification technique is The Support Vector M achine
bestalgos ← findBest4Algorithms(reviews) (SVM ) classification technique, which is renowned for its high
bestalgo ← max(fraud_detect) accuracy and effectiveness in handling huge datasets was utilized.
Sampling: Its strategy for ensuring that the dataset used to train the
model had an equal proportion of reviews that were fraudulent and
Data on consumer reviews was gathered from a variety of sources,
non-fraudulent. This assisted in avoiding model bias towards a
including Amazon, websites for making airline, hotel, and
certain class of reviews.
restaurant reservations, CarGurus, and more. By doing this, the
review data's diversity was increased, This improves the diversity With these three approaches feature extraction, sampling,
of the review data. Using NLTK, the full review is parsed into classification algorithm can achieved high efficeinecy.
sentences after being provided as input. There are no longer any
punctuation at the beginning or conclusion of the reviews, and Classifying the evaluations into categories based on whether they
there are also more white spaces. are positive, negative, or neutral in terms of emotion. It includes
determining whether reviews will be positive or negative based on
To facilitate retrieval, each unique review is segmented into words the text's word choice, the emojis used, the review's rating, and
and saved in a list. Rating fraud reviews often include five stars to other factors. The use of reviews to say public opinion is one
entice buyers for competitors' products, which is vital in the false explanation, and it is also more important to express ideas than to
review identification procedure. With the fraud reviews are less state the facts as they are. Advertisers create fictitious evaluations
likely to be verified as purchases than reviews that are true. With with more objective details and emphasise emotions like how

979-8-3503-9720-8/23/$31.00 ©2023 IEEE 73

happy it made them rather than describing the nature of the product The SVM classification algorithm performed better than the
or what it accomplishes. Analysis of the review's sentiment remaining other models when the vales is set to the review_Text
contributes to the determination of whether it is authentic or and the label attributes
fraudulent.
The result of confusion matric as shown in below figure.
Finally, there will be an outcome of the four algorithms can be
obtained and comparing with those algorithms SVM has high
accuracy as compared to other algorithms as NavieBayes, Logistic
Regression and Decision Tree classifier.

IV. RESULTS
A. Generating accuracy for the trained and test results

Table 1: accuracy of 4 algorithms

Model Type Acurracy

Navie Bayes 96.0447

SVM 97.0398
Logistic regression 96.0447
Decision Tree classification 94.5545

Fro m table1, can predict the outcome of accuracy values

for the four algorith ms in which SVM has highest accuracy
as compared to the remaining models.
B. Classification report

The classification report of 4 algorithms models based on the Fig 2. Confusion M atrix
review dataset.
D. Classification metrics:
Table 2: classification report of precision and recall
Classification report
s.no model Precision Recall
Table 4: Classification Metrics for S VM
1 Navie Bayes 0.98 1.00
s.no metrics precision recall F1-score support
2 SVM 0.96 0.98
1 Accuracy 0.95 202
3 Logistic regression 0.92 0.99
2 M acro_avg 0.58 0.55 0.56 202
4 DTC 0.95 0.98
3 Weighted_avg 0.93 0.95 0.94 202

From table2, can give the classification report of the 4 models with
the respective metrics as precision and recall values. In which From table 4, shows the classification report of the algorithm
naviebayes has higher precision and recall metrics can obtained. It SVM , where it can show results of the metric values such as
can give complete information about the classification reports on macro_avg, Weighted_avg and accuracy .
the metric values. E. Trained and Test accuracy results:
Table 3: classification report of F1-score and support
The plots of bar graphs shows the accuracy of trained and test
S.no M odel F1-score Support values for the four algorithms models.
1 Navie Bayes 0.98 195 In Figure 3, it shows the results of trained data and test data from
the dataset, which can give the visualization of the accuracy
2 SVM 0.97 192
results. The results of the Naviebayes, SVM , Logist ic regression
3 Logistic regression 0.97 192 and decision tree classifier methods.
4 DTC 0.94 194

From table3, can give the classification report of the 4 models with
the respective metrics as F1-score and support values. In which
Naviebayes has higher precision and recall metrics can obtained.

C. Confusion matrix

H. Detection of product review type ratio:

Table 4: Product review type ratio

s.no Review type ratio
1 Fraud review 33.33
2 No fraud review 66.66

Fro m Table5, shows the results of product review type in the

form of ratio and it shows 33.33% of fraud review was
obtained and the 66.66% of nofraud review was obtained
from the review system.
Fig 3. Trained and Test accuracy results
I. Comparison of algorithms with metrics
F. Viewing Train and test accuracy results:
For certain values SVM range accuracy of 96.5 than other
In Figure 4, line chart shows the accuracy results of the 4 models algorithms and those algorithms got values range of 90-95.
and its ranging from 94 to 96.
120
100
80
60
40
20
0
Naviebayes SVM Logistic DTC
regression

accuracy precision recall

Fig 6: comparison of 4 algorithms.

From, the figure 6, it shows the results and comparison of the four
algorithms with the three metric types which are accuracy,
precision and recall. In this bar plot where SVM shows high range
of accuracy than remaining algorithms.
Fig 4. Train and test data accuracy results in Line chart
The SVM can obtained the maximum accuracy which is 97% and
precision can obtain 55% and the recall metric can obtain upto
85%.
The above line chart shows the accuracy of the four algorithms
which can shows in the form of percentages.

V. CONCLUS ION AND FUTURE WORK

G. Detection of product review type datasets Online marketplaces are becoming increasingly concerned about
the problem of fake-reviews, which may negatively affect both the
In Figure 5, line chart shows the fraud review rat ion and no
platform's image and the trustworthiness of the items. So we
farud ratio based on the datasets ranging from 30 to 70.
developed a method for detecting false reviews that makes use of
machine learning techniques to examine a variety of elements,
including review content, user behaviour, and sentiment. The
outcomes of our tests revealed that the review system was capable
of accurately and precisely identifying fraud review, highlighting
its potential for use in practical situations.
In future, it can be expanded for fraud detection model to handle
data from dynamic graphs in additional domains, such as strong
product recommendation for e-commerce site for financial services
to detect online spamming.

REFERENCES
[1]. LuhuaJin, Songkai Tang and Fan Cheng,” Online Product Review
Systems via Heterogeneous Graph Transformer” IEEE Entry,
2022,Vol.9 .
[2]. J. Wang, H. Kan, F. Meng, Q. Mu, G. Shi, and X. Xiao, ‘‘Fake
review detection based on multiple feature fusion and rolling
Fig 5: product review ratio collaborative training,’’ IEEE Access, vol. 8, pp. 182625–182639,
2020.

[3]. Fei, G., Mukherjee, A., Liu, B., Hsu, M., Castellanos, M., & Ghosh, [21]. Kanika and J. Singla, "A Survey of Deep Learning based Online
R. (2021).” Exploiting Burstiness in Reviews for Review Spammer Transactions Fraud Detection Systems," 2020 International
Detection” Vol 7. Conference on Intelligent Engineering and Management (ICIEM),
[4]. S. T ang, L. Jin and F. Cheng, "Fraud Detection in Online Product London, UK, 2020, pp. 130-136, doi:
Review Systems via Heterogeneous Graph Transformer," in IEEE 10.1109/ICIEM48762.2020.9160200.
Access, vol. 9, pp. 167364-167373, 2021, doi:
10.1109/ACCESS.2021.3084924. [22]. V. Gupta, A. Aggarwal and T. Chakraborty, "Detecting and
[5]. Hina T ufail, M. Usman Ashraf, Khalid Alsubhi and Hani Moaiteq Characterizing Extremist Reviewer Groups in Online Product
Aljahdali “The Effect of Fake Reviews on e-Commerce During and Reviews," in IEEE Transactions on Computational Social Systems,
After Covid-19 Pandemic” IEEE Access, Vol 10 . vol. 7, no. 3, pp. 741-750, June 2020, doi:
[6]. Z. Liu, Y. Dou, P. S. Yu, Y. Deng, and H. Peng, “Alleviating the 10.1109/TCSS.2020.2988098.
inconsistency problem of applying graph neural network to fraud [23]. B. Conlin and U. Ruhi, "Current Research Landscape of Machine
detection,” in SIGIR, 2020. Learning Algorithms in Online Identity Fraud Prediction and
[7]. J. Mach. Learn. Res., vol. 7, pp. 1–30, Jan. 2006. [32] J. Wang, H. Detection," 2021 IEEE International Conference on Technology
Kan, F. Meng, Q. Mu, G. Shi, and X. Xiao, ‘‘Fake review detection Management, Operations and Decisions (ICTMOD), Marrakech,
based on multiple feature fusion and rolling collaborative training,’’ Morocco, 2021, pp. 1-6, doi:
IEEE Access, vol. 8, pp. 182625–182639, 2020. 10.1109/ICTMOD52902.2021.9739308.
[8]. Y. Wu, E. W. T. Ngai, P. Wu and C. Wu, "Fake online reviews:
Literature review synthesis and directions for future research", [24]. R. S. Solitario, "Fake Delivery Bookings In Context-Aware Food
Decis. Support Syst., vol. 132, May 2020. Delivery Systems: A Literature And Mobile Apps Review," 2021 1st
[9]. J. Salminen, C. Kandpal, A. M. Kamel, S.-G. Jung and B. J. Jansen, International Conference in Information and Computing Research
"Creating and detecting fake reviews of online products", J. (iCORE), Manila, Philippines, 2021, pp. 1-5, doi:
Retailing Consum. Services, vol. 64, Jan. 2022. 10.1109/iCORE54267.2021.00019.
[10]. R. Mohawesh, S. Xu, S. N. T ran, R. Ollington, M. Springer, Y.
Jararweh, et al., "Fake reviews detection: A survey", IEEE Access, [25]. S. Shehnepoor, R. Togneri, W. Liu and M. Bennamoun, "HIN-RNN:
vol. 9, pp. 65771-65802, 2021. A Graph Representation Learning Neural Network for Fraudster
[11]. D. j. S. K. Sayam Kumar Yashika Goyal, "Fake Reviews Filtering Group Detection With No Handcrafted Features," in IEEE
System Using Supervised Machine Learning," IEEE, vol. 9, no. 14 Transactions on Neural Networks and Learning Systems, doi:
October 2022, p. 10, 2022. 10.1109/TNNLS.2021.3123876.
[12]. S. Alaa, M. A. Farooq, and M. Younas, "Deep Learning Approaches [26]. J. Zhou, Y. -F. Liu and H. -L. Sun, "A Reputation Ranking Method
for Fraud Detection: A Comprehensive Review," in IEEE, vol. 6, no. based on Rating Patterns and Rating Deviation," 2022 5th
14, 2020. International Conference on Data Science and Information
[13]. S. Shehnepoor,R.Togneri,W.Liu and M.Bennamoun, "ScoreGAN:A Technology (DSIT ), Shanghai, China, 2022, pp. 1-6, doi:
Fraud Review Detector Based on Regulated GAN With Data 10.1109/DSIT55514.2022.9943923.
Augmentation," in IEEE Transactions on Information Forensics and
Security,vol.17,pp.280-291,2022,doi:10.1109/TIFS.2021.3139771.S. [27]. K. Joshi, S. Kumar, J. Rawat, A. Kumari, A. Gupta and N. Sharma,
Bagga, A. Goyal, N. Gupta, and A. Goyal, “Credit Card Fraud "Fraud App Detection of Google Play Store Apps Using Decision
Detection using Pipeling and Ensemble Learning,” Procedia Tree," 2022 2nd International Conference on Innovative Practices in
Comput. Sci., vol. 173, pp. 104–112, 2020. Technology and Management (ICIPTM), Gautam Buddha Nagar,
[14]. L. P. Pracidelli and F. S. Lopes, "Fraud identification architecture India, 2022, pp. 243-246, doi:
using data mining and machine learning in a private transport 10.1109/ICIPTM54933.2022.9754207.
company that operates by applications," 2020 15th Iberian
Conference on Information Systems and T echnologies (CIST I), [28]. S. Shehnepoor, R. Togneri, W. Liu and M. Bennamoun,
Seville, Spain, 2020, pp. 1-6, doi: "ScoreGAN: A Fraud Review Detector Based on Regulated GAN
With Data Augmentation," in IEEE Transactions on Information
10.23919/CISTI49556.2020.9140992.
Forensics and Security, vol. 17, pp. 280-291, 2022, doi:
[15]. M. N. Ashtiani and B. Raahemi, "Intelligent Fraud Detection in 10.1109/TIFS.2021.3139771.
Financial Statements Using Machine Learning and Data Mining: A
Systematic Literature Review," in IEEE Access, vol. 10, pp. 72504- [29]. C. G. Harris, "Combining Linguistic and Behavioral Clues to Detect
Spam in Online Reviews," 2022 IEEE International Conference on
72525, 2022, doi: 10.1109/ACCESS.2021.3096799.
e-Business Engineering (ICEBE), Bournemouth, United Kingdom,
[16]. G. J. Priya and S. Saradha, "Fraud Detection and Prevention Using 2022, pp. 38-44, doi: 10.1109/ICEBE55470.2022.00017.
Machine Learning Algorithms: A Review," 2021 7th International
[30]. P. Rathore, J. Soni, N. Prabakar, M. Palaniswami and P. Santi,
Conference on Electrical Energy Systems (ICEES), Chennai, India,
"Identifying Groups of Fake Reviewers Using a Semisupervised
2021, pp. 564-568, doi: 10.1109/ICEES51510.2021.9383631.
Approach," in IEEE Transactions on Computational Social Systems,
[17]. C. G. Harris, "Detecting Fake Yelp Reviews Using a Positional vol. 8, no. 6, pp. 1369-1378, Dec. 2021, doi:
LSTM / K-L Divergence Ensemble Approach," 2022 1st 10.1109/TCSS.2021.3085406.
International Conference on Information System & Information
Technology (ICISIT), Yogyakarta, Indonesia, 2022, pp. 61-66, doi: [31]. Silpa, C., Niranjana, G., Ramani, K. (2022). Fraud detection of
10.1109/ICISIT54091.2022.9872788. review using classification models- An Extensive Study. In:
Manogaran, G., Shanthini, A., Vadivu, G. (eds) Proceedings of
[18]. B. Al Smadi and M. Min, "A Critical review of Credit Card Fraud International Conference on Deep Learning, Computing and
Detection Techniques," 2020 11th IEEE Annual Ubiquitous Intelligence. Advances in Intelligent Systems and Computing, vol
Computing, Electronics & Mobile Communication Conference 1396. Springer, Singapore.
(UEMCON), New York, NY, USA, 2020, pp. 0732-0736, doi:
10.1109/UEMCON51285.2020.9298075. [32]. Inayathulla, Mohammed, and C. Silpa. "An Approach to Reduce
fraud review spam by Using Machine Learning Techniques."
[19]. S. Vyas and S. Serasiya, "Fraud Detection in Insurance Claim International Journal of Computer Science and Network Security
System: A Review," 2022 Second International Conference on (IJCSNS) 15, no. 9 (2015): 99.
Artificial Intelligence and Smart Energy (ICAIS), Coimbatore, India,
[33]. V Jyothsna, D R Kumar Raja, G Hemanth Kumar, Dileep Chnadra
2022, pp. 922-927, doi: 10.1109/ICAIS53314.2022.9742984.
E, “A Novel Manifold approach for fraud detection, Gongcheng
[20]. D. H. Bhatt and A. Meniya, "A Review on Machine Learning Kexue Yu Jishu/Advanced Engineering Science, Vol 54, Issue 02,
Methods for Credit Card Fraud Classification," 2022 Second PP.2043 – 2076, 2022
International Conference on Artificial Intelligence and Smart Energy
[34]. Jyothsna, V., Prasad, K.M., Rajiv, K. et al. Review based system
(ICAIS), Coimbatore, India, 2022, pp. 312-318, doi:
10.1109/ICAIS53314.2022.9743014. using ensemble classifier with Feature Impact Scale. Cluster Comput
24, 2461–2478 (2021).

[35]. M. Ganesh Karthik, Dr. M B Mukesh Krishnan “Detecting

spamming technique using Post Pruning Decision Tree-Synthetic
Minority Over Sampling Technique” published by International
Journal of Intelligent Engineering and Systems (Scopus Indexed),
Vol.14, No.4, 2021
[36]. K. K. Baseer, M. Jahir Pasha, A. V. Rama Krishna Reddy, Kamarthi
Rekha, M. Shaheda Begum, Sandhya E., " Online fraud review
monitoring System", Journal of Algebraic Statistics, Volume 13, No.
3, 2022, p.559-570, ISSN: 1309-3452.

Authorized licensed use limited to: Zhejiang University. Downloaded on April 10,2024 at 18:09:51 UTC from IEEE Xplore. Restrictions apply.

Computer Vision and Simulation
100% (1)
Computer Vision and Simulation
191 pages
SSRN 4786593
No ratings yet
SSRN 4786593
13 pages
Deep Learning Hybrid Approaches To Detect Fake Reviews and Ratings
No ratings yet
Deep Learning Hybrid Approaches To Detect Fake Reviews and Ratings
8 pages
B.Tech - IT and CSIT Syllabus of 3rd Year
No ratings yet
B.Tech - IT and CSIT Syllabus of 3rd Year
37 pages
Shiva
No ratings yet
Shiva
16 pages
Final PPT - Fake Product Review
100% (1)
Final PPT - Fake Product Review
27 pages
QB Pec-Cs701e
No ratings yet
QB Pec-Cs701e
12 pages
DAML - Lecture Notes
No ratings yet
DAML - Lecture Notes
35 pages
Art 20191163
No ratings yet
Art 20191163
3 pages
Detection of Fake Online Reviews Using Semi-Supervised and Supervised Learning
No ratings yet
Detection of Fake Online Reviews Using Semi-Supervised and Supervised Learning
10 pages
Fack Review Detection
No ratings yet
Fack Review Detection
53 pages
Shiv Final Report PDF
No ratings yet
Shiv Final Report PDF
26 pages
Fake Product Review Final
No ratings yet
Fake Product Review Final
30 pages
Fake Review Detection Iee Paper
No ratings yet
Fake Review Detection Iee Paper
4 pages
Fake Product Review Monitoring and Removal For Genuine Online Shopping
No ratings yet
Fake Product Review Monitoring and Removal For Genuine Online Shopping
5 pages
Fake Product Review Monitoring and Removal For Genuine Online Product Reviews Using Opinion Mining
No ratings yet
Fake Product Review Monitoring and Removal For Genuine Online Product Reviews Using Opinion Mining
4 pages
Opinion Mining and Review Spam Detection: Issues and Challenges
No ratings yet
Opinion Mining and Review Spam Detection: Issues and Challenges
8 pages
Req - Full Doc - Online Fake Reviews Detection in E-Commerce
No ratings yet
Req - Full Doc - Online Fake Reviews Detection in E-Commerce
52 pages
Feedback Shiv Report
No ratings yet
Feedback Shiv Report
25 pages
Fake Review Detection
No ratings yet
Fake Review Detection
27 pages
Fake Reviews Detection Based On Sentiment Analysis Using ML Classifiers
No ratings yet
Fake Reviews Detection Based On Sentiment Analysis Using ML Classifiers
6 pages
Fake Review Detector
No ratings yet
Fake Review Detector
41 pages
Best Journal
No ratings yet
Best Journal
9 pages
E-Commerce Product Rating Based On Customer Review Mining
No ratings yet
E-Commerce Product Rating Based On Customer Review Mining
4 pages
Electronics 13 04322
No ratings yet
Electronics 13 04322
17 pages
Deep Learning Based Model For Fake Review Detection
No ratings yet
Deep Learning Based Model For Fake Review Detection
4 pages
Crawford2015 Article SurveyOfReviewSpamDetectionUsi PDF
No ratings yet
Crawford2015 Article SurveyOfReviewSpamDetectionUsi PDF
24 pages
1 Iis 2020 185-194
No ratings yet
1 Iis 2020 185-194
10 pages
Shivathmaj Report
No ratings yet
Shivathmaj Report
28 pages
Use of Supervised Machine Learning Class
No ratings yet
Use of Supervised Machine Learning Class
22 pages
Research Pap
No ratings yet
Research Pap
8 pages
JETIR2104042
No ratings yet
JETIR2104042
8 pages
Spam Review Detection Using Machine Learning Ijariie24145
No ratings yet
Spam Review Detection Using Machine Learning Ijariie24145
7 pages
Week 6: Test Bank Questions Data Mining and Data Warehousing - IT 446
No ratings yet
Week 6: Test Bank Questions Data Mining and Data Warehousing - IT 446
39 pages
Fake Product Monitoring
No ratings yet
Fake Product Monitoring
22 pages
Classification and Analysis of Fake Product Review Using Ai
No ratings yet
Classification and Analysis of Fake Product Review Using Ai
9 pages
Iccmc 2019 8819685
No ratings yet
Iccmc 2019 8819685
4 pages
A Multilingual Spam Review Detection
No ratings yet
A Multilingual Spam Review Detection
5 pages
Identifying Groups of Fake Reviewers Using A Semisupervised Approach
No ratings yet
Identifying Groups of Fake Reviewers Using A Semisupervised Approach
10 pages
Fin Irjmets1680182289
No ratings yet
Fin Irjmets1680182289
6 pages
Bioconf Iscku2024 00099
No ratings yet
Bioconf Iscku2024 00099
12 pages
Lstmfinal
No ratings yet
Lstmfinal
5 pages
Fin Irjmets1702880945
No ratings yet
Fin Irjmets1702880945
4 pages
Fake Product Review Monitoring System
No ratings yet
Fake Product Review Monitoring System
7 pages
Fake Review Detection Based On Multiple Feature Fusion and Rolling Collaborative Training
No ratings yet
Fake Review Detection Based On Multiple Feature Fusion and Rolling Collaborative Training
15 pages
Machine Learning Approaches For Fake Reviews Detection A Systematic Literature Review
No ratings yet
Machine Learning Approaches For Fake Reviews Detection A Systematic Literature Review
27 pages
Fake Review Detection
No ratings yet
Fake Review Detection
9 pages
Fake Product Review Detection and Elimination Using Opinion Mining
No ratings yet
Fake Product Review Detection and Elimination Using Opinion Mining
5 pages
Fake Review Detection Using Machine Learning Algorithm On Online Product Selling Platforms Publication Paper
No ratings yet
Fake Review Detection Using Machine Learning Algorithm On Online Product Selling Platforms Publication Paper
6 pages
13527-Article Text-24222-1-10-20230328
No ratings yet
13527-Article Text-24222-1-10-20230328
12 pages
AI Facial Recognition System
No ratings yet
AI Facial Recognition System
54 pages
Project Report
No ratings yet
Project Report
56 pages
Project Report Vidhan
No ratings yet
Project Report Vidhan
53 pages
Irjet V8i4168
No ratings yet
Irjet V8i4168
5 pages
1 s2.0 S0167923623001203 Main
No ratings yet
1 s2.0 S0167923623001203 Main
11 pages
Opinion Mining
No ratings yet
Opinion Mining
32 pages
Unit 3
No ratings yet
Unit 3
86 pages
2023 Ijsem-147259
No ratings yet
2023 Ijsem-147259
23 pages
Stroke Prediction Using Machine Learning
No ratings yet
Stroke Prediction Using Machine Learning
8 pages
(IJIT-V9I3P1) :T. Primya, A. Vanmathi
No ratings yet
(IJIT-V9I3P1) :T. Primya, A. Vanmathi
6 pages
Fake Reviewer Group S' Detection System
No ratings yet
Fake Reviewer Group S' Detection System
4 pages
Fake Product Review Monitoring and Removal For Genuine Product Using Opinion Mining
No ratings yet
Fake Product Review Monitoring and Removal For Genuine Product Using Opinion Mining
23 pages
Fraud Detection in E-Commerce Using Machine Learning
No ratings yet
Fraud Detection in E-Commerce Using Machine Learning
6 pages
Fake Product Review Monitoring System
No ratings yet
Fake Product Review Monitoring System
3 pages
Textbook 3
No ratings yet
Textbook 3
331 pages
Petroleum Well Drilling Monitoring Throu PDF
No ratings yet
Petroleum Well Drilling Monitoring Throu PDF
7 pages
.E-Commerce Product Rating Based On Customer Review Mining
No ratings yet
.E-Commerce Product Rating Based On Customer Review Mining
4 pages
Musical Genre Classification Using Advanced Audio Analysis and Deep Learning Techniques
No ratings yet
Musical Genre Classification Using Advanced Audio Analysis and Deep Learning Techniques
11 pages
A1 - Full Papers PS2 10862 2022
No ratings yet
A1 - Full Papers PS2 10862 2022
10 pages
Data Mining of Agricultural Yield Data - A Comparison of Regression Models
No ratings yet
Data Mining of Agricultural Yield Data - A Comparison of Regression Models
15 pages
Detection of Fake Online Reviews Using Semi Supervised and Supervised Learning
No ratings yet
Detection of Fake Online Reviews Using Semi Supervised and Supervised Learning
4 pages
TY AI Syllabus
No ratings yet
TY AI Syllabus
72 pages
BT-2016 SEM-IV Project Report (Review 1)
No ratings yet
BT-2016 SEM-IV Project Report (Review 1)
42 pages
Oracle's Machine Learning: Newest Features and Road Map: Move The Algorithms Not The Data!
No ratings yet
Oracle's Machine Learning: Newest Features and Road Map: Move The Algorithms Not The Data!
100 pages
Tao 2021
No ratings yet
Tao 2021
19 pages
2020 XIA Soh Li-Ion Incremental Capacity
No ratings yet
2020 XIA Soh Li-Ion Incremental Capacity
12 pages
Department of Computer Science and Engineering: Course Delivery Plan
No ratings yet
Department of Computer Science and Engineering: Course Delivery Plan
8 pages
Implementation of ML Model For Image Classification
No ratings yet
Implementation of ML Model For Image Classification
19 pages
Artificial Intelligence To Optimize Water Consumption in Agriculture - A Predictive Algorithm-Based Irrigation Management System
No ratings yet
Artificial Intelligence To Optimize Water Consumption in Agriculture - A Predictive Algorithm-Based Irrigation Management System
11 pages
Driver Behavior Classification A Systematic Literature Review
No ratings yet
Driver Behavior Classification A Systematic Literature Review
27 pages
Advanced Techniques in Machine Learning and Optimization
No ratings yet
Advanced Techniques in Machine Learning and Optimization
8 pages
Perspectives and Applications of Machine Learning For Evolutionary Developmental Biology
No ratings yet
Perspectives and Applications of Machine Learning For Evolutionary Developmental Biology
18 pages
Machine Learning Based Intrusion Detection System
No ratings yet
Machine Learning Based Intrusion Detection System
5 pages
Bridge The Gap: From Data To Insights
No ratings yet
Bridge The Gap: From Data To Insights
16 pages
Spam Email Detection Using Machine Learning
No ratings yet
Spam Email Detection Using Machine Learning
8 pages
Poly Kernel
No ratings yet
Poly Kernel
6 pages
Experimental Investigation of Body-Centric Indoor Localization Using Compact Wearable Antennas and Machine Learning Algorithms
No ratings yet
Experimental Investigation of Body-Centric Indoor Localization Using Compact Wearable Antennas and Machine Learning Algorithms
11 pages
DocScanner Dec 31, 2023 8-23 PM
No ratings yet
DocScanner Dec 31, 2023 8-23 PM
7 pages
Get Theory of Neural Information Processing Systems A. C. C. Coolen PDF Ebook With Full Chapters Now
No ratings yet
Get Theory of Neural Information Processing Systems A. C. C. Coolen PDF Ebook With Full Chapters Now
45 pages
A Small Comparative Study of Machine Learning Algorithms in The Detection of Fak
No ratings yet
A Small Comparative Study of Machine Learning Algorithms in The Detection of Fak
6 pages
J Adv Model Earth Syst - 2023 - Sanford - Improving The Reliability of ML Corrected Climate Models With Novelty Detection
No ratings yet
J Adv Model Earth Syst - 2023 - Sanford - Improving The Reliability of ML Corrected Climate Models With Novelty Detection
14 pages
Circular
No ratings yet
Circular
2 pages
Industrial Automation: Learn the current and leading-edge research on SCADA security
From Everand
Industrial Automation: Learn the current and leading-edge research on SCADA security
Vikalp Joshi
No ratings yet
Cookbook for Mobile Robotic Platform Control: With Internet of Things And Ti Launch Pad
From Everand
Cookbook for Mobile Robotic Platform Control: With Internet of Things And Ti Launch Pad
Dr. Anita Gehlot
No ratings yet
Machine Learning Algorithms for Data Scientists: An Overview
From Everand
Machine Learning Algorithms for Data Scientists: An Overview
Vinaitheerthan Renganathan
No ratings yet

Detection of Fake Online Reviews by Using Machine Learning

Uploaded by

Detection of Fake Online Reviews by Using Machine Learning

Uploaded by

International Conference on Innovative Data Communication Technologies and Application (ICIDCA-2023)

IEEE Xplore Part Number: CFP23CR5-ART; ISBN: 979-8-3503-9720-8

Detection of Fake Online Reviews by using

Y Bhumika C H Surya Pavan M Naveed

979-8-3503-9720-8/23/$31.00 ©2023 IEEE 71

979-8-3503-9720-8/23/$31.00 ©2023 IEEE 72

Count Vectorizer, each cell of the matrix represents the frequency

979-8-3503-9720-8/23/$31.00 ©2023 IEEE 73

Table 1: accuracy of 4 algorithms

Navie Bayes 96.0447

Fro m table1, can predict the outcome of accuracy values

979-8-3503-9720-8/23/$31.00 ©2023 IEEE 74

H. Detection of product review type ratio:

Table 4: Product review type ratio

Fro m Table5, shows the results of product review type in the

accuracy precision recall

Fig 6: comparison of 4 algorithms.

V. CONCLUS ION AND FUTURE WORK

979-8-3503-9720-8/23/$31.00 ©2023 IEEE 75

979-8-3503-9720-8/23/$31.00 ©2023 IEEE 76

[35]. M. Ganesh Karthik, Dr. M B Mukesh Krishnan “Detecting

979-8-3503-9720-8/23/$31.00 ©2023 IEEE 77

You might also like