Fake Job Prediction
Fake Job Prediction
PREDICTIO
N
GROUP
NUMBER
:
32
1
BACHELOR OF TECHNOLOGY IN COMPUTER SCIENCE & TECHNOLOGY
Submitted By
NAME ENROLLMENT NO. REGISTRATION NO.
ADITYA GHOSH 12020009022288 304202000901063
ANUBHAV SENAPATI 12020009022257 304202000901008
ANNWESHA MAHANTA 12020009022172 304202000900639
RAHUL DAS 12020009022215 304202000900682
SNEHA SARKAR 12020009027009 304202000900828
JOYEE SAHA 12020009022168 304202000900635
2
TABLE OF
1 ABSTRACT
CONTENTS
2 INTRODUCTION PROBLEM
3 STATEMENT
1 2 3
Natural Language Naïve Bayes SGD Classifier
Processing Algorithm
Naïve Bayes and SGD Classifier are compared on accuracy and F1-scores and
a final model is chosen. These models are used on both the text and numeric
data separately and the final results are combined.
WHY THIS ALGORITHM?
Stop
Lemm
Tokeni To word
atizati
zation Lower remov
on
al
A histogram describing a
character count is explored to
visualize the difference
between real and fake jobs.
What can be seen is that even
though the character count is
fairly similar for both real and
fake jobs, real jobs have a
higher frequency.
RESULT AND OUTPUT
The final model used for this analysis is – SGD. This is based on the results of
the metrics as compared to the baseline model. The outcome of the baseline
model and SGD are presented in the table below:
32