Aug Batch Project Details

Uploaded by

ankit

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Aug Batch Project Details

Uploaded by

ankit

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Title of the Project: Ensemble ML Modelling to classify Real/Fake jobs

Below are the details of the project for the August Machine Learning Batch.

Students need to work on Ensemble Learning (modeling) for a given problem.

Problem Statement: For a given input features which is the most common answer (as per
majority). Also, suggest which algorithm gives maximum accuracy for the dataset worked on.

Note: In the dataset attached "Fraudulent" is the target feature. Description of the dataset can
be found here https://www.kaggle.com/shivamb/real-or-fake-fake-jobposting-prediction

Ask any 4 questions on the dataset of your choice and provide answers for the same. For
instance, for the given dataset questions can be as follows.

Q1) What are the most common title used in jobs in the US?
Q2) Which department has the most number of fake jobs?
Q3) Which department or function has high-paying jobs in the UK?
Q4) What are the top 3 most commonly used words in Company Profile? (Excluding stopwords)

Take up three classification algorithms of your own choice and build three respective Machine
learning models. Compare the Accuracy of all three and suggest which ML algorithms suit best
for the given problem.

NOTE: For the given dataset "Fraudulent" will be your dependent variable.

Evaluation will be done on the following points:

1) Exploratory data analysis and Data Cleaning if required

2) At least 3 visualizations of data using Matplotlib or any other visualization library
3) Questions asked on dataset and answers for the same with a brief explanation
4) Feature Selection and feature Engineering if required depending on the dataset
5) Ensemble Machine learning Modelling (3 Classification Algorithms or 5 would do too)
6) Metrics calculation (along with justification about why a particular metrics was used)
7) Summarised write up at the end

OPTIONAL REQUIREMENT: It will be appreciated if any one algorithm is built from scratch
instead of using a library.

Please explain all your steps with clear details and comments. Do mention which are your
Independent and dependent variables on the dataset

Prepare a PDF/Word Document at the end with a Summary of this project and submit it.
Mail subject: Capstone Project August Machine Learning

21 Machine Learning Design Patterns Interview Questions (ANSWERED) MLStack
No ratings yet
21 Machine Learning Design Patterns Interview Questions (ANSWERED) MLStack
29 pages
Data Scientist Test Task V2
No ratings yet
Data Scientist Test Task V2
1 page
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Data Science
No ratings yet
Data Science
38 pages
7641 Assignment 1
No ratings yet
7641 Assignment 1
4 pages
7641 Assignment 2 Fall 2024
No ratings yet
7641 Assignment 2 Fall 2024
5 pages
p78 Domingos
No ratings yet
p78 Domingos
10 pages
Building A Generative AI Platform
No ratings yet
Building A Generative AI Platform
26 pages
SYSTEM2
No ratings yet
SYSTEM2
15 pages
Huyenchip Com 2023 04 11 LLM Engineering HTML
No ratings yet
Huyenchip Com 2023 04 11 LLM Engineering HTML
13 pages
Main Dock Pin
No ratings yet
Main Dock Pin
31 pages
Problem Statement
No ratings yet
Problem Statement
1 page
Algorithms and Data Structure
100% (2)
Algorithms and Data Structure
199 pages
Subjects You Need To Know:: Programming Languages of AI
0% (1)
Subjects You Need To Know:: Programming Languages of AI
7 pages
Udemy Test4
No ratings yet
Udemy Test4
41 pages
ISB_Assignment 2
No ratings yet
ISB_Assignment 2
5 pages
Machinelearning Concepts
No ratings yet
Machinelearning Concepts
29 pages
Lecture 1 Notes
No ratings yet
Lecture 1 Notes
8 pages
EE236 Course Project Total: 10% of Grade
No ratings yet
EE236 Course Project Total: 10% of Grade
1 page
2024 AutoML past, present and future
No ratings yet
2024 AutoML past, present and future
82 pages
55 Machine Learning Engineer Questions To Find The Perfect Candidate
100% (1)
55 Machine Learning Engineer Questions To Find The Perfect Candidate
14 pages
ML 22-23 Sem, GPT
No ratings yet
ML 22-23 Sem, GPT
14 pages
Assignment 3 Fall 2017
0% (1)
Assignment 3 Fall 2017
2 pages
PDSC_Few_Questions_Answers_2020
No ratings yet
PDSC_Few_Questions_Answers_2020
36 pages
Prompt Engineering Guide by Examples
No ratings yet
Prompt Engineering Guide by Examples
14 pages
DP-Designing and Implementing
No ratings yet
DP-Designing and Implementing
10 pages
Aws Ml Notes
No ratings yet
Aws Ml Notes
4 pages
DS Notes
No ratings yet
DS Notes
170 pages
Assignment 3: Named Entity Recognition: Training Dataset
No ratings yet
Assignment 3: Named Entity Recognition: Training Dataset
4 pages
Project analysis plan PDF
No ratings yet
Project analysis plan PDF
6 pages
Chapter 2 Solutions
No ratings yet
Chapter 2 Solutions
6 pages
ML Interview Questions
No ratings yet
ML Interview Questions
146 pages
Assignment2 2024
No ratings yet
Assignment2 2024
4 pages
Introduction To Analytics
No ratings yet
Introduction To Analytics
40 pages
5.3 Model
No ratings yet
5.3 Model
26 pages
ML & DL
No ratings yet
ML & DL
19 pages
Air quality prediction using machine learning
No ratings yet
Air quality prediction using machine learning
29 pages
50 - Data Structure and Algorithms Interview Questions
No ratings yet
50 - Data Structure and Algorithms Interview Questions
10 pages
50+ Data Structure and Algorithms Interview Questions
No ratings yet
50+ Data Structure and Algorithms Interview Questions
12 pages
ML Interactively
No ratings yet
ML Interactively
273 pages
Sample Thesis Algorithm
100% (3)
Sample Thesis Algorithm
7 pages
Bard Advices
No ratings yet
Bard Advices
35 pages
Assignment 3-PDS Python-24S3
No ratings yet
Assignment 3-PDS Python-24S3
5 pages
The 5 Feature Selection Algorithms Every Data Scientist Should Know
No ratings yet
The 5 Feature Selection Algorithms Every Data Scientist Should Know
29 pages
Prompt Engineering
No ratings yet
Prompt Engineering
26 pages
The Machine Learning Lifecycle in 2021
No ratings yet
The Machine Learning Lifecycle in 2021
20 pages
Week 13 GCP Lec Notes
No ratings yet
Week 13 GCP Lec Notes
28 pages
User Manual The LOGO Algorithm: Sunyijun@biotech - Ufl.edu
No ratings yet
User Manual The LOGO Algorithm: Sunyijun@biotech - Ufl.edu
1 page
Chatgpt and Excel - Trust, But Verify
No ratings yet
Chatgpt and Excel - Trust, But Verify
15 pages
Page 3 of 25
No ratings yet
Page 3 of 25
9 pages
Top Data Science Interview Questions and Answers in 2023 PDF
100% (1)
Top Data Science Interview Questions and Answers in 2023 PDF
14 pages
ML Notes
No ratings yet
ML Notes
12 pages
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
No ratings yet
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
38 pages
Developing a machine learning or a deep learning model
No ratings yet
Developing a machine learning or a deep learning model
24 pages
Unit III
No ratings yet
Unit III
19 pages
Module 3 Data Science Machine Learning
No ratings yet
Module 3 Data Science Machine Learning
53 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Essential Algorithms: A Practical Approach to Computer Algorithms
From Everand
Essential Algorithms: A Practical Approach to Computer Algorithms
Rod Stephens
4.5/5 (2)
Machine Learning: Hands-On for Developers and Technical Professionals
From Everand
Machine Learning: Hands-On for Developers and Technical Professionals
Jason Bell
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
2023-24 Tshirt
No ratings yet
2023-24 Tshirt
6 pages
ECE Department All Students Done List
No ratings yet
ECE Department All Students Done List
6 pages
AMAN (01511502820) Feedback Done
No ratings yet
AMAN (01511502820) Feedback Done
1 page
518StudentList3 yCOt
No ratings yet
518StudentList3 yCOt
28 pages