0% found this document useful (0 votes)

14 views20 pages

ML Report

The document describes an experiment to analyze the heating process of an electric kettle. Water is poured into the kettle and its temperature, weight, and time taken to boil are recorded. The voltage, current and power consumption are also measured. Graphs of temperature vs. time and weight vs. time are plotted. Rates of temperature rise, evaporation and heat transfer are calculated. The efficiency of the kettle is determined.

Uploaded by

Kalyan Kale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views20 pages

ML Report

Uploaded by

Kalyan Kale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 20

SRES’

SANJIVANI COLLEGE OF ENGINEERING,

KOPARGAON
423601(M.S.)

Department of
Mechatronics Engineering 2022-2023
REPORT ON

“Salary Prediction Based on work

experience ML Web App”

1|Page
CERTIFICATE

This is to certify that this report on Project entitled, “Salary Prediction Based on
work experience ML Web App”
Submitted by,

Sr. No Roll no. NAME PRN NO

1 28 Kale Kalyan Jayant UMX20M1028

For the partial fulfilment of the requirements of Final Year (Mechatronics Engineering) degree
of the Sanjivani COE, Kopargaon embodies the work done by them under our guidance and
supervision in the academic year 2023-2024

Prof. Chaitanya Kale

(Subject-ML)

Prof. R. A. Kapgate Dr. A. G. Thakur

Head of Dept. of Mechatronics Engg. Director, SCOE

2|Page
DECLARATION

We declare that this written submission represents our ideas in our own words; we have
adequately cited and referenced the original sources. We also declare that we have adhered to all
principles of academic honesty and integrity and have not misrepresented or fabricated any
idea/data/fact/source in our submission. We understand that any violation of the above may
cause disciplinary action by the Institute and can also evoke penal action from the sources which
have not been properly cited.

Date: - 18 /05/2023

Place: - Kopargaon

Sr no. Roll no. Name

1 28 Kale Kalyan Jayant

3|Page
ACKNOWLEDGEMENT

"Salary Prediction Based on work experience ML Web App ” has been the

opportunity to express ourselves technically. This has proven to be a steppingstone which will be

of immense help to us as we enter market. We want to express our gratitude to everyone who

helped us by giving moral support and by solving our difficulties. Everyone has contributed

immensely and helped us for the same into the completion of project.

We take this opportunity to express our deep sense of gratitude towards head of

department of Mechatronics Engineering Dr. R. A. Kapgate and our esteemed guide Prof.

Chaitanya Kale for his expert guidance during preparation of this seminar. He has received us

whenever we required his help. In true sense of word, we are grateful to him. We are highly

grateful to our subject coordinator for extending all the facilities in completing this seminar.

We would also like to thank all our friends, who helped us and initiated discussion during

the seminar. Last but not least; we want to acknowledge our beloved parents, who have taken

great pains for our education.

Signature
28 Kale Kalyan Jayant

4|Page
ABSTRACT
This report presents a comprehensive overview of a Machine Learning (ML) web application
designed to predict salaries based on an individual's years of experience and job type. The
primary objective of this project is to harness the power of data transformation and machine
learning algorithms to develop a predictive model that offers valuable insights into salary
expectations.The dataset employed in this study is characterized by its simplicity, with minimal
missing data, and comprises a training dataset featuring key features such as years of experience
and corresponding salary information. Through rigorous data transformation processes, the raw
data is refined to enhance the model's accuracy and reliability.The predictive power of the model
hinges on two fundamental factors: years of experience and job type. By analyzing these key
parameters, the ML model provides a reliable guide for estimating salaries, making it a valuable
tool for both employers and employees in the decision-making process related to
compensation.This web application aims to serve as a practical resource for organizations and
individuals seeking informed salary predictions. The simplicity of the model, combined with its
demonstrated accuracy in predicting salaries based on years of experience, positions it as a
valuable asset for shaping compensation strategies. Ultimately, this project contributes to the
ongoing dialogue surrounding fair and data-driven salary determination, offering a useful tool
for stakeholders navigating the intricacies of compensation decisions in various professional
settings.

5|Page
INTRODUCTION

OBJECTIVES
 Develop a robust Machine Learning (ML) web application with the primary goal of
predicting salaries based on years of experience and job type.
 Implement advanced data transformation techniques to preprocess and refine the raw
dataset, ensuring minimal missing data and enhancing the accuracy of the predictive
model.
 Leverage machine learning algorithms to create a predictive model that provides valuable
insights into salary expectations, focusing on key parameters such as years of experience
and job type.
 Demonstrate the practical utility of the web application as a resource for organizations
and individuals by offering reliable and reasonable predictions for salary determination.
 Contribute to the advancement of fair and data-driven salary determination practices by
developing a model that serves as a guide for compensation decisions, fostering informed
dialogue in professional settings.
 Provide a user-friendly and accessible platform that can be utilized by both employers
and employees, facilitating better decision-making processes related to compensation and
contributing to a more transparent salary negotiation environment.

LIMITATIONS:
 Limited Feature Set: The predictive model relies primarily on years of experience and job
type, potentially overlooking other relevant factors that could influence salary, such as
education level, geographic location, or industry-specific qualifications.
 Data Quality Constraints: The accuracy of the model is contingent upon the quality of the
training dataset. If the dataset contains inaccuracies, biases, or insufficient representation
of diverse job types, the model's predictions may be compromised.
 Static Model: The web application assumes a static relationship between years of
experience, job type, and salary. It does not account for dynamic changes in job markets,
economic conditions, or emerging industry trends, potentially limiting its adaptability
over time.
 Assumption of Linearity: The model assumes a linear relationship between the selected
features and salaries. Non-linear relationships or interactions between variables may not
be adequately captured, affecting the accuracy of predictions.
 Generalization Challenges: The model's performance may vary when applied to different
industries, job markets, or regions, as it may not account for the nuanced salary
determinants specific to each context.

6|Page
 Lack of Individual Context: The model treats individuals as homogeneous entities,
neglecting personal attributes and circumstances that could significantly impact salary
expectations, such as negotiation skills, individual achievements, or unique job
responsibilities.

REQUIREMENTS SPECIFICATION

 Data Analysis and Visualization:

The system must support data analysis methods to explore and understand the
dataset.Users should be able to visualize data trends, distributions, and relationships between
variables.Utilize Seaborn, Matplotlib, and other relevant libraries for effective data
visualization.

 Linear Regression:
Implement linear regression algorithms for modeling the relationship between years
of experience, job type, and salary.Ensure the model is capable of making accurate
predictions based on linear relationships.

 Polynomial Transformation:
Apply polynomial transformation techniques to capture non-linear relationships
between features and salaries.The system should allow users to choose polynomial degrees
for optimization.

One litre of water was poured into

the electrical kettle.
2. The temperature of the water is
determined by using a
thermocouple.

7|Page
3. The kettle is placed on a
weighing scale and the kettle with
water included is weighed.
4. Switched on the electricity.
5. The temperature and the time
until the water is recorded.
6. The time and the reducing
weight are recorded as the water
evaporated.
7. The voltage and the current are
determined in order to determine
the electrical assumption for
the heater.
8. A table is constructed to obtain
the necessary data and the
following output is determined:

8|Page
I. Rate of temperature rise to boil
the water (Table and graph of
Temp vs Time).
II. Rate of water evaporated from
the kettle (Table and graph Weight
of evaporated water
vs Time).
III. Rate of heat transferred of
water to boil.
IV. Rate of heat transferred of 0.5
litres of water to evaporate.
V. Total power consumption.
VI. The voltage and current from
the power supply are determined to
determine power
input.
VII. Efficiency of the kettle.
9|Page
One litre of water was poured into
the electrical kettle.
2. The temperature of the water is
determined by using a
thermocouple.
3. The kettle is placed on a
weighing scale and the kettle with
water included is weighed.
4. Switched on the electricity.
5. The temperature and the time
until the water is recorded.
6. The time and the reducing
weight are recorded as the water
evaporated.
7. The voltage and the current are
determined in order to determine
the electrical assumption for
10 | P a g e
the heater.
8. A table is constructed to obtain
the necessary data and the
following output is determined:
I. Rate of temperature rise to boil
the water (Table and graph of
Temp vs Time).
II. Rate of water evaporated from
the kettle (Table and graph Weight
of evaporated water
vs Time).
III. Rate of heat transferred of
water to boil.
IV. Rate of heat transferred of 0.5
litres of water to evaporate.
V. Total power consumption.

11 | P a g e
VI. The voltage and current from
the power supply are determined to
determine power
input.
VII. Efficiency of the kettle.
One litre of water was poured into
the electrical kettle.
2. The temperature of the water is
determined by using a
thermocouple.
3. The kettle is placed on a
weighing scale and the kettle with
water included is weighed.
4. Switched on the electricity.
5. The temperature and the time
until the water is recorded.

12 | P a g e
6. The time and the reducing
weight are recorded as the water
evaporated.
7. The voltage and the current are
determined in order to determine
the electrical assumption for
the heater.
8. A table is constructed to obtain
the necessary data and the
following output is determined:
I. Rate of temperature rise to boil
the water (Table and graph of
Temp vs Time).
II. Rate of water evaporated from
the kettle (Table and graph Weight
of evaporated water
vs Time).
13 | P a g e
III. Rate of heat transferred of
water to boil.
IV. Rate of heat transferred of 0.5
litres of water to evaporate.
V. Total power consumption.
VI. The voltage and current from
the power supply are determined to
determine power
input.
VII. Efficiency of the kettle.
Technologies/Libraries Used:

 Develop the web app using Python 3 as the primary programming language.

 Utilize Pandas for efficient data manipulation and preprocessing.

 Incorporate NumPy for numerical operations and array manipulations.

 Employ Scikit-learn for machine learning model implementation.

 Integrate Jupyter notebooks for interactive development and documentation.

14 | P a g e
1. Data Overview:

The dataset employed for this salary prediction model is characterized by its simplicity, with

minimal missing data.Raw data includes a training dataset containing features such as years of

experience and their corresponding salaries.A subset of 20% of the training dataset was extracted

to form a test dataset for evaluating the model's performance.

2. Testing Dataset:

A separate testing dataset was utilized for simulating real-world scenarios. This dataset lacks

salary information, resembling situations where predictions are needed for new, unseen data.

3. Information Used for Prediction:

15 | P a g e
The primary feature used for predicting salaries is the 'Years of Experience.'

4. Model Training:

The model training process is encapsulated in the 'model.py' script.Upon completion of training,

the model is saved as 'model.pkl,' a pickle file, for future use and deployment.

5. Run App:

The 'app.py' script is responsible for managing the Flask application and handling APIs.

To initiate the web application, users need to open the command prompt, navigate to the

specified directory, and execute 'python app.py.'

16 | P a g e
17 | P a g e
6. Procedure:

The following steps outline the procedure to run the web application:

Open the command prompt.

Navigate to the designated directory.

Execute the command 'python app.py.'

7. Model Files:

'model.pkl': The serialized pickle file containing the trained machine learning model.

'model.py': Script responsible for training the model and saving it as 'model.pkl.'

8. Project Objective:

The primary goal of this project is to predict employee salaries based on their years of

experience.

9. Model Output:

The trained model provides salary predictions based on the input of years of experience, offering

a valuable tool for compensation decisions.

18 | P a g e
10. Real-world Application:

The model's use of a testing dataset without salary information mirrors real-world scenarios

where predictions are needed for new data points.

11. Flask Application:

The Flask application, managed by 'app.py,' serves as the user interface for interacting with the

model predictions.

19 | P a g e
Conclusions

In conclusion, the provided data and model for the Salary Prediction Web App present a
straightforward yet effective approach to forecasting employee salaries based on years of
experience. The dataset's simplicity, with minimal missing data, facilitates model training and
evaluation. The inclusion of a testing dataset without salary information mimics real-world
scenarios, enhancing the model's applicability to new and unseen data.

The model training process, encapsulated in 'model.py,' yields a serialized pickle file
('model.pkl') for easy deployment and future use. Running the web application, managed by
'app.py' with Flask, is a user-friendly process, simplifying the prediction of salaries through the
input of years of experience.

This project successfully achieves its primary objective of predicting salaries based on a single
feature, providing a valuable tool for compensation decisions. The combination of machine
learning techniques and a Flask application offers a practical solution for organizations and
individuals seeking informed salary predictions.

Looking forward, potential enhancements could include expanding the feature set for more
comprehensive predictions and refining the user interface for an enhanced user experience.
Overall, this project lays a solid foundation for the integration of data science and web
development to address the practical challenges of salary prediction in professional settings.

20 | P a g e

Hands-On Machine Learning With Scikit-Learn, Keras, and TensorFlow 3rd Edition TEXTBOOK
0% (2)
Hands-On Machine Learning With Scikit-Learn, Keras, and TensorFlow 3rd Edition TEXTBOOK
14 pages
Research Proposal
No ratings yet
Research Proposal
31 pages
170 Machine Learning Interview Questios - Greatlearning
100% (1)
170 Machine Learning Interview Questios - Greatlearning
57 pages
Salary Prediction
No ratings yet
Salary Prediction
9 pages
Sigma Plot Statistics User Guide
No ratings yet
Sigma Plot Statistics User Guide
470 pages
Salary Prediction
No ratings yet
Salary Prediction
4 pages
Internship Report
No ratings yet
Internship Report
20 pages
ME P4252-II Semester - MACHINE LEARNING
100% (1)
ME P4252-II Semester - MACHINE LEARNING
48 pages
Factors Causing Tardiness Among The Grade 2 Pupils and Their Academic Performance
100% (3)
Factors Causing Tardiness Among The Grade 2 Pupils and Their Academic Performance
54 pages
4 Completed Action Research Template 2023
No ratings yet
4 Completed Action Research Template 2023
10 pages
Internship Report
No ratings yet
Internship Report
33 pages
Salary Prediction Using Machine Learning
No ratings yet
Salary Prediction Using Machine Learning
4 pages
Machine Learning (Aryan Kumar 7th Sem) PDF
No ratings yet
Machine Learning (Aryan Kumar 7th Sem) PDF
56 pages
Project Synopsis
33% (3)
Project Synopsis
4 pages
Organizational Climate (OCTAPACE) An Insight Into Its Effect On Job Satisfaction in The IT (Information Technology) Sector
100% (1)
Organizational Climate (OCTAPACE) An Insight Into Its Effect On Job Satisfaction in The IT (Information Technology) Sector
63 pages
Introduction To Chemistry Classification and Properties of Matter
100% (1)
Introduction To Chemistry Classification and Properties of Matter
20 pages
Salary Prediction Document
No ratings yet
Salary Prediction Document
30 pages
Downloaded
No ratings yet
Downloaded
159 pages
ETHNOGRAPHIC, HISTORICAL and MIXED-METHOD RESEARCH
No ratings yet
ETHNOGRAPHIC, HISTORICAL and MIXED-METHOD RESEARCH
94 pages
Syllabus (CBCS) : Faculty of Commerce & Business Management, Kakatiya University
No ratings yet
Syllabus (CBCS) : Faculty of Commerce & Business Management, Kakatiya University
50 pages
Internship Report AIML
No ratings yet
Internship Report AIML
40 pages
Final-Report22 3 PDF
No ratings yet
Final-Report22 3 PDF
124 pages
Final Report22.4 PDF
No ratings yet
Final Report22.4 PDF
118 pages
Project Report
No ratings yet
Project Report
11 pages
Mini Project Report
No ratings yet
Mini Project Report
10 pages
New Project Report
No ratings yet
New Project Report
70 pages
Skills: Data Scientist
No ratings yet
Skills: Data Scientist
3 pages
20MCA041
No ratings yet
20MCA041
72 pages
Ourppt
No ratings yet
Ourppt
11 pages
Shwet Mlds
No ratings yet
Shwet Mlds
35 pages
Skripsi Eka Serli Sudarni
No ratings yet
Skripsi Eka Serli Sudarni
62 pages
Unit 1
No ratings yet
Unit 1
32 pages
Batch 1 Job Market Analysis and Prediction-1
No ratings yet
Batch 1 Job Market Analysis and Prediction-1
60 pages
BT4234 - RPT - Mr. Sreenarayanan N M
No ratings yet
BT4234 - RPT - Mr. Sreenarayanan N M
32 pages
Jamal Internship Report
No ratings yet
Jamal Internship Report
39 pages
Salary Predictions
No ratings yet
Salary Predictions
43 pages
Sarumathi Intern18
No ratings yet
Sarumathi Intern18
37 pages
Precision Marketing: The Account-Based Approach
No ratings yet
Precision Marketing: The Account-Based Approach
36 pages
Master
No ratings yet
Master
4 pages
AIML-Curriculum by Pregrad
No ratings yet
AIML-Curriculum by Pregrad
33 pages
02 Provision Electricity Internet Access DepEd Schools School Performance Digital Inclusivity Alampay Navarro
No ratings yet
02 Provision Electricity Internet Access DepEd Schools School Performance Digital Inclusivity Alampay Navarro
29 pages
Group 24 Miniproject
No ratings yet
Group 24 Miniproject
33 pages
Previewpdf
No ratings yet
Previewpdf
27 pages
Running Head: (REFLECTIVE ESSAY)
No ratings yet
Running Head: (REFLECTIVE ESSAY)
20 pages
Avinash PDF
No ratings yet
Avinash PDF
23 pages
G H Raisoni College of Engineering and Management, Pune: Department Name
No ratings yet
G H Raisoni College of Engineering and Management, Pune: Department Name
22 pages
Final Report
No ratings yet
Final Report
22 pages
Group Thesis Part 1
No ratings yet
Group Thesis Part 1
17 pages
Seipp, 1991, Anxiety and Academic Performance
No ratings yet
Seipp, 1991, Anxiety and Academic Performance
17 pages
Regression Analysis by Example - (CHAPTER 7 WEIGHTED LEAST SQUARES)
No ratings yet
Regression Analysis by Example - (CHAPTER 7 WEIGHTED LEAST SQUARES)
18 pages
Preprints202408 0365 v1
No ratings yet
Preprints202408 0365 v1
17 pages
Final File of Research
No ratings yet
Final File of Research
23 pages
Batch 1 Publication
No ratings yet
Batch 1 Publication
16 pages
Dnyaneshwar Ds
No ratings yet
Dnyaneshwar Ds
2 pages
Code Masters
No ratings yet
Code Masters
10 pages
Ai 53
No ratings yet
Ai 53
13 pages
TBS Final Exam 2018
No ratings yet
TBS Final Exam 2018
10 pages
Research Paper 1
No ratings yet
Research Paper 1
9 pages
36-401 Modern Regression HW #7 Solutions: Problem 1 (40 Points)
No ratings yet
36-401 Modern Regression HW #7 Solutions: Problem 1 (40 Points)
12 pages
Job Salaries Prediction System
No ratings yet
Job Salaries Prediction System
9 pages
Gladwin Tirkey Research Paper
No ratings yet
Gladwin Tirkey Research Paper
7 pages
Updated Resume Verdana
No ratings yet
Updated Resume Verdana
7 pages
Synopsis Group 6 Final
No ratings yet
Synopsis Group 6 Final
6 pages
Volume6 Issue3 Paper10 2022
No ratings yet
Volume6 Issue3 Paper10 2022
6 pages
Fammm
No ratings yet
Fammm
8 pages
Business Mathematics Iv-6
No ratings yet
Business Mathematics Iv-6
5 pages
SSRN 3526707
No ratings yet
SSRN 3526707
5 pages
Salary Prediction Abstract
No ratings yet
Salary Prediction Abstract
5 pages
Salary Hike Predictor Synopsis
No ratings yet
Salary Hike Predictor Synopsis
4 pages
Grubbs' Outlier Test
No ratings yet
Grubbs' Outlier Test
2 pages
Unit 9 (STAT 17 Assignment)
No ratings yet
Unit 9 (STAT 17 Assignment)
5 pages
Aishwarya Swetha Data Science
No ratings yet
Aishwarya Swetha Data Science
1 page
KSBResume Ext Copy 1
No ratings yet
KSBResume Ext Copy 1
3 pages
Vignes HJ
No ratings yet
Vignes HJ
2 pages
Goutham Resume
No ratings yet
Goutham Resume
2 pages
CV Snehal Bisen
No ratings yet
CV Snehal Bisen
3 pages
Data Trials and Triumphs
No ratings yet
Data Trials and Triumphs
3 pages
Lampiran Syntax R Sarima
No ratings yet
Lampiran Syntax R Sarima
4 pages
1 - Swati Madhukar Taur
No ratings yet
1 - Swati Madhukar Taur
2 pages
Data Scientist Entryl Level
No ratings yet
Data Scientist Entryl Level
2 pages
Aditya Shebe
No ratings yet
Aditya Shebe
3 pages
CHAPTER 4: Example 4 (Alpha 0.1) : Forecasting Exponential Smoothing
No ratings yet
CHAPTER 4: Example 4 (Alpha 0.1) : Forecasting Exponential Smoothing
2 pages
Aman Kumar Rai - Resume - 2025
No ratings yet
Aman Kumar Rai - Resume - 2025
1 page
Hemant Kumar Jha: Education
No ratings yet
Hemant Kumar Jha: Education
1 page
Ankur - Shukla - DS - Almabetter - Ankur Shukla
No ratings yet
Ankur - Shukla - DS - Almabetter - Ankur Shukla
1 page
Insights Beyond Ir4.0 with Ioe Checksheets For Implementation - a Basic Reference Manual: A Disruptive Digital Technology - Forging Ahead with Industrial Transformation
From Everand
Insights Beyond Ir4.0 with Ioe Checksheets For Implementation - a Basic Reference Manual: A Disruptive Digital Technology - Forging Ahead with Industrial Transformation
Sugumaran RS Ramachandran
No ratings yet
Mastering Partial Least Squares Structural Equation Modeling (Pls-Sem) with Smartpls in 38 Hours
From Everand
Mastering Partial Least Squares Structural Equation Modeling (Pls-Sem) with Smartpls in 38 Hours
Ken Kwong-Kay Wong
3/5 (1)
Earned Schedule
From Everand
Earned Schedule
Walter Lipke
No ratings yet
EDUCATION DATA MINING FOR PREDICTING STUDENTS’ PERFORMANCE
From Everand
EDUCATION DATA MINING FOR PREDICTING STUDENTS’ PERFORMANCE
Dr. GEETHA N DATA SCIENTIST, BENGALURU
No ratings yet
Applied Predictive Modeling: An Overview of Applied Predictive Modeling
From Everand
Applied Predictive Modeling: An Overview of Applied Predictive Modeling
Steven Taylor
No ratings yet

ML Report

Uploaded by

ML Report

Uploaded by

SRES’

SANJIVANI COLLEGE OF ENGINEERING,

“Salary Prediction Based on work

Sr. No Roll no. NAME PRN NO

1 28 Kale Kalyan Jayant UMX20M1028

Prof. Chaitanya Kale

Prof. R. A. Kapgate Dr. A. G. Thakur

Head of Dept. of Mechatronics Engg. Director, SCOE

Sr no. Roll no. Name

great pains for our education.

 Data Analysis and Visualization:

One litre of water was poured into

 Utilize Pandas for efficient data manipulation and preprocessing.

 Incorporate NumPy for numerical operations and array manipulations.

 Employ Scikit-learn for machine learning model implementation.

 Integrate Jupyter notebooks for interactive development and documentation.

to form a test dataset for evaluating the model's performance.

3. Information Used for Prediction:

specified directory, and execute 'python app.py.'

Open the command prompt.

Navigate to the designated directory.

Execute the command 'python app.py.'

a valuable tool for compensation decisions.

where predictions are needed for new data points.

11. Flask Application:

You might also like