Linear Regression Research Paper
Linear Regression Research Paper
1. Introduction
In the field of data analysis and machine learning, linear regression is one of the simplest
and most commonly used algorithms for predicting numerical values. This paper aims to
demonstrate how linear regression can be used to predict an employee’s salary based on
their years of experience, using Python and libraries such as pandas and scikit-learn.
2. Dataset Description
The dataset used in this experiment contains two main columns:
- YearsExperience: Number of years of professional experience.
- Salary: The corresponding salary.
The data was loaded using the pandas library and visually inspected to verify the structure
and column names.
3. Methodology
4. Results
The model successfully captured the linear relationship between years of experience and
salary.
As experience increases, the predicted salary also increases, forming a nearly linear pattern.
The results were visualized as shown below:
5. Conclusion
The linear regression algorithm proved to be effective in predicting salaries based on years
of experience, especially when the relationship between the variables is linear. This method
provides a simple yet powerful way to make salary estimations, which can be useful in areas
like HR planning and compensation analysis.
6. References
- Scikit-learn Documentation: https://scikit-learn.org/
- Python for Data Science Handbook – Jake VanderPlas
- Géron, A. (2019). Hands-On Machine Learning with Scikit-Learn and TensorFlow. O'Reilly
Media.