Crop Yield Prediction Using Machine Learning
Crop Yield Prediction Using Machine Learning
ISSN No:-2456-2165
Abstract:- Agriculture is an important part of the Indian many seeds. This shows that predicting crop yield is a result of
economy and more than half of the country's population the development process and not an explicit interaction.
earns their living from agriculture. Agriculture is an Potential models can now easily calculate actual outcomes;
important part of the growth of human civilization but better production forecasts are still needed. Regression
through the raising of domesticated animals that produce techniques were used to make future predictions, while
food that enables people to survive. Machine learning is descriptive models were used to gain insight from the
used to predict crop yield based on parameters such as collected data and explain the results. Mechanical engineering
rainfall, yield and weather. In addition to being an research presents many challenges when trying to develop
important decision-making tool for crop yield prediction, high-performance models. Choosing the right algorithm for
machine learning also supports crop production and crop the job is crucial; The algorithm and underlying platform need
production-related decision-making. commonly used to be able to handle a lot of data.
algorithms It is a neural network device. Weather, climate
and other environmental factors pose a long-term threat to In the Indian economy, the main occupation of people
agriculture. Machine learning (ML) is important because living in rural areas is agriculture. According to 2018
it provides decision support tools for crop forecasting statistics, its population makes up more than half of the
(CYP) that can help make decisions such as which crops to country, but its GDP accounts for only 17% of the country.
plant and how during the growing season. The main Today's farmers grow products based on centuries of
limitation of neural networks is to reduce the relative error experience. Since the time of traditional methods, there have
and efficiency of crop yield prediction. The main objective been examples of such over- or under-production where the
of crop forecasting is to improve crop production and actual rules for growing a crop are not observed. This is a
various models are used to achieve this goal. This research major source of stress for Indian farmers. Farmer suicides are
helps make agriculture more efficient by demonstrating increasing due to low productivity, inbreeding and high
machine learning's ability to predict crop yields with high production costs. All over the world, soil is being lost, water is
levels of productivity. Design can be a decision support polluted, and farmers' incomes are slowly decreasing. This
tool for farmers, enabling them to make informed problem can be solved by trying new crops, but farmers have
decisions on crop management, resource allocation and difficulty predicting the results of a single crop. If they can
risk mitigation, ultimately increasing agricultural predict which crop will yield the most, they will make more
sustainability and food security. Using the results of this money and support farmers. Forecasting is one of the latest
study, farmers will be able to make informed decisions by changes in agriculture and has a huge impact on the industry.
determining the yield of their crops before planting on It uses satellite imagery, comprehensive data on soil
their farms. parameters, weather conditions and crop history to achieve
optimum results. Therefore, we chose to use the most effective
Keywords:- Crop_Yield_Prediction; Logistic_Regression; regression models, compare their accuracy, and see which
Naive Bayes; Random Forest; Dataset. model best predicted yield when some weather and soil
parameters were not entered.
I. INTRODUCTION
India grows crops throughout the year. Crops are
Machine learning (ML) technology is used in everything classified according to growing season. There are two growing
from predicting mobile phone usage to analyzing consumer seasons in India: Kharif season and Rabi season. Kharif and
behavior in supermarkets. Machine learning has been used in Rabi growing season is from July to October and October to
agriculture for a long time. Predicting crop yield is one of the March. Thereafter crop and weather data were combined into
most difficult problems in horticulture, and to date there are a single file. Crop history information such as crop region,
many models to be applied and evaluated. This question name, season, and other climatic factors affecting the crop are
requires a lot of data because the harvest is affected by many used as independent variables. The yield will be extraordinary.
factors such as soil, weather, environment, use of compost and
Random Forest algorithm can be applied on the the dataset for the classification(To get the optimal price from the dataset). By
using the random forest algorithm works 85% accurately.
Random forests work in two stages; first, it creates a random forest by combining N decision trees, and second, it makes
predictions for each tree created in the first stage.
We can explain the working process in the following steps and ways:
Step 1: Select the content K data from the training process.
Step 2: Create a decision tree with the selected data points (subset).
Step 3: Select the number N for the decision tree you want to create.
Step 4: Repeat steps 1 and 2.
Step 5: For new data, find the prediction for each decision tree and put the new data points in the class that won the most votes.
For this dataset we got the accuracy upto 85% by using random forest algorithm.
REFERENCES