0% found this document useful (0 votes)
210 views9 pages

Crop Yield Prediction Using ML Algorithms: A Mini Project Report On

The document describes a mini project report on crop yield prediction using machine learning algorithms. It was submitted by three students to Jawaharlal Nehru Technology University, Hyderabad in partial fulfillment of the requirements for a Bachelor of Technology degree in Information Technology. The project involved analyzing factors like location, soil properties, weather, and other data to predict crop yields using machine learning models.

Uploaded by

Md Talha Khan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
210 views9 pages

Crop Yield Prediction Using ML Algorithms: A Mini Project Report On

The document describes a mini project report on crop yield prediction using machine learning algorithms. It was submitted by three students to Jawaharlal Nehru Technology University, Hyderabad in partial fulfillment of the requirements for a Bachelor of Technology degree in Information Technology. The project involved analyzing factors like location, soil properties, weather, and other data to predict crop yields using machine learning models.

Uploaded by

Md Talha Khan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 9

A MINI PROJECT REPORT ON

CROP YIELD PREDICTION USING ML ALGORITHMS


Submitted to Jawaharlal Nehru Technology University, Hyderabad,

Is partial fulfillment of requirement for the award of the degree of

BACHELOR OF TECHNOLOGY

IN

INFORMATION TECHNOLOGY

SUBMITTED BY

MUDUGULA AASHISH MUDIRAJ (18J21A1233)


KARKA BHARGAVI (18J21A1219)
SAHIL MANDAVIA (18J21A1243)

Under the guidance of

Mr. P.THIMMA REDDY


Associate Professor

Department of Information Technology

JOGINPALLY B.R. ENGINEERING COLLEGE


(Affiliated to Jawaharlal Nehru Technology University, Hyderabad, T.S.)

Yenkapally (v) , Moinabad (M), Hyderabad (T.S.)-500075

2018-2022
JOGINPALLY B.R ENGINEERING COLLEGE
(Approved by A.I.C.T.E, Affiliated to J.N.T.U. Hyderabad)

Yenkapally , Moinabad Mandal, R.R District , Hyderabad ,(AP)-500075

CERTIFICATE
This is to certify that the project entitled “ CROP YIELD PREDICTION USING
ML ALGORITHMS “ is the bonafide work carried out by MUDUGULA AASHISH
MUDIRAJ (18J21A1233) , KARKA BHARGAVI (18J21A1219) , SAHIL MANDAVIA
(18J21A1243) of B.Tech (INFORMATION TECHNOLOGY) under our guidance and
supervision. The Project Report is submitted to JNTU Hyderabad in partial fulfillment of
requirements of the award of B.Tech degree during the academic year 2021-2022.

INTERNAL GUIDE HEAD OF THE DEPARTMENT


Mr. P.THIMMA REDDY Mr. P.SRINIVAS

Associate Professor Associate Professor

External Examiner
ABSTRACT:
An agriculture country, its economy predominantly depends on agriculture yield growth and agro
industry products. Data Mining is an emerging research field in crop yield analysis. Yield
prediction is a very important issue in agricultural. Any farmer is interested in knowing how
much yield he is about to expect.

Analyze the various related attributes like location, pH value from which alkalinity of the soil is
determined. Along with it, percentage of nutrients like Nitrogen (N), Phosphorous (P), and
Potassium (K) Location is used along with the use of third-party applications like APIs for
weather and temperature, type of soil, nutrient value of the soil in that region, amount of rainfall
in the region, soil composition can be determined.

All these attributes of data will be analyzed train the data with various suitable machine learning
algorithms for creating a model. The system comes with a model to be precise and accurate in
predicting crop yield and deliver the end user with proper recommendations about required
fertilizer ratio based on atmospheric and soil parameters of the land which enhance to increase
the crop yield and increase farmer revenue.
1. INTRODUCTION
1.1 INTRODUCTION TO PROJECT:
Agriculture is the backbone of every economy. In a country like India, which has ever increasing
demand of food due to rising population, advances in agriculture sector are required to meet the
needs. From ancient period, agriculture is considered as the main and the foremost culture
practiced in India. Ancient people cultivate the crops in their own land and so they have been
accommodated to their needs. Therefore, the natural crops are cultivated and have been used by
many creatures such as human beings, animals and birds. The greenish goods produced in the
land which have been taken by the creature leads to a healthy and welfare life. Since the
invention of new innovative technologies and techniques the agriculture field is slowly
degrading. Due to these, abundant invention people are been concentrated on cultivating artificial
products that is hybrid products where there leads to an unhealthy life. Nowadays, modern
people don’t have awareness about the cultivation of the crops in a right time and at a right place.
Because of these cultivating techniques the seasonal climatic conditions are also being changed
against the fundamental assets like soil, water and air which lead to insecurity of food. By
analysing all these issues and problems like weather, temperature and several factors, there is no
proper solution and technologies to overcome the situation faced by us. In India there are several
ways to increase the economical growth in the field of agriculture. There are multiple ways to
increase and improve the crop yield and the quality of the crops. Data mining also useful for
predicting the crop yield production. Generally, data mining is the process of analysing data from
different perspectives and summarizing it into useful information.

Data mining software is an analytical tool that allows users to analyse data from many different
dimensions or angles, categorize, and summarize the relationships identified. Technically, data
mining is the process of finding correlations or patterns among dozens of fields in large
relational databases. The patterns, associations, or relationships among all this data can provide
information. Information can be converted into knowledge about historical patterns and future
trends. For example, summary information about crop production can help the farmers identify
the crop losses and prevent it in future. Crop yield prediction is an important agricultural
problem. Each and Every farmer is always tries to know, how much yield will get from his
expectation. In the past, yield prediction was calculated by analysing farmer's previous
experience on a particular crop. The Agricultural yield is primarily depends on weather
conditions, pests and planning of harvest operation. Accurate information about history of crop
yield is an important thing for making decisions related to agricultural risk management.
1.2 Problem Statement

To Design, Develop and Implement the training model by using different inputs data. So
machine will able to learn the features and extract the crop yield from the data by using machine
leaning techniques.

1.3 Objectives

The proposed system aims at predicting or forecasting the crop yield by learning the past data of
the farming land. By considering various factors such as soil conditions, rainfall, temperature,
yield and other entities the system builds a predicting a model using machine learning
techniques. Here we make use of different machine learning techniques such random forest,
Polynomial Regression, Decision Tree. Performance is evaluated based on predicted accuracy.

2. Existing System
The other method utilized time series vegetation indices to optimize crop growth process
model, and was usually used to predict crop yield in advance. It is noticed that the multiple
indices-based method was less time-efficient than single index-based method. To support
agricultural management decisions, the crop yield trend needed to be analyzed and reported. The
yield variation trend includes temporal and spatial trends, which were studied using statistical
test and spatial analysis methods, respectively. Because of the irrigation facility and crop cultivar
updating in northeast China, a perception exists that crop yields are increasing. However, the
actual trends remain unknown yet.

2.1 Disadvantages of Existing System:


 Sampling error due to clustered sampling.
 Less accuracy.
 In this method the crops are classified in four types
a) Seasonal crops
b) Whole year crops
c) Short time plantation crops
d) Long time plantation crops
 It doesnot show the Estimation of yield for each crop in a particular region based on the
important parameters.
To overcome all the above disadvantages existing in the existing system we will propose
a new system as mentioned below.

3. Proposed System
Machine learning mainly consists of three learning methods, namely supervised learning,
reinforcement learning and unsupervised learning methods of training a model. Supervised
learning is a learning method which maps known input resulted into output which maps from
input to output. But in case of unsupervised leaning we would not know targeted output in this
learning we should train the model in order to get desired output. Generally researchers used .csv
files of agriculture dataset for crop yield prediction. The dataset is supervised learning. It consists
of different attributes like State, Humidity, Crop, Year, Average low temperature, Average high
temperature, Cloud cover, Precipitation, Potential Evaporation, Yield etc.

Our proposed system helps in predicting yield of the crop, from the agricultural data
which we have collected and stored in .csv .It predicts the yield of the crop in a particular state
and in a particular year .The yield of the crop is determined by several features. Some important
features or the parameters which has the highest impact on the agricultural yield considered in
the project are listed below.

 Average high temperature


 Average low temperature
 Cloud cover
 Precipitation
 Potential Evaporation
 Area
 Yield
 State

3.1 Advantages:

 It predicts the yield of a crop in a state.


 It predicts which State will have the highest yield for the crop.
 It predicts which crop will have the higest yield in a state.
 It depicts the yield of the crop for a particular state in a particular year in a graphical
representation.
 Simple design and User Interface.
 The system aims to help farmers to cultivate proper crop for better yield production.

4. MODULES

4.1) Dataset collection

4.2) Pre-processing the data

4.3) Transforming the data

Data set of
Average high
temperature,
Average low
Pre-processing of Prediction model
temperature,
the data set (ML Algorithm)
Cloud cover,
Precipitation,
Potential
Evaporation

4.1) Dataset collection :

The dataset used for this project is collected from various online sources like Kaggle.com
and data.govt.in. We have taken the agricultural data of few regions . Some important features or
the parameters which has the highest impact on the agricultural yield considered in the project
are listed below.
 Average high temperature
 Average low temperature
 Cloud cover
 Precipitation
 Potential Evaporation
 Area
 Yield
 State

4.2) Pre-processing the data :

After the selection of the dataset, it has to be pre-processed into a form that you can work
with. Some of the steps are formatting, cleaning and sampling. Initially the data you have
selected is converted into the format suitable for you to work with. Cleaning data is the removal
or fixing of the mixed data. Sampling is taking a small representative sample of the selected data
that may be much faster for exploring the solutions than electing the whole dataset.

4.3) Transforming the data :

The final step is transforming the selected data. The preprocessed data here is then
transformed into data that is ready for machine learning algorithms by using various engineering
features like scaling, feature aggregation and so on. There may be several features that can be
combined into a single feature which would be more meaningful to the problem you are trying to
solve
5. SOFTWARE REQUIRMENTS SPECIFICATION

5.1 SOFTWARE REQUIRMENTS :


● Operating System : Microsoft Windows 10 

● Programming Language : Python

● Python IDE : Spyder

● Text Editor : Jupyter notebook

● Back End Tools : Python 3.6


● Data Set : Average low and high temperatures,
Percentage of Cloud Cover,
Precipitation and Potential Evaporation.

5.2 HARDWARE REQUIRMENTS :


 Processor with speed : Intel [i3] (2.0 GHz)
 Hard Disk : 40 GB
 RAM : 4 GB

You might also like