Complete Project Work
Complete Project Work
CROP PREDICTION
Our Aim :-
1
Title:- A STUDY ON USAGE OF DATA ANALYTICS IN AGRICULTURE FOR
CROP PREDICTION
Objective:-
The following the Objectives of the Project
1. To study the role of Data Analytics application in Agriculture.
2. To examine the advantages of using Data Analytics Techniques for Crop Prediction.
3. To develop a Machine Learning Model for crop prediction by using Python Coding and various
Statistical Concepts.
4. To predict the best crop to be grown based on Soil Composition, Temperature, Ph- Value and
Rainfall.
2
INTRODUCTION
Data Analytics ? :-
Business Analytics is the process by which businesses use statistical methods and
technologies for analysing historical data in order to gain new insight and improve
strategic decision-making. It is a data management solution refers to the use of
methodologies such as data mining, predictive analytics, and statistical analysis in order
to analyze and transform data into useful information, identify and anticipate trends and
outcomes, and ultimately make smarter, data-driven business decisions.
History…
b) DATA MINING: Data mining for business analytics sorts through large datasets
using databases, statistics, and machine learning to identify trends and establish
relationships.
d) TEXT MINING: explores and organizes large, unstructured text datasets for the
purpose of qualitative and quantitative analysis
g) OPTIMIZATION: once trends have been identified and predictions have been
made, businesses can engage simulation techniques to test out best-case scenarios
Agriculture is the backbone of the Indian economy, but the industry currently requires more
support and concentration than any other. India is a country of over a billion people in
population. Around 70% of the population lives in the rural areas. With 40% of the country’s
workforce, agriculture is a major industry and an influencer of the Indian economy. Despite
this all facts, its contribution to the $2.3 trillion economy is just a meager 16% of the entire
GDP.
4
The reason behind this low contribution of Agriculture towards GDP of this country is many.
Agriculture in India lacks institutional attention, proper support from banks in terms of loans
and supportive farmer welfare schemes from the Government side. Sometimes, it even suffers
from a myriad of disasters like depleting groundwater levels in rural areas, climate change,
unpredictable monsoon or lack of rainfall, droughts, floods, unfair price fixing policies of
produce, migration of farmers towards the cities in search of better paying jobs, and more.
Agriculture is one sector responsible for feeding every individual, but the people involved in it
are not taken care properly. After many institutions have failed, time has indeed come for
technology to take care of the change. With newer problems cropping up every day in the most
inevitable indigenous sectors, it is high time we resort to emerging technologies for solutions.
The revolutionary technology that goes by the name “Big Data” has already made impact in
other Indian industries from IT to healthcare. And now, investors and market players are
planning to leverage the potential of Big Data for the benefit of agriculture sector in India.
Apart from major companies, it is the vision of several youth of the country that has attracted
the use of Big Data for farming. For instance, SatSure, founded by the 33-year-old Abhishek
Raju works on using Big Data and its allied technologies like data science and IoT to better the
lives of farmers.
Abhishek shares the fact that he was deeply moved by the rate of farmer suicides and the lack
of application of science and newer technologies in the oldest Indian industry – agriculture. His
solution to this is SatSure. According to him, “The parameters associated with soil health and
crop growth have had a very restricted scope for research and his technology immensely uses
Big Data and Machine Learning technologies to solve the restriction and bring about insights
on crop phenology.”
When we got on a call with Mr. Abhishek Raju, he shared that, “Indian agriculture sector is
much disorganized and heavily cash oriented. Electronic transactions are almost non-existent,
and that is why most of the transactions are unrecorded. We help them provide insights about
farm productivity, when to irrigate, sow, harvest, and the patch of land that can be used by
farmers. We help banking and insurance companies in settlement of risk assessment, crop loss,
5
and offer insights by analyzing current and historical satellite images.”
“The satellite images are not only in a single visual spectrum but we have multiple data layers
which contain images merged into one to gather as much information as possible. This is what
we at SatSure mine. However, data is one thing, and what you infer from data is another thing.
We analyze data to make action oriented conclusion-able intelligence.” He adds.
Facilities like satellite-based filed monitoring, embedded sensors on crops and fields,
predictions on wind direction, fertilizer requirement notifications, pest infestations, GPS-
enabled tractors, water cycles, and more are acting as points of rich data sources that could be
used for better agriculture practices. Besides, Big Data and analytics now also enable
monitoring and supervision for growth rate and nutrient requirements on a plant-by-plant basis.
Moreover, analytics is enabling farmers to make data-based decisions like which crops to plant
for their next harvest. The rich information on soil health, water availability, and predictions on
rainfall and precipitation make this data source. Welcome to the world of Smart Farming.
Smart Farming
Smart Farming is the breakthrough application of science and technology in the field of
agriculture. Smart farming is the application of technologies like IoT, Big Data and analytics
on an agricultural field. It makes use of technologies like the Internet of Things, cloud
computing, Machine Learning, and Big Data to enable farmers to have more insights on the
consequences of their actions and take a much better and informed decision on farming
practices.
The power of smart farming lies in the fact that it goes beyond solving the shortcomings and
pitfalls of agriculture. The application of Big Data is leaving significant impact on the entire
realm of supply-chain, giving predictive insights on farming practices and operations, help
redesign business models, deliver realtime decisions on operations and more.
Jyoti Vaddi of Cropin shares, “The world population is estimated to cross the 10 billion mark
6
by the middle of this century. This population growth combined with urbanization will require
the agricultural production to double. To succeed, Jyoti recommends the need for smart
solutions for fairly produced, sustainable food, feed, and fibre, which is one of the mainstay
principles of CropIn.”
Satsure enables insurance companies, banks, traders, pesticide and seed manufacturing
companies, and farmers the ability to take informed farming decisions by leveraging the
combined potentials of technologies like –
Cloud Computing
Big data
Machine Learning
The Internet of Things
Web-based Software as a Service platforms
Raman Singh Saluja, founder of Gramco Infratech says, “Agriculture is a very physical
business, requiring physicality in terms of handling/warehousing/value addition / etc, which
will continue to be the bedrock.”
When it comes to the use of technologies, he shares that data analytics offers tremendous
potential for improving cost to output ratio, reduce/optimize Input usage, increase yields, offer
timely actionable information and do more.
He further reveals,” At Gramco, there are two initiative underway which will be brought to
market by the 3-4th quarter of 2018. One has been piloted with very encouraging results with
on ground support of a leading insurance company.”
7
During their market opportunity study, Cropin gathered that agribusinesses had minimal and
outdated technological/digital resources, and were not able to make informed data-driven
decisions. She reveals that with consumers keen to know the origin of their food and how it
was produced and processed, there was a need for transparency along the end-to-end
agribusiness supply chain.
The technologies that power Cropin in delivering efficient farming solutions to stakeholders in
its network include:-
These technologies foster an environment for production forecast, risk management and
coverage, output predictability, quality maximization, and increased farm sustainability to
agriculture input companies, banks and financial institutions, insurance companies, farming
enterprises, seed manufacturing companies and government bodies respectively.
This is generally the database where all chunks of data from multiple sensors and resources are
gathered, stored, analyzed, and retrieved for actions. An optimized management information
system should offer information on:
Crops:
Crop stress
Statuses on crop tissue nutrients
Crop population
Weed patches
Fungal or insect infestation
Crop yield
Soil
Physical condition
Soil texture
Structure
Moisture
Nutrients and more
Climate
Humidity
Rainfall
Wind speed
Temperature
Devices
Technology is what puts the smart in smart farming and the following make up the network:
9
Global positioning systems and differential global positioning systems for better
accuracy
Geographical information systems
Remote sensing technologies like data sensors, RADARS, data transmitters, drones,
cameras, and other connected devices
Cloud architecture
The Internet of Things, where devices are capable of communicating with each other
and deliver real-time updates and notifications to farmers on crop statuses, water levels,
moisture content, crop yield, and more.Technologies like Machine Learning, Data
Analytics, and Big Data for the entire process and setup to make sense.
The primary vision of precision farming is to optimize RoI and preserve resources by allowing
farmers and landlords to take optimized and informed decisions from the available field data.
Precision agriculture fosters an environment where farmers can zero-down precise locations in
their fields for the spatial availability of several resources like water availability, topography,
soil fertility, organic matter, nitrogen levels, moisture content, the presence of magnesium,
potassium, and more.
Complemented by services and features like GPS devices, sensors that are even capable of
measuring chlorophyll levels, drones, and satellite imagery, precision agriculture offers a
treasure chest of information for farmers.
Big data is an extensive collection of both structured and unstructured data that can be mined
for information and analyzed to build predictive systems for better decision making. Besides
the government, telecom, healthcare, marketing, education, and several industrial sectors, big
data applications in agriculture are gaining momentum as technologies like livestock
monitoring gadgets, drones, and soil sensors are generating large volumes of data to support
data-driven farming. The ultimate goal is to help farmers, agriculturists, and scientists adopt
1
0
beneficial farming practices.
Big data
When a question arises what is big data, it is a collection of large, complex, and unprocessed
data is called ‘big data’. Due to complexity, big data cannot be processed by conventional data
processing and data management applications and requires advanced tools that can analyze and
process large volumes of data. Big data is characterized by some unique features – volume,
variety, velocity, variability, veracity, and complexity. This vast reservoir of information must
be studied, stored, and processed systematically for its applications in the public sector,
scientific research, agriculture, industry, etc.
Government – Data influx from sources such as sensors, satellites, CCTV and traffic cameras,
calls, emails, social media, IT spaces, academia, etc. calls for efficient data storage and analysis
for better governance and management of the public sector.
Banking – The big data applications in banking & insurance sector handles enormous amounts
of data. Big data analytics are being used to store data, improve scalability, and derive business
insights.
Healthcare – The problem of communication silos that plagues the healthcare industry can be
considerably reduced with the application of big data-based protocols.
Telecom – Real-time analysis of big data provides useful predictions to derive business
insights and strategies such as delivering revenue-generating services while keeping in mind
network and customer considerations.
1
1
Traditional tools are being replaced by sensor-equipped machines that can collect data from
their environments to control their behavior – such as thermostats for temperature regulation or
algorithms for implementing crop protection strategies. Technology, combined with external
big data sources like weather data, market data, or standards with other farms, is contributing to
the rapid development of smart farming.
Sustainability, global food security, safety, and improved efficiency are some of the critical
issues that are being addressed by big data applications in agriculture. Undoubtedly, these
global issues have extended the scope of big data beyond farming and now cover the entire
food supply chain. With the development of the Internet of Things, various components of
agriculture and the supply chain are wirelessly connected, generating data that is accessible in
real-time.
Primary sources of data include operations, transactions, and images and videos captured by
sensors and robots. However, extracting the full potential of this data repertoire lies in efficient
analytics. The development of applications related to risk management, sensor deployment,
predictive modeling, and benchmarking, has been possible due to big data.
Stakeholders involved
Technology and input suppliers are the traditional players who offer their platforms and
solutions to the farmers. Data privacy and security risks compel farmers to form coalitions to
benefit from their data, creating a close and proprietary environment. Big data also attract start-
ups, private firms, non-agricultural tech companies, and public institutions.
The organization of the stakeholders determines the infrastructure of big data solutions – either
proprietary or an open-source system. The development of big data applications in
agriculture will result in either the farmers becoming franchisers in integrated long supply
chains or a scenario in which farmers collaborate with suppliers and the government to engage
in short supply chains.
Boosting productivity – Data collected from GPS-equipped tractors, soil sensors, and other
external sources has helped in better management of seeds, pesticides, and fertilizers while
1
2
increasing productivity to feed the ever-increasing global population.
Access to plant genome information – This has allowed the development of useful agronomic
traits.
Predicting yields – Mathematical models and machine learning are used to collate and analyze
data obtained from yield, chemicals, weather, and biomass index. The use of sensors for data
collection reduces erroneous manual work and provides useful insights on yield prediction.
Risk management– Data-driven farming has mitigated crop failures arising due to changing
weather patterns.
Food safety – Collection of data relating to temperature, humidity, and chemicals, lowers the
risk of food spoilage by early detection of microbes and other contaminants.
Savings – AI and data analytics-driven farming generate significant savings for the agriculture
industry.
Big data analytics has influenced some of the most critical sectors of the economy and will
continue to do so. The big data applications in agriculture are still in their early days, with
challenges that need to be addressed. The full potential of big data will be realized if farmers
and stakeholders come together to develop and adopt innovative crop management techniques
that are data-driven and data-enabled.
If you are interested to know more about Big Data, check out our , which is designed for
working professionals and provides 7+ case studies & projects, covers 14 programming
languages & tools, practical hands-on workshops, more than 400 hours of rigorous learning &
job placement assistance with top firms.
Agriculture is the backbone of Indian economy, still, it suffers from a heaping number of
disasters such as climate change, unpredictable monsoon or lack of it, droughts, floods,
1
3
migration of farmers towards the cities in search of better-paying jobs, and more. People
involved in agriculture are the last to be taken care of, even when they are the one who feeds
the whole country. With institutions failing to support agriculture in terms of providing loans
and farmer welfare schemes, the time has come for technology to take over the change. Data
Science is here for the rescue!
Data is the necessity for industries and hence, data science has a number of applications. After
revolutionizing industries like IT, Banking, Manufacture, Finance, Healthcare, and many more,
it is all set to benefit the agriculture industry.
This is related to building digital maps for soil types and properties. Some people in the
agricultural industry manage so many acres of land, it’s almost impossible to get prompt
updates and alerts about potential problems without help from technology.
Farmers Edge, a Canadian company takes daily satellite images of farms and combines it
1
4
with other relevant data. It includes information from more than 4000 interconnected weather
stations!
Several countries like Ireland also depend on satellite-based soil and crop monitoring to inspect
areas more quickly than traditional methods allow. This helps in deciding what crops should be
grown on a particular piece of land. It saves a lot of time and effort and results in higher yield
production.
2. Weather Prediction
Weather plays a very important role in agriculture production and has an influence on the
growth, development, and yield of crops. Weather aberrations can cause physical damage to
crops and soil erosion.
The quality of crops from the field to the market depends on the weather. Bad weather can
adversely affect the quality of crop during transportation or storage.
Data science experts know how to use tools that identify the patterns and relationships that may
otherwise be hidden. They can draw conclusions that push agricultural science forward through
the examination of specific factors leading to change in weather.
The findings brought about by sifting through databases and studies to conclude things like this
in agricultural processes can bring about remarkable changes.
3. Fertilizers Recommendation
Knowing the exact fertilizer rate is a science and requires a thorough analysis of multiple
factors. Often, hundreds of dynamic parameters have to be considered.
1
5
Such parameters include crop nutrient uptake rates, research data, soil chemical, physical and
biological properties, weather, water composition, land type, soil testing methods, irrigation
techniques, fertilizer characteristics, interactions between fertilizers and many more.
Because of the complexity of finding the “optimal fertilization range”, misuse of fertilizers is a
global phenomenon. The majority of farmers still rely on trial and error, guesswork and
estimation.
The result is, crops do not meet their yield potential, and increase environmental pollution.
Data science professionals are now able to advise the farmers with the right quantity of
fertilizers.
In modern agriculture, advanced algorithms are used to identify the patterns and behavior of
nature that helps in forecasting the invasion of pests and the spread of microscopic diseases.
Advanced analytics in agriculture are informing how farmers should manage pests. Digital
tools and data analysis in agriculture are being utilized to scientifically deal with harmful
insects.
Agricultural pests can quickly cut into a farmer’s profits. But, misusing pesticides can have
adverse effects on people, plants and other living things.
Fortunately, some companies have recruited data science professionals to develop user-facing
platforms that analyze when to apply pesticides and how much to use.
While some insects can be incredibly beneficial to farmers and the crops, others can be toxic
and spread diseases. Disease detection can be done by taking images of the field using drones
and processing them to detect areas within this field that are infected.
Climate change is a looming concern that has already affected the agriculture sector. However,
data science experts are working hard to figure out ways to compensate for the change.
One project involves giving IoT sensors to Taiwanese farmers of rice production so they can
collect information that is necessary about their crops. It will help farmers to optimize their
production cycles, even if climatic changes make it challenging.
1
6
The traditional farming calendar is no longer sufficient due to the extreme climate changes but,
data analysis can revolutionize the future of farming.
Data scientists are also analyzing agriculture soil data to understand how soil can cope with
climate change by releasing greenhouse gases and also how soil can adapt to climate change.
We can also use weather prediction in the automated irrigation system. How exactly? Having a
system that tells you the soil is dry but you don’t need to irrigate because it is going to rain
after a few hours seems to be an interesting use case.
All the countries in the world are currently in a situation where they are required to use water in
a very efficient manner.
According to the recent studies, water is becoming more and more in short supply worldwide
and more than one-third of the world population would face total water shortage by the year
2025.
In agriculture too, the major problem which farmers face is the water scarcity, so to improve
the usage of water, one of the irrigation systems- using drip irrigation which is implemented as
automated irrigation system for small scale farms.
According to a recent press, DATOS project applied artificial intelligence, machine learning,
and other data science techniques to remotely sensed data. Apart from that, it used systems to
produce geospatial outputs that can be used for disasters, agriculture and other purposes.
The DATOS Project has developed a way to map out crops by using satellite images and by
extracting the temporal signature of crops determined through radar satellite images.
Moreover, DATOS produces flood situation maps by retrieving satellite images and letting AI
identify flooded areas from these images.
It is able to detect floods in areas that are hit by heavy rainfall and sends these mapped out
areas to the respective DOST regional offices in the event of severe weather disruptions.
1
7
Data science is also used to detect objects from satellite images. An example is the current
partnership with the Bataan Peninsula State University in order to help the institution automate
the mapping of their province’s mango trees.
Other objects that have been detected from satellite images include road networks, ships, land
cover classes, and built-up areas.
A good example is in Egypt, where farmers use water pumps to collect water from the
river Nile to their crops. Advanced water sprinklers are used to irrigate large fields and this
helps the crops to get enough water which is essential in their growth. This method is
useful in the efficient usage of water.
The invention of the mobile app such as “FamGraze” for farmers manage their grass
more effectively by suggesting the cheapest feed for their livestock. It saves time and there
is no need for any paper or spreadsheets. This helps in working faster and being accurate in
and off the field!
Modern transportation technology facilities help farmers easily transport fertilizers or
other farm products to different parts of the world. For example – consumers in Dubai will
get fresh carrots from Africa on the same day when the carrots leave the garden in Africa.
Summary
These were just a few cases showing how one can use data science in the work of an
agricultural corporation right now because, in the future, we cannot predict what opportunities
it can bring. Technology has played a huge role in developing this industry. Today, it is
possible to grow crops in a desert by use of agricultural biotechnology and there is much more
scope in the future.
Data scientists are part of virtually every major industry, and agriculture is no exception. Here
are six ways data scientists bring advantages to the agricultural sector.
1. Receiving Useful Data to Help Fight Food Scarcity and Empower Small Farmers
1
8
Since data scientists have tools to process and analyze gigantic amounts of data efficiently,
projects are underway to determine how that information might help small-scale farmers join in
the battle to solve worldwide food shortages.
In September 2018, a coalition launched a project that will run through 2030 and look at data
from approximately 500 million farmers in impoverished areas from 50 countries.
The people behind the project hope the data will show whether agricultural investments in
various countries are paying off and help develop policies for the farmers. On a larger scale, this
project aligns with the United Nations’ Sustainable Development Goals to double the
agricultural productivity and incomes of farmers in developing nations and help them reduce
world hunger.
Agricultural pests can quickly cut into a farmer’s profits. But, misusing pesticides can have
adverse effects on people, plants and other living things. Fortunately, some companies recruit
data scientists to help them develop user-facing platforms that analyze when to apply pesticides
and how much to use.
In another case, Saillog, an Israeli startup, developed a smartphone app called Agrio that
informs farmers of the diseases currently affecting their crops or ones found on surrounding
farms.
Data scientists know how to use tools that identify patterns and relationships that may otherwise
remain hidden. As such, they can draw conclusions that push agricultural science forward
through the examination of specific factors. For example, researchers know trace minerals
1
9
positively affect the metabolic functions of livestock and poultry, while carotenoids play a role
in increasing egg yolk quality and nutrition.
The findings brought about by sifting through databases and studies to conclude things like
these show how seemingly small factors in agricultural processes can bring about substantial
changes. When animal feed brands, farmers and others in the agriculture sector take advantage
of such findings from data scientists, they could positively change their operations and get better
results.
Climate change is a looming concern that has already affected the agriculture sector. However,
data scientists are hard at work figuring out ways to compensate for the shift.
One project involves giving IoT sensors to Taiwanese rice farmers so they can collect crucial
information about their crops. It’ll all go into a database used to help farmers optimize their
production cycles, even when climate change makes that task exceptionally challenging.
Following the traditional farming calendar is no longer sufficient because of climate change.
But, data analysis could forever change the future of farming.
Scientists are also scrutinizing agricultural soil data to improve their understanding of how soil
contributes to climate change by releasing greenhouse gases, as well as how soil data might aid
in adapting to climate change. Collecting this kind of information is tricky, but scientists believe
it could fill in knowledge gaps associated with the relationship betweensoil and climate change.
A poor yield can result in a devastating season for farmers, as well as all the entities that depend
on the crops. IBM has a platform that estimates corn yields two to three months in advance,
reducing unpleasant surprises for agricultural professionals.
Similarly, researchers at the University of Illinois depend on seasonal forecasts and satellite data
to make end-of-season predictions earlier than usual. Lab results indicate this new method is
even more accurate than the U.S. Department of Agriculture’s real-time data.
2
0
6. To Provide Images of Crops and Land
Some people in the agricultural industry manage so many acres of land, it’s impossible to get
prompt updates and alerts about potential problems without help from technology. Canadian
company Farmers Edge takes daily satellite images of farms and combines it with other relevant
data, including information from more than 4,000 connected weather stations.
Moreover, several countries, including Ireland and Ghana, depend on satellite-based crop
monitoring to inspect areas more quickly than traditional methods allow.
Data scientists play an essential role in furthering the use of satellite imagery for these purposes
and others. Pilot projects and new platforms could help those experts determine the methods
most appropriate for tackling current and future needs within the agriculture sector.
Agriculture has become increasingly high-tech over the years. Now, much more than in the past,
agriculture professionals can dig into data and use it to make highly informed decisions. The
advancements of data scientists are making this reality possible, both now and for the
foreseeable future.
PROFILES:
INDUSTRY PROFILE
Agro-based industries are industries that use plant and animal-based agricultural output
2
1
as their raw material. Also, they add value to agricultural output by processing and
producing marketable and usable products. Some examples of agro-based industries in
India include Textile, Sugar, Vegetable Oil, Tea, Coffee and Leather goods industries.
2
2
REVIEW OF LITERATURE
In a time series analytic model, formerly observed values are used to forecast the output.It
uses the concept of time series which is a series of data points arranged in time. However,
here time is considered as an independent variable with an ambition to forecast the future. It
is used in forecasting crops, vegetation price movement and price fluctuation in market
The following is the outcome of the research, Farmers also grow non-food
items like rubber, cotton, jute etc. More than 70% of the household in the
rural area depend on agriculture. This domain provides employment to more
than 60% of the total population and has a contribution to GDP also (about
17%).
This research has discussed the application of mathematical model like fuzzy
logic designs in optimization of the crop yield, artificial neural networks in
validation studies, genetic algorithms designs in accessing the fitness of the
model applied, decision trees, and support vector machines to study soil,
climate conditions and water regimes related to crop growth and pest
management in agriculture.
This surveys the technical achievements in the field of Rice crop yield prediction.
It also Discussed methodology, comprehensive survey of many proposed approaches to predict
rice crop yield and applications. It also discusses various data mining methods used for
prediction of crop yield for rice. Rising better plans to foresee crop productivity in various
climatic conditions can help farmer and different partners in essential basic leadership as far as
agronomy and product decision.
Precision agriculture gives not only great challenges but also great
opportunities for Computer Scientists specially who are working in the field
of data analysis. Most batch processing tools follow Apache Hadoop which is
an open-source software which is reliable and very fast. Also, Hadoop
framework has been used for taking decision on storage and distribution of
the products in an efficient way without wastage.
This research has revealed that “Tools for better demand and yield prediction
should be developed”. Among the suggested future directions, agricultural robots which are
self-operated to identify weeds and remove them, tools with high performances, programming
language for big data analysis can be identified. It is revealed that data privacy is an issue in big
data analysis in agriculture because misusing of data.
This research has identified the growth of big data was doubled at the end of
2016. Also, big data is characterized by 3Vs to 4Vs which is meant by
Volume, Variety, Velocity and Veracity. Through this research, we came to
know that Map Reduce, Storm and Apache Spark are emerging tools in big
data analytics. Most batch processing tools follow Apache Hadoop which is
an open-source software which is reliable and very fast.
The framework Hadoop is having two major parts which are HDFS and
Hadoop MapReduce. Apache Spark is also a computing framework for real
time data as well as having memory oriented architecture with
flexibility.This research has also revealed that “Apache Hadoop and Apache
Spark most popular among big data analytics applications”. In order to
minimize the restrictions regarding data collecting, data storage and data
analyzing it is proposed to use deep learning techniques in near future.
The crops that were considered in the model for prediction include coriander,
pulses, cotton, paddy, sorghum, groundnut, sugarcane, banana and vegetables.
Different attributes of the soil were considered in order to predict the crop,
which included pH, depth, erosion, permeability, texture, drainage, dater
holding and soil color. The technique used was ensembling, which combined
the power of using two or more different models for better prediction. The
ensembling technique used was called the Majority Voting Technique.
The paper hypothesizes analysis of Explorative Data and considers the design
of different types of predictive models. A data set is taken as a sample data
set, and different regression techniques are tried to recognise and examine
each property. Specific regression methods discussed here are Multiple
Linear, Linear, Non-Linear, Polynomial, Ridge regression and Logistic. Using
this article, we obtain a comparative study of the different algorithms in data
analytics. This helped in determining which algorithm is most appropriate to
the proposed system.
The crops were inspected and graded depending on an examination to estimate crop
yielding. This categorisation is found from different data mining algorithms. This
paper provides a perception into various grouping rules, such as K-Nearest
Neighbour and Naive Bayes. By making use of this document, we evaluated the
classification rules and established which all will match the set of data we will be
using in our project.
Step-by-step Process :-
Step – 1:
Step – 2:
Step – 3:
Step - 4:
a) Data Visualizations
a)
Data Analysis and Inferences
The Dataset also consists of Soil Conditions required to grow the Crops. They are :-
The Dataset consists of Climatic conditions required to grow the crops. They are :-
Temperature
Humidity
Rainfall.
The Dataset consists of 22 unique crops such as Maize, Wheat, Mango, Watermelon, Rice,
Chickpea, Kidney beans, Pigeon peas, Moth beans etc.
a) Clustering
K Means algorithm to perform Clustering analysis
b)
PROBLEM STATEMENT
4. RESEARCH METHODOLY
b) The source of the Data is collected only from ‘Harvestify’, ‘GitHub’ and
‘Kaggle’.
c)
d) The crop is suggested based on the Soil Health, Ground water Table,
Seasoning effects etc. This study is confined only these four parameters.
Sources of Data: -
Sampling Techniques: -
Statistical Software’s:-
a) Anaconda Software
b) Jupyter Notebook
c) Python Coding tools are used to analyze the collected Data.
Youtube link :- https://www.youtube.com/watch?v=323jreAO8O0
Youtube :- https://www.analyticsvidhya.com/blog/2020/11/artificial-intelligence-in-
agriculture-using-modern-day-ai-to-solve-traditional-farming-problems/
REFERENCE
2019:-
1. Viviliya, B. and Vaidhehi, V., “The Design of Hybrid Crop Recommendation System
using Machine Learning Algorithms”. International Journal of Innovative Technology
and Exploring Engineering, 2019, 9(2), pp.4305-4311.
5.M. Peixeiro, “The Complete Guide to Time Series Analysis and Forecasting,” Medium, 07-
Aug-2019. [Online]. Available: https://towardsdatascience.com/the-complete-guide-to-time-
seriesanalysis-and-forecasting-70d476bfe775. [Accessed: 06-Jan-2020].
2018:-
1. Z. Doshi, S. Nadkarni, R. Agrawal and N. Shah, "AgroConsultant: Intelligent Crop
Recommendation System Using Machine Learning Algorithms," 2018 Fourth
International Conference on Computing Communication Control and Automation
(ICCUBEA), Pune, India, 2018, pp. 1-6. doi: 10.1109/ICCUBEA.2018.8697349.
2017:-
1. T. Edwin, “Onion, tomato price spike: season not the only reason”, Nov. 13, 2017. [Online].
Available: https://www.thehindubusinessline.com/economy/agri-business/onion-tomato-price-
spike-season-not-the-only-reason/article9957255.ece#. [Accessed Feb. 22, 2020].
2. Dhivya B H, Manjula R, Siva Bharathi S, Madhumathi R. A Survey on Crop Yield
Prediction based on Agricultural Data, International Journal of Innovative Research in
Science, Engineering and Technology. 2017; 6(3).
3.Jharna Majumdar, Sneha Naraseeyappa, Shilpa Ankalaki. Analysis of agriculture data using
datamining techniques: application of big data. Journal of Big data. 2017.
4. Swarupa Rani. The Impact of Data Analytics in Crop Management based on Weather
Conditions. International Journal of Engineering Technology Science and Research. 2017;
4(5):299-308.
5. DakshayiniPatil, Dr. M .S Shirdhonkar, “Rice Crop Yield Prediction using Data Mining
Techniques: An Overview”, International Journal of Advanced Research in Computer Science
and Software Engineering, Volume 7, Issue 5, ISSN: 2277 128X,2017.
6.Mukesh kumar , Prof.Mayura Nagar, "Big Data analytics in agriculture and distribution
channel," in IEEE International Conference on Computing Methodologies and
Communication, 2017.
10. Ch. Chandra Sekhar, J. Uday Kumar, B. Kishor Kumar, Ch. Sekhar, "Big Data Analytics on
Indian Crop Planning to Increase Agricultural Production," Advanced Science and Technology
Letters, vol. 147, p. 7, November 2017.
2016:-
1. S. Pudumalar, E. Ramanujam, R. H. Rajashree, C. Kavya, T. Kiruthika and J. Nisha,
"Crop recommendation system for precision agriculture," 2016 Eighth International
Conference on Advanced Computing (ICoAC), Chennai, 2017, pp. 32-36. doi:
10.1109/ICoAC.2017.7951740.
2. T.R. Lekhaa, “Efficient Crop Yield and Pesticide Prediction for Improving Agricultural
Economy using Data Mining Techniques”, International Journal of Modern Trends in
Engineering and Science (IJMTES), 2016, Volume 03, Issue 10.
1. R. Kumar, M. P. Singh, P. Kumar and J. P. Singh, "Crop Selection Method to maximize crop
yield rate using machine learning technique," 2015 International Conference on Smart
Technologies and Management for Computing, Communication, Controls, Energy and
Materials (ICSTM), Chennai, 2015, pp. 138-145. doi: 10.1109/ICSTM.2015.72254.