Agricultural Commodity Price Prediction Model A Machine Learning
Agricultural Commodity Price Prediction Model A Machine Learning
https://doi.org/10.1007/s00521-023-08528-7 (0123456789().,-volV)(0123456789().
,- volV)
ORIGINAL ARTICLE
Received: 17 August 2022 / Accepted: 21 March 2023 / Published online: 6 April 2023
The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2023
Abstract
An efficient machine learning-based framework for crop price prediction is proposed in this paper to assist the farmers in
estimating their profit-loss beforehand. The proposed work is composed of four functional blocks, such as crop yield
prediction, determination of supply, demand prediction and crop price prediction. The input datasets consist of the various
field values, such as yield, remaining crop at the end of the year, import, demand and price of a crop. Various time series-
based algorithms, such as autoregression, moving average, autoregressive moving average, autoregressive integrated
moving average and exponential smoothing, are used to forecast the crop yield. The supply of the crop is determined as a
sum of three variables, i.e., the predicted crop yield, residue and import values. The demand for the crop is predicted from a
year alone as the demand has more correlation with year over other factors. The crop price from demand, supply and year is
predicted using different approaches, which include the time series method, statistical approaches and machine learning
techniques. Finally, these three techniques for price prediction are compared to determine the best model having minimum
root-mean-square error value. In the proposed work, the decision tree regressor is found to be the best model, for predicting
crop price, over others. The superiority of the proposed work over existing approaches, in terms of various aspects, is
shown by simulation results.
Keywords Crop price prediction Crop yield Demand and supply Machine learning Agriculture
123
15110 Neural Computing and Applications (2023) 35:15109–15128
received by farmers from the domestic market for a specific LOESS are used to forecast the crop yield. The ARIMA
agricultural commodity produced within a year [6]. The method is found to be the best, in terms of minimum
important determinants influencing the crop price are yield, absolute percentage error (MAPE) value, among others
crop type, rainfall, historical prices etc. Another promising used in predicting the crop yield. After that, the supply of
factor in predicting the crop price is the demand [7] which the crop is determined as a sum of three variables, such as
measures the quantity of the crop that consumers can the predicted crop yield, residue and import values. Next,
purchase during a given period. Furthermore, the amount the demand for the crop is predicted from the year alone as
of a crop available for consumers as a supply [8] is the demand has more correlation with year over other
obtained from yield, residue and import, which can affect factors. Furthermore, the crop price from demand, supply
the variation of crop price. Here, the residue is the quantity and year is predicted using different approaches, which
of the crop which remains unsold at the end of the year and include the time series method (ARIMA), statistical
is available for consumption in the next year, while import regression (SR) and various ML techniques such as deci-
denotes the amount of the crop, not produced within the sion tree regressor (DTR), random forest (RF), K-nearest
country, which is brought from another country [9]. neighbor regressor (KNN) and kernel ridge(KR). Finally,
In the early days, crop price prediction was performed these three techniques for price prediction are compared to
by the farmers using their past experience in terms of determine the best model having a minimum root-mean-
historical data [10]. Gradually, much research is being square error (RMSE) value. In the proposed work, the best
done to predict the crop price with growing interest. Pre- model DTR having the least RMSE is found over other
diction of the crop price of agricultural products is also models. The superiority of the proposed work over existing
needed to anticipate the negative impact of future price approaches is shown in various aspects. Hence, the major
changes ahead of time. In a real scenario, every farmer contributions of our work are summarized as follows:
desires to know about his profit if he cultivates a particular
• Crop yield is forecasted using different time series-
crop in the next season. This can improve farmer’s living
based ML algorithms, and subsequently, the best one
standards. Most existing research works on predicting crop
among alternatives is determined in terms of having
price using time series methods. However, supply and
minimum MAPE.
demand are the foremost factors in deciding the price of a
• The amount of supply is determined from predicted
commodity [11]. Hence, the question remains open to
yield, residue data and import data.
predicting the crop price more accurately considering those
• Forecasting of demand is obtained by the most accurate
factors. Usually, a considerable part of forecasting crop
time series prediction method from historical data.
yield and crop price in the agricultural framework cannot
• Predicting the crop price for the next year is obtained by
be delineated in a fundamental stepwise procedure, espe-
using different methods such as time series, statistical
cially with complex, incomplete, ambiguous and strident
and ML techniques and the best one amidst others
datasets [12]. In such a scenario, numerous studies indicate
having a minimum RMSE value is subsequently
that machine learning (ML) algorithms [13] have com-
determined.
paratively an improved potential over conventional statis-
• The superiority of the proposed work over existing
tics in order to develop an agricultural framework with
approaches is shown by various performance compar-
remarkable forecasting ability. Henceforth, it motivates us
ison through simulation results.
to develop an ML-based efficient crop price prediction
model using significant parameters related to it, which can The rest of this paper is organized as follows: Section 2
assist the farmers’ anticipations of substantial price chan- presents the literature survey for the completeness of the
ges in the future and their consequences. proposed work. Section 3 introduces useful preliminaries
An efficient crop price prediction using ML is proposed with respect to the proposed work. Section 4 describes the
here to assist the farmers in estimating their profit-loss proposed methodology. Section 5 covers results from dif-
beforehand. The input datasets for the proposed work ferent models and forecasting the real price data. Finally,
consist of the various field values of the crop (corn) for 47 Sect. 6 concludes the paper.
number of years. The fields of the input datasets are yield,
remaining crop at the end of the year, import, demand and
price of the crop for each year. Initially, the various time 2 Literature survey
series-based ML algorithms, such as exponential smooth-
ing (ES), autoregression (AR), theta model (Theta), The price of different crops is predicted by analyzing the
autoregressive integrated moving average (ARIMA), previous rainfall data [14, 15]. In order to predict the crop
moving average (MA), autoregressive moving average price, several contemporary ML techniques are used [5]. In
(ARMA) and seasonal-trend decomposition (STL) using another work [16], an integrated feature selection of
123
Neural Computing and Applications (2023) 35:15109–15128 15111
ARIMA with computational intelligence approaches has Another prediction model for crop price is discussed
been applied for crop price prediction. Other than the using time series methods [27]. However, this work has yet
ARIMA, the components of the proposed integrated fore- to consider other input parameters except historical prices
casting models include artificial neural networks (ANNs) in order to develop this price prediction model. In [28], a
and support vector regression (SVR). The paper by [12] comparison based on different algorithms, such as ARIMA,
predicts the crop price for the next rotation. This work is SVR and XGBoost, is introduced concerning price pre-
based on finding suitable data models that help in achieving diction. In this context of price forecasting, the authors of
generality for price prediction. Again, different data mining [29] have used optimal lag selection to improve the pre-
techniques were evaluated on different datasets to solve diction performance in terms of time and reduced error.
crop price prediction issues of seasonal crops [17]. Here, However, the significance of supply and demand factors
the equipped marketing information is analyzed for inte- [30] in forecasting agricultural commodity prices was
gration of domestic markets. Subsequently, the price beyond the scope of their work. Similarly, another paper
transmission from markets to farm gates is perceived, [31] predicts the crop price using statistical regression on
which can lead to a sustainable profit. Another paper [18] time series data of monthly price for 14 years. The price
highlights yield and price forecasting for crop decision forecast of a crop using ARIMA is presented by [32].
planning. For yearly yield prediction, they introduce a However, it is restricted only to the prediction of the
weather-based time-dependent dataset. Here, the price is monthly price, which would not be beneficial for the
predicted as a sum of the future prices on a commodity farmers in their season-based agriculture. The work of [33]
basis. compares various models and finds ARIMA as suitable for
In order to make decisions for agricultural marketing, an crop price prediction. An ANN-based approach is used to
intelligent crop price prediction model is described in [19]. predict the future crop price [34].
It is addressed by different ML models which can predict In a nutshell, several procedures to predict crop price
the prices of crops in advance. An adaptive crop price accurately are discussed in various ways. It is observed that
prediction for agriculture applications is proposed by [20]. most of these techniques have used the time series method
Another time series-based approach [21] is proposed to or weather data to predict the price of the crop. However,
explore the crop price and crop yield prediction of selected the utility of supply and demand to predict the crop price is
crops to identify the relevant information with respect to beyond the scope of their work. As per our best knowledge,
the market prices and crop yields. Some of the uncertain no prior research was considered earlier to predict price
conditions, such as climate changes, fluctuations in the using the supply and demand of the crop, even after these
market and flooding, to the agricultural process have been being the prime factors of pricing a crop. Another scope of
addressed here. The research work by [22] discusses the research is still open for further reduction of training and
prediction of crop yield using regression analysis. In this testing errors in developing a crop price prediction model,
work, several influencing factors are mentioned which are which in turn can be beneficial to assist the farmers in
related to crop yield prediction. In [23], a framework for decision-making on their profit. Hence, the work proposed
crop price forecasting is designed by analyzing the time in this paper addresses an efficient ML-based technique to
series data. Here, the major features related to developing obtain the reduced error in price prediction using the sup-
crop price prediction models are the historical weather data ply and demand of that particular crop. Furthermore, the
that influence crop production and transportation and data best alternative training algorithms are determined from the
quality-related features obtained by performing statistical perspective of error reduction in crop price prediction.
analysis. Using ML approaches, another location-aware Hence, this ML-based crop price prediction framework,
crop price prediction is discussed [24]. In this work, the using crop yield, supply and demand in agriculture, is
future price of vegetables is predicted depending on the comprehensively presented next.
suitable location for the profitable production of agricul-
tural products. Forecasting of both crop yield and price
using ML techniques have presented in [25]. The predic- 3 Preliminaries
tion of crop price is mainly dependent on factors, such as
rainfall, temperature, market prices, land area and past crop For a comprehensive presentation of the proposed work,
yield. The authors of this paper also predict the price and the descriptive statistics of the input data is shown next. In
the gain for the next twelve months over the past twelve addition, various performance measures used to highlight
months. Another research work employs intelligent tech- efficiency of the proposed work are introduced.
niques on historical prices to predict the crop price for
different agricultural products [26].
123
15112 Neural Computing and Applications (2023) 35:15109–15128
The dataset (D2-corn) (https://www.ers.usda.gov/data-pro The Block 1 considers an input D2 corn½}yield}. Dif-
ducts/feed-grains-database/feed-grains-yearbook-tables/) ferent time series-based ML algorithms, as shown in Fig. 2,
[35] is obtained from the department of agriculture, United are used to train the model for crop yield prediction in
States (U.S.). The D2-corn was created on Friday, Block 1 which is represented as follows:
December 10, 2021. It consists of the fields such as yield,
Y^Xi ¼ fX ðD2 corn½}yield}:headði 1ÞÞ
residue and import of the crop, corn, year wise for 47 years. ð1Þ
In Table 1, the statistical summary of D2-corn is shown. þ b1X þ erriX ; i=1,2, ..., n
Another dataset, D3-corn, obtained from the same [35] was In (1), Y^Xi is the amount of crop yield predicted by an
also created on Friday, December 10, 2021. The D3-corn algorithm ‘‘X’’ for ith instance, headði 1Þ returns the first
includes the fields year, demand and price for the same 47 (i-1) number of instances, b1X is an additive term for X and
years. It has similar statistics on the crop corn, like D2- errXi is the error in prediction for ith instance of D2
corn, for 47 years. The statistical summary of D3-corn is corn½}yield} by the algorithm X. In Block 1, time series-
shown in Table 2. It is significant to mention here that the based ML algorithms, such as ES, AR, Theta, ARIMA,
field, year, is common to both D2-corn and D3-corn, so that MA, ARMA and STL as shown in Fig. 2, are used to train
an efficient integration can be performed for successive the prediction model using Eq. (1). Finally, all of these
operations. In Tables 1 and 2, ’N’ denotes the number of
seven predicted values, such as Y^ES , Y^AR , Y^Theta , Y^ARIMA ,
samples in that corresponding dataset. In addition, the
description of all parameters of D2-corn and D3-corn are Y^MA , Y^ARMA and Y^STL , are compared. Subsequently, the
outlined in Table 3 for a better understanding of the pro- best one (Y^best ) for the crop yield prediction is determined
posed work. by the following:
Y^best ¼ min Y^method :mapeðÞ
3.2 Performance measures method2ðES;AR;Theta;ARIMA;MA;ARMA;STLÞ
ð2Þ
The following measures [36], shown in Table 4, are used to
In (2), the min operator returns the algorithm for which
determine the performance of the work proposed in this
MAPE is minimum and mape() returns the MAPE of the
paper.
corresponding algorithm. Thus, Y^best corresponds to one of
the algorithms, addressed earlier, has the minimum MAPE
in prediction over others.
4 Proposed methodology
The proposed work aims to predict the crop price for the
4.2 Determination of supply
next farming season. The proposed methodology is com-
In Block 2, the value of supply is determined from values
prised of four individual functional blocks as shown in
of other parameters such as the amount of predicted crop
Fig. 1. The respective functions of these are-(i) Block 1:
Crop yield prediction, (ii) Block 2: Determination of sup- yield from Block 1, i.e., Y^best , the import value of that crop
ply, (iii) Block 3: Demand prediction and (iv) Block 4: in the current year, i.e., D2 corn½}import} and residue
Crop price prediction. These functional blocks are num- left-over at the end of last year, i.e., D2 corn½}residue}.
bered in a sequence according to the order of their occur- Henceforth, the residue can be expressed as follows:
rence in the proposed methodology. The entire Rn ¼ Sðn1Þ C ðn1Þ ð3Þ
methodology is described next.
where Sðn1Þ is the supply amount for the crop in ðn 1Þth
year and C ðn1Þ denotes the total amount of the crop con-
sumed in the ðn 1Þth year. Therefore, the supply of the
123
Neural Computing and Applications (2023) 35:15109–15128 15113
Table 3 Description of parameters used in D2-corn and D3-corn 4.3 Demand prediction
Parameter Description
It is already discussed that the demand has a positive
Year Year correlation with the parameter ‘‘Year’’. So, the demand can
Yield Yearly yield of the crop be predicted using various time series methods. In Block 3,
Residue Remaining crop at the end of the year four methods, such as exponential smoothing (ES),
Import Yearly import of the crop autoregression (AR), moving average (MA) and autore-
Demand Yearly demand of the crop gressive integrated moving average (ARIMA), are used to
Price Yearly price of the crop obtain a predicted value for the demand of the corre-
sponding crop. Each of these methods is briefly introduced
here.
• ES: This method predicts the demand value by using a
crop is determined as a sum of the values of three weighted sum of past observations with exponentially
parameters by the following: decreasing weight for older observations. It can be
Sn ¼ Y^best þ I n þ Rn ð4Þ expressed by the following:
where Sn is the supply amount for nth year and Y^best is the D^nES ¼a D3 corn½}demand}:headðn 1Þ
ðn1Þ
ð5Þ
predicted crop yield obtained in Block 1. Here, for nth þ ð1 aÞ DES
year, the parameter I n denotes the import value. It is to be
noted here that the parameter Sn has a significant impact on where D^nES is the predicted demand for nth year, D3
the crop price prediction. corn½}demand}:headðn 1Þ is the demand data up to
ðn1Þ
ðn 1Þth year, DES is the actual demand of ðn 1Þth
year and a is the smoothing factor having a value from
U ¼ P Pn 12
n
Y2 Y^2
i¼1 þ i¼1
n n
123
15114 Neural Computing and Applications (2023) 35:15109–15128
123
Neural Computing and Applications (2023) 35:15109–15128 15115
the sum of the number of time periods. It can find a approaches. The predicted crop price by these algorithms
constantly updated average demand by considering are obtained by the following equations:
equal weightage to all the past values. This MA method
• For SR:
uses a way to predict the demand from D3 corn as
follows: P^iREG ¼FREG ðD23 corn; a3reg Þ
ð9Þ
1 X n1 þ b3reg þ eireg ; i ¼ 1; 2; :::; n
D^nMA ¼ D3 corn½}demand}:locðiÞ ð7Þ
t i¼n1t
where P^iREG denotes the predicted price for ith instance
of D23 corn, a3reg is the coefficient matrix. Another
where D^nMA is the value of demand for the nth year and
‘t’ is the window size of moving average. term b3reg acts as an additive term in (9) and eireg
• ARIMA: It is based on both AR and MA processes. It denotes the error in prediction of ith instance of
has an ability to generalize for non-stationary series. D23 corn.
The predicted demand by ARIMA is obtained by the • For ARIMA:
following: P^nARIMA ¼U1 D3 corn½}price}:locðn 1Þ þ :::
D^nARIMA ¼/1 D3 corn½}demand}:locðn 1Þ þ ::: þ Up D3 corn½}price}:locðn pÞ
þ /p D3 corn½}demand}:locðn pÞ þ h1 kn1 þ ::: þ H1 Kn1 þ ::: þ Hq Knq þ b3ARIMA þ l2
þ hq knq þ b2ARIMA þ l1 ð10Þ
ð8Þ
In (10), P^nARIMA is the predicted price for nth instance,
where D^nARIMA
is the predicted demand for nth instance D3 corn½}price}:locðiÞ is the ith instance, ‘p’ is the
of D3-corn[‘‘demand’’], ’p’ is the order of the autore- order of the autoregression, ‘q’ is the order of the
gression, ’q’ denotes the order of the moving average, h moving average, K is MA parameter, U is AR param-
is MA parameter, / is AR parameter, D3 eter, Pnp are lagged values, Knq are lagged errors of
corn½}demand}:locðn pÞ are lagged values, knq are moving average, b3ARIMA is an additive term and l2 is
lagged errors of moving average, b2ARIMA is an additive the constant.
term and l1 denotes a constant. • For ML:
Thus, these four time series methods have different pro- P^iX ¼ fX ðD23 corn; a3ml Þ þ b3ml þ eiml ; i ¼ 1; 2; :::; n
cessing techniques, which can lead to distinct values of
ð11Þ
prediction for each. Then, the outputs obtained from the
four methods are compared, and subsequently, the best one where P^X denotes the price prediction using algorithm
(Dnbest ) is selected, which has minimum error for demand ‘X’, a3ml is the coefficient matrix, b3ml is the additive
prediction over others. term and eiml is the error for the ith instance. It is to be
mentioned here that four different ML algorithms, such
4.4 Crop price prediction as DTR, KNN, RF and KR, are used to train over the
training dataset to develop the crop price prediction
The function of crop price prediction for next year (Yn ) in model.
Block 4 is detailed in Fig. 3. Initially, the dataset D3-corn
and the supply (Sn ) are considered as the inputs of Block 4. After the completion of the training phase, the perfor-
The feature ‘‘price’’ from D3-corn is selected. It is noted mance of those training models are evaluated using test
that D3 corn½}price} is partitioned to keep the first 70% datasets, such as D3 corn½}price}:Test and
samples as D3 corn½}price}:Train and the rest 30% D23 corn:Test. Finally, the best one amidst others having
samples as D3 corn½}price}:Test for training and testing, the least error is selected as the proposed crop price pre-
respectively. Then, D3 corn½}price}:Train is used to diction model, Modelproposed , as shown as follows:
train the model by the ARIMA method as a time series Modelproposed ¼ min P^method :rmseðÞ ð12Þ
method2ðREG;ARIMA;MLÞ
data. Meanwhile, the inputs D3-corn and Sn are integrated
to obtain the dataset, D23 corn which is partitioned into In (12), the min operator returns the algorithm for which
two subsets, such as D23 corn:Train and RMSE is minimum and rmse() returns the RMSE of the
D23 corn:Test, in a 70:30 ratio using random sampling. corresponding algorithm. Thus, for any unseen sample
Thus, the D23 corn:Train is used to train the model by comprising of (Sn , Dnbest , Yn ), the proposed model can
using statistical regression (SR) and different ML predict the crop price for the next year more accurately.
123
15116 Neural Computing and Applications (2023) 35:15109–15128
123
Neural Computing and Applications (2023) 35:15109–15128 15117
frequency distributions of the parameters. The frequency relationship between each variable by simply staring at the
distributions of more number of parameters are nearly parameters of datasets. Figure 6a can indicate the correla-
normal, except a few parameters have positive skewness. tions between each pair of variables for D2-corn. Here, it is
As these variables are normally distributed in both D2-corn shown that most of the variables are highly correlated with
and D3-corn, Pearson’s correlation is preferred over each other. In Fig. 6b, the correlations between each pair of
Spearman’s correlation to highlight the linear relationships variables for D3-corn are shown. Here, a few of the vari-
between the variables. The correlation analysis is signifi- ables are highly correlated with each other.
cant here as it would be very difficult to understand the
123
15118 Neural Computing and Applications (2023) 35:15109–15128
Figure 7 shows the values of MAPE for all time series- emphasizes the deviations of predicted yield from actual
based ML algorithms used to predict the crop yield. These yields by what percentage of actual yields, so it focuses on
ML algorithms are trained over the training subset of the magnitude of the error. Thus, the ARIMA(4,1,1) is
D2 corn½}yield}, and subsequently, these are tested over selected as the best in terms of reduced MAPE, for crop
the testing subset of the same. The train-test split for D2 yield prediction.
corn½}yield} is 85% and 15%, respectively. The prediction Figure 8 shows the normal distributions of all crop yield
performance of all these training algorithms is measured algorithms used in the proposed work and that of test data.
using the test dataset. It is observed that the MAPE is It shows the spread of the yield and subsequent weightage
highest for ES and lowest for ARIMA(4,1,1). The MAPE of the various values of the yield. No test dataset can show
123
Neural Computing and Applications (2023) 35:15109–15128 15119
123
15120 Neural Computing and Applications (2023) 35:15109–15128
123
Neural Computing and Applications (2023) 35:15109–15128 15121
123
15122 Neural Computing and Applications (2023) 35:15109–15128
123
Neural Computing and Applications (2023) 35:15109–15128 15123
123
15124 Neural Computing and Applications (2023) 35:15109–15128
123
Neural Computing and Applications (2023) 35:15109–15128 15125
the test datasets, and KNN performs worst. It means that the crop price. Simulation results show that the ML
KNN finds it difficult to fit when the training dataset is less. approach is more suitable for crop price prediction than
Finally, DTR is shown as best for crop price prediction in other time series methods and statistical regression tech-
terms of MAPE. The proposed price prediction using DTR niques. A comprehensive result analysis is contributed after
is found to produce a better result of 14.02% than 16.19% postprocessing of data obtained in simulations. Conse-
[34] in terms of MAPE, as highlighted in Table 9. The quently, the convincing results obtained by data analysis
Theil’s U for predicting the crop price using DTR is found can assist the farmers in deciding crop prices, which can
to be 0.456864. So, according to the U statistics measure improve their economy. Thus, the results observed from
mentioned earlier, it is revealed that the proposed model the precision and efficiency tests illustrate the effectiveness
can be considered a better price prediction method. and versatility of the proposed ML algorithms for crop
price prediction. By building an ML-based price prediction
environment, the proposed method makes it feasible for the
6 Conclusion farmer to bring about the crop price prediction with less
expert dependency and minimum prior knowledge. The
An efficient crop price prediction model is proposed to proposed work can assist the farmers and organizations to
predict the crop price using supply, demand and year data. take beneficial decisions ahead of time which can help
As per our best knowledge, no prior research work pre- them to improve their living standards. Experimenting with
dicting the crop price was done using such supply and data prediction through a wide range of ML predictive
demand of that particular crop, even after these being the algorithms can be observed as a basis for decision-making.
prime factors of pricing a crop. So, from the proposed Exploration of more crop price prediction parameters with
framework, a new research direction has emerged where respect to capita income, the health impact of crop, the
the supply and demand of a particular crop are utilized effect of any disastrous event, the price of neighboring
under a single domain toward obtaining crop price pre- countries, future inflation, and availability of alternative
diction. The prediction model for crop pricing is well crops can be included in the current framework to construct
presented as a significant measure of crop yield. However, a more robust working model in the future. Further
the effect of crop yield in predicting crop price needed to improvement in the computing efficiency of the training
be addressed suitably in existing research works in this process is an intriguing option to be concentrated.
price prediction domain. An exhaustive analysis of the
training datasets can provide ample insights to accurately
forecast the required market price of the crop. Through the Appendix
dataset prediction results, it is evident that the price pre-
diction agent administers the process, suggesting that the See Tables 10, 11, 12, 13 and 14.
proposed method can precisely define the characteristics of
ES 12 – – – Add – – Auto – – –
AR – 3 – – – –
Theta 12 – – Auto True
ARIMA (4,1,1) – 4 1 1 – –
MA – – 3
ARMA 3 1
STL – – 3 7 1 1
P = number of lagged forecast errors; D = number of differencing needed; Q = number of autoregressive
terms; SP= number of observations in seasonal period; S = smoothing parameter; SS = length of seasonal
smoother; TS = length of trend smoother; S degree = seasonal degree; S jump = seasonal jump
123
15126 Neural Computing and Applications (2023) 35:15109–15128
Table 11 Coefficients and performance summary of the model using ARIMA (4,1,1) for crop yield
Components Coef Std err z P[ j z j [0.025 0.975]
MA(3) – – – 3 – – – –
MA(4) 4
ES 12 – Add Auto
AR (1) – 1 – –
ARIMA (1,0,1) 1 0 1
123
Neural Computing and Applications (2023) 35:15109–15128 15127
Acknowledgements We thank National Institute of Technology 11. Choi J-S, Helmberger PG (1993) How sensitive are crop yields to
Durgapur, India, for providing access to the computational resources price changes and farm programs? J Agricult Appl Econ
and literatures. 25(1):237–244
12. Samuel P, Sahithi B, Saheli T, Ramanika D, Kumar NA (2020)
Data availability The datasets generated during and/or analyzed Crop price prediction system using machine learning algorithms.
during the current study are available from the corresponding author Quest J J Softw Eng Simul
on reasonable request. 13. Medar R, Rajpurohit VS, Shweta S (2019) Crop yield prediction
using machine learning techniques. In: 2019 IEEE 5th interna-
tional conference for convergence in technology (I2CT), pp 1–5.
Declarations IEEE
14. Akshay Prassanna S, Harshanand B, Srishti B, Chaitanya R,
Conflict of interest All authors of this research paper declare that they Kirubakaran Nithiya Soundari S, Kumar VM, Varshitha Chen-
have no conflict of interest. namsetti VG, Maurya PK (2020) Crop value forecasting using
decision tree regressor and models. Eur J Mol Clin Med, 7(2):
2020
References 15. Dhanapal R, AjanRaj A, Balavinayagapragathish S, Balaji J
(2021) Crop price prediction using supervised machine learning
1. Rehman U, Bahar F, Saad A, Bhat M, Singh L, Mahdi S, Mir MS, algorithms. J Phys Conf Ser 1916:012042
Bashir M, Bhat TA, Palmo T (2022) Trends, challenges and 16. Shao YE, Dai J-T (2018) Integrated feature selection of Arima
prospects of underutilized food crops under temperate ecologies: with computational intelligence approaches for food crop price
a review prediction. Complexity 2018:1–17
2. Rob V (2021) Cattaneo a poverty reduction through the devel- 17. Lavanya K, Raguchander T, Iyengar N (2013) Svm regression
opment of inclusive food value chains. J Integ Agricult and Sonn based approach for seasonal crop price prediction. Int J
20(4):964–978 u-and e-Serv Sci Technol 6(4):155–168
3. von Braun J, Afsana K, Fresco LO, Hassan M (2021) Food 18. Kantanantha N, Serban N, Griffin P (2010) Yield and price
systems: seven priorities to end hunger and protect the planet. forecasting for stochastic crop decision planning. J Agricult Biol
Nature Publishing Group Environ Stat 15(3):362–380
4. Tian X, Engel BA, Qian H, Hua E, Sun S, Wang Y (2021) Will 19. Ghutake I, Verma R, Chaudhari R, Amarsinh V (2021) An
reaching the maximum achievable yield potential meet future intelligent crop price prediction using suitable machine learning
global food demand? J Clean Prod 294:126285 algorithm. ITM Web Conf 40:03040
5. Rakhra M, Soniya P, Tanwar D, Singh P, Bordoloi D, Agarwal P, 20. Bayona-Oré S, Cerna R, Tirado Hinojoza E (2021) Machine
Takkar S, Jairath K, Verma N (2021) Crop price prediction using learning for price prediction for agricultural products
random forest and decision tree regression: a review. Mater 21. Dhanasekaran K, Ramprasath M, Sathiyamoorthi V, Poornima N,
Today Proc. https://doi.org/10.1016/j.matpr.2021.03.261 Jayaraj IA (2021) Meta-learning based adaptive crop price pre-
6. Fafchamps M (1992) Cash crop production, food price volatility, diction for agriculture application. In: 2021 5th international
and rural market integration in the third world. Am J Agricult conference on electronics, communication and aerospace tech-
Econ 74(1):90–99 nology (ICECA), pp 396–402. IEEE
7. Bennett AJ, Bending GD, Chandler D, Hilton S, Mills P (2012) 22. Sellam V, Poovammal E (2016) Prediction of crop yield using
Meeting the demand for crop production: the challenge of yield regression analysis. Indian J Sci Technol 9(38):1–5
decline in crops grown in short rotations. Biol Rev 87(1):52–71 23. Jain A, Marvaniya S, Godbole S, Munigala V (2020) A frame-
8. Fischer R, Byerlee D, Edmeades G (2014) Crop yields and global work for crop price forecasting in emerging economies by ana-
food security. Canberra ACT ACIAR, 8–11 lyzing the quality of time-series data. arXiv preprint arXiv:2009.
9. Smith TR (1935) The wheat surplus. Geograph Rev 04171
25(1):107–116 24. Gamage A, Kasthurirathna D (2019) Agro-genius: crop predic-
10. James HS, Sykuta ME (2006) Farmer trust in producer-and tion using machine learning. Int J Innov Sci Res Technol, 4(10)
investor-owned firms: evidence from Missouri corn and soybean 25. Mulla SA, Quadri S (2020) Crop-yield and price forecasting
producers. Agribusiness Int J 22(1):135–153 using machine learning. Int J Anal Exp Modal Anal
12:1731–1737
123
15128 Neural Computing and Applications (2023) 35:15109–15128
26. Brunda SS, Nimish L, Khan A (2020) Crop price prediction using 34. Anggraeni W, Mahananto F, Rofiq MA, Andri KB, Zaini Z,
random forest and decision tree regression Subriadi AP, et al. (2018) Agricultural strategic commodity price
27. Sabu KM, Kumar TM (2020) Predictive analytics in agriculture: forecasting using artificial neural network. In: 2018 International
forecasting prices of arecanuts in kerala. Procedia Comput Sci seminar on research of information technology and intelligent
171:699–708 systems (ISRITI), pp 347–352. IEEE
28. Yuan CZ, Ling SK (2020) Long short-term memory model based 35. Feed Grains: Yearbook Tables. https://www.ers.usda.gov/data-
agriculture commodity price prediction application. In: Pro- products/feed-grains-database/feed-grains-yearbook-tables/.
ceedings of the 2020 2nd international conference on information Accessed: 2010-09-30
technology and computer communications, pp 43–49 36. Chicco D, Warrens MJ, Jurman G (2021) The coefficient of
29. Yuan CZ, San WW, Leong TW (2020) Determining optimal lag determination r-squared is more informative than SMAPE, MAE,
time selection function with novel machine learning strategies for MAPE, MSE and RMSE in regression analysis evaluation. Peer J
better agricultural commodity prices forecasting in malaysia. In: Comput Sci 7:623
Proceedings of the 2020 2nd international conference on infor-
mation technology and computer communications, pp 37–42 Publisher’s Note Springer Nature remains neutral with regard to
30. Gale D (1955) The law of supply and demand. Math Scandi- jurisdictional claims in published maps and institutional affiliations.
navica, 155–169
31. Mishra MK, Sisodia B, Rai V (2019) Forecasting technique of
Springer Nature or its licensor (e.g. a society or other partner) holds
price of potato of Uttar Pradesh. J Pharm Phytochem 8(3):60–62
exclusive rights to this article under a publishing agreement with the
32. Kumari RV, Venkatesh P, Ramakrishna G, Sreenivas A (2019)
author(s) or other rightsholder(s); author self-archiving of the
Chilli price forecasting using auto regressive integrated moving
accepted manuscript version of this article is solely governed by the
average (Arima). Int Res J Agricul Econ Stat 10(2):290–295
terms of such publishing agreement and applicable law.
33. Ohyver M, Pudjihastuti H (2018) Arima model for forecasting the
price of medium quality rice to anticipate price fluctuations.
Procedia Comput Sci 135:707–711
123