Example
Example
org (ISSN-2349-5162)
Talla Umesh, IV B.Tech Student, Dept of IT, Sreenidhi Institute of Science and Technology (A),
Hyderabad. [email protected]
Dr. Sreenivas Mekala, Associate Professor, Dept of IT, Sreenidhi Institute of Science and
Technology (A), Hyderabad.
JETIR2206616 Journal of Emerging Technologies and Innovative Research (JETIR) www.jetir.org g107
© 2022 JETIR June 2022, Volume 9, Issue 6 www.jetir.org (ISSN-2349-5162)
depending on the network architecture used, e.g., data-driven modeling approach is capable of
Convolutional Neural Networks, Recurrent Neural formulating a model solely from gathered data
Networks and Unsupervised Networks. without necessarily using domain knowledge. Data
The LSTM model is introduce with the objective of driven models (DDM) include classical Machine
modelling long term dependencies and determining Learning techniques, artifical neural networks
the optimal time lag for time series problems. A (Daniel et al., 2008), support vector machines
LSTM network is composed of one input layer, one (Pouteau et al., 2012), and generalized linear
recurrent hidden layer, and one output layer. The models. Those methods have many desirable
basic unit in the hidden layer is the memory block, characteristics, such as imposing fewer restrictions,
containing memory cells with self-connections or assumptions, the ability to approximate nonlinear
memorizing the temporal state and a pair of functions, strong predictive abilities, and the
adaptive, multiplicative gating units controlling flexibility to adapt to inputs of a multivariate system
information flow in the block. The memory cell is (Buhmann, 2003).
primarily a recurrently self-connected linear unit, According to Singh et al., 2016 and reviewed by
called Constant Error Carousel (CEC), and the cell Liakos et al., 2018 Machine Learning (ML), linear
state is represented by the activation of the CEC. polarizations, wavelet-based filtering, vegetation
The multiplicative gates learn when to open and indices (NDVI) and regression analysis are the most
close. By keeping the network error constant, the popular techniques used for analyzing agricultural
vanishing gradient problem can be solved in LSTM. data. However and besides the aforementioned
Moreover, a forget gate is added to the memory cell techniques, a new methodology which is recently
preventing the gradient from exploding when gaining momentum is deep learning
learning long time series. (DL)(Goodfellow et al., 2016).
2. LITERATURE REVIEW: DL belongs to the machinelearning computational
field and is similar to ANN. However, DL is about
As with many bio-systems, plant growth is a highly
“deeper” neural networks that provide a hierarchical
complex and dynamic environmentally linked
representation of the data by means of various
system. Therefore, growth and yield modeling is a
operations. This allows larger learning capabilities,
significant scientific challenge. Modeling
and thus higher performance and precision. A strong
approaches vary in a number of aspects (including,
advantage of DL is feature learning, i.e., automatic
scale of interest, level of description, integration of
feature extraction from raw data, with features from
environmental stress, etc.). According to
higher levels of the hierarchy being formed by
(Todorovski and Dzeroski, 2006; Atanasova et al.,
composition of lower level features (Goodfellow et
2008) two basic modeling approaches are possible,
al., 2016).
namely, "knowledge-driven" or "data-driven"
DL can solve more complex problems particularly
modeling. The knowledge driven approach relies
well, because of the more complex related models
mainly on existing domain knowledge. In contrast, a
JETIR2206616 Journal of Emerging Technologies and Innovative Research (JETIR) www.jetir.org g108
© 2022 JETIR June 2022, Volume 9, Issue 6 www.jetir.org (ISSN-2349-5162)
(Pan and Yang, 2010). These complex models periodic variation is related to plant water content
employed in DL can increase classification accuracy and can be used as an indicator of the plant water
and reduce error in regression problems, provided content change. During active vegetative growth and
there are adequately large data-sets available development, crop plants rely on the carbohydrate
describing the problem. Gonzalez-Sanchez et al.( gained from photosynthesis and the translocation of
2019) presented a comparative study of ANN, SVR, photo-assimilates from the site of synthesis to sink
M5-prime, KNN ML techniques and Multiple organs (Yu et al., 2015). The fundamentals of stem
Linear Regression for crop yield prediction in ten diameter variations have been well documented in a
crop datasets. In their study, Root Mean Square substantial amount of literature (Vandegehuchet et
Error (RMS), Root Relative Square Error (RRSE), al., 2014).
Normalized Mean Absolute Error (MAE) and It has been documented that SDV is sensitive to
Correlation Factor (R) were used as accuracy water and nutrient conditions and is closely related
metrics to validate the models. Results showed that to the responses of crop plants to the changes of
M5-Prime achieved the lowest errors across the environmental conditions (Kanai et al., 2008). The
produced crop yield models. stem diameter is an important parameter describing
The results of that study ranked the techniques from the growth of crop plants under abiotic stress during
the best to the worst, according to RMSE, RRSE, R, vegetative growth stage. Therefore, it is important to
and MAE resulting, in the following order: M5- generate stem diameter growth models able to
Prime, kNN, SVR, ANN and MLR. Another study predict the response of SDV to environmental
by (Nair and Yang-Won, 2016) applied four ML changes and plant growth under different conditions.
techniques, SVM, Random Forest (RF), Extremely Many studies emphasize the need to critically
Randomized Trees (ERT) and Deep Learning (DL) review and improve SDV models for assessment of
to estimate corn yield in Iowa State. Comparisons of environmental impact on crop growth (Hinckley and
the validation statistics showed that DL provided Bruckerhoff, 2011). SDV daily models have been
more stable results, overcoming the overfitting developed to accurately predict inter-annual
problem. Stem diameter is considered as one of the variation in annual growth in balsam fir (Abies
important parameters describing the growth of balsamea L) (Duchesene and Houle, 2011).
plants during vegetative growth stage. Inclusion of daily data in growth-climate models can
Also, the variation of stem diameter has widely been improve predictions of the potential growth response
used to derive proxies for plant water status and, is to climate by identifying particular climatic events
therefore applied in optimisation strategies for plant- that escape to a classical dendroclimatic approach
based irrigation scheduling in a wide range of (Duchesene and Houle, 2011).
species. Plant stem diameter variation (SDV) refers However, models for predicting SDV and plant
to plant stem periodic shrinkage and recovery growth using environmental variables have so far
movement during the day and night, and this remained limited. Tomato crop growing in
JETIR2206616 Journal of Emerging Technologies and Innovative Research (JETIR) www.jetir.org g109
© 2022 JETIR June 2022, Volume 9, Issue 6 www.jetir.org (ISSN-2349-5162)
greenhouse environment is considered as a dynamic tools that can help farmers in making decisions.
and complex system, with few models having been
studied for it up to now. In the literature TOMGRO
3. METHODOLOGY:
and TOMSIM (Jones et al., 1999), (Heuvelink,
This project consists of following modules
1996) are considered as the main applicable
dynamic growth models. Those models are 1) upload dataset: using this module we will upload
represent biomass partitioning, crop growth, and 2) Dataset cleaning: using this module we will find
yield as a function of several climate and out empty values in the dataset and replace with
limited application to practicalsettings, their 3) Train & Test Split: Using this module we will
complexity, the difficulty in estimating initial split dataset into two parts called and training
parameter values and the need for calibration and and testing. All machine learning algorithms
validation in every new environment, growers take 80% dataset to train classifier and 20%
uptake has been limited. The Tompousse model was dataset is used to test classifier prediction
developed by (Abreu et al., 2000) to predict tomato accuracy. If classifier prediction accuracy high
yield in terms of the weight of harvested fruits. then Mean Square Error, Root Mean Square
The model was developed by examining the Error and Mean Absolute Error will be dropped.
relationship between environmental parameters in a 4) Run SVR Classifier: Using this module we will
heated greenhouses in the Southern part of France. train SVR classifier with splitted 80% data and
A linear relationship between flowering rate and used 20% data to calculate it performance
fruit growth was the basic assumption used in this 5) Run Random Forest Classifier: Using this
model. However, the model performance was poor module we will train Random Forest classifier
when tested in unheated plastic greenhouses in with splitted 80% data and used 20% data to
by Adams (Adams, 2002), based on a form of 6) Run LSTM Classifier: Using this module we
graphical simulation tool. The main objective of the will train LSTM classifier with splitted 80% data
model was to represent weekly fluctuations of and used 20% data to calculate it performance
greenhouse tomato yield in terms of fruit size and 7) Predict Plant & Yield Growth: Using this
harvest rate. Hourly climate data were used to module we will upload test data and then apply
estimate the rate of growth of leaf truss and the LSTM classifier to predict it growth value
JETIR2206616 Journal of Emerging Technologies and Innovative Research (JETIR) www.jetir.org g110
© 2022 JETIR June 2022, Volume 9, Issue 6 www.jetir.org (ISSN-2349-5162)
Fig 5: In above screen we got RMSE, MAE and Fig 8: In above screen I am uploading ‘test.txt’ file
MSE error for SVR algorithm and now click on and now click on ‘Open’ button to predict growth
‘Run Random Forest Algorithm’ button to train for test data
random forest algorithm
5. CONCLUSION:
REFERENCES:
JETIR2206616 Journal of Emerging Technologies and Innovative Research (JETIR) www.jetir.org g114