Rainfall Prediction Using Modified Linear Regression: Submitted By: John Philip O. Echevarria Lazan, Rolan
Rainfall Prediction Using Modified Linear Regression: Submitted By: John Philip O. Echevarria Lazan, Rolan
Regression
Submitted by:
Lazan, Rolan
ABSTRACT
Analytics often involves studying past historical data to research potential trends.
Weather condition is the state of atmosphere at a given time in terms of weather variables
like rainfall, cloud condition, temperature, etc., the existing models use data mining
techniques to predict the rainfall. The main disadvantage of these systems is that it
doesn’t provide an estimate of the predicted rainfall. The system calculates the average of
values and understand the state of atmosphere, which doesn’t yield estimate result. This
order to obtain the prediction of rainfall through the temperature and wind speed.
INTRODUCTION
Rain and bad weather conditions bring direct consequences to several sectors of
the economy. One of those that suffers the most is precisely the maritime transport sector.
Some shipment products cannot be transported under rain, which brings about a deep
impact in logistics and in business generation. Some Brazilian ports register up to 110
days of rain per year, which means that these places might have their activities suspended
In the Philippines, shipping has always played an important role in the economy
of the country, since the Spanish colony and it has improved over the years. As
mentioned earlier, weather affects the maritime transportation and most likely the
shipment of products especially those vulnerable goods which cannot be unloaded in the
ports while raining. It is therefore necessary to predict the weather condition of the ports
Analytics often involve studying past historical data to research potential trends.
Weather condition is the state of atmosphere at a given time in terms of weather variables
like rainfall, cloud conditions, temperature, etc., the existing models use data mining
techniques to predict the rainfall. The main disadvantage of these systems is that it
doesn't provide an estimate of the predicted rainfall. The system calculates average of
values and understand the state of atmosphere, which does not yield estimate results (S.
rainfall in the ports of Metro Manila that serve as the unloading area of mostly imported
goods. The Linear Regression method is modified in order to obtain the most optimum
error percentage by iterating and adding some percentage of error to the input values.
This method provides an estimate of rainfall using different atmospheric parameters like
average temperature and relative humidity to predict the rainfall. The linear regression is
applied on the set of data and the coefficients are used to predict the rainfall based on the
corresponding values of the parameters. The main advantage of this model is that this
model estimates the rainfall based on the previous correlation between the different
atmospheric parameters. Thus, an estimate value of what the rainfall could be at a given
time period.
In prediction or forecasting study, the rule of thumb is that you should have at
least 50 but preferably more than 100 observations (Box and Tiao 1975). In this context,
monthly data of rainfall in port area from 2000 to 2017 will be utilized to form at least
120 data or observations to predict the future rainfall condition of the area.
In line with maritime sector, the result of this study can be used to provide the maritime
organization with the best possible information about the future, anticipate the possible
challenges and guide them in their decision making and policy creation for the improvement of
For many years the effect of global warming on food security, rainfall and temperature
patterns has received great attention from policy makers and academics.
BACKGROUND
Linear Regression
It is a method used for defining the relation between a dependent variable (Y) and one or
more independent variables or explanatory variables, denoted by (X). For multiple explanatory
i=1,…n,
Where y denotes the dependent variable(rainfall) where i=1,2 …n denotes the
Rainfall=(AvgTemp* )+
The data necessary for the system to predict rainfall are prev
METHODOLOGY
The historical data in the PORT AREA for period years 2000 to 2018 was collected from
PAG-ASA. The mean annual rainfall values and use modified linear regression to perform the
prediction of rainfall in our system. The process of this method is explained in this following
steps.
1. The input data set are examined. The input data of training set is obtained from 2000-
2. The set data contains the average rainfall cover from 2000-2018 from the input data
sets
3. The linear regression is applied on the data sets and the rainfall is forecasted using the
rainfall.
4. The error percentage is calculated by subtracting the predicted value from the actual
Correlations
Notes
Correlations
Average
Average Rainfall Temperature
N 18 18
Average Temperature Pearson Correlation .268 1
N 18 18
REGRESSION
/MISSING LISTWISE
/STATISTICS COEFF OUTS R ANOVA
/CRITERIA=PIN(.05) POUT(.10)
/NOORIGIN
/DEPENDENT AverageRainfall
/METHOD=ENTER AverageTemperature.
Regression
Notes
Variables Entered/Removeda
Variables Variables
Model Entered Removed Method
1 Average
. Enter
Temperatureb
Model Summary
1 5.35933375494
.268a .072 .014
2995
Total 495.145 17
Coefficientsa
Standardized
Unstandardized Coefficients Coefficients
CONCLUSION
Rainfall is the major cause for many of the natural disaster like flashflood, drought,
tsunamis. So in order to prevent these natural calamities, we should be able to predict the cause
of the source. The proposed system can be used to estimate the rainfall over required period so
that the respective authorities can take precautions to prevent the loss of life and property. This
data is used to perform the necessary calculation to predict the rainfall from the average
temperature, and wind speed.