Lesson 4_ Developing Models
Lesson 4_ Developing Models
Lesson objectives
To introduce the student to the practical steps of model development
At the end of the lesson the student should be in a position to design a simple simulation
model
Introduction
Model development refers to the process of creating, testing, and refining a mathematical
or computational model that represents a system or process of interest.
The goal of model development is to create a model that accurately represents the system
or process and can be used to make predictions or test hypotheses.
The process of model development typically involves the following steps:
i. Define the problem or system of interest and identify the variables that need to
be modeled.
ii. Choose an appropriate modeling approach based on the available data and the
goals of the model. This could involve choosing between different types of models,
such as linear regression, neural networks, or decision trees.
iii. Collect and preprocess data to be used in training and validating the model. This
may involve cleaning and transforming data, handling missing values, and
selecting relevant features.
iv. Train the model using the data and an appropriate algorithm. This involves
adjusting the model parameters to minimize the difference between the predicted
and actual values.
v. Validate the model by testing its performance on a separate set of data that was
not used for training. This helps ensure that the model is not overfitting to the
training data and can generalize to new data.
vi. Refine the model as needed based on the validation results. This could involve
adjusting the model parameters, choosing different features or algorithms, or
collecting additional data.
vii. Deploy the model for use in real-world applications.
The process of model development can be iterative, with each round of testing and
refinement leading to a better model. It is also important to evaluate the assumptions
and limitations of the model and to communicate the results and uncertainties to
stakeholders.
The generic steps of model development can be grouped into five blocks of the modeling
cycle
Figure 1: Main steps of a modeling cycle
Model calibration is the process of adjusting the model parameters to improve the fit of
the model to observed data. Model calibration involves finding the optimal values of the
model parameters that result in the best possible agreement between the model
predictions and the observed data.
The process of model calibration involves the following steps:
i. Collect and preprocess data: The first step in model calibration is to collect and
preprocess the data that will be used to calibrate the model. This may involve
cleaning and filtering the data, and transforming it into a suitable format for use
in the model.
ii. Specify the model structure: The next step is to specify the structure of the model,
including the equations, variables, and parameters that will be used to simulate
the system being modeled.
iii. Select initial parameter values: Once the model structure is specified, initial values
are selected for the parameters of the model. These initial values may be based
on previous studies, literature, or expert knowledge.
iv. Run simulations and compare to data: The model is then used to simulate the
system being modeled, and the results of the simulations are compared to the
observed data. This comparison is used to evaluate the fit of the model to the
data.
v. Adjust parameters and repeat: If the fit of the model to the data is not satisfactory,
the parameters of the model are adjusted and the simulation is run again. This
process is repeated until the model provides an adequate fit to the observed data.
vi. Evaluate model performance: Once the model has been calibrated, it is important
to evaluate its performance. This may involve comparing the model predictions to
additional data, conducting sensitivity analyses to evaluate the impact of different
parameter values on the model predictions, or using statistical tests to evaluate
the goodness-of-fit of the model.
The process of model calibration is an iterative process that involves adjusting the model
parameters until the model provides an adequate fit to the observed data.
Calibration is an important step in model development, as it helps to ensure that the
model accurately represents the system being modeled and provides reliable predictions.
2. Holdout validation: In holdout validation, a portion of the data is set aside for testing, and
the model is trained on the remaining data. The model is then evaluated based on its
performance on the testing data.
3. Bootstrap validation: In bootstrap validation, the model is trained on multiple samples of
the data, each of which is randomly drawn with replacement from the original data. The
results from each sample are then combined to evaluate the performance of the model.
4. Sensitivity analysis: Sensitivity analysis involves varying the inputs to the model to
evaluate the impact on the model's output. This helps to identify the inputs that have the
greatest impact on the model's performance and can help to improve the accuracy and
reliability of the model.
5. Out-of-sample testing: In out-of-sample testing, the model is tested on data that was not
used in the model development process. This helps to assess the generalizability of the
model and to identify any potential biases or limitations in the model.
Example:
Yang HM, Lombardi Junior LP, Castro FFM, Yang AC (2021) Mathematical modeling of the
transmission of SARS-CoV-2—Evaluating the impact of isolation in São Paulo State (Brazil)
and lockdown in Spain associated with protective measures on the epidemic of CoViD-19.
PLoS ONE 16(6): e0252271. https://doi.org/10.1371/journal.pone.0252271
ii. Climate models: Climate models are used to study the Earth's climate system and to
project future climate change scenarios. These models are used to inform climate policy
decisions, including the design of mitigation and adaptation strategies, and the evaluation
of the potential impacts of climate change on natural and human systems.
iii. Economic models: Economic models are used to study the behavior of markets and the
impact of policy decisions on the economy. These models are used to inform a wide range
of economic policy decisions, including tax policy, trade policy, and monetary policy.
iv. Transportation models: Transportation models are used to study the behavior of
transportation systems, and to evaluate the potential impacts of infrastructure
investments or policy changes. These models are used to inform transportation planning
decisions, including the design of new transportation networks, the pricing of
transportation services, and the evaluation of the potential impacts of transportation
policy changes.
v. Financial models: Financial models are used to study the behavior of financial markets
and to evaluate the potential risks and returns of different investment strategies. These
models are used to inform a wide range of financial decision-making, including asset
allocation, risk management, and the valuation of financial instruments.
Additional reading
1. Kerr, L. A., Cadrin, S. X., & Secor, D. H. (2010). Simulation modelling as a tool for examining the
consequences of spatial structure and connectivity on local and regional population dynamics.
ICES Journal of Marine Science, 67(8), 1631-1639.
2. Higgins, S. I., Richardson, D. M., & Cowling, R. M. (2001). Validation of a spatial simulation model
of a spreading alien plant population. Journal of Applied Ecology, 38(3), 571-584.