Unit II - RM Notes
Unit II - RM Notes
RESEARCH METHODOLOGY
UNIT II: Quantitative Methods for problem solving
Statistical Modelling and Analysis:
Statistical modelling and analysis are integral components of research methodology,
employed to draw meaningful insights from data and make informed decisions.
These techniques are particularly prevalent in fields such as science, social science,
economics, and engineering. Here's an overview of statistical modelling and analysis
in research methodology:
I. Statistical Modelling:
i. Definition: Statistical modelling involves the construction of
mathematical models that represent relationships between variables
in a dataset. These models serve as simplified representations of real-
world phenomena, allowing researchers to make predictions or
inferences.
ii. Types of Models:
a. Descriptive Models: Summarize and describe the main features of
a dataset without making predictions.
b. Predictive Models: Use data to make predictions or estimate future
outcomes.
c. Inferential Models: Draw inferences about a population based on a
sample of data.
iii. Common Models:
a. Linear Regression: Models the relationship between a dependent
variable and one or more independent variables.
b. Logistic Regression: Models the probability of a binary outcome.
c. Time Series Models: Used for analysing time-dependent data.
d. Bayesian Models: Incorporate prior knowledge and update beliefs
based on new evidence.
II. Statistical Analysis:
i. Definition: Statistical analysis involves the application of statistical
methods to examine and interpret data. It helps researchers identify
patterns, relationships, and trends in the data.
ii. Steps in Statistical Analysis:
In summary, statistical modelling and analysis play a crucial role in the research
methodology by providing researchers with the tools to analyse data, make
predictions, and draw meaningful conclusions. These techniques contribute to the
robustness and validity of research findings.
Here's how probability distributions are relevant in the context of time series analysis
within research methodology:
I. Stochastic Processes:
Time series data is often modelled as a stochastic process, where each
observation is considered a random variable. The probability distribution of
these random variables is essential for characterizing the inherent randomness
or uncertainty in the data.
IV. Forecasting:
Probability distributions play a key role in forecasting future values of a time
series. Forecast intervals, which indicate the uncertainty around predictions,
are often derived from probability distributions. For example, prediction
intervals for future values can be constructed based on the distribution of
forecast errors.
3. Data Description: Descriptive statistics are used to summarize and describe the
characteristics of the collected data. This includes calculating measures of central
tendency (e.g., mean, median, mode) and measures of dispersion (e.g., range,
variance, standard deviation) to understand the distribution and variability of the
data.
4. Exploratory Data Analysis (EDA): EDA involves visually exploring the data using
graphs, charts, and plots to identify patterns, trends, and relationships among
variables. This step helps researchers generate hypotheses and guide further
statistical analysis.
These multivariate methods offer powerful tools for analysing complex data structures
and uncovering relationships among multiple variables in research across various
disciplines. Researchers should carefully select the appropriate method based on
their research questions, data characteristics, and assumptions underlying the
analysis.
I. Correlation:
i. Definition: Correlation measures the strength and direction of the
relationship between two variables. It indicates how much one variable
changes when the other variable changes. Correlation does not imply
causation, meaning that just because two variables are correlated does
not mean that one causes the other.
ii. Types of correlation:
1. Positive correlation: When both variables move in the same
direction. As one variable increases, the other also increases (or as
one decreases, the other also decreases).
2. Negative correlation: When variables move in opposite directions.
As one variable increases, the other decreases (or vice versa).
3. Zero correlation: There is no apparent relationship between the
variables.
iii. Correlation coefficient: This is a statistical measure that quantifies the
strength and direction of the relationship between two variables. The
most common correlation coefficient is Pearson's correlation coefficient,
denoted as r, which ranges from -1 to +1. A correlation coefficient close
to +1 or -1 indicates a strong relationship, while a coefficient close to 0
indicates a weak relationship.
II. Regression:
i. Definition: Regression analysis is a statistical method used to model the
relationship between a dependent variable and one or more independent
variables. It helps to understand how changes in the independent
variables are associated with changes in the dependent variable.
ii. Types of regression:
1. Simple linear regression: When there is only one independent variable
used to predict the dependent variable.
researchers might use the correlation between stock prices and interest rates
to predict future stock market movements.
4) Model Building: Regression analysis is used to build models that describe the
relationship between variables in a dataset. Researchers select independent
variables that are theoretically or empirically relevant to the dependent variable
and use regression analysis to estimate the parameters of the model. The
resulting model can then be used to make predictions and draw conclusions
about the relationships between variables.
these assumptions are met. If the assumptions are violated, researchers may
need to transform the data or use alternative modelling techniques.