Unit 2
Unit 2
1) Randomization
2) Replication
3) Local Control
Factorial Design:
● Factorial designs involve the
simultaneous study of two or more
factors (independent variables)
and their interactions.
● It helps assess how multiple
factors influence the response
variable.
● Factorial designs can be 2x2 (two
factors with two levels each), 3x2,
2x3, and so on, depending on the
number of factors and levels.
Types of DOE
Nested Design:
● Nested designs are appropriate
when the experimental units are
hierarchically structured or
nested within larger groups.
● For example, in educational
research, students may be
nested within classrooms, which
are in turn nested within schools.
Types of DOE
Repeated Measures Design:
● Repeated measures designs involve multiple measurements on the same
subjects over time or under different conditions.
● Commonly used in longitudinal studies or when assessing changes in response
over time
Types of DOE
Sequential Design:
● Sequential designs
involve collecting data in
stages, with the option to
stop the experiment
early if statistically
significant results are
observed.
● They are often used in
clinical trials and quality
control processes.
Types of DOE
Taguchi Design:
Taguchi designs, developed by Japanese engineer Genichi
Taguchi, are aimed at optimizing processes and products by
identifying factors that influence variability and quality.
Factors and Levels: In a Taguchi experiment, factors are variables
that can be adjusted or controlled, and levels are the different
settings or values that each factor can take. Factors can be
classified as controllable (those you can change) and
uncontrollable (those beyond your control).
Orthogonal Arrays: Taguchi designs often use orthogonal arrays,
which are structured tables that specify the combinations of factor
levels to be tested in an experiment. Orthogonal arrays ensure
efficient and systematic experimentation by reducing the number of
experimental runs required while still providing valuable
information.
Control Factors and Noise Factors: Taguchi experiments
distinguish between control factors (those factors you want to
optimize) and noise factors (factors that introduce variability but are
not the focus of optimization). The goal is to find settings of control
factors that are robust to variations caused by noise factors.
Signal-to-Noise (S/N) Ratios: Taguchi introduced the concept of
signal-to-noise ratios as objective functions for optimization. S/N
ratios measure the sensitivity of the system's performance to
variations in factors. The higher the S/N ratio, the better the
Data Types in DOE
In Design of Experiments (DOE), data types refer to the nature of the data that are collected
during an experimental study. Understanding the data types is crucial for designing the
experiment, choosing the appropriate statistical analysis, and drawing valid conclusions.
Continuous Data:
● Continuous data, also known as quantitative or numerical data, represent measurements that
can take any value within a range.
● These data are typically obtained through instruments or measurements that provide numeric
results.
● Examples of continuous data include:
● Length measurements (e.g., height, width, distance).
● Temperature readings.
● Weight or mass measurements.
● Time intervals.
● Concentrations (e.g., chemical concentrations).
● Continuous data are often analyzed using statistical techniques such as t-tests, analysis of
variance (ANOVA), regression analysis, and correlation analysis.
Data Types in DOE
Categorical Data:
Ordered Categories: Ordinal data consist of categories or levels that have a natural
order or ranking. This ranking implies that one category is "higher" or "lower" than
another, but the intervals between categories may not be equal.
● Likert scale responses (e.g., strongly agree, agree, neutral, disagree, strongly
disagree).
● Educational attainment (e.g., high school diploma, bachelor's degree, master's
degree).
● Socioeconomic status (e.g., low income, middle income, high income).
● Customer satisfaction ratings (e.g., very satisfied, somewhat satisfied, neutral,
somewhat dissatisfied, very dissatisfied).
Data Types in DOE
Ordinal data are non-numeric and are often represented using labels or descriptive terms.
These labels indicate the position or ranking of each category.
Arithmetic operations (e.g., addition, subtraction, multiplication) are not meaningful for
ordinal data because the intervals between categories may not be equal or well-defined.
Therefore, calculating means or medians may not provide accurate information about the
data.
Rank-Based Analysis: When analyzing ordinal data, rank-based statistical methods are
often used. These methods focus on the order or ranking of the categories rather than their
numeric values.
Non-parametric tests like the Wilcoxon signed-rank test or the Mann-Whitney U test are
commonly used for ordinal data analysis.
Spearman's rank correlation coefficient is used to assess the strength and direction of
relationships between ordinal variables.
Classification problems in design of experiments
Computer Vision:
● Time Series Classification: Categorizing time series data into different classes or states.
● Forecasting: Predicting future values or trends in time series data.
Computer Security:
Autonomous Systems:
● Autonomous Vehicle Perception: Classifying objects and obstacles for self-driving cars.
● Robot Navigation: Navigating robots in complex environments and avoiding obstacles.
● Human Activity Recognition: Identifying and classifying human activities from sensor data.
Classification problems in design of experiments
Biometrics:
● Disease Classification: Diagnosing diseases based on medical data (e.g., X-rays, MRIs).
● Drug Discovery: Classifying compounds for drug development.
Environmental Science:
These classification problems represent just a subset of the many applications of AI and
computer science in various domains. Solving these problems often involves the use of
machine learning algorithms and deep learning models to make accurate predictions and
classifications.
Data collection
Data collection is a crucial step in the research and analysis process. It involves
gathering relevant and accurate information to answer research questions or
achieve specific objectives.
There are various methods and tools available for collecting data, each with its
own advantages and limitations.
Here are some common data collection methods and tools:
Data collection – Methods and Tools
1. Surveys and Questionnaires:
- Method: Surveys involve asking individuals a set of standardized questions to gather
information about their opinions, preferences, behaviors, or characteristics.
- Tools: Online survey platforms (e.g., SurveyMonkey, Google Forms), paper-based
questionnaires, interviews (structured, semi-structured, or unstructured), telephone
surveys.
2. Observations:
- Method: Researchers directly observe and record behaviors, events, or activities to
gather data. This can be done in controlled settings (laboratories) or natural environments
(field observations).
- Tools: Notebooks, cameras, video recording equipment, mobile devices for real-time
data entry.
Data collection – Methods and Tools
3. Experiments:
- Method: Controlled experiments involve manipulating variables and observing
their effects on the outcome. This method is often used to establish cause-and-
effect relationships.
- Tools: Laboratory equipment, experimental setups, sensors, data loggers.
4. Case Studies:
- Method: In-depth examination of a single individual, group, or situation to gain
a comprehensive understanding of a specific context.
- Tools: Interviews, observations, archival data, documents, photographs.
Data collection – Methods and Tools
5. Secondary Data Analysis:
- Method: Analyzing existing data that was collected for other purposes. This can
include data from sources such as government agencies, research organizations, or
previous studies.
- Tools: Statistical software (e.g., R, Python), database management systems, data
visualization tools.
6. Content Analysis:
- Method: Systematically analyzing textual, visual, or audio content to identify
patterns, themes, and meanings.
- Tools: Text analysis software (e.g., NVivo, ATLAS.ti), image recognition tools,
sentiment analysis tools.
Data collection – Methods and Tools
7. Ethnography:
- Method: Immersing researchers in a cultural or social context to gain insights
into the experiences and perspectives of individuals within that context.
- Tools: Field notebooks, audio or video recording equipment, participant
observation techniques.