0% found this document useful (0 votes)

67 views44 pages

Methods and Stats in I/O: - Science - Research - Data Analysis - Correlation and Regression - Psychometrics

This document discusses concepts and methods in psychometrics and measurement theory, including: - Psychometrics is the study of how people respond to tests and questionnaires. It involves assigning numbers to represent constructs that cannot be directly observed. - Validity refers to how well a test measures what it claims to measure. There are different types of validity including criterion-related, content-related, and construct-related validity. - Reliability indicates the consistency of a measure. Types of reliability include test-retest, equivalent forms, internal consistency, and inter-rater reliability. - Item response theory models items using parameters like difficulty, discrimination, and guessing. It can provide useful information for test

Uploaded by

Dave Nemth

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views44 pages

Methods and Stats in I/O: - Science - Research - Data Analysis - Correlation and Regression - Psychometrics

Uploaded by

Dave Nemth

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 44

Methods and Stats in I/O

• Science
• Research
• Data Analysis
• Descriptive and Inferential
• Correlation and Regression
• Psychometrics
• Psychometrics
• Study of how people respond to tests and questionnaires
• Measurement
• System of rules of assigning numbers to represent a person’s
standing on some underlying characteristic
• Latent Variable
• Theoretical variable of interest
• Cannot be observed directly
• Measure
• Operational definition of construct
• Imperfect indicator of latent variable
Applications of Psychometrics
• Validity

• Measurement Precision
• Reliability
• Understanding error

• Scale Development
• Item selection

• Computer Adaptive Testing

The Concept of Validity
• How well a test fulfills the function for what it is being used
• Can we predict Y from X?
• Does the test measure what it claims to measure?
• Does the test “look right”?
• A match between empirical relations and theoretical relations
• A property of tests
Validity as a property of tests
• A test is valid for measuring an attribute if variation in the attribute
causes variations in the test scores

• The attribute must exist and have a causal impact on test scores

• Therefore, if one does not have an idea of how the attribute

variations produce variations in measurement outcomes, one cannot
have a clue as to whether the tests measures what it should measure.
Criterion-Related Validity
• A couple definitions
• Predictor
• The test chosen or developed to assess attributes (e.g., abilities) identified as important
for successful job performance
• Criterion
• An outcome variable that describes important aspects or demands of the job
• The variable that we want to predict when evaluating the validity of the predictor

• Criterion-Related
• Correlation of test scores (predictor) with job performance (criterion)
• Represented as the validity coefficient (i.e., a correlation)
Criterion-Related Validity
• Predictive Validity
• Predictors correlate with criterion separated by time

• Concurrent Validity
• Predictors correlate with criterion at the same time
Predicting Preference to Work Alone
Content-Related Validity
• The content of the predictor and criterion represent an adequate
sample of important work behaviors and KSAOs defined by the job
analysis

• Using the knowledge of incumbents (or subject matter experts,

SMEs), we make logical connections between tests and job
performance
Construct-Related Validity
• Construct
• Concept that a test is intending to measure
• A broad representation of a human characteristic

• Construct Validity
• The integration of validity evidence which is important for determining the
meaning of test scores
• Correlation between similar and dissimilar tests should be in the predicted
direction (and sometimes strength)
• Evidence from other sources (literature reviews, studies, theories, etc.)
Multi-Trait Multi-Method Matrix
1 2 3 4 5
1. Verbal Ability Test
2. Interview Rating of .5
Communication
3. Sample Lecture .4 .6
4. Test of I/O Knowledge .2 .1 .1
5. Interview Rating of I/O .1 .3 .1 .4
Knowledge
6. Number of Top Tier Pubs .1 .1 .1 .5 .4
Variable F1 F2
1. Verbal Ability Test .8 .1
2. Interview Rating of .6 .2
Communication
3. Sample Lecture .7 .2
4. Test of I/O Knowledge .1 .8
5. Interview Rating of I/O .4 .5
Knowledge
6. Number of Top Tier Pubs .2 .7
Measurement Theory
• Classical Test Theory

• Item Response Theory

Classical Test Theory
• The main idea in CTT is that observed scores can be decomposed into
a true score and an error component
• Observed = True + Error

• The true score is defined as the expected value of the observed

scores
• Derived from the Theory of Errors
• The central limit theorem
Reliability
• Reliability
• Consistency or stability of a measure
• A measure is said to be reliable when you get the same results at different
times, with different users, or in different situations

• More precisely,
• It indicates the fraction of observed variance that is systematic, as opposed to
random
• In CTT, reliability is the squared correlation between true and observed scores
Types of Reliability
• Test-Retest
• Equivalent Forms
• Internal Consistency
• Inter-Rater
Test-Retest Reliability
• Calculated by correlating measurements taken at time 1 with
measurements taken at time 2
• Represented as a correlation coefficient
• Higher the correlation, higher the reliability
Equivalent Forms Reliability
• Calculated by correlating measurements from a sample of individuals
who complete two different forms of the same test

Split halves are

another form
of Equivalent
forms.
Internal Consistency
• Assesses how consistently the items of a test measure a single
construct
• Affected by the number of items in the test, and
• Correlations among test items
Internal Consistency
Mini IPIP v Students: Internal Consistency
• Extraversion (.77) (.79)
• Agreeableness (.70) (.66)
• Conscientiousness (.69) (.73)
• Neuroticism (.68) (.52)
• Openness to Experience (.65) (.76)
2. Sympathize with others’ feelings
7. Am not really interested in others (R)
12. I believe others have good intentions
17. Am not interested in other people’s problems (R)
4. Have frequent mood swings
9. Am relaxed most of the time (R)
14. Get upset easily
19. Seldom feel blue (R)
Inter-Rater Reliability
• The reliability of several different individuals making judgements

• Assesses the how much consensus there is in ratings

• Absolute vs Relative agreement

Validity & Reliability

Reliable
Neither Valid but not
nor Reliable Valid

Fairly Valid but

not very Reliable Valid & Reliable
Item Response Theory
Item Response Theory
• Items have a number of parameters
• Difficulty
• Discrimination
• Guessing

• IRT can estimate these values and provide useful data

• Item and test information
• Ability level estimates

• Applications include
• Study of item bias
• Creating equivalent forms
• Computer adaptive testing
Item Information
• Item 1 (a = 2, b=-1)
• Item 2 (a = 2, b=-0.5)
• Item 3 (a = 1, b=1)
• Item 4 (a = 1.5, b=2)
Test Information
• The information provided by a set of items is simply
the sum of the Item Information

𝑇𝐼 𝜃 = ෍ 𝐼𝑖 𝜃
𝑖
• Item 1 and 4
Which item adds the most?
• Start with Items 1 & 4

• If we add Item 2

• If we add Item 3
Computerized Adaptive Testing
• CATs are used in many popular tests today
• SAT, ACT, GRE

• In CAT, we choose the next item which we hope in some way supplies
us with the most information about the individual’s trait
Intro To CAT
1. Pick an initial item
2. Based on response estimate θ
3. Using current θ, select item with max I(θ)
4. Base on response, update θ estimate
5. Check stopping rule
• e.g., stop if SE < .3
6. If stopping rule is not met, repeat 3-5
Example: Prior Distribution
Item 1 (a = 1, b= 0)
Item 2 (a = 1.5, b= 2)
Item 3 (a = 2, b= -1)
Item 4 (a = 2, b= 1)
Prior

Probability
0.10
0.00

-4 -2 0 2 4
Theta
Item 1 Correct
Item 1 (a = 1, b= 0)
Item 2 (a = 1.5, b= 2)
Item 3 (a = 2, b= -1)
Item 4 (a = 2, b= 1)
Item 1

Probability
0.10
0.00

-4 -2 0 2 4
Theta
Item 2 Wrong
Item 1 (a = 1, b= 0)
Item 2 (a = 1.5, b= 2)
Item 3 (a = 2, b= -1)
Item 4 (a = 2, b= 1)
Item 2

Probability
0.10
0.00

-4 -2 0 2 4
Theta
Item 3 Correct
Item 1 (a = 1, b= 0)
Item 2 (a = 1.5, b= 2)
Item 3 (a = 2, b= -1)
Item 4 (a = 2, b= 1)
Item 3

Probability
0.10
0.00

-4 -2 0 2 4
Theta
Item 4 Wrong
Item 1 (a = 1, b= 0)
Item 2 (a = 1.5, b= 2)
Item 3 (a = 2, b= -1)
Item 4 (a = 2, b= 1)
Item 4

Probability
0.10
0.00

-4 -2 0 2 4
Theta
Research on PROMIS
• Our team at IIT was able to improve the PROMIS CAT by reducing the
number of items by 50% and making the CAT more efficient in general

• We did this by performing a Multi-dimensional CAT

Jean Jacques Rousseau - Excerpts From Emile On Education
No ratings yet
Jean Jacques Rousseau - Excerpts From Emile On Education
6 pages
Bachelor of Education Primary Program Code 3114 PDF
No ratings yet
Bachelor of Education Primary Program Code 3114 PDF
1 page
Motivation To Reduce Uncertainty A Reconceptualization of Uncertainty Reduction Theory
No ratings yet
Motivation To Reduce Uncertainty A Reconceptualization of Uncertainty Reduction Theory
12 pages
3 MLP Psychometrics Reliability and Validity 3.1
No ratings yet
3 MLP Psychometrics Reliability and Validity 3.1
26 pages
BUS-7101 - WK1 - Assignment - Design A Degree Completion Plan - Carlos Young
No ratings yet
BUS-7101 - WK1 - Assignment - Design A Degree Completion Plan - Carlos Young
9 pages
Walden Strategy Paper
No ratings yet
Walden Strategy Paper
19 pages
APA Literature Review Template
No ratings yet
APA Literature Review Template
2 pages
International Business Development GESI Guidelines
100% (1)
International Business Development GESI Guidelines
27 pages
Employer Designation Application: Atlantic Immigration Pilot
No ratings yet
Employer Designation Application: Atlantic Immigration Pilot
8 pages
Personality Factors of Teachers of Autistic Students
No ratings yet
Personality Factors of Teachers of Autistic Students
59 pages
Training & Development Policy
No ratings yet
Training & Development Policy
59 pages
EM Evaluation Final Report For NIJ
No ratings yet
EM Evaluation Final Report For NIJ
207 pages
Training Ensures Success Overseas
No ratings yet
Training Ensures Success Overseas
8 pages
Group Dynamics
No ratings yet
Group Dynamics
50 pages
Macroeconomics & National Income
No ratings yet
Macroeconomics & National Income
20 pages
Strategic Management and Determinism
No ratings yet
Strategic Management and Determinism
12 pages
Are Our Troops Ready For Biological and Chemical Attacks?, Cato Policy Analysis No. 467
No ratings yet
Are Our Troops Ready For Biological and Chemical Attacks?, Cato Policy Analysis No. 467
15 pages
Psychometrics: An Introduction
No ratings yet
Psychometrics: An Introduction
42 pages
Assessment Task 2 - Business Analytics Case Study
100% (1)
Assessment Task 2 - Business Analytics Case Study
8 pages
Quaternary Geology of The Fort Polk Area, SW Louisiana (2002)
No ratings yet
Quaternary Geology of The Fort Polk Area, SW Louisiana (2002)
25 pages
Job Search Resource Guide: Office of Human Resources
No ratings yet
Job Search Resource Guide: Office of Human Resources
46 pages
Career Counseling Competencies
100% (1)
Career Counseling Competencies
13 pages
Organizational Culture Change Effectiveness: November 2013
No ratings yet
Organizational Culture Change Effectiveness: November 2013
14 pages
Behavioral Interviews: Tutorial
No ratings yet
Behavioral Interviews: Tutorial
27 pages
Theoretical Framework and Hypothesis Development
0% (1)
Theoretical Framework and Hypothesis Development
33 pages
Chapter 5 Selection
No ratings yet
Chapter 5 Selection
20 pages
Alphabetic List of Theories
No ratings yet
Alphabetic List of Theories
9 pages
ITSEC (The Interservice/ Industry Training, Simulation and Education Conference)
No ratings yet
ITSEC (The Interservice/ Industry Training, Simulation and Education Conference)
16 pages
Factor Trait Report Kritika Garg
No ratings yet
Factor Trait Report Kritika Garg
4 pages
Adult Learners
100% (1)
Adult Learners
99 pages
Departmental Handbook CS
No ratings yet
Departmental Handbook CS
39 pages
Labour Economics
100% (1)
Labour Economics
6 pages
Correlation Design
No ratings yet
Correlation Design
43 pages
Get The Right People WP PDF
No ratings yet
Get The Right People WP PDF
24 pages
E-COMMERCE and Its Applications
100% (2)
E-COMMERCE and Its Applications
19 pages
International Business Development GESI Toolkit
No ratings yet
International Business Development GESI Toolkit
53 pages
PHD Completion Report
No ratings yet
PHD Completion Report
20 pages
SEM Boot Camp Day 1 Morning: Basics & Data Screening: James Gaskin James - Gaskin@byu - Edu
No ratings yet
SEM Boot Camp Day 1 Morning: Basics & Data Screening: James Gaskin James - Gaskin@byu - Edu
38 pages
Demand and Supply Forecasting
No ratings yet
Demand and Supply Forecasting
19 pages
Corporate Culture
No ratings yet
Corporate Culture
9 pages
Diversity and Inclusion: in The VA Workforce
No ratings yet
Diversity and Inclusion: in The VA Workforce
38 pages
Interviewing Candidates
100% (1)
Interviewing Candidates
26 pages
Chapter Three: Perceiving Ourselves and Others in Organizations
No ratings yet
Chapter Three: Perceiving Ourselves and Others in Organizations
28 pages
Human Resource Human Resource Management Management
No ratings yet
Human Resource Human Resource Management Management
7 pages
2010 - HR Issues in Outsourcing - White Paper
No ratings yet
2010 - HR Issues in Outsourcing - White Paper
21 pages
Entrepreneur Attributes
No ratings yet
Entrepreneur Attributes
13 pages
Internal Alignment
No ratings yet
Internal Alignment
42 pages
CBHRM Ajay
No ratings yet
CBHRM Ajay
12 pages
Action Script For Making A Good Presentation
No ratings yet
Action Script For Making A Good Presentation
2 pages
R.A.K Colege of Nursing, New Delhi: Subject:-Educational Methods and Media
No ratings yet
R.A.K Colege of Nursing, New Delhi: Subject:-Educational Methods and Media
12 pages
Competency Based Performance Appraisal
100% (1)
Competency Based Performance Appraisal
13 pages
CF Notes
No ratings yet
CF Notes
73 pages
Combined Version of Knowing Your Psychometrics
No ratings yet
Combined Version of Knowing Your Psychometrics
28 pages
Service Thinking
No ratings yet
Service Thinking
168 pages
Variables, Validity & Reliability
100% (1)
Variables, Validity & Reliability
42 pages
Van Knippenberg - 2004 - Work Group Diversity and Group Performance - An Integrative Model and Research Agenda
No ratings yet
Van Knippenberg - 2004 - Work Group Diversity and Group Performance - An Integrative Model and Research Agenda
15 pages
Week 3
No ratings yet
Week 3
38 pages
An Introduction To Psychometrics
100% (1)
An Introduction To Psychometrics
5 pages
Strructures
No ratings yet
Strructures
28 pages
Psych Testing Reviewer Midterm
No ratings yet
Psych Testing Reviewer Midterm
9 pages
Psych Assessment (Ratio)
No ratings yet
Psych Assessment (Ratio)
14 pages
Maven Tutorial 01
No ratings yet
Maven Tutorial 01
33 pages
Thinking in Multiple Directions Hyperspace Categories in Divergent Thinking
No ratings yet
Thinking in Multiple Directions Hyperspace Categories in Divergent Thinking
14 pages
Jay Hardwick Resume
No ratings yet
Jay Hardwick Resume
2 pages
Exam Practice Fel0012
No ratings yet
Exam Practice Fel0012
5 pages
Direct Examination Questions For Court
No ratings yet
Direct Examination Questions For Court
9 pages
Career in Law
No ratings yet
Career in Law
31 pages
Dynamic Transformation To An Original Creature
No ratings yet
Dynamic Transformation To An Original Creature
165 pages
Data Stewardship For Open Science Implementing FAIR Principles by Mons, Barend
No ratings yet
Data Stewardship For Open Science Implementing FAIR Principles by Mons, Barend
245 pages
Thesis On Boolean Algebra
100% (3)
Thesis On Boolean Algebra
8 pages
NBHS4133 Clinical Assessment in Healthcare - SMay19 (Bookmark)
No ratings yet
NBHS4133 Clinical Assessment in Healthcare - SMay19 (Bookmark)
203 pages
UnitTesting Succinctly
No ratings yet
UnitTesting Succinctly
128 pages
Monthly Accomplishment Report Ipcrf Based Template
No ratings yet
Monthly Accomplishment Report Ipcrf Based Template
7 pages
2 Types of Variables Worksheet
No ratings yet
2 Types of Variables Worksheet
4 pages
Course 6-14 : Roadmap &
No ratings yet
Course 6-14 : Roadmap &
1 page
UBS - SIP Feedback Form - To Be Filled by The Corporate
No ratings yet
UBS - SIP Feedback Form - To Be Filled by The Corporate
2 pages
GENERAL, APPLIED AND THEORETICAL - Reinventing Anthropology - Dell Hymes
No ratings yet
GENERAL, APPLIED AND THEORETICAL - Reinventing Anthropology - Dell Hymes
5 pages
CSA Certification Brochure 2024 PDF
No ratings yet
CSA Certification Brochure 2024 PDF
7 pages
CIS 520, Machine Learning, Fall 2015: Assignment 2 Due: Friday, September 18th, 11:59pm (Via Turnin)
No ratings yet
CIS 520, Machine Learning, Fall 2015: Assignment 2 Due: Friday, September 18th, 11:59pm (Via Turnin)
3 pages
Rubin's Theory
50% (2)
Rubin's Theory
3 pages
SZRZ6014 SilibusApprovedSenate2010
No ratings yet
SZRZ6014 SilibusApprovedSenate2010
8 pages
Science 4 Q4 W3
No ratings yet
Science 4 Q4 W3
6 pages
Atmosphere Unit Plan PT
No ratings yet
Atmosphere Unit Plan PT
31 pages
ADA Unit 3
No ratings yet
ADA Unit 3
41 pages
Commencement Guide
No ratings yet
Commencement Guide
3 pages
Literacy Homework Year 3 Poetry
100% (1)
Literacy Homework Year 3 Poetry
9 pages
Psychological Perspective
No ratings yet
Psychological Perspective
2 pages
GSL Dictionary
No ratings yet
GSL Dictionary
290 pages
Project Proposal Updated 4 - 1
No ratings yet
Project Proposal Updated 4 - 1
3 pages

Methods and Stats in I/O: - Science - Research - Data Analysis - Correlation and Regression - Psychometrics

Uploaded by

Methods and Stats in I/O: - Science - Research - Data Analysis - Correlation and Regression - Psychometrics

Uploaded by

Methods and Stats in I/O

• Computer Adaptive Testing

• Therefore, if one does not have an idea of how the attribute

• Using the knowledge of incumbents (or subject matter experts,

• Item Response Theory

• The true score is defined as the expected value of the observed

Split halves are

• Assesses the how much consensus there is in ratings

• Absolute vs Relative agreement

Fairly Valid but

• IRT can estimate these values and provide useful data

• We did this by performing a Multi-dimensional CAT

You might also like