0% found this document useful (0 votes)

59 views3 pages

Simulating Multivariate Structures

This document describes how to simulate multivariate data structures in R for purposes such as demonstrating regression, correlation, factor analysis, or structural equation modeling. It provides an example function to simulate data based on a specified measurement model relating observed variables to latent factors and a structural model describing relationships between latent variables. Several examples are given that vary the models to include errors in measurement, correlated predictors, and multiple indicators of latent variables.

Uploaded by

Diogo Mendes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views3 pages

Simulating Multivariate Structures

Uploaded by

Diogo Mendes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Simulating multivariate structures

[Link]

Simulating multivariate structures using R

The following examples shows how to simulate a multivariate structure with a particular measurement model and a particular structural model. This example produces data suitable for demonstrations of regression, correlation, factor analysis, or structural equation modeling. See the mvtnorm package for more elegant ways to simulate covariance matrices. The set of procedures shown here are meant to help the user think about structural models. The basic logic is in terms of a measurement (factor) model relating observed variables to a set of unobserved factors. Then we have an effects model that describes how the latent variables are interrelated. (This is the basic logic of structural equation modeling, but of course, here we are doing it in reverse.) First we create a function (mes) that does the work. Parameters to be passed to this function are a factor model (also known as a measurement model) relating how each item relates to a number of latent factors. Then we create an effects model, which is a set of path coefficients between the latent variables. In this page, as well as most of my examples, the blue text can be copied directly into R.

mes <- function(fmodel,effect,numberofcases=1000) {

# define a general function in terms of a factor model and an effects matrix

numberofvariables <- dim(fmodel)[1] #problem size determined by input to the function numberoflatent <- dim(fmodel)[2] tmodel <- t(fmodel) #transpose of model # fmodel %*% tmodel #show the resulting measurement structure communality=diag(fmodel%*%tmodel) uniqueness=1-communality errorweight=sqrt(uniqueness) errorweight=diag(errorweight) #find how much to weight true scores and errors given the measurement model

#how much to weight the errors

latentscores <- matrix(rnorm(numberofcases*(numberoflatent)),numberofcases) #create true scores for the latent variables #round(cor(latentscores),2) #if uncommented, this shows the sample true score correlation matrix of the factors latentscores <- latentscores%*%effect #create true scores to reflect structural relations between the factors # round(cor(latentscores),2) #note that the factors are now correlated truescores <- latentscores %*% tmodel #round(cor(truescores),2) #show the true score correlation matrix (without error) error<- matrix(rnorm(numberofcases*(numberofvariables)),numberofcases) #create normal error scores error=error%*%errorweight observedscore=truescores+error allscores<- [Link](observedscore,truescores) return(allscores) } #end of function mes

The first example is the classic multiple regression of two predictors and one criterion. We are assuming no errors of measurement and uncorrelated (in the population) predictors. The values of beta for X1 and X2 may be changed for alternative demonstrations. Note that the unique variance of the Y variable is specified in terms of what is not predicted by X1 and X2.
#example 1 2 predictors, 1 criterion variable no errors of measurement

beta1 <- .5 beta2 <- .6 uniquey <- sqrt(1-(beta1^2 + beta2^2)) fmodel <- matrix(c (1,0,0, 0,1,0, 0,0,1), nrow=3,ncol=3,byrow=TRUE) effect <- matrix(c ( 1,0,beta1, 0,1,beta2, 0,0,uniquey),nrow=3,ncol=3,byrow=TRUE) [Link] <- mes(fmodel,effect) names([Link]) <- c("X1","X2", "Y","T1","T2","Ty") round(var([Link]),2) summary(lm(Y ~ X1+X2,data=[Link]))

This next example is merely two correlated predictors, still with no measurement error.
#example 2 2 predictors, 1 criterion variable no errors of measurement correlated predictors

beta1 <- .5 beta2 <- .6 rx1x2 <- .5 uniquex2 <- sqrt(1-rx1x2) uniquey <- sqrt(1-(beta1^2 + beta2^2)) fmodel <- matrix(c (1,0,0, 0,1,0, 0,0,1), nrow=3,ncol=3,byrow=TRUE) effect <- matrix(c ( 1,rx1x2,beta1, 0,beta2,0, 0,0,uniquey),nrow=3,ncol=3,byrow=TRUE) [Link] <- mes(fmodel,effect) names([Link]) <- c("X1","X2", "Y","T1","T2","Ty") round(var([Link]),2) round(cor([Link]),2) summary(lm(Y ~ X1+X2,data=[Link]))

Here we modify example 1 by introducing errors of measurement into the criterion variable. This does not affect the relationships between the true scores, but does between the observed scores.
#example 3 2 predictors, 1 criterion variable errors in measurement of criterion

beta1 <- .5 beta2 <- .6 uniquey <- sqrt(1-(beta1^2 + beta2^2)) fmodel <- matrix(c (1,0,0, 0,1,0,

1 de 3

04/11/2011 23:57

Simulating multivariate structures

[Link]

0,0,.5), nrow=3,ncol=3,byrow=TRUE) effect <- matrix(c ( 1,0,beta1, 0,1,beta2, 0,0,uniquey),nrow=3,ncol=3,byrow=TRUE) [Link] <- mes(fmodel,effect) names([Link]) <- c("X1","X2", "Y","T1","T2","Ty") round(var([Link]),2) round(cor([Link]),2) summary(lm(Y ~ X1+X2,data=[Link]))

Now we introduce errors in measurement on the predictors. An interesting exercise is to vary the amount of error for the X1 and X2 variables. Also, compare the effect of changing the beta weights.
#example 4 2 predictors, 1 criterion variable errors in measurement of predictors

beta1 <- .5 beta2 <- .6 uniquey <- sqrt(1-(beta1^2 + beta2^2)) fmodel <- matrix(c (.8,0,0, 0,.6,0, 0,0,1), nrow=3,ncol=3,byrow=TRUE) effect <- matrix(c ( 1,0,beta1, 0,1,beta2, 0,0,uniquey),nrow=3,ncol=3,byrow=TRUE) [Link] <- mes(fmodel,effect) names([Link]) <- c("X1","X2", "Y","T1","T2","Ty") round(var([Link]),2) round(cor([Link]),2) summary(lm(Y ~ X1+X2,data=[Link]))

Now introduce errors in measurement into both predictors as well as the criterion variable.
#example 5 2 predictors, 1 criterion variable errors in measurement

beta1 <- .5 beta2 <- .6 uniquey <- sqrt(1-(beta1^2 + beta2^2)) fmodel <- matrix(c (.8,0,0, 0,.6,0, 0,0,.5), nrow=3,ncol=3,byrow=TRUE) effect <- matrix(c ( 1,0,beta1, 0,1,beta2, 0,0,uniquey),nrow=3,ncol=3,byrow=TRUE) [Link] <- mes(fmodel,effect) names([Link]) <- c("X1","X2", "Y","T1","T2","Ty") round(var([Link]),2) round(cor([Link]),2) summary(lm(Y ~ X1+X2,data=[Link]))

Now, finally, we can consider what happens in the case where we have errors of measurement in the predictors as well as the criterion, but more importantly, we have multiple indicators of all of the latent variables. This is, of course, an example of the general problem encountered in structural equation modeling. By having multiple indicators of the latent variables, we are able to estimate the errors of measurement that affected all of the previous models but that was not possible to estimate. For this case, notice that the factor model (fmodel) has more than one variable loading on each factor. The effects matrix is the same as before. This particular example then rescales the observed score matrix to put the variables into more "realistic" terms. The model may be seen as having three estimates of ability, two of achievement, and three of performance. The particular example assumes that there are 3 measures of ability (GREV, GREQ, GREA), two measures of motivation (achievment motivation and anxiety), and three measures of performance (Prelims, GPA, MA). These titles are, of course, arbitrary and can be changed easily.
#example 6 9 variables on 3 factors, with the first two predicting the 3rd

title<-

"data set for Psychology 405: Psychometric Theory"

#<---title goes here

#measurement (factor) model for 3 factors and 9 variables fmodel <- matrix(c (.9, 0, 0, .8, 0, 0, .7, .7, 0, 0, .6, 0, 0, -.8, 0, 0, 0, .7, 0, 0, .6, 0, 0, .5), nrow=numberofvariables,ncol=numberoflatent,byrow=TRUE)

#structural model for 3 factors (the diagonals reflect unique variance, the off diagonals the structure coefficients beta1 <- .7 beta2 <- .6 uniquey <- sqrt(1-(beta1^2 + beta2^2)) effect <- matrix(c ( 1,0,beta1, 0,1,beta2, 0,0,uniquey),nrow=3,ncol=3,byrow=TRUE) observedscore <- mes(fmodel,effect) #1000 subjects is the default

round(cor(observedscore),2)

#show the correlation matrix #give the data "realistic" properties

GREV=round(observedscore[,1]*100+500,0) GREQ=round(observedscore[,2]*100+500,0)

2 de 3

04/11/2011 23:57

Simulating multivariate structures

[Link]

GREA=round(observedscore[,3]*100+500,0) Ach=round(observedscore[,4]*10+50,0) Anx=round(-observedscore[,5]*10+50,0) Prelim=round(observedscore[,6]+10,0) GPA=round(observedscore[,7]*.5+4,2) MA=round(observedscore[,8]*.5+3,1) data=[Link](GREV,GREQ,GREA,Ach,Anx,Prelim,GPA,MA) summary(data) #basic summary statistics round(cor(data),2) #show the resulting correlations #it is, of course, identical to the previous one

#example 6 9 variables on 3 factors, with the first two predicting the 3rd # The particular example assumes that there are 3 measures of ability (GREV, GREQ, GREA), two measures of motivation (achievment #motivation and anxiety), a

title<-

"data set for Psychology 405: Psychometric Theory"

#<---title goes here

#measurement (factor) model for 3 factors and 9 variables fmodel <- matrix(c (.9, 0, 0, .8, 0, 0, .7, 0, 0, 0, .6, 0, 0, .8, 0, 0, 0, .7, 0, 0, .6, 0, 0, .5), nrow=numberofvariables,ncol=numberoflatent,byrow=TRUE)

#structural model for 3 factors (the diagonals reflect unique variance, the off diagonals the structure coefficients effect <- matrix(c(1,0,.7, 0,1,.6, 0,.0,.39),nrow=numberoflatent,byrow=TRUE) observedscore <- mes(fmodel,effect) #1000 subjects is the default

round(cor(observedscore),2)

#show the correlation matrix #give the data "realistic" properties sds <-c(100,100,100,10,10,1,.5,.5,rep(1,8)) means <- c(500,500,500,50,50,10,4,3,rep(0,8)) t <-observedscore *sds+means GREV=round(observedscore[,1]*100+500,0) GREQ=round(observedscore[,2]*100+500,0) GREA=round(observedscore[,3]*100+500,0) Ach=round(observedscore[,4]*10+50,0) Anx=round(-observedscore[,5]*10+50,0) Prelim=round(observedscore[,6]+10,0) GPA=round(observedscore[,7]*.5+4,2) MA=round(observedscore[,8]*.5+3,1) data=[Link](GREV,GREQ,GREA,Ach,Anx,Prelim,GPA,MA) summary(data) #basic summary statistics round(cor(data),2) #show the resulting correlations #it is, of course, identical to the previous one

#this data set has been saved

and may be used for another analyses

datafilename="[Link] dataset =[Link](datafilename,header=TRUE) #read the data file

part of a short guide to R Version of May 4, 2004 - revised October 22, 2005 to be somewhat more readable

William Revelle Department of Psychology Northwestern University

3 de 3

04/11/2011 23:57

Lecture 10
No ratings yet
Lecture 10
5 pages
GuideToIRTinvarianceUsingMIRT (ANCHOR)
No ratings yet
GuideToIRTinvarianceUsingMIRT (ANCHOR)
10 pages
Introduction to SEM with lavaan
No ratings yet
Introduction to SEM with lavaan
33 pages
T G S E M W L V: I. Specification
No ratings yet
T G S E M W L V: I. Specification
25 pages
Guide To IRT Invariance Tests in R
No ratings yet
Guide To IRT Invariance Tests in R
14 pages
Simple Regression Model Fitting
No ratings yet
Simple Regression Model Fitting
5 pages
Which Test When: 1 Exploratory Tests
No ratings yet
Which Test When: 1 Exploratory Tests
5 pages
Linear Regression Analysis in R
100% (1)
Linear Regression Analysis in R
15 pages
Linear Latent Variable Models in R: Odel Building ON Linear Constraints
No ratings yet
Linear Latent Variable Models in R: Odel Building ON Linear Constraints
2 pages
WEEK
No ratings yet
WEEK
17 pages
ASSIGNMENT NO - 2, FDAS - SUMANYAKUMARI - Bfia
No ratings yet
ASSIGNMENT NO - 2, FDAS - SUMANYAKUMARI - Bfia
6 pages
QMM: Exercise Sheet 8 - Structural Equation Model: Structural Regression
No ratings yet
QMM: Exercise Sheet 8 - Structural Equation Model: Structural Regression
3 pages
Regression and Classification Analysis
No ratings yet
Regression and Classification Analysis
101 pages
R Notesss
No ratings yet
R Notesss
12 pages
Summary Statistics and Data Analysis in R
No ratings yet
Summary Statistics and Data Analysis in R
11 pages
Linear Regression
No ratings yet
Linear Regression
22 pages
R Programming Practical Exercises
No ratings yet
R Programming Practical Exercises
13 pages
R Console
No ratings yet
R Console
6 pages
Introduction To Econometrics With R
No ratings yet
Introduction To Econometrics With R
18 pages
Make Up Cat
No ratings yet
Make Up Cat
6 pages
DS File Et C1 23
No ratings yet
DS File Et C1 23
15 pages
Module - 4 (R Training) - Basic Stats & Modeling
No ratings yet
Module - 4 (R Training) - Basic Stats & Modeling
15 pages
A028 GLM-SC3
No ratings yet
A028 GLM-SC3
137 pages
Business Analytics C-2
No ratings yet
Business Analytics C-2
7 pages
Linear Regression
No ratings yet
Linear Regression
17 pages
Computational Psychology
No ratings yet
Computational Psychology
39 pages
R Functions for Statistical Analysis
No ratings yet
R Functions for Statistical Analysis
4 pages
Mock Exam - Appendix
No ratings yet
Mock Exam - Appendix
15 pages
Videos and Tutorials On Data Analysis in The Psychometrics Lab
No ratings yet
Videos and Tutorials On Data Analysis in The Psychometrics Lab
13 pages
Lab 5
No ratings yet
Lab 5
6 pages
Understanding Interactions in Linear Models
No ratings yet
Understanding Interactions in Linear Models
4 pages
Multiple Variables: Regression
No ratings yet
Multiple Variables: Regression
14 pages
UNIT 2 Notes
No ratings yet
UNIT 2 Notes
8 pages
R Practicals
No ratings yet
R Practicals
32 pages
Statistical Modelling
No ratings yet
Statistical Modelling
39 pages
04 BasicAnalyses
No ratings yet
04 BasicAnalyses
44 pages
Multivariable Regression Guide
No ratings yet
Multivariable Regression Guide
79 pages
Regression in R
No ratings yet
Regression in R
40 pages
Lavaan Package in RStudio
No ratings yet
Lavaan Package in RStudio
39 pages
Uni T - 2 - R Programming
No ratings yet
Uni T - 2 - R Programming
10 pages
MultivariableRegression Summary
No ratings yet
MultivariableRegression Summary
15 pages
Principal Component Regression Guide
No ratings yet
Principal Component Regression Guide
12 pages
Essential R Commands Guide
No ratings yet
Essential R Commands Guide
11 pages
R Stastics PDF
No ratings yet
R Stastics PDF
30 pages
Regression Model Building Guide
No ratings yet
Regression Model Building Guide
41 pages
Multinomial Logistic Regression - R Data Analysis Examples - IDRE Stats
No ratings yet
Multinomial Logistic Regression - R Data Analysis Examples - IDRE Stats
8 pages
Gary Chamberlain Econometric S
No ratings yet
Gary Chamberlain Econometric S
152 pages
Analysis of Panel Data 2019
No ratings yet
Analysis of Panel Data 2019
29 pages
Multivariate Assign
No ratings yet
Multivariate Assign
11 pages
An Introduction To The Psych Package: Part I: Data Entry and Data Description
No ratings yet
An Introduction To The Psych Package: Part I: Data Entry and Data Description
63 pages
A+B A-B B AB A: Homework 1
No ratings yet
A+B A-B B AB A: Homework 1
3 pages
Modern Regression 1 - hw6
No ratings yet
Modern Regression 1 - hw6
11 pages
Exam 1 Notes
No ratings yet
Exam 1 Notes
4 pages
Econometrics 2019 PDF
No ratings yet
Econometrics 2019 PDF
143 pages
SEM Analysis in R for Researchers
No ratings yet
SEM Analysis in R for Researchers
15 pages
Samkhya Philosophy: Yoga Veda Institute
No ratings yet
Samkhya Philosophy: Yoga Veda Institute
7 pages
Euro J of Education - 2015 - Miller - Learning The Future and Complexity An Essay On The Emergence
No ratings yet
Euro J of Education - 2015 - Miller - Learning The Future and Complexity An Essay On The Emergence
11 pages
Measurement of Creativity
No ratings yet
Measurement of Creativity
14 pages
Financial Problem
67% (15)
Financial Problem
42 pages
Action Plan
No ratings yet
Action Plan
3 pages
Political Science - Wikipedia
No ratings yet
Political Science - Wikipedia
48 pages
Geometric Optics Manual
No ratings yet
Geometric Optics Manual
7 pages
Factors Influencing Employee Performance
100% (1)
Factors Influencing Employee Performance
5 pages
African Philosophy: Ubuntu & Maat
No ratings yet
African Philosophy: Ubuntu & Maat
8 pages
Theory of Machines Lab: Department of Mechanical Engineering University of Engineering & Technology Lahore, Pakistan
No ratings yet
Theory of Machines Lab: Department of Mechanical Engineering University of Engineering & Technology Lahore, Pakistan
31 pages
Ulysses' Leadership and Character Traits
100% (1)
Ulysses' Leadership and Character Traits
2 pages
Ethics Group 5
No ratings yet
Ethics Group 5
11 pages
Advanced Logistics and Material Management Course
No ratings yet
Advanced Logistics and Material Management Course
8 pages
2b Chakra Rulebook
No ratings yet
2b Chakra Rulebook
2 pages
Understanding Tawhid in Islam
No ratings yet
Understanding Tawhid in Islam
144 pages
Naess A 1995
No ratings yet
Naess A 1995
9 pages
Castel, Pierre-Henri. The Coming Evil
No ratings yet
Castel, Pierre-Henri. The Coming Evil
22 pages
Legal Battle: Allado & Mendoza Case
No ratings yet
Legal Battle: Allado & Mendoza Case
9 pages
Figurative Language Game: Can You Discover The Missing Picture by Answering Questions About Figurative Language?
No ratings yet
Figurative Language Game: Can You Discover The Missing Picture by Answering Questions About Figurative Language?
17 pages
International Handbook On The Continuing Professional Development of Teachers
100% (2)
International Handbook On The Continuing Professional Development of Teachers
335 pages
A Comparative Essay On Ellen Goodman
0% (2)
A Comparative Essay On Ellen Goodman
3 pages
The Persistence of Complexity - Re-Reading Donna Haraway's Cyborg Manifesto (Gandy)
No ratings yet
The Persistence of Complexity - Re-Reading Donna Haraway's Cyborg Manifesto (Gandy)
4 pages
Contract Law Case Analysis
100% (1)
Contract Law Case Analysis
14 pages
Novdec Waec Time Table
No ratings yet
Novdec Waec Time Table
5 pages
Gerunds vs. Infinitives Explained
No ratings yet
Gerunds vs. Infinitives Explained
3 pages
Textbook Review
No ratings yet
Textbook Review
9 pages
Question Paper Unit g481 Mechanics
No ratings yet
Question Paper Unit g481 Mechanics
16 pages
The 100 Basic Facts For Multiplication and Addition
No ratings yet
The 100 Basic Facts For Multiplication and Addition
5 pages
Chapter - 3 Research Design
No ratings yet
Chapter - 3 Research Design
19 pages
Trostruka Mantra
No ratings yet
Trostruka Mantra
4 pages