Using Excel To Clean and Prepare Data
Using Excel To Clean and Prepare Data
for Analysis
R. Jason Weiss
Development Dimensions International
Robert J. Townsend
California State UniversityFullerton
Cleaning data and preparing it for analysis is one of those
thankless jobs that is dull, laborious, and painstaking, no matter which way you
slice it. The cost of a mistake is considerable, too, as you will discover if you
try to report an observed F of 317. We think the burden can be greatly reduced
with some help from our old friend, Excel. Its true that many of us already use
Excel to clean and prepare data for analysis, but our sense is that few people
leverage Excels considerable strengths in a systematic way. In this article, we
describe a power users approach to cleaning and preparing your data with
Excel. We suggest a phased approach that produces analysis-ready data with-
out destroying the original dataset. Well also look at ways to document your
dataset so that it will make sense when reviewed at a later point, or by other peo-
ple. We conclude with a note about a presentation at the upcoming SIOP con-
ference that needs your input!
Strip out undesirable characters. Often, our data have undesirable char-
acters that are useful for visually displaying the information but can trip up sta-
tistical analyses. Consider the phone number column, for example. We want
References
Weiss, R. J. (2004a, July). Leading Edge: Programming Excel macros. TIP: The Industri-
al-Organizational Psychologist, 42, 127134.
Weiss, R. J. (2004b, April). Leading Edge: Using Excel forms. TIP: The Industrial-Orga-
nizational Psychologist, 41, 6169.