Data Management
Data Management
April, 2017
1
Part-I
Data Management
Part-II
Data Capturing Tools
2
Data Management overview outline
Definitions and Principles
Unforeseen Problems and Solution Tools
DM Process and Questionnaire Development
Questionnaire Handling
Procedures for Completion of Questionnaire
Archiving of Questionnaire
Tsegaye Hailu
Definitions and Principles
“Data management” is a general term covering
procedures both for:
– the collection of data at study sites and
– the quality control of those data before and after they
have been submitted to a statistical analysis or
coordinating centre.
Data management includes all aspects of data
planning, handling, analysis, documentation and
storage, and takes place during all stages of a study.
Data management team is responsible for producing
high quality databases containing high quality data
meet operational, clinical and regulatory requirements
4
WHY COLLECT THE DATA ?
Tsegaye Hailu
WHY COLLECT THE DATA ? (1)
To meet objectives of study & health and patient
management strategy
• Data Content
Tsegaye Hailu
To Achieving Quality of Data:
Leading to:
Credible health research data.
Tsegaye Hailu
When does Data Management Begin?
Begins with the overall planning process of the research
/ survey, or whatever the purpose is.
Tsegaye Hailu
Data Management motto: G I G O
Tsegaye Hailu
PROBLEMS
WHAT POSSIBLE
PROBLEMS
CAN YOU ENVISAGE?
Tsegaye Hailu
UNFORESEEN PROBLEMS
Data and/or software occasionally gets corrupted for
some unknown reasons.
Tsegaye Hailu
UNFORESEEN PROBLEMS cont’d
Tsegaye Hailu
FIELD AND DATA ENTRY PROBLEMS
Difficulty getting exact date from Subjects.
Dates: _ _ /01/ 98, 1 month ago, weeks ago.
Writing eligibly
Tsegaye Hailu
How Can Data Management Solve this Problem?
Tsegaye Hailu
TOOLS FOR THE PROBLEMS (1)
DETECTING ERRORS IN DATA
Manual Checking: Manually going through forms
Interviewers (F. Workers), Data supervisors (if any)
Tsegaye Hailu
TOOLS FOR THE PROBLEMS (2)
DETECTING ERRORS IN DATA
VERIFICATION:
• Used to ensure that data entered is actually data
on the questionnaire. This is normally
accomplished by double entry (entry by two
different clerks).
Tsegaye Hailu
TOOLS FOR THE PROBLEMS (4)
DATA PROCESSING PROCEDURES
Archiving of data
Tsegaye Hailu
DM: ISSUES IN D’VELOPING COUNTRIES
Tsegaye Hailu
CONCLUSION
Tsegaye Hailu
DATA MANAGEMENT IN OR FORMS
Questionnaire / Form:
Participants’ data are collected in the study questionnaire with
unique identifiers on each form and specimen label.
Responsibility:
It is the investigators’ responsibility to ensure accuracy,
legibility and completeness of data entry in the questionnaire
and in all other required report forms and logs.
Tsegaye Hailu
DATA MANAGEMENT PROCESS
on going eQES
Data Entry Instructions
design and validation
Qu
Data Entry erie
s Monitor
-----------------
ries
Data Validation Que Investigator
Clean File
Statistical Analysis
Completion of OStudy
QES Archiving
Report
Tsegaye Hailu
Purpose of Designing Questionnaire
Collects relevant data in a specific format
in accordance with the protocol
compliance with regulatory requirements; IEC/IRB
Tsegaye Hailu
Questionnaire Relationship to Protocol
Tsegaye Hailu
Questionnaire Development Process
Designer: Drafts questionnaire from protocol
• Avoid duplication
Tsegaye Hailu
Poorly Designed Questionnaire
• Data not collected
Tsegaye Hailu
Submitting Questionnaire
“The investigator should ensure the accuracy completeness,
legibility, and timeliness of the data reported to the sponsor in
the CRFs (Questionnaire) and in all required reports.”
ICH-GCP 4.9.1
Tsegaye Hailu
Questionnaire Safety and Precautions
•Keep questionnaires in a well-protected location.
Tsegaye Hailu
PROCEDURE
1. State of the Questionnaires:
• Verify that each questionnaire page conforms to the
procedures to be performed on that study day.
Tsegaye Hailu
PROCEDURE 3 CONT’D
Tsegaye Hailu
MAKING CORRECTIONS:
a. Authorized actions:
• Cross out the wrong entry with a single line
• Write the correct entry alongside/above/under the
wrong entry
• Initial the correction
• Date the correction
d. Prohibited actions:
Use the correcting fluids
Erasing or overwriting entries
Intentionally entering false data
Illegible entries
Tsegaye Hailu
Examples of Data onto CRFs
1. Entering the Data:
Tsegaye Hailu
Examples of Data onto CRFs 1 cont’d
__/___ 2004
dd mm yyyy
– The data field. Please record the date in the
European format (i.e. day/ month/year).
Tsegaye Hailu
Examples of Data onto CRFs 1 cont’d
2. Correction procedure
_08_/_05_/2016 TH
09/05/2016
Tsegaye Hailu
Data Entry and Validation
Data processing errors are errors that occur
after data have been collected.2 Examples of
data processing errors include:
Transpositions (e.g., 19 becomes 91)
• Copying errors (e.g., 0 (zero) becomes O)
• Coding errors (e.g., a racial group).
Routing errors (e.g., the interviewer asks the wrong
question or asks questions in the wrong order)
• Consistency errors (contradictory responses, such as
the reporting of a hysterectomy after the respondent
has identified himself as a male)
• Range errors (responses outside of the range)
Tsegaye Hailu
Data Entry and Validation cont’d
To prevent such errors, you must identify the stage
at which they occur and correct the problem.
Methods to prevent data entry errors include:
Manual checks during data collection (e.g., checks
for completeness, handwriting legibility)
• Range and consistency checking during data entry
(e.g., preventing impossible results, such as ages
greater than 110)
• Double entry and validation following data entry
• Data analysis screening for outliers during data
analysis
Tsegaye Hailu
DM AND Questionnaire (6)
BACK-UPs & ARCHIVING :
• Back-up of data entered should be on DM’s computer/
CD /USB at end of each day/ week/month appropriately.
48
Data Capturing Tools outline
• Secure and web-based. Input data from anywhere in the world with secure web authentication,
data logging, and Secure Sockets Layer (SSL) encryption.
• Fast and flexible Conception to production-level database in less than one day.
• Multisite access. Projects can be used by researchers from multiple sites and institutions.
• Fully customizable. You are in total control of shaping your database or survey.
• Advanced question features. Auto-validation, branching logic, and stop actions.
• Mid-study modifications. You may modify the database or survey at any time during the study.
• Data import functions. Data may be imported from external data sources to begin a study or to
provide mid-study data uploads.
• Save your survey or forms as PDFs. Generate a PDF version for printing in order to collect
52
Login interface
53
After login
54
Different Features
55
OpenClinica
• The world’s most widely-used,
open-source software for clinical research
• 1st released in 2005
• Designed to meet the diverse needs of
modern research environments
• Built as a lightweight, extensible, and modular
application
• Web brower
56
Important Features of OpenClinica
• Organization of research by study protocol and site.
• Dynamic generation of web-based CRFs in portable Excel
templates.
• Management of longitudinal data for recurring patient visits
• Data import/export tools for migration of study datasets.
• Interfaces for data query and retrieval across subjects, time, and
clinical parameters
• Compliance with regulatory guidelines e.g. 21 CFR Part 11
• Built on robust and scalable technology infrastructure interoperable
with relational databases
57
Login Interface
58
After login and different features
59
After login and different project
60
Working with OpenClinica
• Policy determination needed
• Required human and material resources allocated
• When know-how is established, utilization requires only
5 main steps:
– Designing
– Creating CRF’s
– Event definitions
– Data Entry
– Data Extraction
61
Designing CRFs
Done in excel using a blank CRF template
provided by Openclinica
62
Uploading CRFs
63
Event Definition
64
Data Entry
65
Epidata
67
How to work with EpiData?
Work Process toolbar“
Define Data
69
EpiData(2)
Close the form as well as the Epi-Editor
Proceed to next section
Create DataFile
Accept the ”first.qes” and ”first.rec” names
for "make datafile“
Data form saved as first.qes
Data file which will contain the data, saved as
first.rec.
70
Add checks of Data Entry
Click Add checks of Data Entry
Add checks specify rules for data entry
71
Now add value labels to a variable
72
Data Entry
• Continue with Enter Data
• Simply activate the Enter data on the toolbar
and accept first.rec for data entry
• Double Entry of Data
Toos->
prepare
double
data entry
73
Export, Analysis and options
• Export to any data format
74
Data management and analysis using stata
• Running Stata
• Stata windows shown below
75
Data management using stata(6)
• Simple linear regression – regress, rvfplot,
other diagnostics
• Correlation – corr, spearman, ktau – I tend not
to use corr because of the sensitivity to the
normality assumption for tests and confidence
intervals
• Only pwcorr and not corr provide test of
significance
76
THANK YOU
77