ICSSR Data Service
ICSSR Data Service
Pallab Pradhan,
Scientist - B
[email protected]
www.inflibnet.ac.in
1
Background
ICSSR, established by Government of India in 1969, with its 27 research institutes
and 6 regional centres across India; large volume of data are being generated by
these research institutes as well as individual researchers through research
projects funded by the ICSSR.
National Policy on Data Sharing and Accessibility (NPDSA) was initiated and published in
2012 by Ministry of Science and Technology, Government of India to increase the
accessibility and easy sharing of non-sensitive data available either in digital or analog
forms that is generated using public funds by various Ministries / Departments /
Subordinate offices / Organizations / Agencies of Government of India as well as States.
The NDSAP policy is designed to promote data sharing and enable access to Government
of India owned data for national planning, development and awareness
ICSSR Data Service: Indian Social Science Data Repository is created to support Social
Science Research with free availability of data to everyone for use, reuse and republish
without restrictions or minimal restrictions.
INFLIBNET Centre 2
Background
ICSSR has signed MoU with MoSPI. Decision to acquire data from other social science
institutes, ministries and other Government agencies are on process and will be taken
soon.
Workshop on Data Repository was organized on 7 th and 8th January, 2015 at UK Data
Centre, University of Essex, UK as per MoU between ICSSR and UK RC.
Task for setting-up Indian Social Science Data Repository was assigned to the INFLIBNET
Centre as a project for a period of two years.
The Centre explored a number of open source software including CKAN, DCAN, NADA,
LifeRay, Fedora, etc. NADA has been identified as most suitable platform for hosting the
repository, specially for MoSPI datasets.
INFLIBNET Centre 3
Background
Reviewed features and functionalities of various International Social Science Data Repositories under
Consortium of European Social Science Data Archives (CESSDA), Australian Data Archive, ICPSR (Uni. of
Michigan), etc.
The NADA software is installed on a server specifically acquired for the repository at the INFLIBNET Centre.
All 135+ datasets received from MoSPI through ICSSR have been extracted from raw files, transformed and
uploaded into the repository.
The INFLIBNET Centre has signed a license agreement with the UK Data Service for using Humanities and
Social Science Electronic Thesaurus (HASSET) to index datasets.
All datasets in the repository are indexed using Humanities and Social Science Electronic Thesaurus
(HASSET).
Detailed documentation including: i) ICSSR Data Service: Data Deposit and Access Policy Guideline; ii)
Introductory Brochure; iii) User’s Manual on How to Make Effective Use of Data Repository; iv) User’s Manual
on SPSS, STATA and R.
ICSSR Data Service was launched formally on 20th June, 2016 by Dr. T.C.A. Anant, Secretary and Chief
Statistician of India, MoSPI at ICSSR, New Delhi.
INFLIBNET Centre 4
Objectives
To serve as a national data service for promoting powerful research environment through sharing and reuse
of data among social science community in India.
To acquire, process, organize, preserve and host research data and its metadata along with ETL (extract,
transform and load) facilities of raw data in social sciences and related domains collected from diverse
sources for easy sharing and access.
To facilitate online submission, access, search, browse, discovery, conversion, analysis and visualization of
data through intuitive interfaces.
To impart training and spread awareness about benefits of data sharing and reuse amongst social science
research community in India.
Interact, cooperate and collaborate with other national and international data services and repositories for
data and resource sharing and improved management of data services.
INFLIBNET Centre 5
ICSSR Data Service – DR - Policy Framework (Available Online)
Data Access to data Metadata: DDI Depositors, Data Scope: subjects &
Preservation, objects, Use and Standard Moderation by languages, Types
Withdrawal and reuse of data (Access & Reuse repository, Data of research data,
Succession Plans objects, Tracking of metadata, quality Status of the data,
users and usage Metadata types requirements, Versions, File
statistics and sources, Confidentiality & formats, Volume &
Metadata disclosure, size limitations
schemas and Embargo period
standards) and status, Rights
& ownership
Sources of Data
MoSPI ICSSR
(MoU Signed) Institutes
Data
Repository
Private
Data Sources &
Individual Government
s Researchers, Orgizations
Students, &
Others NGOs
INFLIBNET Centre 7
Dataset Category and Contributions
NSS Funding
Universities
(92 Datasets) Agencies
ASI Govt.
Colleges
(43 Datasets) Organizations
Private
Researchers Organization
s
Students NGOs
INFLIBNET Centre 8
Stakeholders
INFLIBNET Centre 9
Platform : ICSSR Data Service
http://www.icssrdataservice.in
INFLIBNET Centre 10
Microdata Catalog
INFLIBNET Centre 11
Central Data Catalog
Collections
Datasets
1 st
INFLIBNET Centre 12
Sorting and Exporting Data List
Country
Year
Sort
Title
Popularity
INFLIBNET Centre 13
Search : Study and Variable Description
• By Year
Filter • By Country
• By Data Access
• By Collection
INFLIBNET Centre 14
Metadata : At a glance
Title
Region
Year
Organization
Collection
Created Date
Modified Date
Views
INFLIBNET Centre 15
Metadata: Detailed
Documentatio
n
Study
Description
Data
Description
Get Microdata
INFLIBNET Centre 16
Metadata: Documentation
• Schedule Information
• List of Households
Questionnaire
s
• Key Reports
Reports
• State Code
• District Code
• List of towns
Other • Sub Stratum size
Materials • Allocation of samples
INFLIBNET Centre 17
Metadata: Study Description
DDI (XML)
Dublin Core
(RDF)
PDF
Data Export
Over Sam Collect
Data
Metad
view pling Access
ion ata
INFLIBNET Centre 18
Data
Blocks
Variabl
es
Dataset
INFLIBNET Centre 19
Metadata: Variable Description
Valid
Cases
Invalid
Cases
Format
and Type
Width
INFLIBNET Centre 20
User Registration Workflow
INFLIBNET Centre 21
User Registration Form
•Pa
P Or Cr
•First
•Des
Nam
e ign ss
•Last
Nam er atio
n
ga wo e
e
•Email
Addr so •Org ni rd
•Co d
ani
ess
•Cont
n zati za nfi e
act
on
Num
ber
al •Ad tio rm
nt
Pa
n
•Coun
dre
ss ial
try
•Purp ss
ose
•ID
•Sta wo
Proof te rd
INFLIBNET Centre 22
Acknowledgement email for account creation
INFLIBNET Centre 23
Activation link in email for account activation
INFLIBNET Centre 24
Submission/Deposit of data to ICSSR Data
Service
Supports almost all preferred machine readable data formats for hosting into the
repository:
raw or preliminary data;
data that are ready to use and ready for full release;
unit‐level summary data; and
tabulated, analysed and derived data, etc.
Two ways to deposit data: online or offline along with the filled and signed copies of
prescribed Data Deposit Form and Licence Agreement Form.
INFLIBNET Centre 25
Types of access to data/datasets
Open Data Safeguarded Data Controlled Data
•Access to data is •Data is accessible •Access to data
free without any only to the would be
process of registered users controlled through
registration / only after a Secured Lab,
authorization to prescribed online stored in a secured
one and all without registration server.
any restrictions. process or •Data declared as
•User can freely authorization by sensitive and
download these the ICSSR Data highly confidential,
data directly from Service once they by Government of
the repository. agree to the terms India policies, will
and conditions of be accessible only
INFLIBNET Centre data usage policy. through this mode. 26
Conditions for data withdrawal
Violation of copyright
Confidentiality concerns
INFLIBNET Centre 27
Conditions for data use/re-use
Users are strictly prohibited to use the data for any kinds of commercial benefits
Misuse of data, data modification, repackaging, and redistribution of datasets in other formats
are not permitted
User should not breach the intellectual property rights of the data owner while reusing the data
Individuals or organizations responsible for misuse of the datasets might be debarred from using
the ICSSR Data Service
INFLIBNET Centre 28
How to download data?
INFLIBNET Centre 29
How to explore & analyse data online ?
INFLIBNET Centre 30
Explore Online
In Explore Online, User can visualize the results as charts and tables. Further, user have the
option to select pre-derived and pre-selected data from the drop-down menu / list
available to generate charts and tables.
INFLIBNET Centre 31
ICSSR Data Analytics
Data Univariat Transfor
Selection e Analysis mation
INFLIBNET Centre 32
Data Selection: Select Data tables
S
e
l
e
c
t
i
o
n
o
f
D
a
t
a
t
a
b
l
e
s
INFLIBNET Centre 33
Data Selection: Select Variables
Select
Data
table
Select
Select Key
Desired
Variable(s
Variable(s
)
)
INFLIBNET Centre 34
Data Selection: Select Variables
INFLIBNET Centre 35
Data Selection: Select Base Table
Select Base
Table
Select type of
join
Click on
Submit
INFLIBNET Centre 36
Data table based on selection
INFLIBNET Centre 37
Univariate Analysis: Qualitative
INFLIBNET Centre 38
Univariate Analysis: Qualitative
INFLIBNET Centre 39
Univariate Analysis: Quantitative
INFLIBNET Centre 40
Univariate Analysis: Quantitative
INFLIBNET Centre 41
Transformation
Transformation
Computation
Aggregation Recoding
Computing
Filtering
a new
variable Data
INFLIBNET Centre 42
Compute new variable
Compute a new
variable
Addition
Subtraction
Multiplication
Division
Deciles
Log
Exponential
or more
INFLIBNET Centre 43
Compute new variable
INFLIBNET Centre 44
Filter Data
INFLIBNET Centre 45
Data Aggregation
Select
Select Apply
Break
Variabl Functi
Variabl
e on
e
INFLIBNET Centre 46
Data Recoding
T
y
p
e
n
e
w
n
a
m
e
o
f
L
e
v
e
l
s
(
N
a
m
e
f
o
r
R
e
c
o
d
i
n
g
)
INFLIBNET Centre 47
Cross Tabulation
Select Operation
Row % X 100
Column % X 100
Row % X 1000
Column % X 1000
Row Fraction
Column Fraction
INFLIBNET Centre 48
Pivot Analysis
• Tabulation
Pivot • Filtering
Analysis • Visualization
• Aggregation
INFLIBNET Centre 49
Pivot Analysis
Charts
• Table Bar
• Heat Map
• Row Heat Map
• Column Heat Map
• Tree Map
• Line
• Bar
• Stacked Bar
• Area
• Scatter
INFLIBNET Centre 50
Pivot Analysis
Aggregation
• Count
• Count Unique Values
• List Unique Values
• Sum
• Integer Sum
• Average
• Minimum
• Maximum
• Sum over Sum
• 80% Upper Bound
• 80% Lower Bound
• Sum as Fraction of Total
• Sum as Fraction of Rows
• Sum as Fraction of Columns
• Count as Fraction of Total
• Count as Fraction of Rows
• Count as Fraction of Columns
INFLIBNET Centre 51
Data Visualization
INFLIBNET Centre 52
Data Visualization
INFLIBNET Centre 53
Data Visualization
INFLIBNET Centre 54
Data Visualization
INFLIBNET Centre 55
How to cite data/datasets after its use?
The ICSSR Data Service expects that users of it’s data from the repository should
provide correct citation and acknowledgement for data used by them.
For Example:
Central Statistics Office (Industrial Statistics Wing), MOSPI, Government of India. Annual
Survey of Industries: 1983-84 [computer file]. × Edition. Kolkata, India: ICSSR Data Service
[distributor], October 2012. SN: ×, DOI/Handle: xxx xxx xxx.
INFLIBNET Centre 56
Thank You…!