Monitoring and Supporting Data Conversion
Monitoring and Supporting Data Conversion
NO: _____
MODULE TITLE: Monitoring and Supporting Data Conversion
NOMINAL DURATION: 40 hrs
LO1. Monitor data conversion
1.1. Defining concepts of data conversion and Data Terminologies
Data is raw facts or unorganized things (such as alphabets, numbers, or symbols) that refers to, or represent,
conditions, ideas, or objects.
It can be qualitative or quantitative.
Qualitative data is descriptive information (it describes something)
Quantitative data is numerical information (numbers).
Discrete data can only take certain values (like whole numbers)
Continuous data can take any value (within a range)
Put simply: Discrete data can be counted, Continuous data can be measured
Example:
Qualitative:
It is brown and black
It has long hair
It has lots of energy
Quantitative:
Discrete:
o It has 4 legs
o It has 10 fingers
Continuous:
o It weighs 25.5 kg
o It is 565 mm tall
Data conversion is the conversion of one file or database from one format (from one physical environment)
to another.
Often, when data is moved from one system to another, some form of data conversion is required to convert the data
to a format the receiving system can interpret.
Types of conversion:
Database conversion (SQL, MySQL, MS Access, XLS, XML etc)
File format conversion (PDF to Word)
Image conversion (GIF to JPG, TIFF, PNG etc)
Character or string conversion(numeric to alphabet or viceversa)
NO: _____
1.2. Reading and Analyzing Existing Data Conversion Documents
The data conversion process can often be a complex and difficult task during an implementation.
When performing data conversions, you must include analysis of your source data and continues through to system
testing and user acceptance.
Throughout the conversion process, we perform quality control checks to ensure correctness of the conversion.
1.3. Understanding Data and Its Characteristics
1.3.1. Data Conversion Systems and Tools
Data Conversion Tool allows you to convert data both from and to (both sides are supported) a wide variety of
formats, including:
SQLServer Tables
Oracle Tables
ODBC Tables
OleDb Tables
Microsoft Access Tables
XML Files
Once a conversion type is defined, it can be saved and reused either in a future conversion or as a step within a
batch conversion.
1.3.2. Data Modeling Methodologies
Data modeling is the formalization and documentation of existing processes and events that occur during
application software design and development.
Data modeling techniques and tools capture and translate complex system designs into easily understood
representations of the data flows and processes, creating a blueprint for construction or re-engineering.
A data model can be thought of as a diagram or flowchart that illustrates the relationships between data.
There are several different approaches of data modeling, including:
- Conceptual Data Modeling - identifies the highest-level relationships between different entities.
- Logical Data Modeling - illustrates the specific entities, attributes and relationships involved in a business
function.
- Physical Data Modeling - represents an application and database-specific implementation of a logical data
model.
1.3.3. Data Conditioning and cleaning
Data conditioning (Pre-processing) is the use of data management and optimization techniques which result in
the intelligent routing, optimization and protection of data for storage or data movement in a computer system.
Data cleaning is the act of detecting and removing or correcting dirty data (i.e.: data that is incorrect, out-of-date,
redundant, incomplete, or formatted incorrectly).
Data Cleaning helps to increase the overall efficiency of your data management systems and leads to an increase
in the productivity of the organization.
1.3.4. Data Transformation and integration
Data transformation is one of the collective processes known as extract, transform or load which is one of the
most important processes in data warehouse implementation from different data sources.
Data Integration is the process of combining heterogenous data sources in to a single queriable schema so as to
get a unified view of these data.
NO: _____
1.3.5. Sorting, updating, exporting and convert data
Sorting data
Sorting data is the process of arranging items into meaningful order so that you can analyze it more effectively.
Example:
sort text data into alphabetical order
sort numeric data into numerical order
Updating Data
The modification of data that is already in the database is referred to as updating. The update operation allows you
to change an existing database record in a logical or physical file. You can update individual rows, all the rows in a
table. Each column can be updated separately without affecting other columns.
UPDATE table_name
SET column1=value, column2=value2, ...
WHERE some_column=some_value
To perform an update, you need three pieces of information:
1. The name of the table and column to update,
2. The new value of the column,
NO: _____