Case Study - Data Stage
Case Study - Data Stage
With emerging competition in the telecom sector they faced with the problems like high
attrition rate of employees, churn of customers etc. They employed a consultancy farm which has come up with a suggestion to
implement data warehousing solution.
They presented the following high level design.
Extraction
EMP
database
Oracle
Transformation
D
A
T
A
S
T
A
G
E
Loading
DA
TA
TERA DATA
S
TA
GE
Data Mart
in Oracle
Data Mart
in Oracle
Extraction
Format Check
Rejection(File & Table Record)
Staging
Dimensional Model
Loading
Data Mart
Report
Considerations were made to acquire employee data oracle database from the tables like TALENT_ACQUISITION, EMP_DETAILS,
EMP_LEAVE, PROJECT_DETAILS, EMP_ALLOCATION, EMP_PERFORMANCE, EMP_SALARY, TALENT_DEVELOPMENT, EMP_TRAINING
and EMP_TERMINATION.
CDR data will come in the format of flat file with comma separated and with Unix new line character as record separated
Customer data will come in the format of flat file with comma separated and with Unix new line character as record separated
ABC Company was recommended to use IBM IIS data stage tool to do the ETL job.
Teradata Database will be in 3NF data Model.
Data marts will be added for reporting purpose.
DATE_TO_FULLFIL
COST_OF_RECRUITMENT
SKILL_SET_REQ
MIN_EDU_QUALIFICATION
EXPERIENCE_REQ
EMP_DETAILS
EMP_ID
NAME
ADDRESS
CONTACT_NO
EMAIL_ID
SEX
DATE_OF_BIRTH
EMP_LEAVE
EMP_ID
ENTILED_LEAVE
LEAVE_PER_MONTH
LEAVE_TAKEN
CL
SL
PROJECT_DETAILS
PROJECT_ID
PROJECT_NAME
PROJECT_START_DATE
PROJECT_END_DATE
PROJECT_MANAGER
WORK_EXP
DATE_OF_JOINING
EMP_ALLOCATION
EMP_ID
PROJECT_ID
DAYS_WORKED
ALLOCATION_START_DT
ALLOCATION_ED_DT
EMP_PERFORMANCE
EMP_ID
PROJECT_ID
INDIVITUAL_GOAL
PROJECT_GOAL
SELF_RATING
MANAGER_RATING
ACTUAL_PERFORMANCE_RATING
EMP_SALARY
EMP_ID
SALARY
BANK_ACC_NO
SAL_DATE
VAR_PAY
BASIC
LOP_DAY
TALENT_DEVELOPMENT
TRAINING_ID
TRAINING_NAME
TRAINING_START_DATE
TRAINING_END_DATE
NO_OF_DAYS
EMP_TRAINING
EMP_ID
TRAINING_ID
TRAINER_NAME
NO_OF_DAYS
TRAINING_FEEDBACK
TRAINER_NAME
TRAINING_COST_TYPE
TRAINING_COST
EMP_TERMINATION
EMP_ID
TERM_DATE
EMP DATABASE
Table Structure.txt
REASON
REMARKS