Srikanth
Srikanth
Senarios-1
1. I have a flat file, in which I have two fields, emp_id, emp_name.
The data is like this-
emp_id, emp_name
Soha101, ali101, kahn101 khan102,
Siva102, shanker102, reddy.
How to merge the names so that my output is like this
Emp_id Emp_name
101 Soha ali Kahn?
A:-In expression transformation use an output port ...in expression window write
emp_id||empname
2. How to join a Flat and Relational Source without using (Joiner, Update, Lookup)
Transformations... is it possible? If yes how?
A: - not possible
3. I have a source which relational, I am trying to populate to target flat file with one
column for daily date which is sysdate, I want to populate the sysdate coulmn with
DD/MM/YYYY format.
Kindly provide a solution for this. Clear that my target is flat file.
A: - In expression transformation create one out put port, and write like:
TO_DATE ('SYSDATE'.'DD/MM/YYYY').Connect this port to target.
A-In Informatica 8.6.0 multiple repositories can be created under node. The domain can have multiple nodes.
6. Router T/R is active but some people are saying some times passive what is the Reason behind that?
A: - First of all Every Active transformation is a Passive transformation, But every passive not Active.
In Router Transformation there is a special feature with Default group. Because of Default Group its passive.
We can avoid this Default group by some transformation Settings, Now It’s Active.
7. I want to run an informatica workflow after completion of oracle procedure. That
Procedure is not running through informatica and can be run at any Time in
Database. Informatica is in windows environment. Is it possible? If yes please
Explain?
A: - This can be possible with UNIX. Create a shell scripts which first has to Execute the
Stored procedure or Package and we have command to check the completion or
Procedure
After that use Pmcmd command in the same UNIX to start the workflow.
Informatica Senarios-2
8. in a single mapping, more than 500 sources (legacy, VSAM, relational) will be
loading into only one target. Whenever I retrieve the data (any record) from target, I
need to find the details that the record belongs to which source?
A: - After every Source qualifier transformation just keep an Expression with flag.Dont go
Single Source qualifier transformation. Keep 1 source qualifier Trans for one table.
11. If no. of source columns is changing every time (First time it is 10 next time it is
20 so on). How to deal with it without changing mapping?
A: - If I understand this question properly, it says that the no. of "Source" columns are changing. I do not agree with this scenario.
Probably in Data warehousing, you won't find such a design. As far as DWH is concerned, it takes the data from the OLTP systems
& after performing some operations (E-extract, T- transform) it finally loads the data in some targets. Here, as per the question, the
question itself arises for the OLTP design. No any OLTP (or database design principal) system suggests a varying number of
columns. So, please do not get confused by such trivial kind of questions.DWH is a much disciplined subject & it follows a very good
standards. Please go through the concepts first. You will get a clear picture of DWH then.
`13. Can we create index and drop index in existing table while using informatica?
A: - I know 4 ways in INFORMATICA 1) Source Analyzer window- (source table, Using key ports (enable, disable)).
2) Source qualifier Trans-(Sql override)
3) Target override
4) Pre sql, Post sql
14. If we are using an aggregator but forget to mention the group by port .what will
be the Output?
A: - If we miss to enable any of the port as GROUP BY, the aggregator will write the lat
row of
the table to the next transformation.
15. There are n numbers of flat file of exactly same format are placed in a folder. Can we load these flat file’s data one by one to a single
relational table by a Single session?
A: - Use source type as Indirect File Type and source file name as a file having the
Names of
All the n flat files to be read.
Informatica Senarios-3
15. Why do we go for update strategy TR in SCD rather using the session
Properties?
A: - Session Properties like pre Source Rows INSERT, UPDATE, REJECT, DELETE,
Using
Session Properties we can do single flow only .SCD applicable for Insert, Update at a
Time using Update Strategy Trans only. Using Update Trans we can create SCD
Mapping
16. How many mapplets u have created? And what is the logic used?
A: - We can create any No of Mapplets for 1 mapping. There is no limit for Mapplets.
Every mapplet can have a Logic or logics,,,,,, There is no limit for logics.
17. LOOKUP Condition is nothing but a Join condition? What type of join condition
it,
By Default? Using the Lookup Condition How many types of relational
conditions
We can make?
A: - as per my understanding. Lkp is always behave like left outer join. It will give you all
Matched records as well as unmatched records which are not present in base
table...and
Those unmatched records are will be null in case of unconnected lkp trn.....
A: - When we don't have primary keys defined on database level. And still we need
update on
This target from Informatica. We need to define keys at informatica level and use
Update
Override in target property. This way we can update the table.
20. from Source 100 rows are coming, on target there are 5 m rows which options
Is better to match data 1. Joiner 2 No cache 3. Static 4. Dynamic?
A: - Here we will use joiner for better performance. We will join the two sources making source table as master source. So only 100
comparisons will be done. So it will be very faster.
Whereas in static and dynamic we have to look up on the target which is very large 5m
Rows. So caching will take more time.
Informatica Senarios-4
22. What are set operators in Oracle?
A: - UNION, UNION ALL, MINUS and INTERSECT
23. How I can Schedule the Informatica job in "Unix Corn scheduling tool”?
A: - we can do this by using crontab file in UNIX, for this we need to schedule the Power
Centre job. or we can use "at" command in UNIX to schedule the job.
24. How can I generate Sequence Numbers to Target Table (with out using
Sequence Gen Trans, Rank Trans).
A: - Use database Sequence generator call this from stored procedure or dummy lookup
Query Or You can also use expression transformation. Create two ports one is
Variable
And assign it to 0 and another one is output port and Write the logic to increment it
(o_seq=v_Seq+1)
25. Can any one explain me step by step how scd will work?
Selects all rows. Caches the existing target as a lookup table. Compares logical key columns in the source against corresponding
columns in the target lookup table. Compares source columns against corresponding target columns if key columns match. Flags
new rows and changed rows. Creates two data flows: one for new rows, one for changed rows. Generates a primary key for new
rows. Inserts new rows to the target. Updates changed rows in the target, overwriting existing rows.
26. When we load flat files into target tables how do we identify duplicates? And where do load the duplicate records for further
reference? How do we do chage data capture? Is this slowly changing Dimension technique?
A: - I have an idea after sql transformation go thruogh 2 Agg Trans, 2 Router Trans
Agg1-gorup by col count=1 to router Trans
Agg2-group by col count<>1 to router Trans (I think “it will help u”)
From 2nd router Transformation we have a Separate Target Table.
Is this slowly changing Dimension technique?
Change data capture (CDC) Mean newly Inserts, Updates Based on Data loading
Time,
This Inserts, Updates only slowly changing Dimension technique.
27.I have table name called Team and I have name and DOJ in that table in oracle, when I retrieve the table in Informatica DOJ
shows with date and time , I want want to know is it possible to get only date(MMDDYYYY) in the date data type,
A: - TO_CHAR (DOJ_port, 'MMDDYYYY')
29. How can we load first and last record from a flat file source to target?
A: - After sql Trans, go with AGG, RANK transformation, in RankTrans Properties set the
Rank
1 only---1 row returns, In AggTrans Don’t do any column for Group--Last row returns,
We need 2 Target tables. 1 for 1st record,2 for last record, if u using the UNIONtrans
OneTarget table is enough,,,
Informatica Senarios-5
30. Diff B/W MAP Parameter, SESSION Parameter, Database connection session parameters? It’s possible to create 3parameters
at a time? If Possible which one will fire FIRST?
A: - we can pass all these three types of parameters by using Perameterfile.we can
declare all in one parameter file.
A mapping parameter is set at the mapping level for values that do not change from
session to session for example tax rates.
Session parameter is set at the session level for values that can change from sesion to
session, such as database connections for DEV, QA and PRD environments.
The database connection session parameters can be created for all input fields to
connection objects. For example, username, password, etc.
It is possible to have multiple parameters at a time?
The order of execution is wf/s/m.
31. How to run two workflow (not a sessions) sequentially, wha0t is the process?
A: - The best way is obviously to run WF1 and then call WF2 using PMCMD command in
the last session of WF1 (as a post session task).If you absolutely want to ensure that the
second wf starts only after graceful completion of wf1 then simply add a command task
for the pmcmd and use the piple to validate that the previous task is completed properly.
Or
We can run the workflow sequentially .for that we need to write a ksh shell scripts or
batch command and use cmd command
33. Which gives the more performance when compare to fixed width and delimited
File? And why?
A: - fixed width, because there are no delimiters to check so the performance will
increase.
35. How to extract original records at one target & Duplicate records at one target?
A:- Source -> SQ -> Sorter ->Expression ->Router(or 2 filters) ->Targets
37. Without using Lookup & Sequence Generator, How to generate Sequence?
39. How will you remove the duplicate records from flat file without using sorter?
A: - Use aggregator transformation and group by all ports and create one port for
checking
Count...and pass the results accordingly to target tables.
40. How to join the two flat files using the joiner t/r if there is no matching port?
A:-Connect the source Qualifier of two different flat files to two different Exp Trans. Create
a
Dummy output port in both the exp trans. then using that port connects the joiner Tran.
41. What is the difference between Oracle performance and Informatica
Performance? Which performance is better?
A:-oracle performance deals with the source &targets. Informatica performance deals with
the
Transformations. For efficiency result both are impotent...
43. Suppose you have 2000 records in one table and 12000 in another which one
you will consider as master and detail?
A:-We will consider the one with lesser number of records as master as with this
approach the
Data to be cached would have to be less and hence the performance can be improved.
46. This is a scenario in which the source has 2 cols -10 A, 10 A,20 C,30 D,40 E,20 C
And there should be 2 targets one to show the duplicate values and another
Target for distinct rows.
T1 T2
10 A 10 A
20 C 20 C
30 D 40 E which transformation can be used to load data into target?
Informatica Senarios-7
A:- 1.Sorce - Source qualifier – Target Check- Select distinct option
2.Source --Source qualifier- aggregator – Target group by-empno
3.Source --Source qualifier- sorter – Target Check- Select distinct option
4.Source -Source qualifier -Expression –Target -Source qualifier: check sorted by
Empno
Expression:-port expression
in_empno
var-empno=var_duplicate
var_duplicate=in_emp
flag= IIF(var_duplicate=var_emp,'Y','N')
5.Source -Source qualifier -Rank-Expression –Target
Dense Rank:
12<--2nd position
2<--3rd position
34Same ranks are assigned to same totals/numbers/names. The next rank follows the serial
Number.
49. What is the method of loading 5 flat files of having same structure to a single
Target and which transformations I can use?
Two Methods.
1.write all files in one directory then use file repository concept(don’t forget to type source
file type as indirect in the session).
2.use union t/r to combine multiple input files into a single target.
50. Supposesession is configured with commit interval of 10,000 rows and source
has 50,000 rows. Explain the commit points for Source based commit and Target
Based commit. Assume appropriate value wherever required.
Source based commit will commit the data into target based on commit interval.so,for
every 10,000 rows it will commit into target.
Target based commit will commit the data into target based on buffer size of the
target.i.e. it commits the data into target when ever the buffer fills.Let us assume that the
buffer size is 6,000.So,for every 6,000 rows it commits the data.
………..@...........