0% found this document useful (0 votes)
235 views

Interview Questions

Here is the query to find the sales difference between current day and previous day: SELECT month, store, sales, LAG(sales) OVER (PARTITION BY store ORDER BY month) - sales as Diff FROM sales_table; The LAG function allows us to access the previous row's sales value for the same store. PARTITION BY store ensures we calculate the difference for each store separately. ORDER BY month defines the order in which rows should be processed to calculate the lag. Finally, we subtract the current sales from the lagged/previous sales to get the difference. Q: We need to find sales difference between current month and previous month for each store? month
Copyright
© © All Rights Reserved
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
235 views

Interview Questions

Here is the query to find the sales difference between current day and previous day: SELECT month, store, sales, LAG(sales) OVER (PARTITION BY store ORDER BY month) - sales as Diff FROM sales_table; The LAG function allows us to access the previous row's sales value for the same store. PARTITION BY store ensures we calculate the difference for each store separately. ORDER BY month defines the order in which rows should be processed to calculate the lag. Finally, we subtract the current sales from the lagged/previous sales to get the difference. Q: We need to find sales difference between current month and previous month for each store? month
Copyright
© © All Rights Reserved
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
You are on page 1/ 12

1.In case of job failure in production to whom will u report?

2.Suppose a job is ready runny some source file is missing then the job will run or
not?

3.Suppose i have parent table and child table which table load 1st to target either
parent or child? Or will both load at a time to target ?

4.Informatica validates the mapping but fails when we run the session?what is the
option that we need to change to run this mapping?

Pls provide Ans sir.

What is your role in your team?

What is lookup?
->Lookup transformation is used to look up a source, source qualifier, or target to
get the relevant data.
The Lookup transformation is used to retrieve data based on a specified
lookup condition

What are the issues you faced in your project?


->

What is fact and dimension table and difference?

Explain about dynamic lookup?


->which chnages the caches every time the process runs

Types of joins you used in Ur project?


->normal join and left join

Commands you used in SQL?


->

How will you remove duplicates from a table?


-> DELETE
FROM EMPLOYEES
WHERE employee_id IN (
SELECT employee_id
FROM EMPLOYEES
GROUP BY employee_id
HAVING (COUNT(employee_id) = 1)
)

What do you mean by data warehouse concepts?


-> central repository of Data, that data can be used for busniss insight

Did you worked on performance tuning? Explain about it?


->

What is the logic for SCD type 2?


->
1. Read the source in the source qualifier
2. Sort the records in the sorter transformation
3. Use expression transformation to compare previous and and current record, you
need to use variable logic. Flag the first record as distinct record, flag
following records as duplicate records
4. Use router transformation to route distinct records to distinct target, and
duplicate to duplicate target

Accenture Interview Questions

How do you remove duplicates if your source is a flat file

What is SCD Type 3 and difference between SCD Type 2 and SCD Type 3

How to load 100000 records into little chunks of 15000 and how will Informatica
identify that it should send first 15000 to one and next 15000 to next one

Explain Update Strategy Transformation

How will you concatenate using UNIX

*********************************************************************************
Tell Me most Complex Mapping which you have developed in your project?
->

We have Implemented a framework job in our Project.

This Framework Job is consolidated Workflow which contains all the stage, Dimension
and fact sessions.

In, All these all sessions, we have parameterized Connections, Session logs and
given the values
in parameter file.

We have scheduled and automated this framework job. Daily it will run automatically
and load the data from Source to stage, stage to dimension and then to Fact table.

All the source to stage mappings are implemented by using Incremental load, and
Dimensions are loaded by using SCD type-2 logic and then loaded to data to Fact
table.

This is the most complex job which i have worked in my project.


***************************************************************************

Interview qstns---Tech mahindra


Tell me ubout urself?
About projects?
Features of source qualifier query?
-> we can write custom SQl query

What is lookup?
->look up is to look up the data from source ,target on specified condition

Diff between connected and unconnected lookup? .


->
What are Dimension tables?
->Fact: Quantitative data , we can make calulation on them
Dimension : Descriptive data, information regarding facts fields

What is star schema?


->Star schema: Fact table sourdded by dimension table .
Snowflake: Fact table sourdded by dimension table and that dim table is again
connected to dimension

How to delete dulicates in sql?


->DELETE
FROM EMPLOYEES
WHERE employee_id IN (
SELECT employee_id
FROM EMPLOYEES
GROUP BY employee_id
HAVING (COUNT(employee_id) = 1)
)

What is corelated sub query?


->A correlated subquery is a subquery that refers to a column of a table that is
not in its FROM clause

Diff between rank and dense rank?


*************************************************

Interview questions - Broadritz finacial

1. Unix command to list files for more than 500 days


->find . -maxdepth 1 -mtime +14

2. If workflow rejected 50 records in the production what is your next step


->
3. Source flatfile and target flatfile mapping with simple expression, worflow is
taking 15 hrs but it is to be reduced to 15 mins what can you do ?
-> 1st we need to look at the ETL side means which thing is taking longer time
wther it is Source ,transfom,or Traget . We nned to analyse the session log and
workflow log. we can use sorter ,we can eliminate duplicate, if posibale we can try
to reduce the transf

4. Max salary in each group


-> Select e.department_id,max(e.salary) from EMPLOYEES e
group by e.department_id

5. Persistant cache
-> IS will save the cache and will reuse for next process run

6. Target override
->useful to update the target without using the update strategy transformation. You
can find the Target Update Override option in the target definition properties tab

7. Flate file name with todays date and time stamp


->
8. If a target has 14 records, 15 th record is similar to the 1st record and this
15 th record should be rejected how can you achieve this ?
-> we will use exp here . will asigen input to one v_port and if both qual then
that record wil be rejected

Difference between file partition vs relational table ?


->Flat file databases are typically plain text files that store one record per
line, with record fields delimited by whitespace or a delimiting character. Flat
file databases can be read directly by a variety of software applications.

In addition to the data tables, relational databases use "indexes" to quickly find
records based on search criteria. Relational databases generally require a
relational database management system (RDBMS) to manage and access the data
***********************************************************************
Interview questions from ACS

How to generate dynamic parameter file?


->

What are the data models u worked on ur project?


->Entity-Relationship (ER),Relational

Name 2 dimension and 2 fact tables in ur project?


-> dimension: Subject , Site fact: MH,AE,CM,

Where will u implement incremental load?


-> From source to stage mappings

How can u load the Dimension tables and fact tables?


-> by incremental load and SCD type 2

Tell me the Mapping flow in scd type2?


-> Exp,ses-generate, router,lookup,update strategy

How lookup works?


-> On given codition it will check the record in source ,target and will rertun
that

What is the use of cache? Types of caches?


->Informatica queries the database, brings the whole set of rows to the Informatica
server and stores in a cache file
data cache and index cache

How aggregator cache works?


->Index. Stores group values as configured in the group by ports.
Data. Stores calculations based on the group by ports.

Command to delete duplicates in Unix?


->uniq

What is index? Types of indexes?


-> allowing faster retrieval of records,
Clustered Index
Non-Clustered Index
Column Store Index
Filtered Index
Hash Index
Unique Index

How to get 2nd highest salary?


-> select *from EMPLOYEES where salary=(select Max(salary) from EMPLOYEES);

How to get dept wise highest salary?


->Select e.department_id,max(e.salary) from EMPLOYEES e
group by e.department_id;
***********************************************************************************
************

Hi shammi, I attended the ntt data interview but he didn't ask many questions. He
finished the interview within 5mins maybe and i received no mail so i think i got
rejected
As far as I remember he asked

when we use bulk mode and normal mode differences


->in normal load Informatica repository service create logs and in bulk load log is
not being created. That is the reason bulk load loads the data fast and if anything
goes wrong the data cannot be recovered. But in normal load since the log is
created the lost data can be recovered.

Which is better like operator or exists


->If you need to check for existence of values in another table, the EXISTS
operator is preferred
->to search for a specified pattern in a column

-Which has more preference in source qualifer SQL override or other options
->

-Difference between unique and primary key


->A primary key can constitute one or more fields of a table to identify records in
a table uniquely. On the other hand, a unique key prevents two rows from having
duplicate entries in a column.

-What methodology you use like agile or what and do you have sprints
->
If we don't give any group by port what happens in aggregator transformation
-> it will return only single last row as it has no command to aggregate data

*****************************************************************

1) what is your role in your previous company


What you did .
2) what type of requirement you get and how you implement
->
3) what is City Bank recruitment and what you did

4) what America health care requirements and what you did in that

5) can you explain your project architecture

6) what are the complex query you written

6) what are difficulty you faced in your project

1.what s size of ur data warehouse?


Ans : Size should be take care by my team lead. But as of my knowledge it would be
approximately 25 to 30 GB

2. How many records u will get in ur project ?


In production , its based on the table. I have seen few tables having 2 Million
records.
1.What type of data you work ?
->
2.What are the developments you made?
->
3.what is use source Qualifier
->
4.how does informatica tool works?

Q: We need to find sales difference between current day and previous day ?
onth store sales Diff
1 A 138 NA
2 A 134 -4
3 A 112 -22
4 A 132 20
5 A 110 -22
6 A 129 19

https://youtu.be/Iv9qBz-cyVA

ValueMomentum Interview Questions


----------------------------------------------------
Tell me about your roles and responsibilities.
How you do data validation after session is completed.

What is the flow of development in your project.

What is lookup transformation.

What is source qualifier transformation.

Disadvantages of source qualifier transformation.

How you know workflow is successfully completed or not.

Difference between static and dynamic lookup transformation.

How to generate key in target table without using sequence generator


transformation.

How you performed testing in your project.

What is SCD type2.

What is dimensions.

How you write test cases.


---------------------------------------------------
SQL:
What is the difference between function and procedures.
How to calculate second highest salery from table.

How to remove duplicate records from table.

How to fetch record from a particular value of a column.


----------------------------------------------------
UNIX
How to remove a file.
How to create file.
How to compare content of a file.

https://youtu.be/Iv9qBz-cyVA

https://informaticareference.wordpress.com/category/oracle-interview-questions/
page/12/

Deloitte 1st round interview questions

1.what is surrogate key


->
2.what is view , write syntax
->
3.what is rank function, write syntax
->
4.explain about complex mapping in your project
->
5.difference between data ware house and data mart
->
6.rank transformation
->
7.where we use sql override
-> when source is database

8.explian about scd type 1 and scd type 2


->
9.difference between router and filter ,
->
10.what is fact and dimensions
_.
11.how you troubleshotting when workflow is failed
-> we can see the session log and workflow log

12.difference between session log and workflow log


-> session log is created for each session and workflow is created for entire
workflow

13.explain about event wait task,mail task


->event wait : it will wait until the file coms at specified location
Mail: we can send an eamil to developer ,team mmbers when ever session will fail or
scucced

14.how to update target table without primary column


->
15.explian about normalizer transformation
16.explian about commond task
17.what is incremental aggregration
18.difference between star schema and snow flake schema

Deloitte 2nd round interview questions


1.how to delete duplicate records by using SQL Query
2.explain about complex mapping in your project
3.did you have knowledge on unix
4.did you have knowledge tasks in workflow manager
5.did you done any performance tuning in informatica mapping
6.difference between router and filter
7.scenario based question on router(if we have source column a=100, in router we
have three groups like a>50,a>70,a>100
so data will flow in how many groups)
8.what is pushdown optimization
9.explian about normalizer transformation
10.did you have a knowledge on index
11.did you know about performance tuning in database
12.explain about parameters in session level and mapping level
13.router is active or passive transformation, explain why?
14.do you know about migration

TCS Questions
Interview happened for an hour.

General Questions:

1. Tell me about yourself


2. Explain about your project flow.
3. Which methodology you are following, are you involved in requirements gathering?
4. How work will be assigned to you?
5. What are the sources used in your project?
6. Which schema are you using, explain it and Why that schema is used when we have
some other schemas?
7. How do you migrate code from Dev to higher environments(SIT, PROD) and process?
8. Approximate amount of data you're handling in PROD on daily basis?
9. What is datawarehouse and datamart?

Informatica:

1. What is surrogate key?


2. SQ transformation?
3. What if we define SQL Query and user defined in SQ, which one it will pick?
4. What is lookup and types.
5. Use of Unconnected lookup?
6. What is joiner transformation and why can't we use lookup in place of joiner?
7. Difference between mapping parameters and variables?
8. Explian about Incremental load?
9. Some errors which you have faced and how you achieve it?
10. Difference between SCD1 and SCD2?
11. Explain about SCD2 mapping.
12. Source bottlenecks and how to reduce it?
Oracle:

1. How to find duplicate records in a table


2. Scenario: Based on the dept Id, after grouping I want to display the employee
details in a sequence manner(like for first group by set 1,2,3 and next group by
set 1,2,3..)
3. Views and its feature.

UNIX:

1. How to find 2nd row in a file.


2. How you run a workflow and how the workflows are being managed (Any scheduler or
manual?)
3. How to find the count of records in a file
4. How we will get to know whether previous command executed in a script is
successful or not.

TCS Managerial Round:

1. Why you want to switch?


2. If there is a requirement to support over weekend, where there is critical due
to timelines.. will you be able to support and manage?
3. What other technologies you know?
4. Are you comfortable if been placed in Big Data Management projects?

Interview Q
1) Joiner and Lookup difference and is Joiner can replace Look up. If yes, In which
cases
2) 1 table have 3 records,i.e., A,B,C and requirements is to move each row ten
times to each target table. How you do in mapping.?
3) Different types of catches and which all transformation have catches
4) Normalizer Transformation
5) Transaction control Transformation
6) How to bring more than one column in unconnected lookup
8) delete and truncate in Oracle
9) Dept wise max salary
10) How to check file size in Unix
11) How to move files from one environment to other Unix environment.

Mindtree

1 project
2 target load order
3 target dynamically
4 mapping variable and parameter
5 scd type 2 exp
6 md5 function
7 look up caches
8 dimensions types what is junk dimension
9 how to return multiple lookup ports
10 have you done session partioning.
11 dense rank and rank
12 star snowfake which u using and why
13 incremental load
14 which transformations used in your project
15 how to populate source file name to target
16 complex scenrio u faced in your project

Some questions missing totally on informatica and Unix pmcmd,cp and mv

Impetus
1 project
2 views and mvs
3 have you worked on views
4 type of data you are getting for your project .
5 if we have 10 records how many ways we can filter
6 how will update records in target
7 generate flat file target dynamically
8 every day u r getting a new file how will process it after usage of file what
will you do that file.
9 explain mapping you developed from scratch
10 How can we convert rows into columns
11 Normalizer transformation.
12 Update strategy tr
How will you update without update strategy
13 how many ways we can override a query

-------
SQL
1 analytical function
gave a table get salary by acs order
rank and dense rank exmple difference

2 joins They gave two tables and ask to write join queiries for them

3. if null is populating pirticular record like 3 row dept name how will you
replace that with xyz

Interview Questions faced:


1) Tell about your project and what is the DW you have used in your project.
2) In project what is the data you have received.
3) Table names you have used in project
4) SCD type2
5) Active and passive transformation
6) Sorter is which transformation and why?
7) Performance tuning techniques used in project
I was in blind spot while answering first 3 questions though I have tried but he
could make out I am not Realtime guy. Got a question on company business details of
the 2nd project placed in the resume.

Hi All,
Can any one list 4 to 5 minimum steps we need to know in unix so that it will
useful for all?
1) Since Informatica PC is installed in Unix, how we open it.?
Ans:Is it through PMCMD command.?

2) We will use unix if source is flat file only.


Ans: Yes, if no is the answer can any body tell me what other things we do.
3) How we open file in Unix.?
Ans: Through WinSCP directly open or any other channel do we have?
4) For addition of strings in file. We use below command?
Ans: SED command
5) What all other bare minimum top 3 to 4 commands we use in UNIX so that we can
tell in interview.?

1) is this question about how to open powercenter clients? if yes, we have install
the client in our windows machine and configure the domain/repository
PMCMD is used for executing the workflow, it works in both unix and windows
2) No, it is purely depends on the project/client data flow. Ex: if the data is
processed by some other application like any ERP systems (SAP) , those program can
only write the data in to files not into the direct db tables. Data which is
generated by other applications , we will use the same for analysis. To analyse the
data we need to send to DW in form of DB tables and from there the reporting and
other analysis will begin
3) WinSCP is GUI client to connect to UNIX/windows machines, other way to open file
in Unix is Vi editor (Learn more about Vi command )
5) learn more about text processing command , as a informatica developer it is
important to know about (grep,sed,awk,cp,mv,chmod,vi,cat,more,head,tail..etc...)

Today I have given interview for Infosys


He is having 15 yrs of exp into ETL
Just he asked 8 questions
1.Tell me about yourself
2.mapping parameter and variable
3.About reusable transformation s and mapplets
4.what is agile
5 . What is scrum call
6.what is sprint
7.what is sprint retrospective meeting
8.how will you update the status call

Ust Global
1. Tell me about your project
2.Active and passive transformations
list some active and passive
3.scd type explanation and flow of data
4. how will do error handling
5. performance tuning and bottle necks performed in your project
6.SQL query to fetch 2nd higehst salary
decode and case
case statement and decode statement example
7.delete duplicates in table

These questions i explained very well but They asked some basics
1.which location you are working
2.which branch of tcs
i didnt prepare well so i cought dont do mistake😇
These simple questions i didnt prepared well so be prepare it when going as fake
Wipro questions
Performance tuning at informatica level and database level.

Scd 2
Lookup cache
Transaction control
Update strategy
Parameter variable
Parameter file.
How to pass values between sessions in a workflow.
Dense rank.
Department wise second heiest
Project explanation.
Roles and responsibilities

You might also like