Abintio 2
Abintio 2
file?
Ans) Deflate or compress are the components that can be used
to lower the size of the file.
Shiju
Q2) Can a graph be infinitely run? If yes how?
Ans) A graph can run infinitely by call the .ksh in the end of the
script.
Q3) What meaning has lock in abinitio?
Ans) A graph must be locked in order to give permission to the
developers to edit the object if needed.For eg if any other
developer want to make change in the same object then he ll
get warn that this graph has already been locked by some
other user.This is basically for protection mechanism.
Q4)What is EME?
Ans) EME stands for Enterprise meta environment.It is
basically repository to store all the objects.It is also called
as version controller.It keep track of graphs or other objects.
Q5)What role does xfr plays in Abinitio?
Ans) XFR is basically used to store the mapping.It is useful
because rewriting the code takes time and xfr saves that
efforts.
Q6)What is the difference between phase and checkpoint?
Ans) Phase basically deletes the intermediate file(temporary
files) before a new phase begins which is quite different from
checkpoint.Checkpoint keeps the temporary files till the end of
the graph hence it can start from the last good process.
Q7) How much memory do we need for a graph?
Ans)Some calculations lead to 8 MB plus.MAX_CORE and phase
size of the file.
Q8)How the term Standard environment can be defined?
Ans) The term standard environment is basically used when it
include more than one project i.e private and public.
Q9)What is the difference between DB config and cfg?
Ans) Similarity between both is that they both used in
database connectivity.The difference is that cfg used in
Departition Components
Departition components combine the multiple flow partitions of
data records into single flow as follows:
Concatenate :
Concatenate appends multiple flows of data records one after
the other.
1)It reads all the records from the in port and copies them to
the out port.
2)After reading all the records it will read the records from the
second flow in port and append it after the first flow data
records.
Gather:
Gather combines the data records from multiple flow partitions
arbitarly .
Not key-based
Result ordering is unpredictable
Has no affect on the upstream processing
Most useful method for efficient collection of data from
multiple flows.
Multiple partitions and for repartitioning
Used most frequently.
Merge:
Key-based.
Result ordering is sorted if each input is sorted.
Possibly synchronizes pipelined computation.
May even serialize.
Useful for creating ordered data flows.
Other
than
the
Gather
,
the
Merge
other departitioner of choice.
is
the