BASIC Transformer Stage
BASIC Transformer Stage
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
category in the tool palette in the Transformer shortcut container. The BASIC Transformer stage is similar in appearance and function to the Transformer stage described in Transformer stage. It gives access to BASIC transforms and functions (BASIC is the language supported by the server engine and available in server jobs). For a description of the BASIC functions available see InfoSphere DataStage Server Job Developer Guide. You can only use BASIC transformer stages on SMP systems (not on MPP or cluster systems). Note: If you encounter a problem when running a job containing a BASIC transformer, you could try increasing the value of the DSIPC_OPEN_TIMEOUT environment variable in the Parallel Operator specific category of the environment variable dialog box in the DataStage Administrator (see InfoSphere DataStage Administrator Client Guide). BASIC Transformer stages can have a single input and any number of outputs. BASIC Transformer stage: fast path BASIC Transformer editor components BASIC Transformer stage basic concepts Editing BASIC transformer stages The InfoSphere DataStage expression editor BASIC Transformer stage properties Parent topic: Processing Data
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
2 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
Ensure that you have column metadata defined for each of the output links. The easiest way to do this is to drag columns across from the input link. Define the derivation for each of your output columns. You can leave this as a straight mapping from an input column, or explicitly define an expression to transform the data before it is output. Optionally specify a constraint for each output link. This is an expression which input rows must satisfy before they are output on a link. Rows that are not output on any of the links can be output on the otherwise link. Optionally specify one or more stage variables. This provides a method of defining expressions which can be reused in your output columns derivations (stage variables are only visible within the stage). Parent topic: BASIC Transformer stage
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
3 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
Paste Find/replace Load column definition Save column definition Column auto-match Input link execution order Output link execution order Parent topic: BASIC Transformer editor components
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
4 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
If you select a link in the link area, its metadata tab is brought to the front automatically. You can edit the grids to change the column meta data on any of the links. You can also add and delete metadata. Parent topic: BASIC Transformer editor components
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
5 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
You can specify such an operation by entering an expression or by selecting a transform to apply to the data. InfoSphere DataStage has many built-in transforms, or you can define your own custom transforms that are stored in the Repository and can be reused as required. The source of an output link column is defined in that column's Derivation cell within the Transformer Editor. You can use the Expression Editor to enter expressions or transforms in this cell. You can also simply drag an input column to an output column's Derivation cell, to pass the data straight through the BASIC Transformer stage. In addition to specifying derivation details for individual output columns, you can also specify constraints that operate on entire output links. A constraint is a BASIC expression that specifies criteria that data must meet before it can be passed to the output link. You can also specify a reject link, which is an output link that carries all the data not output on other links, that is, columns that have not met the criteria. Each output link is processed in turn. If the constraint expression evaluates to TRUE for an input row, the data row is output on that link. Conversely, if a constraint expression evaluates to FALSE for an input row, the data row is not output on that link. Constraint expressions on different links are independent. If you have more than one output link, an input row might result in a data row being output from some, none, or all of the output links. For example, if you consider the data that comes from a paint shop, it could include information about any number of different colors. If you want to separate the colors into different files, you would set up different constraints. You could output the information about green and blue paint on LinkA, red and yellow paint on LinkB, and black paint on LinkC. When an input row contains information about yellow paint, the LinkA constraint expression evaluates to FALSE and the row is not output on LinkA. However, the input data does satisfy the constraint criterion for LinkB and the rows are output on LinkB. If the input data contains information about white paint, this does not satisfy any constraint and the data row is not output on Links A, B or C, but will be output on the reject link. The reject link is used to route data to a table or file that is a "catch-all" for rows that are not output on any other link. The table or file containing these rejects is represented by another stage in the job design. Parent topic: BASIC Transformer stage basic concepts
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
7 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
You can specify routines to be executed before or after the stage has processed the data. For example, you might use a before-stage routine to prepare the data before processing starts. You might use an after-stage routine to send an electronic message when the stage has finished. Parent topic: BASIC Transformer stage basic concepts
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
8 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
in five ways. Editing multiple derivations Specifying before-stage and after-stage subroutines Defining constraints and handling reject links You can define a constraint to define limits for output data. You can also specify reject links. Specifying link order You can specify links to be in a particular order. Defining local stage variables You can declare a stage variable. Parent topic: BASIC Transformer stage
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
Using drag-and-drop
Many of the BASIC Transformer stage edits can be made simpler by using the Transformer Editor's drag-and-drop functionality.
Procedure
1. Click the source cell to select it. 2. Click the selected cell again and, without releasing the mouse button, drag the mouse pointer to the desired location within the target link. An insert point appears on the target link to indicate where the new cell will go. 3. Release the mouse button to drop the selected cell.
Results
You can drag multiple columns or derivations. Use the standard Explorer keys when selecting the source column cells, then proceed as for a single cell. You can drag the full column set by dragging the link title.
9 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
You can add a column to the end of an existing derivation by holding down the Ctrl key as you drag the column. Parent topic: Editing BASIC transformer stages
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
To use the find/replace facilities, do one of the following: Click the find/replace button on the toolbar Choose find/replace from the link shortcut menu Type Ctrl-F The Find and Replace dialog box appears. It has three tabs: Expression Text. Allows you to locate the occurrence of a particular string within an expression, and replace it if required. You can search up or down, and choose to match case, match whole words, or neither. You can also choose to replace all occurrences of the string within an expression. Columns Names. Allows you to find a particular column and rename it if required. You can search up or down, and choose to match case, match the whole word, or neither. Expression Types. Allows you to find the next empty expression or the next expression that contains an error. You can also press Ctrl-M to find the next empty expression or Ctrl-N to find the next erroneous expression. Note: The find and replace results are shown in the color specified in Tools > Options. Press F3 to repeat the last search you made without opening the Find and Replace dialog box.
10 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
Select facilities
If you are working on a complex job where several links, each containing several columns, go in and out of the Transformer stage, you can use the select column facility to select multiple columns. This facility is also available in the Mapping tabs of certain Parallel job stages.
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
11 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
12 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
Editor. Select the tab for the link meta data that you want to edit, then use the standard InfoSphere DataStage edit grid controls. The meta data shown does not include column derivations since these are edited in the links area. Parent topic: Editing BASIC transformer stages
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
13 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
button in the toolbar. Column auto-match facility Parent topic: Editing BASIC transformer stages
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
Procedure
1. Do one of the following: Click the Auto-match button in the Transformer Editor toolbar. Choose Auto-match from the input link header or output link header shortcut menu. TheColumn Auto-Match dialog box appears. 2. Choose the input link and output link that you want to match columns for from the drop down lists. 3. Click Location match or Name match from the Match type area. If you choose Location match, this will set output column derivations to the input link columns in the equivalent positions. It starts with the first input link column going to the first output link column, and works its way down until there are no more input columns left. 4. Click OK to proceed with the auto-matching. Note: Auto-matching does not take into account any data type incompatibility between matched columns; the derivations are set regardless. Parent topic: Defining output column derivations
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's
14 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
Guide
Feedback
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
Whole expression
With this option the whole existing expression for each column is replaced by the replacement value specified.
15 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
For example, when adding a trim() call around each expression of the currently selected column set, having selected the required columns, you can use the following procedure.
Procedure
1. Select the Whole expression option. 2. Enter a replacement value of:
trim($1)
3. Click OK
Results
Where a column's original expression was:
DSLink3.col1
This is applied to the expressions in each of the selected columns. If you need to include the actual text $1 in your expression, enter it as "$$1". Parent topic: Editing multiple derivations
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
Part of expression
With this option, only part of each selected expression is replaced rather than the whole expression. The part of the expression to be replaced is specified by a Regular Expression match.
16 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
Expression specified must have round brackets around its value. "$1" in the replacement value will then represent that matched text. If the Regular Expression is not surrounded by round brackets, then "$1" will simply be the text "$1". For complex Regular Expression usage, subsets of the Regular Expression text can be included in round brackets rather than the whole text. In this case, the entire matched part of the original expression is still replaced, but "$1", "$2" etc can be used to refer to each matched bracketed part of the Regular Expression specified. The following is an example of the Part of expression replacement. Suppose a selected set of columns have derivations that use input columns from `DSLink3'. For example, two of these derivations could be:
DSLink3.OrderCount + 1 If (DSLink3.Total > 0) Then DSLink3.Total Else -1
You might want to protect the usage of these input columns from null values, and use a zero value instead of the null. Use the following procedure to do this.
Procedure
1. Select the columns you want to substitute expressions for. 2. Select the Part of expression option. 3. Specify a Regular Expression value of:
(DSLink3\.[a-z,A-Z,0-9]*)
Results
From the examples above:
DSLink3.OrderCount + 1
would become
NullToZero(DSLink3.OrderCount) + 1
and
If (DSLink3.Total > 0) Then DSLink3.Total Else -1
would become:
17 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
If the Replace all occurrences option is selected, the second expression will become:
If (NullToZero(DSLink3.Total) > 0) Then NullToZero(DSLink3.Total) Else -1
The replacement value can be any form of expression string. For example in the case above, the replacement value could have been:
(If (StageVar1 > 50000) Then $1 Else ($1 + 100))
would become:
(If (StageVar1 > 50000) Then DSLink3.OrderCount Else (DSLink3.OrderCount + 100)) + 1
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
18 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
If you choose a routine that is defined in the Repository, but which was edited but not compiled, a warning message reminds you to compile the routine when you close the Transformer stage dialog box. If you installed or imported a job, the Before-stage subroutine or After-stage subroutine field might reference a routine that does not exist on your system. In this case, a warning message appears when you close the dialog box. You must install or import the "missing" routine or choose an alternative one to use. A return code of 0 from the routine indicates success, any other code indicates failure and causes a fatal error when the job is run. Parent topic: Editing BASIC transformer stages
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
19 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
To catch rows which are rejected from a specific output link, set the Constraint field to linkname.REJECTED. This will be set whenever a row is rejected on the linkname link, whether because the row fails to match a constraint on that output link, or because a write operation on the target fails for that row. Note that such an otherwise link should occur after the output link from which it is defined to catch rejects. To catch rows which caused a write failures on an output link, set the Constraint field to linkname.REJECTEDCODE. The value of linkname.REJECTEDCODE will be non-zero if the row was rejected due to a write failure or 0 (DSE.NOERROR) if the row was rejected due to the link constraint not being met. When editing the Constraint field, you can set return values for linkname.REJECTEDCODE by selecting from the Expression Editor Link Variables > Constants... menu options. These give a range of errors, but note that most write errors return DSE.WRITERROR. In order to set a reject constraint which differentiates between a write failure and a constraint not being met, a combination of the linkname.REJECTEDCODE and linkname.REJECTED flags can be used. For example: To catch rows which have failed to be written to an output link, set the Constraint field to linkname.REJECTEDCODE To catch rows which do not meet a constraint on an output link, set the Constraint field to linkname.REJECTEDCODE = DSE.NOERROR AND linkname.REJECTED To catch rows which have been rejected due a a constraint or write error, set the Constraint field to linkname.REJECTED As a "catch all", the Constraint field can be left blank. This indicates that this otherwise link will catch all rows which have not been successfully written to any of the output links processed up to this point. Therefore, the otherwise link should be the last link in the defined processing order. Any other Constraint can be defined. This will result in the number of rows written to that link (that is, rows which satisfy the constraint) to be recorded in the job log as "rejected rows". Note: Due to the nature of the "catch all" case above, you should only use one reject link whose Constraint field is blank. To use multiple reject links, you should define them to use the linkname.REJECTED flag detailed in the first case above. Parent topic: Editing BASIC transformer stages
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
20 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
Procedure
1. Do one of the following: Click the output link execution order button on the Transformer Editor toolbar. Choose output link reorder from the background shortcut menu. Click the stage properties button in the Transformer toolbar or choose stage properties from the background shortcut menu and click on the stage page Link Ordering tab. The Link Ordering tab appears: 2. Use the arrow buttons to rearrange the list of links in the execution order required. 3. When you are happy with the order, click OK. Note: Although the link ordering facilities mean that you can use a previous output column to derive a subsequent output column, this is not recommended, and you will receive a warning if you do so. Parent topic: Editing BASIC transformer stages
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
21 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
Stage Variable button in the Transformer toolbar or choosing Stage Variable from the background shortcut menu. Note: Stage variables are not shown in the output link meta data area at the bottom of the right pane. The table lists the stage variables together with the expressions used to derive their values. Link lines join the stage variables with input columns used in the expressions. Links from the right side of the table link the variables to the output columns that use them.
Procedure
1. Do one of the following: Click the stage properties button in the Transformer toolbar. Choose stage properties from the background shortcut menu. The Transformer Stage Properties dialog box appears. 2. Click the Variables tab on the General page. The Variables tab contains a grid showing currently declared variables, their initial values, and an optional description. Use the standard grid controls to add new variables. Variable names must begin with an alphabetic character (a-z, A-Z) and can only contain alphanumeric characters (a-z, A-Z, 0-9). Ensure that the variable does not use the name of any BASIC keywords.
Results
Variables entered in the Stage Properties dialog box appear in the Stage Variable table in the links pane. You perform most of the same operations on a stage variable as you can on an output column (see Defining Output Column Derivations). A shortcut menu offers the same commands. You cannot, however, paste a stage variable as a new column, or a column as a new stage variable. Parent topic: Editing BASIC transformer stages
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
22 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
Complete the names of frequently used variables Validate variable names and the complete expression The Expression Editor can be opened from: Output link Derivation cells Stage variable Derivation cells Constraint dialog box Transform dialog box in the Designer Expression format Entering expressions Completing variable names Validating the expression Exiting the expression editor There are a few ways in which you can exit the expression editor. Configuring the expression editor Parent topic: BASIC Transformer stage
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
Expression format
The format of an expression is as follows:
KEY: something_like_this is a token something_in_italics is a terminal, that is, does not break down any further | is a choice between tokens [ is an optional part of the construction "XXX" is a literal token (that is, use XXX not including the quotes) ================================================= expression ::= function_call | variable_name | other_name | constant | unary_expression | binary_expression | if_then_else_expression | substring_expression | "(" expression ")"
23 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
function_call ::= function_name "(" [argument_list] ")" argument_list ::= expression | expression "," argument_list function_name ::= name of a built-in function | name of a user-defined_function variable_name ::= job_parameter name | stage_variable_name | link_variable name other_name ::= name of a built-in macro, system variable, and so on. constant ::= numeric_constant | string_constant numeric_constant ::= ["+" | "-"] digits ["." [digits]] ["E" | "e" ["+" | "-"] digits] string_constant ::= "'" [characters] "'" | """ [characters] """ | "\" [characters] "\" unary_expression ::= unary_operator expression unary_operator ::= "+" | "-" binary_expression ::= expression binary_operator expression binary_operator ::= arithmetic_operator | concatenation_operator | matches_operator | relational_operator | logical_operator arithmetic_operator ::= "+" | "-" | "*" | "/" | "^" concatenation_operator ::= ":" matches_operator ::= "MATCHES" relational_operator ::= " =" |"EQ" | "<>" | "#" | "NE" | ">" | "GT" | ">=" | "=>" | "GE" | "<" | "LT" | "<=" | "=<" | "LE" logical_operator ::= "AND" | "OR" if_then_else_expression ::= "IF" expression "THEN" expression "ELSE" expression substring_expression ::= expression "[" [expression ["," expression] "]" field_expression ::= expression "[" expression "," expression "," expression "]" /* That is, always 3 args
Note: keywords like "AND" or "IF" or "EQ" might be in any case Parent topic: The InfoSphere DataStage expression editor
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
Entering expressions
About this task
Whenever the insertion point is in an expression box, you can use the Expression Editor to suggest the next element in your expression. Do this by right-clicking the box, or by clicking the Suggest button to the right of the box. This opens the Suggest Operand or
24 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
Suggest Operator menu. Which menu appears depends on context, that is, whether you should be entering an operand or an operator as the next expression element. You will be offered a different selection on the Suggest Operand menu depending on whether you are defining key expressions, derivations and constraints, or a custom transform. The Suggest Operator menu is always the same. Parent topic: The InfoSphere DataStage expression editor
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
25 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
Within the Transformer Editor, the invalid expressions are shown in red. (In some cases this might simply mean that the expression does not meet the strict usage pattern rules of the server engine, but will actually function correctly.) Parent topic: The InfoSphere DataStage expression editor
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
26 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
Feedback
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
27 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
28 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
BASIC Transformer stage: Partitioning tab Parent topic: BASIC Transformer stage properties
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
29 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
number generator. Round Robin . The records are partitioned on a round robin basis as they enter the stage. Same. Preserves the partitioning already in place. DB2. Replicates the DB2 partitioning method of a specific DB2 table. Requires extra properties to be set. Access these properties by clicking the properties button. Range. Divides a data set into approximately equal size partitions based on one or more partitioning keys. Range partitioning is often a preprocessing step to performing a total sort on a data set. Requires extra properties to be set. Access these properties by clicking the properties button. The following Collection methods are available: (Auto). This is the default method for the Transformer stage. Normally, when you are using Auto mode, InfoSphere DataStage will eagerly read any row from any input partition as it becomes available. Ordered. Reads all records from the first partition, then all records from the second partition, and so on. Round Robin . Reads a record from the first input partition, then from the second partition, and so on. After reaching the last partition, the operator starts over. Sort Merge. Reads records in an order based on one or more columns of the record. This requires you to select a collecting key column from the Available list. The Partitioning tab also allows you to specify that data arriving on the input link should be sorted. The sort is always carried out within data partitions. If the stage is partitioning incoming data the sort occurs after the partitioning. If the stage is collecting data, the sort occurs before the collection. The availability of sorting depends on the partitioning method chosen. Select the check boxes as follows: Perform Sort. Select this to specify that data coming in on the link should be sorted. Select the column or columns to sort on from the Available list. Stable. Select this if you want to preserve previously sorted data sets. This is the default. Unique. Select this to specify that, if multiple records have identical sorting key values, only one record is retained. If stable sort is also set, the first record is retained. If NLS is enabled an additional button opens a dialog box allowing you to select a locale specifying the collate convention for the sort. You can also specify sort direction, case sensitivity, whether sorted as ASCII or EBCDIC, and whether null columns will appear first or last for each column. Where you are using a keyed partitioning method, you can also specify whether the column is used as a key for sorting, for partitioning, or for both. Select the column in the Selected list and right-click to invoke the shortcut menu. Parent topic: BASIC Transformer stage: Input page
30 of 31
9/18/2013 4:50 PM
http://pic.dhe.ibm.com/infocenter/iisinfsv/v8r7/advanced/print.jsp?topic=...
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
Feedback
Release date: 2011-10-01 PDF version of this information: IBM InfoSphere DataStage and QualityStage Parallel Job Developer's Guide
31 of 31
9/18/2013 4:50 PM