Content Installation Guide: Informatica Data Quality (Version 10.0)
Content Installation Guide: Informatica Data Quality (Version 10.0)
0)
This product includes software licensed under the terms at http://www.tcl.tk/software/tcltk/license.html, http://www.bosrup.com/web/overlib/?License, http://
www.stlport.org/doc/ license.html, http://asm.ow2.org/license.html, http://www.cryptix.org/LICENSE.TXT, http://hsqldb.org/web/hsqlLicense.html, http://
httpunit.sourceforge.net/doc/ license.html, http://jung.sourceforge.net/license.txt , http://www.gzip.org/zlib/zlib_license.html, http://www.openldap.org/software/release/
license.html, http://www.libssh2.org, http://slf4j.org/license.html, http://www.sente.ch/software/OpenSourceLicense.html, http://fusesource.com/downloads/licenseagreements/fuse-message-broker-v-5-3- license-agreement; http://antlr.org/license.html; http://aopalliance.sourceforge.net/; http://www.bouncycastle.org/licence.html;
http://www.jgraph.com/jgraphdownload.html; http://www.jcraft.com/jsch/LICENSE.txt; http://jotm.objectweb.org/bsd_license.html; . http://www.w3.org/Consortium/Legal/
2002/copyright-software-20021231; http://www.slf4j.org/license.html; http://nanoxml.sourceforge.net/orig/copyright.html; http://www.json.org/license.html; http://
forge.ow2.org/projects/javaservice/, http://www.postgresql.org/about/licence.html, http://www.sqlite.org/copyright.html, http://www.tcl.tk/software/tcltk/license.html, http://
www.jaxen.org/faq.html, http://www.jdom.org/docs/faq.html, http://www.slf4j.org/license.html; http://www.iodbc.org/dataspace/iodbc/wiki/iODBC/License; http://
www.keplerproject.org/md5/license.html; http://www.toedter.com/en/jcalendar/license.html; http://www.edankert.com/bounce/index.html; http://www.net-snmp.org/about/
license.html; http://www.openmdx.org/#FAQ; http://www.php.net/license/3_01.txt; http://srp.stanford.edu/license.txt; http://www.schneier.com/blowfish.html; http://
www.jmock.org/license.html; http://xsom.java.net; http://benalman.com/about/license/; https://github.com/CreateJS/EaselJS/blob/master/src/easeljs/display/Bitmap.js;
http://www.h2database.com/html/license.html#summary; http://jsoncpp.sourceforge.net/LICENSE; http://jdbc.postgresql.org/license.html; http://
protobuf.googlecode.com/svn/trunk/src/google/protobuf/descriptor.proto; https://github.com/rantav/hector/blob/master/LICENSE; http://web.mit.edu/Kerberos/krb5current/doc/mitK5license.html; http://jibx.sourceforge.net/jibx-license.html; https://github.com/lyokato/libgeohash/blob/master/LICENSE; https://github.com/hjiang/jsonxx/
blob/master/LICENSE; https://code.google.com/p/lz4/; https://github.com/jedisct1/libsodium/blob/master/LICENSE; http://one-jar.sourceforge.net/index.php?
page=documents&file=license; https://github.com/EsotericSoftware/kryo/blob/master/license.txt; http://www.scala-lang.org/license.html; https://github.com/tinkerpop/
blueprints/blob/master/LICENSE.txt; http://gee.cs.oswego.edu/dl/classes/EDU/oswego/cs/dl/util/concurrent/intro.html; https://aws.amazon.com/asl/; https://github.com/
twbs/bootstrap/blob/master/LICENSE; https://sourceforge.net/p/xmlunit/code/HEAD/tree/trunk/LICENSE.txt; https://github.com/documentcloud/underscore-contrib/blob/
master/LICENSE, and https://github.com/apache/hbase/blob/master/LICENSE.txt.
This product includes software licensed under the Academic Free License (http://www.opensource.org/licenses/afl-3.0.php), the Common Development and Distribution
License (http://www.opensource.org/licenses/cddl1.php) the Common Public License (http://www.opensource.org/licenses/cpl1.0.php), the Sun Binary Code License
Agreement Supplemental License Terms, the BSD License (http:// www.opensource.org/licenses/bsd-license.php), the new BSD License (http://opensource.org/
licenses/BSD-3-Clause), the MIT License (http://www.opensource.org/licenses/mit-license.php), the Artistic License (http://www.opensource.org/licenses/artisticlicense-1.0) and the Initial Developers Public License Version 1.0 (http://www.firebirdsql.org/en/initial-developer-s-public-license-version-1-0/).
This product includes software copyright 2003-2006 Joe WaInes, 2006-2007 XStream Committers. All rights reserved. Permissions and limitations regarding this
software are subject to terms available at http://xstream.codehaus.org/license.html. This product includes software developed by the Indiana University Extreme! Lab.
For further information please visit http://www.extreme.indiana.edu/.
This product includes software Copyright (c) 2013 Frank Balluffi and Markus Moeller. All rights reserved. Permissions and limitations regarding this software are subject
to terms of the MIT license.
See patents at https://www.informatica.com/legal/patents.html.
DISCLAIMER: Informatica LLC provides this documentation "as is" without warranty of any kind, either express or implied, including, but not limited to, the implied
warranties of noninfringement, merchantability, or use for a particular purpose. Informatica LLC does not warrant that this software or documentation is error free. The
information provided in this software or documentation may include technical inaccuracies or typographical errors. The information in this software and documentation is
subject to change at any time without notice.
NOTICES
This Informatica product (the "Software") includes certain drivers (the "DataDirect Drivers") from DataDirect Technologies, an operating company of Progress Software
Corporation ("DataDirect") which are subject to the following terms and conditions:
1. THE DATADIRECT DRIVERS ARE PROVIDED "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING BUT NOT
LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NON-INFRINGEMENT.
2. IN NO EVENT WILL DATADIRECT OR ITS THIRD PARTY SUPPLIERS BE LIABLE TO THE END-USER CUSTOMER FOR ANY DIRECT, INDIRECT,
INCIDENTAL, SPECIAL, CONSEQUENTIAL OR OTHER DAMAGES ARISING OUT OF THE USE OF THE ODBC DRIVERS, WHETHER OR NOT
INFORMED OF THE POSSIBILITIES OF DAMAGES IN ADVANCE. THESE LIMITATIONS APPLY TO ALL CAUSES OF ACTION, INCLUDING, WITHOUT
LIMITATION, BREACH OF CONTRACT, BREACH OF WARRANTY, NEGLIGENCE, STRICT LIABILITY, MISREPRESENTATION AND OTHER TORTS.
Part Number: DQ-CIG-10000-0001
Table of Contents
Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Informatica Resources. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Informatica My Support Portal. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Informatica Documentation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Informatica Product Availability Matrixes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Informatica Web Site. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Informatica How-To Library. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Informatica Knowledge Base. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Informatica Support YouTube Channel. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Informatica Marketplace. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Informatica Velocity. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Informatica Global Customer Support. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Table of Contents
Alias Street. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
Casing Style. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
Country of Origin. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
Country Type. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
Default Country. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
Dual Address Priority. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
Element Abbreviation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
Execution Instances. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
Flexible Range Expansion. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
Geocode Data Type. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
Global Max Field Length. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
Input Format Type. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
Input Format With Country . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
Line Separator. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
Matching Alternatives. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
Matching Extended Archive. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
Matching Scope. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
Max Result Count. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
Mode. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
Optimization Level. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
Output Format Type. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
Output Format With Country. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
Preferred Language. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
Preferred Script. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
Ranges To Expand. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
Standardize Invalid Addresses. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
Tracing Level. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
Address Reference Data File Status. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
Index. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
Table of Contents
Preface
The Informatica Content Installation Guide is written for Informatica administrators who are responsible for
installing prebuilt rules and reference data to Informatica Data Quality.
Informatica Resources
Informatica My Support Portal
As an Informatica customer, the first step in reaching out to Informatica is through the Informatica My Support
Portal at https://mysupport.informatica.com. The My Support Portal is the largest online data integration
collaboration platform with over 100,000 Informatica customers and partners worldwide.
As a member, you can:
Search the Knowledge Base, find product documentation, access how-to documents, and watch support
videos.
Find your local Informatica User Group Network and collaborate with your peers.
Informatica Documentation
The Informatica Documentation team makes every effort to create accurate, usable documentation. If you
have questions, comments, or ideas about this documentation, contact the Informatica Documentation team
through email at [email protected]. We will use your feedback to improve our
documentation. Let us know if we can contact you regarding your comments.
The Documentation team updates documentation as needed. To get the latest documentation for your
product, navigate to Product Documentation from https://mysupport.informatica.com.
Informatica Marketplace
The Informatica Marketplace is a forum where developers and partners can share solutions that augment,
extend, or enhance data integration implementations. By leveraging any of the hundreds of solutions
available on the Marketplace, you can improve your productivity and speed up time to implementation on
your projects. You can access Informatica Marketplace at http://www.informaticamarketplace.com.
Informatica Velocity
You can access Informatica Velocity at https://mysupport.informatica.com. Developed from the real-world
experience of hundreds of data management projects, Informatica Velocity represents the collective
knowledge of our consultants who have worked with organizations from around the world to plan, develop,
deploy, and maintain successful data management solutions. If you have questions, comments, or ideas
about Informatica Velocity, contact Informatica Professional Services at [email protected].
Preface
The telephone numbers for Informatica Global Customer Support are available from the Informatica web site
at http://www.informatica.com/us/services-and-training/support-services/global-support-centers/.
Preface
CHAPTER 1
Content Overview, 9
Content Overview
Informatica Data Quality and PowerCenter applications can use rules and reference data to improve data
accuracy and to standardize the appearance of data. Informatica uses the term content to collectively refer to
rules and reference data.
Informatica distributes the following types of content:
Accelerators
Accelerators are content bundles that contain rules, reference tables, content sets, demonstration
mappings, and demonstration data objects. Each accelerator provides solutions to common data quality
issues in a country, region, or industry. The Data Quality Content installer includes the Core accelerator,
which contains general data quality rules. You can purchase additional accelerators separately. For
more information about accelerators, see the Data Quality Accelerator Guide.
Address reference data files
Address reference data files contain information on all valid addresses in a country. The Address
Validator transformation uses address reference data to analyze the quality of the input data that you
select. The transformation compares the input data to the address reference data and fixes any error it
finds in the input data.
You purchase address reference data on an subscription basis. Informatica updates address reference
data files with new postal information at regular intervals. You can download the current address data
files at any time during your subscription period.
Identity population files
Identity population files contain metadata for personal, household, and corporate identities. Population
files also contain algorithms that apply the metadata to input data. The Match transformation and the
Comparison transformation use this data to parse potential identities from input fields.
The Content installer does not include address reference data files or identity population files. You purchase
this content separately. For address reference data, you purchase an annual subscription for a specific
country.
Use the Content installer executable files to install address reference data, identity population, and
accelerator demonstration data. Use Informatica Developer to import accelerator rules, demonstration
mappings, and reference tables to the Model repository and to write reference table data to the reference
data database.
Accelerator_Content
Accelerator_Sources
Installer
Accelerator_Content Directory
The Accelerator_Content directory contains the following Core accelerator components:
Accelerator XML file
The accelerator XML file contains metadata for Model repository objects such as rules, demonstration
mappings, reference data, and data objects. When you use the Developer tool to import the XML file, the
Developer tool adds the objects to the Model repository.
Reference data file
The reference data file .zip file contains multiple reference data files in comma-separated DIC format.
You use the Developer tool to import this .zip file as part of the accelerator XML import process. The
import process converts the reference data files to database tables in the reference data database and
writes metadata for the reference tables to the Model repository.
To use reference data or prebuilt rules in PowerCenter, export them as PowerCenter objects from the
Informatica Data Quality Model repository.
Accelerator_Sources Directory
The Accelerator_Sources directory contains the following Core accelerator component:
Demonstration data file
The demonstration data .zip file contains comma-separated data files that demonstration mappings use
as source data. You use the Content installer to install this .zip file.
Installer Directory
The Installer directory contains the following items:
Content installation files
Content installation files write reference data and data sources in the server directories on Windows and
UNIX platforms. There are GUI, console, and silent installers for each supported operating system. Each
content installer can also write address reference data and identity population files to the file system.
10
Description
Content_installer_server.exe
SilentInstall.bat
Use to run the content installer in silent mode, for example as part of a
scheduled process.
SilentInput.properties
Description
Content_installer_server.bin
SilentInstall.sh
Use to run the content installer in silent mode, for example as part of a
scheduled process.
SilentInput.properties
11
CHAPTER 2
Installing Content
This chapter includes the following topics:
Installation Overview, 12
Installation Prerequisites, 12
Installation Overview
Use Informatica Developer to import accelerator rules, demonstration mappings, and reference tables to the
Model repository and to write reference table data to the reference data database. Use the Content
installation files to install address reference data, identity populations, and accelerator demonstration data.
When you install address reference data files and identity population files, verify that the Integration Service
can access the machine to which you install the files. You install address reference data files and identity
population files to an Informatica domain. Rerun the installer to add files or update existing files.
You import a set of prebuilt Informatica rules or reference data files once to a Model repository and reference
data database. If more than one Developer tool or Analyst tool user imports the rules or data files, the data is
either overwritten each time or installed multiple times to different folders in the same system.
Note: You must install all accelerator reference data to a single project in the Model repository.
Installation Prerequisites
Complete or verify the following prerequisites before you install content.
General Prerequisites
You must install Informatica Data Quality or PowerCenter before you install content for each product.
You must know the paths to the files that you will install. You provide paths to compressed files and to
directories that contain compressed files.
12
To install address reference data, identity population data, or additional accelerators, purchase this content
separately.
Note: Do not select uncompressed files when you run the Content installer.
The Address Validator transformation reads address code lookup data when you configure the
transformation to run in address code lookup mode.
Batch data
Install batch data to perform address validation on a set of address records. Use batch data to verify that
the input addresses are fully deliverable and complete based on the current postal data from the national
mail carrier.
The Address Validator transformation reads batch data when you configure the transformation to run in
batch mode.
CAMEO data
Install CAMEO data to add customer segmentation data to residential address records. Customer
segmentation data indicates the likely income level and lifestyle preferences of the residents at each
address.
The Address Validator transformation reads CAMEO data when you configure the transformation to run
in batch mode or certified mode.
Installation Prerequisites
13
Certified data
Install certified data to verify that address records meet the certification standards that a mail carrier
defines. An address meets a certification standard if contains data elements that can identify a unique
mailbox, such as delivery point data elements. When an address meets a certification standard, the mail
carrier charges a reduced delivery rate.
The following countries define certification standards:
Australia. Certifies mail according to the Address Matching Approval System (AMAS) standard.
Canada. Certifies mail according to the Software Evaluation And Recognition Program (SERP)
standard.
France. Certifies mail according to the National Address Management Service (SNA) standard.
United States. Certifies mail according to the Coding Accuracy Support System (CASS) standard.
The Address Validator transformation reads batch data when you configure the transformation to run in
certified mode.
Geocode data
Install geocode data to add geocodes to address records. Geocodes are latitude and longitude
coordinates.
The Address Validator transformation reads geocode data when you configure the transformation to run
in batch mode or certified mode.
Note: Informatica provides different types of geocode data. If you need arrival point or parcel centroid
geocodes for addresses, you must purchase additional geocode data sets.
Interactive data
Install interactive data to find the complete valid address when an input address is incomplete or when
you are uncertain about the validity of the input address.
The Address Validator transformation reads interactive data when you configure the transformation to
run in interactive mode.
Suggestion list data
Install suggestion list data to find alternative valid versions of a partial address record. Use suggestion
list data when you configure an address validation mapping to process address records one by one in
real time. The Address Validator transformation uses the data elements in the partial address to perform
a duplicate check on the suggestion list data. The transformation returns any valid address that includes
the information in the partial address.
The Address Validator transformation reads suggestion list data when you configure the transformation
to run in suggestion list mode.
Supplementary data
Install supplementary data to add data to an address record that can assist the mail carrier in mail
delivery. Use the supplementary data to add detail about the geographical or postal area that contains
the address. In some countries, supplementary data can provide a unique identifier for a mailbox within
the postal system.
14
Do not run an address validation mapping or session while you install address reference data.
Informatica releases address reference data through its Address Doctor division. Address Doctor works
with national mail carriers to develop the address reference data. When a mail carrier updates its data
records with new information, Address Doctor adds the information to the address reference data files for
the country.
Address Doctor updates address reference data files several times each year. Informatica sends you a
monthly email to notify you that the latest updates are ready for download.
The Content installer writes the population files to the following directory on the Data Integration Service
machine:
[Informatica_installation_directory]/services/DQContent/INFA_Content/identity/default
Before you run the Content installer, verify that the /default/ directory is present. Before you create a
mapping that reads the population files, verify that the Reference Data Location property on the Content
Management Service specifies the parent directory for the /default/ directory.
The Content installer writes the population files to the following directory on the PowerCenter Integration
Service machine:
[Informatica_installation_directory]/services/DQContent/INFA_Content/identity/default
Installation Prerequisites
15
Before you run the Content installer, verify that the /default/ directory is present. Before you run a
session that reads the population files, verify that the IdentityReferenceDataLocation property in the
IDQTx.cfg file specifies the parent directory for the /default/ directory.
The PowerCenter installer writes the IDQTx.cfg file to the following path:
[Informatica_Installation_directory]/server/bin
Earlier versions of PowerCenter read the path to the population files from the SSAPR environment
variable. The PowerCenter Integration Service can read the location of the population files from the
IDQTx.cfg file or from the SSAPR environment variable. By default, the PowerCenter Integration Service
reads the location from the IDQTx.cfg file. If the IDQTx.cfg file does not specify the location, or if the file is
not present, the PowerCenter Integration Service reads the location from the SSAPR environment
variable.
The IDQTx.cfg file and the SSAPR environment variable specify the path to the parent directory of
the /default/ directory. The path does not include the /default/ directory name. The path cannot
contain character spaces.
You can use the current version of the population files with the current versions of Informatica Data
Quality and PowerCenter. To use the current population files with an earlier version of PowerCenter,
install the current version of the Data Quality Integration plug-in to PowerCenter.
Note: When you install the current plug-in on a PowerCenter machine, you cannot import objects from an
older Model repository to the PowerCenter repository. You can continue to use any data quality object that
you imported to the PowerCenter repository before you installed the current plug-in.
IBM DB2
Oracle
16
Verify that the database user account has CREATETAB and CONNECT privileges.
Verify that the database user has SELECT privileges on the SYSCAT.DBAUTH and
SYSCAT.DBTABAUTH tables.
Informatica does not support IBM DB2 table aliases for repository tables. Verify that table aliases have not
been created for any tables in the database.
Set the NPAGES parameter to at least 5000. The NPAGES parameter determines the number of pages in
the tablespace.
Verify that the database user account has CONNECT and CREATE TABLE privileges.
Verify that the database user account has CONNECT and RESOURCE privileges.
Informatica does not support Oracle public synonyms for repository tables. Verify that public synonyms
have not been created for any tables in the database.
2.
3.
4.
5.
17
Windows Installation
Follow these steps to install address reference data, identity population data, or demonstration data files on a
Windows machine.
1.
2.
3.
Enter the path to the root directory of the Informatica server installation. This may be a remote directory.
Browse to this directory if required.
4.
If you are installing address reference data files, enter the path to the server directory where the installer
will write these files.
Browse to this directory if required.
5.
Click Next.
6.
Browse to a compressed reference data file, or browse to a directory that contains reference data files,
and click Next.
You can specify multiple file and directory paths.
7.
8.
UNIX Installation
Follow these steps to install address reference data, identity population data, or demonstration data files on a
UNIX machine.
1.
2.
3.
4.
Enter the path to the root directory of the Informatica server installation.
5.
If you are installing address reference data files, enter the path to the server directory where the installer
will write these files.
6.
Enter the path to a compressed reference data file, or to a directory that contains reference data files.
You can enter multiple file and directory paths in a comma-separated list. Do not include spaces in the
list.
7.
8.
18
Silent Installation
You can run the Content installer in silent mode. You define the installation parameters in the
SilentInput.properties file in the Installer directory. You distribute the directory to any user who will run
the silent installer.
Users run the silent installer file from the Installer directory. On Windows systems, the silent installer file is
SilentInstall.bat. On UNIX systems, the silent installer file is SilentInstall.sh.
Follow these steps to prepare the properties file for silent installation:
1.
2.
Open SilentInput.properties.
3.
Set the following properties for the Informatica domain in which you will use the reference data:
4.
Property
Description
USER_INSTALL_DIR
USER_SELECTIONS
Comma-separated list of reference data files or directories. This list must not
contain spaces.
UID_EXTRACTION_FLAG
Determines if the installer will extract the reference data on the Content CD
image. Set to 1 to extract this data.
AV_EXTRACTION_FLAG
Determines if the installer will extract the address reference data files at the
location defined in AV_INSTALL_DIR. Set to 1 to extract this data.
AV_INSTALL_DIR
The path to the directory that contains the address reference data.
In the Developer tool, connect to the Model repository that contains the destination project for the
metadata.
2.
3.
4.
In the Import dialog box, select Informatica > Import Object Metadata File (Advanced).
5.
Click Next.
6.
Browse to the XML metadata file in the accelerator directory structure, and select the file.
7.
8.
In the Source pane, select the items that appear under the project node.
9.
19
10.
If the repository project contains an object that you want to add, the Developer tool prompts you to
merge the object with the current object. Click Yes to merge the objects.
If the Developer tool prompts you to rename the objects, click No.
If any object remains in the Source pane, use the pointer to move the object to the target project.
11.
Click Next.
12.
Browse to the compressed reference data file in the accelerator directory structure, and select the file.
13.
Click Open.
14.
15.
16.
Click Finish.
In the Developer tool, connect to the Model repository that contains the destination project for the
metadata.
2.
3.
4.
In the Import dialog box, select Informatica > Import Object Metadata File (Advanced).
5.
Click Next.
6.
Browse to the XML metadata file in the accelerator directory structure, and select the file.
7.
8.
In the Source pane, select the items that you want to update in the project. The items appear under the
project node.
9.
10.
11.
12.
13.
Click Next.
The Developer tool calculates the object dependencies.
20
14.
Click Next.
15.
Click Browse to add reference data. Find the compressed reference data file in the accelerator directory
structure, and select the file.
16.
Click Open.
17.
18.
Click the selection arrow in the Target Connection field, and select the reference data database.
19.
Click Finish.
21
CHAPTER 3
Post-Installation Overview, 22
Post-Installation Overview
After you install address reference data for Data Quality or PowerCenter, you must configure the address
reference data properties that the Integration Service uses when it runs an address validation mapping or
session.
You can also verify or edit address reference data settings in the Developer tool.
22
Description
License
License key to activate validation reference data. You might have more than one key, for
example, if you use batch reference data and geocoding reference data. Enter keys as a
comma-delimited list. The property is empty by default.
Reference Data
Location
Location of the address reference data files. Enter the full path to the files. Install all address
reference data files to a single location. The property is empty by default.
23
Property
Description
Full Pre-Load
Countries
List of countries for which all batch, CAMEO, certified, interactive, or supplementary
reference data is loaded into memory before address validation begins. Enter the threecharacter ISO country codes in a comma-separated list. For example, enter DEU,FRA,USA.
Enter ALL to load all data sets. The property is empty by default.
Load the full reference database to increase performance. Some countries, such as the
United States, have large databases that require significant amounts of memory.
Partial Pre-Load
Countries
List of countries for which batch, CAMEO, certified, interactive, or supplementary reference
metadata and indexing structures are loaded into memory before address validation begins.
Enter the three-character ISO country codes in a comma-separated list. For example, enter
DEU,FRA,USA. Enter ALL to partially load all data sets. The property is empty by default.
Partial preloading increases performance when not enough memory is available to load the
complete databases into memory.
No Pre-Load
Countries
Full Pre-Load
Geocoding
Countries
List of countries for which all geocoding reference data is loaded into memory before address
validation begins. Enter the three-character ISO country codes in a comma-separated list. For
example, enter DEU,FRA,USA. Enter ALL to load all data sets. The property is empty by
default.
Load all reference data for a country to increase performance when processing addresses
from that country. Some countries, such as the United States, have large data sets that
require significant amounts of memory.
Partial Pre-Load
Geocoding
Countries
List of countries for which geocoding reference metadata and indexing structures are loaded
into memory before address validation begins. Enter the three-character ISO country codes in
a comma-separated list. For example, enter DEU,FRA,USA. Enter ALL to partially load all
data sets. The property is empty by default.
Partial preloading increases performance when not enough memory is available to load the
complete databases into memory.
No Pre-Load
Geocoding
Countries
List of countries for which no geocoding reference data is loaded into memory before address
validation begins. Enter the three-character ISO country codes in a comma-separated list. For
example, enter DEU,FRA,USA. Default is ALL.
Full Pre-Load
Suggestion List
Countries
List of countries for which all suggestion list reference data is loaded into memory before
address validation begins. Enter the three-character ISO country codes in a commaseparated list. For example, enter DEU,FRA,USA. Enter ALL to load all data sets. The
property is empty by default.
Load the full reference database to increase performance. Some countries, such as the
United States, have large databases that require significant amounts of memory.
Partial Pre-Load
Suggestion List
Countries
List of countries for which the suggestion list reference metadata and indexing structures are
loaded into memory before address validation begins. Enter the three-character ISO country
codes in a comma-separated list. For example, enter DEU,FRA,USA. Enter ALL to partially
load all data sets. The property is empty by default.
Partial preloading increases performance when not enough memory is available to load the
complete databases into memory.
No Pre-Load
Suggestion List
Countries
24
List of countries for which no suggestion list reference data is loaded into memory before
address validation begins. Enter the three-character ISO country codes in a commaseparated list. For example, enter DEU,FRA,USA. Default is ALL.
Property
Description
Full Pre-Load
Address Code
Countries
List of countries for which all address code lookup reference data is loaded into memory
before address validation begins. Enter the three-character ISO country codes in a commaseparated list. For example, enter DEU,FRA,USA. Enter ALL to load all data sets. The
property is empty by default.
Load the full reference database to increase performance. Some countries, such as the
United States, have large databases that require significant amounts of memory.
Partial Pre-Load
Address Code
Countries
List of countries for which the address code lookup reference metadata and indexing
structures are loaded into memory before address validation begins. Enter the threecharacter ISO country codes in a comma-separated list. For example, enter DEU,FRA,USA.
Enter ALL to partially load all data sets. The property is empty by default.
Partial preloading increases performance when not enough memory is available to load the
complete databases into memory.
No Pre-Load
Address Code
Countries
List of countries for which no address code lookup reference data is loaded into memory
before address validation begins. Enter the three-character ISO country codes in a commaseparated list. For example, enter DEU,FRA,USA. Default is ALL.
Preloading
Method
Determines how the Data Integration Service preloads address reference data into memory.
The MAP method and the LOAD method both allocate a block of memory and then read
reference data into this block. However, the MAP method can share reference data between
multiple processes. Default is MAP.
Max Result
Count
Maximum number of addresses that address validation can return in suggestion list mode.
Set a maximum number in the range 1 through 100. Default is 20.
Memory Usage
Number of megabytes of memory that the address validation library files can allocate. Default
is 4096.
Max Address
Object Count
Maximum number of address validation instances to run at the same time. Default is 3. Set a
value that is greater than or equal to the Maximum Parallelism value on the Data Integration
Service.
Max Thread
Count
Maximum number of threads that address validation can use. Set to the total number of cores
or threads available on a machine. Default is 2.
Cache Size
Size of cache for databases that are not preloaded. Caching reserves memory to increase
lookup performance in reference data that has not been preloaded.
Set the cache size to LARGE unless all the reference data is preloaded or you need to reduce
the amount of memory usage.
Enter one of the following options for the cache size in uppercase letters:
- NONE. No cache. Enter NONE if all reference databases are preloaded.
- SMALL. Reduced cache size.
- LARGE. Standard cache size.
Default is LARGE.
SendRight
Report Location
Location to which an address validation mapping writes a SendRight report and any log file
that relates to the report. You generate a SendRight report to verify that a set of New Zealand
address records meets the certification standards of New Zealand Post. Enter a local path on
the machine that hosts the Data Integration Service that runs the mapping.
By default, address validation writes the report file to the bin directory of the Informatica
installation. If you enter a relative path, the Content Management Service appends the path to
the bin directory.
25
By default, the Content Management Service applies the ALL value to the options that indicate no data
preload. If you accept the default options, the Data Integration Service reads the address reference data
from files in the directory structure when the mapping runs.
The address validation process properties must indicate a preload method for each type of address
reference data that a mapping specifies. If the Data Integration Service cannot determine a preload policy
for a type of reference data, it ignores the reference data when the mapping runs.
The Data Integration Service can use a different method to load data for each country. For example, you
can specify full preload for United States suggestion list data and partial preload for United Kingdom
suggestion list data.
The Data Integration Service can use a different preload method for each type of data. For example, you
can specify full preload for United States batch data and partial preload for United States address code
data.
Full preload settings supersede partial preload settings, and partial preload settings supersede settings
that indicate no data preload.
For example, you might configure the following options:
Full Pre-Load Geocoding Countries: DEU
No Pre-Load Geocoding Countries: ALL
The options specify that the Data Integration Service loads German geocoding data into memory and
does not load geocoding data for any other country.
The Data Integration Service loads the types of address reference data that you specify in the address
validation process properties. The Data Integration Service does not read the mapping metadata to
identify the address reference data that the mapping specifies.
Alias Location
Determines whether address validation replaces a valid location alias with the official location name.
A location alias is an alternative location name that the USPS recognizes as an element in a deliverable
address. You can use the property when you configure the Address Validator transformation to validate
United States address records in Certified mode.
26
Description
Off
Official
Replaces any alternative location name or location alias with the official
location name. Default option.
Preserve
Alias Street
Determines whether address validation replaces a street alias with the official street name.
A street alias is an alternative street name that the USPS recognizes as an element in a deliverable address.
You can use the property when you configure the Address Validator transformation to validate United States
address records in Certified mode.
The following table describes the Alias Street options:
Option
Description
Off
Official
Replaces any alternative street name or street alias with the official street
name. Default option.
Preserve
Preserves a valid alternative street name or street alias. If the input street
name is not valid, address validation replaces the name with the official name.
Casing Style
Determines the character case that the transformation uses to write output data.
The following table describes the Casing Style options:
Option
Description
Assign Parameter
Lower
Mixed
Uses the casing style in use in the destination country when it is possible to do
so.
Database
Applies the casing style that the address reference data uses. Default option.
Preserved
Writes the output address in the same case as the input address.
Upper
You can also configure the casing style on the General Settings tab.
27
Parameter Usage
You can use one of the following parameter to specify the casing style:
MIXED. Uses the casing style in use in the destination country when it is possible to do so.
NATIVE. Applies the casing style that the address reference data uses. Default option. Matches the
Database option on the General Settings tab.
NOCHANGE. Writes the output address in the same case as the input address. Matches the Preserved
option on the General Settings tab.
Country of Origin
Identifies the country in which the address records are mailed.
Select a country from the list. The property is empty by default.
Country Type
Determines the format of the country name or abbreviation in Complete Address or Formatted Address Line
port output data. The transformation writes the country name or abbreviation in the standard format of the
country you select.
The following table describes the Country Type options:
28
Option
Country
ISO 2
ISO 3
ISO #
Abbreviation
CN
Canada
DA
DE
Germany
EN
ES
Spain
FI
Finland
FR
France
GR
Greece
Option
Country
IT
Italy
JP
Japan
HU
Hungary
KR
Korea, Republic of
NL
Netherlands
PL
Poland
PT
Portugal
RU
Russia
SA
Saudi Arabia
SE
Sweden
Default Country
Specifies the address reference data set that the transformation uses when an address record does not
identify a destination country.
Select a country from the list. Use the default option if the address records include country information.
Default is None.
You can also configure the default country on the General Settings tab.
Parameter Usage
You can use a parameter to specify the default country. When you create the parameter, enter the
ISO 3166-1 alpha-3 code for the country as the parameter value. When you enter a parameter value, use
uppercase characters. For example, if all address records include country information, enter NONE.
29
The following table describes the options on the Dual Address Priority property:
Option
Description
Delivery service
Validates delivery service data elements in an address, such as post office box
elements.
Postal admin
Validates the address elements required by the local mail carrier. Default
option.
Street
Element Abbreviation
Determines if the transformation returns the abbreviated form of an address element. You can set the
transformation to return the abbreviated form if the address reference data contains abbreviations.
For example, the United States Postal Service (USPS) maintains short and long forms of many street and
locality names. The short form of HUNTSVILLE BROWNSFERRY RD is HSV BROWNS FRY RD. You can select the
Element Abbreviation property when the street or locality values exceed the maximum field length that the
USPS specifies.
The option is cleared by default. Set the property to ON to return the abbreviated address values. The
property returns the abbreviated locality name and locality code when you use the transformation in batch
mode. The property returns the abbreviated street name, locality name, and locality code when you use the
transformation in certified mode.
Execution Instances
Specifies the number of threads that the Data Integration Service tries to create for the current transformation
at run time. The Data Integration Service considers the Execution Instances value if you override the
Maximum Parallelism run-time property on the mapping that contains the transformation. The default
Execution Instances value is 1.
The Data Integration Service considers multiple factors to determine the number of threads to assign to the
transformation. The principal factors are the Execution Instances value and the values on the mapping and
on the associated application services in the domain.
The Data Integration Service reads the following values when it calculates the number of threads to use for
the transformation:
Any Maximum Parallelism value that you set at the mapping level. Default is Auto.
If you override the Maximum Parallelism value at the mapping level, the Data Integration Service attempts to
use the lowest value across the properties to determine the number of threads.
If you use the default Maximum Parallelism value at the mapping level, the Data Integration Service ignores
the Execution Instances value.
The Data Integration Service also considers the Max Address Object Count property on the Content
Management Service when it calculates the number of threads to create. The Max Address Object Count
property determines the maximum number of address validation instances that can run concurrently in a
30
mapping. The Max Address Object Count property value must be greater than or equal to the Maximum
Parallelism value on the Data Integration Service.
Multiple users might run concurrent mappings on a Data Integration Service. To calculate the correct
number of threads, divide the number of central processing units that the service can access by the
number of concurrent mappings.
In PowerCenter, the AD50.cfg configuration file specifies the maximum number of address validation
instances that can run concurrently in a mapping.
When you use the default Execution Instances value and the default Maximum Parallelism values, the
transformation operations are not partitionable.
When you set an Execution Instances value greater than 1, you change the Address Validator
transformation from a passive transformation to an active transformation.
Description
On
Address validation limits the number of addresses that the Ranges to Expand
property adds to the suggestion list. Default option.
Off
Address validation does not limit the number of addresses that the Ranges to
Expand property adds to the suggestion list.
Note: The Address Validator transformation applies the Flexible Range Expansion property in a different way
to every address that it returns to the suggestion list. The transformation does not impose a fixed limit on the
number of expanded addresses in the list. The transformation also considers the Max Result Count property
setting when it calculates the number of expanded addresses to include in the list.
31
Arrival point
Returns the latitude and longitude coordinates of the entrance to a building or parcel of land. Default
option.
You can select the arrival point option for addresses in the following countries:
Austria, Canada, Denmark, Finland, Germany, Hungary, Latvia, Luxembourg, Mexico, Netherlands,
Norway, Slovenia, Sweden, the United Kingdom, and the United States.
When you select the arrival point option for addresses in the United Kingdom, the Address Validator
transformation returns the rooftop geocodes for the address. Rooftop geocodes identify the physical
center of the building that contains the mailbox.
If the Address Validator transformation cannot return arrival point geocodes for an address, the
transformation returns interpolated geocodes.
Parcel centroid
Returns the latitude and longitude coordinates of the geographic center of a parcel of land at ground
level.
You can select the parcel centroid option for addresses in the following countries:
Austria, Canada, Denmark, Finland, Germany, Hungary, Latvia, Luxembourg, Netherlands, Norway,
Slovenia, Sweden, and the United States.
If the Address Validator transformation cannot return parcel centroid geocodes for an address, the
transformation does not return any geocode data.
Standard
Returns the estimated latitude and longitude coordinates of the entrance to the building or parcel of land.
An estimated geocode is also called an interpolated geocode.
The Address Validator transformation uses the nearest available geocodes in the address reference data
to estimate the geocodes for the address.
Parameter Usage
You can use a parameter to specify the geocode type. Enter ARRIVAL_POINT, PARCEL_CENTROID, or
NONE. To return the standard geocodes, enter NONE.
Enter the parameter value in uppercase.
Parameter Usage
You can use a parameter to specify the maximum number of addresses. To set the parameter value, enter an
integer from 0 through 1024.
32
All
Address
Organization
Contact
Organization/Contact
The address includes organization information and contact information.
Organization/Dept
The address includes organization information and department information.
Default is All.
Line Separator
Specifies the delimiter symbol that indicates line breaks in a formatted address.
Select one of the following options:
Carriage return
Comma
Line Feed/LF
Pipe
Semicolon
Space
Tab
Default is semicolon.
You can also configure the line separator on the General Settings tab.
Parameter Usage
You can use a parameter to specify the line separator. The parameter value is case-sensitive. Enter the
parameter value in uppercase characters.
Enter one of the following values:
CR
33
COMMA
LF
PIPE
SEMICOLON
SPACE
TAB
CRLF
Matching Alternatives
Determines whether address validation recognizes alternative place names, such as synonyms or historical
names, in an input address. The property applies to street, locality, and province data.
Note: The Matching Alternatives property does not preserve alternative names in a validated address.
The following table describes the Matching Alternatives options:
Option
Description
All
Recognizes all known alternative street names and place names. Default
option.
Archives only
None
Synonyms only
34
Option
Description
Off
On
Returns the address code for the current version of an out-of-date Japanese
address.
The Matching Extended Archive property uses supplementary data and address code lookup data for Japan.
To apply the property in address validation, configure the transformation to run in address code lookup mode.
Matching Scope
Determines the amount of data that the transformation matches against the address reference data during
address validation.
The following table describes the Matching Scope options:
Option
Description
All
Delivery Point
Validates building and sub-building address data in addition to data that the
Street option validates.
Locality
Street
Validates street address data in addition to data that the Locality option
validates.
Parameter Usage
You can use a parameter to specify the maximum number of addresses. To set the parameter value, enter an
integer from 0 through 100.
Mode
Determines the type of address analysis that the transformation performs. You can also configure the mode
on the General Settings tab of the transformation.
Parameter Usage
You can use a parameter to specify the analysis mode. When you define the parameter, enter the mode as
the default value. The value uses uppercase characters and does not contain spaces.
35
The following table describes the Mode menu options and the corresponding parameter values that you can
set:
Mode
Description
Parameter Value
Batch
BATCH
Certified
CERTIFIED
Country recognition
COUNTRYRECOGNITIO
N
Parse
PARSE
Suggestion list
FASTCOMPLETION
Assign parameter
Optimization Level
Determines how the transformation matches input address data and address reference data. The property
defines the type of match that the transformation must find between the input data and reference data before
it can update the address record.
The following table describes the Optimization Level options:
Option
Description
Narrow
Standard
The transformation parses multiple types of address information from the input
data before it performs validation. When you select the standard option, the
transformation updates an address if it can match multiple input values with
the reference data.
Default is Standard.
Wide
The transformation uses the standard parsing settings and performs additional
parsing operations across the input data. When you select the wide option, the
transformation updates an address if it can match at least one input value with
the reference data. The wide option increases mapping run times.
Parameter Usage
You can use a parameter to specify the optimization level. Enter NARROW, STANDARD, or WIDE. Enter the
parameter value in uppercase.
36
All
Address
Organization
Contact
Organization/Contact
The address includes organization information and contact information.
Organization/Dept
The address includes organization information and department information.
Default is All.
Preferred Language
Determines the languages in which the Address Validator transformation returns an address when the
reference data sets contain data in more than one language.
An address reference data set specifies a default language for each valid address. In some countries, the
default language can vary from one region to another. For example, the transformation returns Canadian
addresses in English or in French according to the province that the address identifies. The address
reference data sets for Canada store address elements in both languages. Use the Preferred Language
property to return the addresses in each case in the non-default language.
Note: An address reference data set might contain some address elements in a non-default language but not
others. If the transformation cannot find an element in the language that the property specifies, the
transformation returns the element in the default language.
The following table describes the options that you can select on the Preferred Language property:
Option
Description
Alternative 1,
Returns each address element in the same language regardless of the default
language for the country or region.
Alternative 2,
Alternative 3
Database
Returns each address in the language that the address reference data
specifies. The address reference data might specify different languages for
addresses in different regions in a country.
Database is the default option.
37
Option
Description
English
Returns the locality names and the province names in English when the
reference data contains the data in English. Returns the other address
elements in the default language of the region to which the address belongs.
Preserve Input
Returns each address in the language that the input address uses.
The transformation preserves the input language when the address reference
data contains the address in the input language.
If the property specifies a language that is not available in the address reference data, the transformation
returns the addresses in the default language.
Language
Alternative 1
Flemish
Alternative 2
French
Alternative 3
German
The following table describes the options and the languages that you can specify for addresses in Canada:
Option
Language
Alternative 1
English
Alternative 2
French
Alternative 3
When you select the Alternative 2 option for Canadian addresses, address validation returns the street
descriptors and the province names in French. Address validation returns all other address elements in
English.
38
The following table describes the options and the languages that you can specify for addresses in
Switzerland:
Option
Language
Alternative 1
German
Alternative 2
French
Alternative 3
Italian
When you select the Alternative 3 option for Swiss addresses, address validation returns the locality names
in Italian. Address validation returns all other address elements in the default language for the region.
Note: If you set a preferred language and preferred script for the output data, verify that the language and the
character code you select are compatible.
Preferred Script
Determines the character set that the Address Validator transformation uses for output data.
The transformation can process a data source that contains data in multiple languages and character sets.
The transformation converts all input data to the Unicode UCS-2 character set and processes the data in the
UCS-2 format. After the transformation processes the data, it converts the data in each address record to the
character set that you specify in the property. The process is called transliteration. By default, the property
uses the character set that the local address reference data uses.
The following table describes the options on the property:
Option
Description
ASCII (Simplified)
ASCII (Extended)
Returns address data in ASCII characters and expands special characters. For
example, transliterates to OE.
Database
Returns address data in the character set of the address reference data that
applies to the address.
Default is Database.
Latin
Latin (Alt)
Postal Admin
Preserve Input
Returns address data in the character set that the input address uses.
Transliteration can use the numeric representations of each character in a character set when it converts
characters for processing. Transliteration can also convert characters phonetically when there is no
39
equivalent numeric representation of a character. If the Address Validator transformation cannot map a
character to UCS-2, it converts the character to a space.
Note: If you set a preferred language and preferred script for the output data, verify that the language and the
character code that you select are compatible.
Ranges To Expand
Determines how the Address Validator transformation returns suggested addresses for a street address that
does not specify a house number. Use the property when the transformation runs in suggestion list mode.
The Address Validator transformation reads a partial or incomplete street address in suggestion list mode.
The transformation compares the address to the address reference data, and it returns all similar addresses
to the end user. If the input address does not contain a house number, the transformation can return one or
more house number suggestions for the street. The Ranges to Expand property determines how the
transformation returns the addresses.
The transformation can return the range of valid house numbers in a single address, or it can return a
separate address for each valid house number. The transformation can also return an address for each
number in the range from the lowest to the highest house number on the street.
The following table describes the options on the property:
Option
Description
All
Address validation returns a suggested address for every house number in the
range of possible house numbers on the street.
None
Address validation returns a single address that identifies the lowest and
highest house numbers in the valid range for the street.
Address validation returns a suggested address for every house number that
the address reference data recognizes as a deliverable address.
Note: Suggestion list mode can use other elements in the address to specify the valid range of street
numbers. For example, a ZIP Code might identify the city block that contains the address mailbox. The
Address Validator transformation can use the ZIP Code to identify the lowest and highest valid house
numbers on the block.
If the transformation cannot determine a house number range within practical limits, the number of suggested
addresses can grow to an unusable size. To restrict the number of addresses that the Ranges to Expand
property generates, set the Flexible Range Expansion property to On.
40
Predirectional and postdirectional elements, such as north, south, east, and west.
State or province names. Standardization returns the abbreviated forms of the names.
Description
Off
On
Parameter Usage
You can assign a parameter to specify the standardization policy for data errors. Enter OFF or ON as the
parameter value. Enter the value in uppercase.
Tracing Level
Sets the amount of detail that is included in the log.
You can configure tracing levels for logs.
Configure the following property on the Advanced tab:
Tracing Level
Amount of detail that appears in the log for this transformation. You can choose terse, normal, verbose
initialization, or verbose data. Default is normal.
Description
Country ISO
The country to which the address reference data file applies. This property shows
the ISO three-character abbreviation for the country.
Expiry Date
The date on which the file is due to be replaced by a newer file. You can use an
address reference data file after its expiry date, but the data in the file may no
longer be accurate.
41
Property
Description
Country Type
The type of address processing that you can perform with the data. You select the
processing type in the Mode option on the General Settings tab. If the mode you
select does not correspond to an address data file on the domain, the address
validation mapping will fail.
The date on which the file license expires. You cannot use the file after the unlock
expiry date.
The date on which the license is effective for the file. You cannot use the file
before the unlock start date.
42
Index
M
Microsoft SQL Server database requirements
reference data warehouse 17
O
Oracle database requirements
reference data warehouse 17
43