0% found this document useful (0 votes)
40 views7 pages

Enterprise Data Catalog - Data Sheet - 3238en

1) Informatica Enterprise Data Catalog is an AI-powered data catalog that uses machine learning to automatically catalog and classify data across an enterprise, including both cloud and on-premises data. 2) It provides powerful semantic search and relationship views to help users discover and understand data assets. Users can view metadata, lineage, quality metrics, and collaborate on data. 3) The data catalog automatically associates business terms to technical metadata to add business context to data and help both business and IT users manage metadata.

Uploaded by

Kranti Kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
40 views7 pages

Enterprise Data Catalog - Data Sheet - 3238en

1) Informatica Enterprise Data Catalog is an AI-powered data catalog that uses machine learning to automatically catalog and classify data across an enterprise, including both cloud and on-premises data. 2) It provides powerful semantic search and relationship views to help users discover and understand data assets. Users can view metadata, lineage, quality metrics, and collaborate on data. 3) The data catalog automatically associates business terms to technical metadata to add business context to data and help both business and IT users manage metadata.

Uploaded by

Kranti Kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

Data Sheet

Informatica Enterprise
Data Catalog

Benefits
Unleash the Power of Data With an Intelligent Data Catalog
• Automatically catalog and
classify all types of data across
Data is the lifeblood of our economy, and data-driven companies turn their data assets into
the enterprise using an
revenue and profits. The first step in any data-driven digital transformation initiative is to manage
AI-powered catalog
your data as an enterprise asset: take inventory of it, assess its value, and maximize its use—just
• Provide a metadata system of
record for the enterprise with a like you do with other significant capital and operational investments.
catalog of catalogs

• Automatically extract the most Data is diverse and distributed across many different departments, applications, and data
granular metadata from a wide warehouses and data lakes (some on-premises, others in the cloud), making it a challenge to
array of data sources, including know exactly what data you have and where. As data sources proliferate, the data landscape
complex enterprise systems
becomes even more complex.
• Find data assets through powerful
Google-like semantic search ®
Informatica Enterprise Data Catalog is an AI-powered data catalog that provides a machine-
• Discover and understand your
learning-based discovery engine to scan and catalog data assets across the enterprise—across
data assets with a holistic view ®
multi-cloud and on-premises. Enterprise Data Catalog is powered by the CLAIRE engine,
including lineage, relationship
views, and data profiling stats which provides intelligence by leveraging metadata to deliver recommendations, suggestions,
and quality scorecards and automation of data management tasks. This enables IT users to be more productive and
• Identify domains and entities business users to be full partners in the management and use of data.
with intelligent curation

• Enrich data assets with governed Informatica Enterprise Data Catalog provides data analysts and IT users with powerful semantic
and crowdsourced annotations, search and dynamic facets to filter search results, detailed data lineage, profiling statistics,
ratings, and reviews
data quality scorecards, holistic relationship views, data similarity recommendations, and an
• Automatically associate
integrated business glossary.
business glossary terms to
technical data assets
Collaboration capabilities leverage subject matter expertise and social curation combined with
• Open APIs to integrate into
the power of AI to guide user experience and automate data curation. Users can quickly find data
your environment and expose
intelligent metadata anywhere and easily manage the life cycle of business terms, definitions, reference data, and more.

• Measure and optimize the value


of your data assets with Data With Data Asset Analytics in Enterprise Data Catalog, you get insights on the usage of data within
Asset Analytics your organization, enabling you to proactively manage and optimize the value of your data assets.

1
Informatica Enterprise Key Features
Data Catalog is an Semantic Search With Intelligent Facets
Find and discover the most relevant datasets for your analysis using powerful semantic search
AI-powered data
with intelligent facets. Advanced keyword search with token matching finds the most relevant
catalog that provides
data assets, and semantic search is even applied to inferred data domains. Intelligent facets,
a machine-learning- based on the search results, allow users to narrow the search to the datasets of interest.
based discovery engine
to scan and catalog Holistic Relationship Discovery
data assets across Get a holistic view of data in a knowledge graph that lets you quickly search, discover, and
the enterprise—across understand enterprise data and meaningful data relationships. Automatically discover related

cloud and on-premises. datasets, technical, business, semantic, and usage-based relationships. The holistic data view
shows related datasets, tables, views, data domains, reports, and users. This aids in progressive
discovery of other datasets of interest.

Automated Classifications With Intelligent Domain and Entity Recognition


Automatically classify and identify domains and entities such as customer, product, order etc.
across all structured and unstructured data assets at the field, column, and table level. This is
a crucial step in the ability for companies to catalog, govern, and extract value from their data
assets. This classified data enables better search, filtering of search results, and business
glossary recommendations. Informatica provides over 60 packaged data domains such as email,
credit card number, social security number, country, city, URL, and company name. Users can
add their own custom domains too. Data assets can be classified using data rules (i.e., columns
with data that matches specific logic defined in the rule) or column name rules (i.e., finds
columns that match column name logic defined in the rule).

Figure 1: Quickly find datasets with smart semantic search and dynamic facets. View ratings and certified datasets.

Data Lineage and Impact Analysis


Interactively trace data origin through lineage views at any level—from business-friendly, system-
level views that highlight the endpoints to granular views that include all the complex details
in between. A drill-down lineage view expands any lineage path to show granular column- and
metric-level lineage. Users can perform detailed impact analysis on upstream and downstream
data assets.

2
Collaboration and Social Curation
Informatica Enterprise Data Catalog empowers data analysts and data scientists to easily find
the most relevant and trusted data for analytics by harnessing the combined power of AI and
human expertise and collaboration. Data owners and subject matter experts can certify datasets.
Data consumers can provide ratings and reviews for datasets enabling social curation of data.

Users can follow datasets of interest and get notified of changes, and a Q&A platform allows
subject matter experts to answer common questions from users. In addition, users can add
custom attributes and annotations to datasets, further enhancing business-IT collaboration and
search results.

Figure 2: Enable collaboration with Q&A capabilities.

Integrated Data Quality


View data profiling statistics, data quality rules, scorecards, and metric groups alongside
technical metadata to understand the quality of data assets before using data for analysis.
Profiling statistics include value distributions, patterns, and data type and data domain inference.

Automatic Association of Business Glossary Terms


Informatica Enterprise Data Catalog allows for easy import of business glossary assets such as
terms, policies, and classifications from Informatica AxonTM as well as third-party tools. Add rich
business context to the data by automatically associating business terms with the right technical
metadata, eliminating a tedious manual process and allowing business and IT stewards to
collaboratively manage business metadata that includes efficient human workflow automation.

Intelligent Data Similarity


Advanced statistical and machine learning algorithms identify similar data and subsets of data.
This powerful capability helps users find the most relevant and trusted data they need. For
example, a telecom analyst interested in customer churn analysis might query data containing
pre-paid customer activity for the current quarter. Informatica Enterprise Data Catalog can
recommend a cleaner version of the data (substitute data), data containing customer activity
for the previous quarter (union-able data), and a customer detail table to enrich the dataset
(joinable data).

3
Data Asset Analytics for Data Value
Data Asset Analytics provides prepackaged reports and dashboards on data asset inventory,
usage, enrichment, level of collaboration, and more. Reports are extensible and can be exported,
enabling data leaders to share business adoption and value metrics with stakeholders. Automated
Data Value Calculator, a first-of-its-kind capability, allows an enterprise to measure and optimize
the value of its data assets based on key factors that impact data value.

Universal Metadata Connectivity


Enterprise Data Catalog offers deep and broad metadata connectivity that spans on-premises,
hybrid, and multi-cloud environments. Extract metadata from any type of data source such as
databases, data warehouses, cloud data lakes, BI tools, Hadoop clusters, NoSQL, and complex
enterprise systems including legacy and mainframe systems, multi-vendor ETL tools, SQL dialects,
various enterprise applications, and stored procedures.

With Enterprise Data Catalog Advanced Scanners, you can visually inspect every script, procedure,
or process to fully understand its logic and internal data flow. You can obtain a complete column-
level data lineage, including a full inventory of all the potential lineage sources with rich details.
The Advanced Scanners allow you to scan both static and dynamic code, as well as perform
language parsing to obtain automated data lineage.

Data sources supported include:


• Databases/Data warehouses: Oracle, MS SQL Server, SQL Scripts, Sybase ASE, IBM Netezza,
Teradata, JDBC, SAP HANA, SAP BW, SAP BW/4HANA, Snowflake, Stored Procedures
• Big Data: Cloudera Navigator, Hive (Cloudera/Hortonworks/MapR/IBM BigInsights/EMR), HDFS,
Hortonworks Atlas, Cassandra, MongoDB, Kafka, Greenplum
• Mainframes: DB2 z/OS, DB2 i5/OS, COBOL, JCL
• BI and Analytics: SAP BusinessObjects, Tableau, Microsoft Power BI, Cognos, MicroStrategy,
OBIEE, QlikView, Qlik Sense, Microsoft SSRS and SSAS, SAS
®
• ETL: Informatica PowerCenter , Informatica Data Engineering Integration, Informatica Intelligent
Cloud Servicessm, Informatica Data Integration Hub, Microsoft SSIS, IBM InfoSphere DataStage,
Oracle Data Integrator, Talend Data Integration, AWS Glue
• Business Glossary: Informatica Axon Data Governance, Informatica Business Glossary
• Data Modeling: Erwin Data Modeler, SAP PowerDesigner
• Enterprise Applications: Salesforce, Oracle, Workday, Informatica MDM, SAP ECC, SAP S/4 HANA
• File Systems: Microsoft SharePoint, Microsoft OneDrive, Windows/Linux Filesystems
• File Formats: MS Excel, MS Word, MS PowerPoint, Adobe PDF, Flat Files, CSV, Delimited, XML,
JSON, Avro, Parquet
• Cloud Platforms: AWS S3, AWS Redshift, Azure SQL DB, Azure Synapse Analytics, Azure ADLS,
Azure ADLS Gen 2, Azure Blob, Google Cloud Storage, Snowflake, Google BigQuery

4
Figure 3: Informatica Enterprise Data Catalog supports universal metadata connectivity.

Self-Service Data Provisioning


After you find the relevant datasets for your analysis, easily move your dataset to the target
of your choice with simple click-through provisioning from within Informatica Enterprise
Data Catalog. You can choose from a broad choice of sources and targets including Amazon
Redshift, Azure Synapse Analytics, Google BigQuery, Snowflake, and BI tools like Tableau.
This capability leverages the integration of Informatica Enterprise Data Catalog with Informatica
Cloud Data Integration.

Metadata APIs to Integrate Into Your Environment


Informatica Enterprise Data Catalog includes REST-based APIs that enable you to integrate it
into your environment and consume catalog content anywhere. Organizations can share any
intelligent metadata—applications, BI reports, and dashboards—with business users. Users can
export and share selected catalog content and associated enrichment metadata.

Tableau Integration for Governed Self-Service Analytics


The Chrome browser plug-in and Tableau extension for Informatica Enterprise Data Catalog
provide two different options for Tableau users to access the full resources of Informatica
Enterprise Data Catalog from within the native Tableau user interface. Without leaving the
Tableau interface, users can leverage an intelligent search bar to find trusted data assets, access
business and technical context, and collaborate with their peers.

Resource-Level Security
Grant user and group read/write permissions at the resource level to allow users to view or edit
custom attributes, perform domain curation, and associate business glossary terms.

5
Enterprise-Scale Deployments
Informatica Enterprise Data Catalog is built for true enterprise-scale deployments with the
ability to scan tens of millions of datasets across hundreds of data sources. It supports
parallel metadata ingestion and high-speed distributed indexing to quickly update catalog
content and deliver unmatched search performance and fault tolerant high availability for 24x7
implementations. With Spark-based data profiling, you can profile massive amounts of data at
scale to get a deeper understanding of enterprise data.

Unified Administration
Manage and monitor the catalog resources, metadata extract schedules, profiling runs,
and more from one unified admin console. A job control dashboard provides widgets for task
monitoring and resource views. Email alerts assist administrators in proactively responding to
catalog issues.

Figure 4: Understand your data with holistic data relationship views.

Benefits
Intelligently Catalog All Types of Data Across the Enterprise
Informatica Enterprise Data Catalog intelligently discovers many types of data and their
relationships across the enterprise. Pre-built scanners collect metadata from databases, data
warehouses, data lakes, cloud data stores, applications, BI tools, ETL tools, third-party metadata
catalogs, NoSQL, and more. All the metadata is indexed and cataloged in a highly-scalable graph
database architected for fast updates, smart search, and fast queries. As more and more data
is created and propagated throughout the enterprise, similar and duplicate datasets inevitably
arise. Informatica Enterprise Data Catalog leverages advanced statistical and machine learning
algorithms to discover similar data and subsets of data, helping users find the most relevant and
trusted data they need.

6
About Informatica Find Data Assets Quickly Through Powerful, Google-Like Semantic Search
Digital transformation Trying to find the data you need across hundreds of enterprise systems may sometimes
changes expectations: better seem futile. Only through powerful semantic search built on comprehensive metadata-driven
service, faster delivery, with
intelligence and a scalable infrastructure can one even hope to find relevant data. Informatica
less cost. Businesses must
transform to stay relevant Enterprise Data Catalog delivers semantic search with intelligent facets to further refine search
and data holds the answers. results. Because Informatica uniquely associates business, technical, and operational metadata,
As the world’s leader in business users can search with business terms to find their data and then browse holistic
Enterprise Cloud Data relationship views to find related data assets.
Management, we’re prepared
to help you intelligently lead—
Discover and Understand Your Data Assets With Holistic Relationship Views and Lineage
in any sector, category, or
niche. Informatica provides The classic saying, “You can’t manage what you can’t measure” is true when it comes to
you with the foresight to managing data assets. To get the most value from data, you need to understand what you
become more agile, realize have, where it came from, how it has changed, and what level of trust you have in the data.
new growth opportunities, or
Informatica Enterprise Data Catalog answers all these questions and more with complete end-
create new inventions. With
100% focus on everything to-end summary and detailed lineage, profiling statistics, data quality scorecards, and holistic
data, we offer the versatility relationship views, providing a clear picture of your data.
needed to succeed.

We invite you to explore Enrich Data Assets With Business Context Through Governed and Crowdsourced Annotations
all that Informatica has
Informatica Enterprise Data Catalog maximizes the reuse and value of data by automatically
to offer—and unleash the
classifying enterprise data assets down to the field/column level. To further increase the value of
power of data to drive your
next intelligent disruption. data, Informatica Enterprise Data Catalog captures the context of who is using the data and for
what purpose, along with crowdsourced tags, annotations, ratings, and reviews. This “wisdom of
crowds” helps to enrich and curate data, making it even more valuable throughout the enterprise.
Informatica Enterprise Data Catalog integrates with Informatica Axon for easy import of business
glossary assets such as business terms, definitions, and policies from Axon. This business
metadata is automatically associated with technical metadata and operational metadata so that
business analysts, data stewards, and other users can quickly find, understand, and collaborate
on data assets.

Gain Insight Into Data Usage, Share Best Practices, and Estimate Asset Value
With Data Asset Analytics in Enterprise Data Catalog, you gain insights into data usage and
users, with visibility into what data assets are in demand, who is using them, and more, enabling
you to discover the most valuable data assets within your enterprise. Visual dashboards and
exportable reports empower data leaders to share best practices, socialize data catalog adoption,
and drive data-driven decision-making. By calculating data asset value—according to parameters
you provide—the Automated Data Value Calculator helps you proactively manage and optimize
your most important data assets.

Learn More
To learn more about Informatica Enterprise Data Catalog, please visit https://www.informatica.
com/products/data-catalog.html.

Worldwide Headquarters 2100 Seaport Blvd., Redwood City, CA 94063, USA Phone: 650.385.5000, Toll-free in the US: 1.800.653.3871
IN06_0721_03238
© Copyright Informatica LLC 2021. Informatica, the Informatica logo, CLAIRE, Axon, and PowerCenter are trademarks or registered trademarks of Informatica LLC in the United States and other
countries. A current list of Informatica trademarks is available on the web at https://www.informatica.com/trademarks.html. Other company and product names may be trade names or trademarks of
their respective owners. The information in this documentation is subject to change without notice and provided “AS IS” without warranty of any kind, express or implied.

You might also like