Tibco Data Virtualization
Tibco Data Virtualization
BENEFITS With data the new competitive battleground, businesses that take advantage of
ECONOMICAL their data will be the leaders; those that do not will fall behind.
Integrate data reliably at But gaining this advantage is a more difficult technical challenge than ever
a fraction of physical as your business requirements are ever-changing and your data is more-widely
warehousing and ETL time,
cost and rigidity. Evolve rapidly distributed across on-premises, big data, the Internet of Things (IoT) and the
when requirements change. Cloud. Traditional physical data integration via data warehousing and ETL is often too
IMMEDIATE time consuming, too rigid, and too costly to support your dynamic business needs.
Deliver up-to-the-minute There has to be a better way. There is, and it’s called data virtualization.
data as needed, using
advanced performance TIBCO® Data Virtualization software lets you integrate data with breakthrough
optimization algorithms and speed and cost effectiveness. With it, you can build and manage virtualized views
fine-grained security. and data services that access, transform, and deliver the data your business requires
BUSINESS-FRIENDLY to accelerate revenue, reduce costs and risk, improve compliance, and more.
Transform native IT structures
and syntax into easy-to- TIBCO Data Virtualization is Java enterprise-grade middleware with a modular
understand, IT-curated data structure that supports all phases of data virtualization development, run-time,
sets sharable via a self-service and management.
business directory.
WIDE-RANGING
Access data from distributed
data sources including
traditional enterprise, big data,
cloud, and IoT. Use it across Analytics Self-Service Business Intelligence Transactional Apps
myriad analytics, self-service,
Development Run Time Environment Management
business intelligence, and Environment Environment
transactional applications. SQL Web Services Messaging
Business Directory (ODBC, JDBC, ADO.NET) (HTTP, REST, SOAP, JSON, OData) (JMS) Manager
ENTERPRISE GRADE
Support multiple lines of Federation Engine Cost-based Optimizer Rules-based Optimizer
Discovery Deployment Manager
business, hundreds of projects,
Object Repository
and thousands of users. Security
(SQL, SQL Script, Java, XQuery, XSLT)
Studio Monitor
Caching Quality Governance
SQL
Adapters (ODBC,
Web Messaging
Hadoop Java Mainframe
Application Active Cluster
Services (JMS) APIs
JDBC)
.XLS XML
Packaged RDBMS Excel Files Data OLAP Cubes Hadoop/ XML Docs Flat Files Web Services
Apps Warehouse Big Data
COMPLETE DATA Studio is the agile modeling, development, and resource management tool that
VIRTUALIZATION data-oriented developers use to model data, design and view data services, build
SOLUTION
transformations, optimize queries, manage resources, and more. Easy to learn
Beyond TIBCO Data
Virtualization software, TIBCO and use, Studio’s graphical modeling environment provides a flexible workspace
also provides a broad array where queries are created and tested, as well as a data services repository where
of complementary advanced completed objects are published. Studio also offers a rich set of transformations
services, training, technical
support, customer advisory in addition to an easy-to-use transformation editor. Five languages complement
program, knowledgebase, Studio’s graphical modeling capabilities: SQL, SQL Script, Java, XQuery, and XSLT.
and partner offerings, so you Adapters provide a wide range of data source connectivity for databases, files,
get the complete solution
needed to ensure your data big data, cloud sources, packaged applications, and more. Beyond schema-to-
virtualization success. schema only connectivity, TIBCO Data Virtualization adapters integrate with data
source optimizers to ensure more accurate queries and higher performance. The
Data Source Tool Kit allows you to build additional custom adapters.
Cost-based and Rules-based Optimizers used by the Federation Engine
(both a massively parallel processing [MPP] execution engine or the classic,
pushdown query execution engine) work directly with data source optimizers
to maximize query performance. The Objects Repository lets you manage your
data services throughout their lifecycle. Myriad fine-grained Security capabilities,
including authentication, authorization, and encryption, safeguard that you deliver
sanctioned data only, no more, no less.
Granular workload management allows you to provide more intelligent
allocation of resources for important workloads. Control memory usage, request
length, row counts, and more, as well as avoid potentially problematic requests.
These controls can be implemented at the object, user group, server, and/or
cluster group level.
Flexible Caching options enable higher performance and greater uptime.
Quality helps ensure you deliver the best data possible. And built-in Governance
features provide complete visibility, traceability and control.
Manager is the administrative console used to set up user IDs, passwords, and
security profiles, as well as view logs, check server activity, and more.
Deployment Manager lets you quickly and easily migrate entire projects in
a single step, including their resources, cache settings, server configurations,
security profiles, and more across instances to simplify and automate your
development lifecycle.
Monitor provides a comprehensive, real-time view of your TIBCO Data
Virtualization cluster. Monitor displays all the pertinent system health indicators
required to help your IT operations staff guide corrective actions.
Active Cluster works in conjunction with load balancers to provide high
availability and greater scale to meet your challenging service level agreements.
Active Cluster simplifies complex operations management by automatically
sharing resources, adjusting capacity on demand, and more.
DATASHEET | 3
DEPLOYMENT OPTIONS
TIBCO provides multiple options for deploying TIBCO Data Virtualization. You can
install and run it on-premises, in your private cloud environment, or at a public
cloud provider such as Amazon AWS, Google Cloud Platform, and Microsoft
Azure. For AWS, TIBCO also provides TIBCO Data Virtualization on the AWS
Marketplace to simplify and accelerate deployment.
KEY FEATURES
DEVELOPMENT ENVIRONMENT
Tables 1 through 4 summarize key TIBCO Data Virtualization development
capabilities.
Table 1. Business Directory: Access and use data sets via an intuitive,
self-service browser
FEATURE DESCRIPTION
Search options Find data sets using simple Google-like searches as well as advanced
parameterized searches.
Browse Find data sets using attributes such as data type, format, category,
and more.
Security See only the data sets you’re supposed to see based on your TIBCO Data
Virtualization enforced security profile.
Consumption Use your favorite analytic/BI tools to consume data sets using Business
Directory access information.
FEATURE DESCRIPTION
Collaboration Allow business users and IT to interact via comments to improve data
quality and utilization.
Business metadata Add custom definitions, properties, links, and status codes to enrich IT
extensions metadata with business metadata that your business users understand.
Business metadata Present business metadata in a shared area, an existing properties tab, or
display options a new custom tab.
Personalization Receive email updates of changes and comments for data you care about.
Table 2. Modeling and Transformation: Design reusable views and data services
within an agile, high-productivity development environment
FEATURE DESCRIPTION
Graphical Graphically model data, design view and data services, build
development transformations, optimize queries, manage resources, and more.
environment
Agile development Develop and refine views and data services iteratively.
methodology
Introspection Automatically probe physical data sources and select desired resources.
Optionally inspect data sources interactively.
Data discovery Reveal data relationships across disparate entities using formal keys and
fuzzy matching.
Bottom-up modeling Design views and data services by combining data from disparate systems
without worrying about underlying source access and format complexity.
Flexible modeling Create views and data services graphically or via scripting languages as
and transformation appropriate. SQL, SQL script, XQuery, XSLT, and Java functions.
methods
XPath transformation Establish arbitrary complex mapping of XML schema elements to XML
output.
JSON querying and Query and transform JSON data from Web services into a relational
transformation format.
Analytic functions Use a full set of analytic functions such as CORR, COUNT, NTILE, STDDEV,
and VARIANCE.
Contract last design Define the Java wrapper first and then develop WSDL.
Views Graphically display dependencies between data sources and views and
dependency graph data services.
Physical Create and drop physical tables within a designated data source.
table creation
FEATURE DESCRIPTION
Complete repository Manage resources such as data sources, views and data services, and
procedures throughout their lifecycles.
Public metadata API Deploy a web services metadata API for easy access and sharing.
Open API Open access to TIBCO Data Virtualization system libraries to enable custom
scripting and orchestration.
FEATURE DESCRIPTION
Migration GUI Add project folders directly to version control systems. Check-in and check-
out folders or individual resources to track changes. Rollback revisions of
folders or resources to prior versions.
Source control and Manage artifacts and transports from development through production.
transport
RUN-TIME ENVIRONMENT
Table 5. Federated Query Engine: Run optimized queries across data sources
FEATURE DESCRIPTION
Federation engine Join and aggregate data that is vertically and horizontally partitioned.
Massively Parallel Enable multi-source, big data scale virtualization workloads that
Processing dynamically distribute the work across multiple processors
(MPP) Engine
Data source Leverage data source optimizers to ensure query accuracy and maximize
query performance.
Cost-based optimizer Use statistics to create an optimal query plan that reduces unnecessary
data flow across the network.
Rule-based optimizer Allow users to specify exactly how they want to run a particular query.
DATASHEET | 6
FEATURE DESCRIPTION
Granular workload Control memory usage, request length, row counts, and more, as well as
management avoid potentially problematic requests. These controls can be implemented
at the object, user group, server, and/or cluster group level.
Alert triggers Implement resource, event, and user-defined triggers. Use a published API
to handle custom Java alerts.
FEATURE DESCRIPTION
Complete set of join Automatically rewrite the query to use the most efficient join strategy (for
algorithms example, hash join, sort-merge join, distributed semi-join, data-ship join,
union-join flip, nested-loop join, and others).
Single-source join Run data-reducing joins at the data source rather than bringing the data
grouping across the network.
Predicate push-down Push WHERE clause predicates down into the underlying data source to
reduce data at the source.
Full and partial Push aggregate functions down to source when applicable.
aggregate push down
Serialization or Determine the proper join order and join algorithms based on estimated
parallelization of join cardinality and join results derived from data distribution histograms.
operators
Constraint Distribute filters to multiple branches of the query plan, allowing data
propagation reduction by a single filter to potentially occur in multiple
data sources.
Scan multiplexing Reuse data sets that appear in multiple places in a single
query plan.
Empty scan detection Detect logical conditions that would produce empty data sets, and then
eliminate those parts of the query plan prior to processing.
Blocking operator Proactively run parts of the query plan that must finish before other parts of
prefetching the query plan can continue, thereby increasing the overall responsiveness
of the query.
DATASHEET | 7
FEATURE DESCRIPTION
Results streaming Stream data to consuming applications as results are processed at the
underlying sources.
Hybrid memory and Balance memory and disk use for optimal performance.
disk use
Native XML support Support XML internally for fast parsing and joins.
FEATURE DESCRIPTION
Native bulk extract Use native bulk EXTRACT functions in the source to extract data more
efficiently than using SELECTs.
Native bulk load Use native LOAD functions in destination-to-load and refresh caches
more efficiently than using INSERTs.
Multi-table caching Avoid contention on cache refreshes, accelerate refresh speeds, and
maintain non-stop availability using multiple tables per cache view.
DATASHEET | 8
Table 8. Data Access: Connect and use data from distributed sources
FEATURE DESCRIPTION
Databases Access popular databases via Open Database Connectivity (ODBC) and
Java Database Connectivity (JDBC).
Big data Access Apache Hadoop® through Apache Hive™, Apache Impala®, and
Apache HBase®.
NoSQL and Access NoSQL and cloud database sources such as Amazon DynamoDB,
cloud databases Amazon Redshift, Cassandra, and MongoDB.
Web services Access SOAP over HTTP, XML over HTTP, and Java Message Service
(JMS) services. A message pipeline allows interjection of custom logic
during the web service request and response.
Collaboration Access collaboration apps such as email, Google Sheets, and Microsoft
SharePoint.
Social media Access social media sources such as Facebook, LinkedIn, RSS,
and Twitter.
Data source tool kit Access a set of libraries of services that can be imported into your
preferred integrated development kit to facilitate and accelerate data
adapter creation. Services include database mapping, data type mapping,
syntax mapping, and function mapping. Together these services minimize
custom code development.
FEATURE DESCRIPTION
Database objects Publish views for consumption through ODBC, JDBC, and
ADO.NET.
Web services Publish data services in the form of WSDL for consumption using SOAP
or SOAP over JMS. A message pipeline allows interjection of custom logic
during the web service request and response.
Representational Publish data services in the REST format. REST CREATE, READ, UPDATE,
State Transfer (REST) and DELETE functions are supported.
FEATURE DESCRIPTION
BI tool integration Create temporary tables in BI tools to store filters for visualizations
or reports.
DDL support for Support client applications that create tables using DDL statements via
client applications JDBC/ODBC/ADO. CREATE TABLE, CREATE TABLE AS SELECT, and
CREATE TEMP TABLE syntax. Multiple physical data sources mappings
are supported for each published database to avoid conflicts between
users when creating tables.
Table 10. Security: Protect sensitive data with fine-grained, multi-level security
FEATURE DESCRIPTION
Single sign-on Sign on once to access all integrated data sources and
consuming applications.
Column masking Implement column masking rules to hide, replace, or obfuscate portions
of a column’s value depending on a user’s level of access.
SSL over HTTP with Mutually authenticate published services, web services data sources, and
support for mutual Oracle databases. Certificate-based authentication and Web Services
authentication Security (WSS) authentication are supported.
Pass-through Use an existing user ID and password and pass through to TIBCO Data
Virtualization for authentication.
Lightweight Directory Use security profiles from LDAP to authenticate user access to protected
Access Protocol data sources.
(LDAP)
Access management Use TIBCO Data Virtualization as the system of record for security roles
and profiles.
DATASHEET | 10
FEATURE DESCRIPTION
Data lineage Trace lineage from multiple data sources to a single data consumer.
Open API Open access to TIBCO Data Virtualization system libraries to enable
custom scripting and orchestration.
Table 12. Data Quality: Help ensure correct and complete data
FEATURE DESCRIPTION
Standardization and Create views and data services that conform to agreed standards.
conformation
Enrichment and Extend views and data services with additional data.
augmentation
Validation Validate data sets with users prior to publishing views and data services.
Object reuse Share views and data services to ensure consistent data definitions.
MANAGEMENT ENVIRONMENT
Tables 13 through 15 summarize key TIBCO Data Virtualization
management capabilities.
FEATURE DESCRIPTION
Management console Access the management console through TIBCO Data Virtualization
options Studio or a web browser.
FEATURE DESCRIPTION
Real-time system Monitor critical system metrics and tune for optimal performance
indicators including memory usage, query plans for currently running and past
requests, data sources, and caches.
Usage metrics Deliver usage activity detail to your reporting tool of choice via an
open API.
Table 14. Active Cluster: Cluster servers to meet reliability, availability and
scalability SLAs
FEATURE DESCRIPTION
Flexible cluster Expand capacity and improve availability simply by adding new nodes to
deployment an existing cluster or adding new clusters.
Shared cluster cache Improve overall cluster performance by coalescing redundant data
source hits and reducing data latency.
Restore cluster nodes Restore cluster nodes using Web Manager, cluster_util script or API.
FEATURE DESCRIPTION
Cache setting Migrate or promote cache table names, caching methods, refresh
migration method, and cache policies and schedules.
User/group migration Migrate or promote user and group IDs, security profiles, and other user
and group information.
DATASHEET | 12
TECHNICAL SPECIFICATIONS
Tables 16 through 18 summarize TIBCO Data Virtualization technical specifications
and standards support.
Table 16. Supported Platforms: Leverage standard operating systems and high
performance servers
TIBCO DATA
VIRTUALIZATION
STUDIO CLIENTS SERVER JVM
• Microsoft Windows
• Solaris
Table 17. Data Source Adapters: Simplify data access while ensuring query
accuracy and performance
• Elasticsearch • Sybase IQ
• IBM DB2
• Informix
• LDAP
• Microsoft Access
• Microsoft Excel
DATASHEET | 13
• Microsoft Dynamics GP
• NetSuite CRM
• NetSuite ERP
• Oracle EBS
• Salesforce.com
• Siebel
• HubSpot
• Marketo
• Oracle Eloqua
• Microsoft SharePoint
Excel services
• Cassandra • RSS
• Couchbase • Twitter
• Google BigQuery
• HBase
• MongoDB
DATASHEET | 14
• Sybase ASE
• Sybase IQ
• Teradata
• Vertica
• Sybase IQ
SOFTWARE
• Teradata
WEB SERVICES DEVELOPMENT
PROTOCOLS STANDARDS • Vertica
• OData • Unicode
• SOAP • J2EE
• WSDL • JNDI
• WSI
• XPath
• XQuery
• XSLT
Global Headquarters TIBCO fuels digital business by enabling better decisions and faster, smarter actions through the TIBCO
3307 Hillview Avenue Connected Intelligence Cloud. From APIs and systems to devices and people, we interconnect everything,
Palo Alto, CA 94304 capture data in real time wherever it is, and augment the intelligence of your business through analytical insights.
+1 650-846-1000 TEL Thousands of customers around the globe rely on us to build compelling experiences, energize operations, and
+1 800-420-8450 propel innovation. Learn how TIBCO makes digital smarter at www.tibco.com.
+1 650-846-1005 FAX ©2017–2018, TIBCO Software Inc. All rights reserved. TIBCO, the TIBCO logo, and Enterprise Message Service are trademarks or registered trademarks of TIBCO
Software Inc. or its subsidiaries in the United States and/or other countries. Apache Drill, Hadoop, HBase, Hive, Impala, and Subversion are trademarks of The
www.tibco.com Apache Software Foundation in the United States and/or other countries. All other product and company names and marks in this document are the property of
their respective owners and mentioned for identification purposes only.
11/14/18