0% found this document useful (0 votes)

110 views

Cti Oracle Data Mining

The document provides an overview of Oracle Data Mining (ODM), which enables building and deploying predictive analytics applications within Oracle Database. ODM performs data mining tasks like predictive modeling, classification, regression, and clustering using machine learning algorithms. It allows data preparation, model building, testing, and deployment to be done directly in the database, avoiding data movement between systems. ODM provides APIs, a graphical user interface, and SQL functions to make these tasks accessible to developers and business users.

Uploaded by

Aljon Esguerra Antiola

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

110 views

Cti Oracle Data Mining

Uploaded by

Aljon Esguerra Antiola

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 4

ORACLE DATA MINING

Aljon E. Antiola
Technological Institute of the Philippines College of Information Technology Education A Research Requirement in CTI IT22FB5

INTRODUCTION
Too much data and not enough information this is a problem facing many businesses and industries. Most businesses have an enormous amount of data, with a great deal of information hiding within it, but "hiding" is usually exactly what it is doing: So much data exists that it overwhelms traditional methods of data analysis. Data mining provides a way to get at the information buried in the data. Data mining creates models to find hidden patterns in large, complex collections of data, patterns that sometimes elude traditional statistical approaches to analysis because of the large number of attributes, the complexity of patterns, or the difficulty in performing the analysis.

Making the entire data mining process work in a reproducible and reliable way is challenging; it may involve automation and transfers across servers, data repositories, applications, and tools. For example, some data mining tools require that data be exported from the corporate database and converted to the data mining tool's format; data mining results must be imported into the database. Removing or reducing these obstacles can enable data mining to be utilized more frequently to extract more valuable information and, in many cases, to make a significant impact on the bottom-line of an enterprise. Data mining in the database makes the data movement required by tools that do not operate in the database unnecessary and make it much easier to mine up-to-date data. Also, the less data movement, the less time the entire data mining process takes. Data movement can make data insecure. If data never leaves the database, database security protects the data. In summary, data mining in the database provides the following benefits:

BACKGROUND
Data Mining in Database
Data mining projects usually require a significant amount of data collection and data processing before and after model building. Data tables are created by combining many different types and sources of information. Real-world data is often dirty, that is, includes wrong or missing values; data must often be cleaned before it can be used. Data is filtered, normalized, sampled, transformed in various ways, and eventually used as input to data mining algorithms. Up to 80% of the effort in a data mining project is often devoted to data preparation. When the data is stored as a table in a database, data preparation can be performed using database facilities. Data mining models have to be built, tested, validated, managed, and deployed in their appropriate application domain environments. The data mining results may need to be post-processed as part of domain specific computations (for example, calculating estimated risks, expected utilities, and response probabilities) and then stored into permanent databases or data warehouses.

Less data movement More data security Up-to-date data

ORACLE DATA MINING

Oracle Data Mining (ODM) an option to Oracle Database 11g Enterprise Editionenables you to easily build and deploy next-generation applications that deliver predictive analytics and new insights. Application developers can rapidly build next-generation applications using ODM's SQL APIs that automatically mine Oracle data and deploy results in real-time-throughout the enterprise. Because the data, models and results remain in the Oracle Database, data movement is eliminated, security is maximized and information latency is minimized. Oracle Data Mining models can be included in SQL queries and embedded in applications to offer improved business intelligence. Data analysts can quickly access their Oracle data using Oracle Data Miner 11g Release 2 graphical user

interface and explore their data to find patterns, relationships, and hidden insights. Oracle Data Mining provides a collection of in-database data mining algorithms that solve a wide range of business problems. Anyone who can access data stored in an Oracle Database can access Oracle Data Mining results-predictions, recommendations, and discoveries using Oracle Business Intelligence Solutions.

HISTORY
Oracle Data Mining was first introduced in 2002 and its releases are named according to the corresponding Oracle database release: Oracle Data Mining 9iR2 (9.2.0.1.0 May 2002) Oracle Data Mining 10gR1 (10.1.0.2.0 - February 2004) Oracle Data Mining 10gR2 (10.2.0.1.0 - July 2005) Oracle Data Mining 11gR1 (11.1 September 2007) Oracle Data Mining 11gR2 (11.2 September 2009) Oracle Data Mining is a logical successor of the Darwin data mining toolset developed by Thinking Machines Corporation in the mid-1990s and later distributed by Oracle after its acquisition of Thinking Machines in 1999. However, the product itself is a complete redesign and rewrite from ground-up - while Darwin was a classic GUIbased analytical workbench, ODM offers a data mining development/deployment platform integrated into the Oracle database, along with the GUI.

Generalized linear model (GLM) for Logistic regression. Support Vector Machine (SVM). Decision Trees (DT). Anomaly detection. One-class Support Vector Machine (SVM). Regression Support Vector Machine (SVM). Generalized linear model (GLM) for Multiple regression Clustering: Enhanced k-means (EKM). Orthogonal Partitioning Clustering (O-Cluster). Association rule learning: Itemsets and association rules (AM). Feature extraction. Non-negative matrix factorization (NMF). Text and spatial mining: Combined text and non-text columns of input data. Spatial/GIS data.

INPUT SOURCES PREPARATION

AND

DATA

FUNCTIONALITY
As of release 11gR1 Oracle Data Mining contains the following data mining functions: Data transformation and model analysis: Data sampling, binning, discr etization, and other data transformations. Model exploration, evaluation and analysis. Feature selection (Attribute Importance). Minimum description length (MDL). Classification. Naive Bayes (NB).

Most Oracle Data Mining functions accept as input one relational table or view. Flat data can be combined with transactional data through the use of nested columns, enabling mining of data involving one-to-many relationships (e.g. a star schema). The full functionality of SQL can be used when preparing data for data mining, including dates and spatial data. Oracle Data Mining distinguishes numerical, categorical, and unstructured (text) attributes. The product also provides utilities for data preparation steps prior to model building such as outliertreatment, discretization, normalization and binning (sorting in general speak)

GRAPHICAL USER INTERFACE: ORACLE DATA MINER

Oracle Data Mining can be accessed using Oracle Data Miner a GUI client that provides access to the data mining functions and structured templates called Mining Activities that automatically prescribe the order of operations, perform required data transformations, and set

model parameters. The user interface also allows the automated generation of Java and/or SQL code associated with the data mining activities. The Java Code Generator is an extension to Oracle JDeveloper. There is also an independent interface: the Spreadsheet Add-In for Predictive Analytics which enables access to the Oracle Data Mining Predictive Analytics PL/SQL package from Microsoft Excel.

BEGIN DBMS_DATA_MINING.CREATE_MODEL ( model_name => 'credit_risk_model', =>

function DBMS_DATA_MINING.classification, data_table_name => 'credit_card_data',

ORACLE DATA MINER 11G RELEASE 2

The free Oracle Data Miner GUI is an extension to Oracle SQL Developer 3.0 that enables data analysts to work directly with data inside the database, explore the data graphically, build and evaluate multiple data mining models, apply Oracle Data Mining models to new data and deploy Oracle Data Mining's predictions and insights throughout the enterprise. Oracle Data Miner work flows capture and document the user's analytical methodology and can be saved and shared with others to automate advanced analytical methodologies.

case_id_column_name => 'customer_id', target_column_name => 'credit_risk', settings_table_name => 'credit_risk_model_settings'); END; where 'credit_risk_model' is the model name, built for the express purpose of classifying future customers' 'credit_risk', based on training data provided in the table 'credit_card_data', each case distinguished by a unique 'customer_id', with the rest of the model parameters specified through the table

'credit_risk_model_settings'. Oracle Data Miner work flows capture, document and automate the in-database predictive analytics process. Oracle Data Mining also supports a Java API consistent with the Java Data Mining (JDM) standard for data mining (JSR-73) for enabling integration with web and Java EE applications and to facilitate portability across platforms.

PL/SQL AND JAVA INTERFACES

Oracle Data Mining provides a native PL/SQL package (DBMS_DATA_MINING) to create, destroy, describe, apply, test, export and import models. The code below illustrates a typical call to build aclassification model:

column feature selection. The new 11g feature PROFILE finds customer segments and their profiles, given a target attribute. These operations can be used as part of an operational pipeline providing actionable results or displayed for interpretation by end users.

REFERENCES
http://docs.oracle.com/html/B14339_01/1intro.htm http://www.enotes.com/topic/Oracle_Data_Mining#History http://www.oracle.com/technetwork/database/options/odm/i ndex.html

SQL SCORING FUNCTIONS

As of release 10gR2, Oracle Data Mining contains built-in SQL functions for scoring data mining models. These single-row functions support classification, regression, anomaly detection, clustering, and feature extraction. The code below illustrates a typical usage of a classificationmodel: SELECT customer_name FROM credit_card_data WHERE PREDICTION (credit_risk_model USING *) = 'LOW' AND customer_value = 'HIGH';

PMML
In Release 11gR2 (11.2.0.2), ODM supports the import of externally-created PMML for some of the data mining models. PMML is an XML-based standard for representing data mining models.

PREDICTIVE ANALYTICS MS EXCEL ADD-IN

The PL/SQL package DBMS_PREDICTIVE_ANALYTICS automates the data mining process including data preprocessing, model building and evaluation, and scoring of new data. The PREDICT operation is used for predicting target values classification or regression while EXPLAIN ranks attributes in order of influence in explaining a target

Preboard 4 Practical Problem Answer Key
71% (7)
Preboard 4 Practical Problem Answer Key
8 pages
Digital Booklet - Mad Max - Fury Road
67% (3)
Digital Booklet - Mad Max - Fury Road
9 pages
Move Shoot Move User Manual 20200813
No ratings yet
Move Shoot Move User Manual 20200813
24 pages
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
30-60-90 Day Plan Example-21604
100% (1)
30-60-90 Day Plan Example-21604
2 pages
Imsva - Web Gateway
No ratings yet
Imsva - Web Gateway
631 pages
Modified Systematic Approach To Answering Questions
No ratings yet
Modified Systematic Approach To Answering Questions
5 pages
Oracle Data Mining
No ratings yet
Oracle Data Mining
17 pages
Data Mining With Oracle 12c 11g - Scientific Books
No ratings yet
Data Mining With Oracle 12c 11g - Scientific Books
236 pages
Oracle Data Mining
No ratings yet
Oracle Data Mining
6 pages
Building Data Mining Models in The Oracle 9i Environment
No ratings yet
Building Data Mining Models in The Oracle 9i Environment
10 pages
Data Mining and Data Warehousing
No ratings yet
Data Mining and Data Warehousing
12 pages
Practical File
No ratings yet
Practical File
17 pages
An Introduction To Data Mining: Discovering Hidden Value in Your Data Warehouse
No ratings yet
An Introduction To Data Mining: Discovering Hidden Value in Your Data Warehouse
18 pages
Data Mining-CH5
No ratings yet
Data Mining-CH5
49 pages
Module 2 Data Mining
No ratings yet
Module 2 Data Mining
49 pages
Data Mining - Digital Notes (Unit I To V)
No ratings yet
Data Mining - Digital Notes (Unit I To V)
85 pages
Prof. Chandan Singhavi
No ratings yet
Prof. Chandan Singhavi
86 pages
Chap 1
No ratings yet
Chap 1
32 pages
DataMining S
No ratings yet
DataMining S
103 pages
Data Mining
No ratings yet
Data Mining
4 pages
Data Mining: Discovering Hidden Value in Your Data Warehouse
No ratings yet
Data Mining: Discovering Hidden Value in Your Data Warehouse
6 pages
DM notes
No ratings yet
DM notes
26 pages
Oracle Data Mining: Gaurav Mittal
No ratings yet
Oracle Data Mining: Gaurav Mittal
6 pages
A Survey On Data Mining
No ratings yet
A Survey On Data Mining
4 pages
FFFFFFFFFFFFFFFFFFFF
No ratings yet
FFFFFFFFFFFFFFFFFFFF
17 pages
Data Mining
No ratings yet
Data Mining
20 pages
Web Data Mining: A Case Study: Samia Jones
No ratings yet
Web Data Mining: A Case Study: Samia Jones
6 pages
Data Mining: M.P.Geetha, Department of CSE, Sri Ramakrishna Institute of Technology, Coimbatore
No ratings yet
Data Mining: M.P.Geetha, Department of CSE, Sri Ramakrishna Institute of Technology, Coimbatore
115 pages
An Introduction To Data Mining
No ratings yet
An Introduction To Data Mining
11 pages
Data Mining
No ratings yet
Data Mining
7 pages
Data Mine
No ratings yet
Data Mine
14 pages
What Motivated Data Mining?: Huge Amount of Raw DATA Is Available - The Motivation For The Data Mining Is To
No ratings yet
What Motivated Data Mining?: Huge Amount of Raw DATA Is Available - The Motivation For The Data Mining Is To
83 pages
Data Mining Seminar
50% (2)
Data Mining Seminar
21 pages
An Introduction To Data Mining
No ratings yet
An Introduction To Data Mining
16 pages
Data Mining Information
No ratings yet
Data Mining Information
7 pages
Lesson 1
No ratings yet
Lesson 1
32 pages
Unit - 1 - Pca20g02t.docx
No ratings yet
Unit - 1 - Pca20g02t.docx
17 pages
Data Mining Overview
No ratings yet
Data Mining Overview
24 pages
An Introduction To Data Mining
No ratings yet
An Introduction To Data Mining
12 pages
KM Notes Unit-3
No ratings yet
KM Notes Unit-3
20 pages
Data Mining
No ratings yet
Data Mining
24 pages
Intro Data Mining
100% (1)
Intro Data Mining
87 pages
Data Mining
100% (3)
Data Mining
18 pages
DM Intro - 1
No ratings yet
DM Intro - 1
31 pages
Unit 1 Data Mining
No ratings yet
Unit 1 Data Mining
15 pages
data_mining_2
No ratings yet
data_mining_2
59 pages
Data Mining, Cobol, Memory
No ratings yet
Data Mining, Cobol, Memory
54 pages
Data Mining:: Dr. Hany Saleeb
No ratings yet
Data Mining:: Dr. Hany Saleeb
37 pages
Mehrdad Jalali: Jalali@mshdiau - Ac.ir Jalali - Mshdiau.ac - Ir
No ratings yet
Mehrdad Jalali: Jalali@mshdiau - Ac.ir Jalali - Mshdiau.ac - Ir
27 pages
1 Chapter One
No ratings yet
1 Chapter One
54 pages
UNIT-1 Introduction: Motivation: Why Data Mining?
No ratings yet
UNIT-1 Introduction: Motivation: Why Data Mining?
86 pages
02 DM BI Data Mining
No ratings yet
02 DM BI Data Mining
66 pages
Data Mining Concepts
No ratings yet
Data Mining Concepts
35 pages
L_1 Data Mining
No ratings yet
L_1 Data Mining
17 pages
Introduction-DM2
No ratings yet
Introduction-DM2
13 pages
Data Analysis-2
No ratings yet
Data Analysis-2
41 pages
data mining
No ratings yet
data mining
23 pages
An Introduction To Data Mining: Information System Management Assignment
No ratings yet
An Introduction To Data Mining: Information System Management Assignment
18 pages
DWDM
No ratings yet
DWDM
48 pages
Data Mining and Data Warehouse BY: Dept. of Computer Science Engineering
No ratings yet
Data Mining and Data Warehouse BY: Dept. of Computer Science Engineering
10 pages
Motivation of Data Mining
No ratings yet
Motivation of Data Mining
4 pages
BIDM
No ratings yet
BIDM
48 pages
Data Mining 1
No ratings yet
Data Mining 1
10 pages
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Snowflake Data Platform Engineering: Definitive Reference for Developers and Engineers
From Everand
Snowflake Data Platform Engineering: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Module 9 Opportunity
No ratings yet
Module 9 Opportunity
11 pages
LPG Series Products Catalog 2025 NEW
No ratings yet
LPG Series Products Catalog 2025 NEW
15 pages
SMITH CHART EXCEL Open A New Workbook in Excel
50% (2)
SMITH CHART EXCEL Open A New Workbook in Excel
3 pages
IoMemory VSL 3.2.14 User Guide For VMware ESXi 2016-06-27
No ratings yet
IoMemory VSL 3.2.14 User Guide For VMware ESXi 2016-06-27
59 pages
Assignment BBA (Sem1) Organisational Behaviour
100% (1)
Assignment BBA (Sem1) Organisational Behaviour
12 pages
ClickSoftware Infrastructure Overview
No ratings yet
ClickSoftware Infrastructure Overview
29 pages
Analysis of Substation Reliability
No ratings yet
Analysis of Substation Reliability
5 pages
06 Thingworx Ansys PTC Davide de Cesaris PDF
No ratings yet
06 Thingworx Ansys PTC Davide de Cesaris PDF
15 pages
QMS Formats PDF
No ratings yet
QMS Formats PDF
11 pages
Design of Transmission Systems-Question Bank
100% (1)
Design of Transmission Systems-Question Bank
28 pages
Sales Phone
No ratings yet
Sales Phone
15 pages
Site Master: Ompact Handheld Cable & Antenna Analyzer
No ratings yet
Site Master: Ompact Handheld Cable & Antenna Analyzer
16 pages
Stress and Strain - Axial Loading
No ratings yet
Stress and Strain - Axial Loading
18 pages
Capex Report
0% (1)
Capex Report
51 pages
Flo Darsensor F.T.
No ratings yet
Flo Darsensor F.T.
4 pages
Sec6 Subsea Blowout Preventers PDF
No ratings yet
Sec6 Subsea Blowout Preventers PDF
22 pages
6381y Cable To bs6004 PDF
No ratings yet
6381y Cable To bs6004 PDF
4 pages
Car Dekho
No ratings yet
Car Dekho
2 pages
PP Jess Cohen-Tanugi Design Principles For Visualization - 2-20-19
No ratings yet
PP Jess Cohen-Tanugi Design Principles For Visualization - 2-20-19
61 pages
XS618B1MAL2: Product Data Sheet
No ratings yet
XS618B1MAL2: Product Data Sheet
2 pages
Vhs Housing Spring
0% (1)
Vhs Housing Spring
1 page
Clutcht
No ratings yet
Clutcht
6 pages
Dbms
No ratings yet
Dbms
6 pages
Oracle Form Personalization - by Dinesh Kumar S
100% (2)
Oracle Form Personalization - by Dinesh Kumar S
104 pages

Cti Oracle Data Mining

Uploaded by

Cti Oracle Data Mining

Uploaded by

ORACLE DATA MINING

Less data movement More data security Up-to-date data

ORACLE DATA MINING

INPUT SOURCES PREPARATION

GRAPHICAL USER INTERFACE: ORACLE DATA MINER

BEGIN DBMS_DATA_MINING.CREATE_MODEL ( model_name => 'credit_risk_model', =>

function DBMS_DATA_MINING.classification, data_table_name => 'credit_card_data',

ORACLE DATA MINER 11G RELEASE 2

PL/SQL AND JAVA INTERFACES

SQL SCORING FUNCTIONS

PREDICTIVE ANALYTICS MS EXCEL ADD-IN

You might also like