0% found this document useful (0 votes)
354 views2 pages

Paper Presentation

Data mining involves using statistical analysis and machine learning techniques to extract patterns from large data sets to discover useful information. It combines techniques like statistical analysis, visualization, induction, and neural networks to explore large amounts of data and discover relationships and patterns that can help solve business problems. Data mining is useful for processing huge amounts of data stored in data warehouses to extract intelligence and knowledge in a timely manner to help with decision making. An integrated data mining architecture combines a data warehouse with data mining and OLAP servers to allow advanced analysis techniques to be fully applied to the stored data to gain insights and improve various business processes.

Uploaded by

Harpreet10
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
354 views2 pages

Paper Presentation

Data mining involves using statistical analysis and machine learning techniques to extract patterns from large data sets to discover useful information. It combines techniques like statistical analysis, visualization, induction, and neural networks to explore large amounts of data and discover relationships and patterns that can help solve business problems. Data mining is useful for processing huge amounts of data stored in data warehouses to extract intelligence and knowledge in a timely manner to help with decision making. An integrated data mining architecture combines a data warehouse with data mining and OLAP servers to allow advanced analysis techniques to be fully applied to the stored data to gain insights and improve various business processes.

Uploaded by

Harpreet10
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 2

ABSTRACT

Determining the relevant data from large data warehouses has always been a killer domain for the
companies, resulting in low quality decisions and tedious functioning of organizational activities.
The solution to this is Data Mining. Data mining develops rules and decision trees for an
organization outputting the statistical analysis of mined factors. Data mining was designed for
exploiting massive amounts of data. Data mining combines techniques including statistical
analysis, visualization, induction, and neural networks to explore large amounts of data and
discover relationships and patterns that shed light on business problems. Data mining and
warehousing provide a technology that enables the decision maker in the corporate sector/govt. to
process this huge amount of data in a reasonable amount of time to extract
intelligence/knowledge in a near real time.

The data warehouse allows the storage of data in a format that facilitates its access, but if the
tools for deriving information and/or knowledge and presenting them in a format that is useful for
decision making are not provided the whole rationale for the existence of the warehouse
disappears.

This paper reveals the research on need for information repositories and discovery of knowledge
and hence the overview of, the so hyped, Data Warehousing and Data Mining. This describes data
mining techniques and also we are giving a brief focus on some data mining algorithms. The
paper highlights the mining process with correlation, variance analysis, forecasting and cluster
analysis and so on.

An Architecture for Data Mining

To best apply these advanced techniques, they must be fully integrated with a data
warehouse as well as flexible interactive business analysis tools. Many data mining tools
currently operate outside of the warehouse, requiring extra steps for extracting,
importing, and analyzing the data. Furthermore, when new insights require operational
implementation, integration with the warehouse simplifies the application of results from
data mining. The resulting analytic data warehouse can be applied to improve business
processes throughout the organization, in areas such as promotional campaign
management, fraud detection, new product rollout, and so on. Figure 1 illustrates an
architecture for advanced analysis in a large data warehouse.
Figure 1 - Integrated Data Mining Architecture

The ideal starting point is a data warehouse containing a combination of internal data
tracking all customer contact coupled with external market data about competitor activity.
Background information on potential customers also provides an excellent basis for
prospecting. This warehouse can be implemented in a variety of relational database
systems: Sybase, Oracle, Redbrick, and so on, and should be optimized for flexible and
fast data access.

An OLAP (On-Line Analytical Processing) server enables a more sophisticated end-user


business model to be applied when navigating the data warehouse. The multidimensional
structures allow the user to analyze the data as they want to view their business –
summarizing by product line, region, and other key perspectives of their business. The
Data Mining Server must be integrated with the data warehouse and the OLAP server to
embed ROI-focused business analysis directly into this infrastructure. An advanced,
process-centric metadata template defines the data mining objectives for specific business
issues like campaign management, prospecting, and promotion optimization. Integration
with the data warehouse enables operational decisions to be directly implemented and
tracked. As the warehouse grows with new decisions and results, the organization can
continually mine the best practices and apply them to future decisions.

This design represents a fundamental shift from conventional decision support systems.
Rather than simply delivering data to the end user through query and reporting software,
the Advanced Analysis Server applies users’ business models directly to the warehouse
and returns a proactive analysis of the most relevant information. These results enhance
the metadata in the OLAP Server by providing a dynamic metadata layer that represents a
distilled view of the data. Reporting, visualization, and other analysis tools can then be
applied to plan future actions and confirm the impact of those plans.

You might also like