0% found this document useful (0 votes)
136 views11 pages

Final Solved DMW Question Bank

Uploaded by

kartikrpatil0023
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
136 views11 pages

Final Solved DMW Question Bank

Uploaded by

kartikrpatil0023
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

Solutions: Data Mining & Warehousing Question Bank

1. Define Data Warehouse.

A Data Warehouse is a central repository of integrated data from multiple sources, designed to support

decision-making and business intelligence.

2. Define Multidimensional data model

This model organizes data into a data cube format, allowing for analysis in multiple dimensions (e.g., time,

product, geography).

3. State difference between data warehouse database and OLTP database

Data warehouse databases are used for analysis and reporting, while OLTP databases are used for

transactional tasks like data entry and retrieval.

4. State Source data component

Source data components are the raw data from various operational systems, external sources, and

databases that are processed for analysis.

5. Define Data Cube Aggregation.

Data Cube Aggregation is the process of summarizing data to compute measures like totals or averages for

efficient querying.

6. Define Data Marts.

Data Marts are subsets of a data warehouse focused on specific business areas, such as sales or marketing.
Solutions: Data Mining & Warehousing Question Bank

7. State steps in KDD.

The steps in Knowledge Discovery in Databases (KDD) are:

1. Data Selection

2. Data Preprocessing

3. Data Transformation

4. Data Mining

5. Interpretation and Evaluation.

8. State steps in data cleaning.

The steps in data cleaning are:

1. Handling missing values

2. Removing duplicate data

3. Resolving inconsistencies

4. Smoothing noisy data.

9. Define Data Smoothing.

Data Smoothing involves reducing noise in data to identify underlying trends and patterns.

10. Define Classification.

Classification is a data mining process that assigns data items to predefined categories based on their

attributes.

11. Define KDD.


Solutions: Data Mining & Warehousing Question Bank

KDD, or Knowledge Discovery in Databases, is the process of discovering useful patterns and knowledge

from large datasets.

12. Define Clustering.

Clustering is a technique in data mining that groups similar data points together based on their

characteristics.

13. Define Distributed Data Mining.

Distributed Data Mining refers to mining data that is stored across multiple locations or databases.

14. Define OLTP.

OLTP (Online Transaction Processing) involves managing transactional data for day-to-day operations in

real-time.

15. State Application of Data Warehousing

Applications of Data Warehousing include:

1. Business Intelligence

2. Decision Support Systems

3. Market Research

4. Sales Analysis.

16. Define Multidimensional Data Models


Solutions: Data Mining & Warehousing Question Bank

Multidimensional Data Models organize data into cubes, allowing analysis across various dimensions like

time, product, and geography.

17. State Source data component.

Source data components include data from transactional systems, flat files, and external sources used to

feed the data warehouse.

18. State data staging component.

Data staging components include tools and processes for data extraction, transformation, and loading (ETL)

into the data warehouse.

19. Define Data reduction.

Data reduction simplifies large datasets by reducing their size without losing critical information, using

techniques like dimensionality reduction or data compression.

20. Define Text Mining.

Text Mining is the process of analyzing unstructured text data to extract meaningful information and patterns.

1. Explain Data Warehouse deployment.

Data warehouse deployment involves making a data warehouse operational. Steps include:

1. Data Integration: Collecting and integrating data from different sources.

2. ETL Process: Extracting, transforming, and loading data into the warehouse.
Solutions: Data Mining & Warehousing Question Bank

3. Physical Design: Setting up hardware and database structures.

4. Testing: Ensuring the data is accurate and meets requirements.

5. Deployment: Making the system available for end-users.

2. Explain challenges in Web Mining

Challenges in Web Mining include:

1. High Volume: Web data is vast and constantly growing.

2. Noise: Web data contains irrelevant information like advertisements.

3. Dynamic Nature: Websites frequently update, requiring continuous monitoring.

4. Privacy Concerns: User data collection must comply with privacy regulations.

3. Describe Data cleaning in data mining

Data cleaning involves identifying and correcting errors in the dataset. It includes:

1. Handling missing values.

2. Removing duplicate data.

3. Resolving inconsistencies.

4. Smoothing noisy data for better analysis.


Solutions: Data Mining & Warehousing Question Bank

1. Define Data Warehouse.

A Data Warehouse is a central repository of integrated data from multiple sources, designed to support

decision-making and business intelligence.

2. Define Multidimensional data model

This model organizes data into a data cube format, allowing for analysis in multiple dimensions (e.g., time,

product, geography).

3. State difference between data warehouse database and OLTP database

Data warehouse databases are used for analysis and reporting, while OLTP databases are used for

transactional tasks like data entry and retrieval.

4. State Source data component

Source data components are the raw data from various operational systems, external sources, and

databases that are processed for analysis.

5. Define Data Cube Aggregation.

Data Cube Aggregation is the process of summarizing data to compute measures like totals or averages for

efficient querying.

6. Define Data Marts.

Data Marts are subsets of a data warehouse focused on specific business areas, such as sales or marketing.
Solutions: Data Mining & Warehousing Question Bank

7. State steps in KDD.

The steps in Knowledge Discovery in Databases (KDD) are:

1. Data Selection

2. Data Preprocessing

3. Data Transformation

4. Data Mining

5. Interpretation and Evaluation.

8. State steps in data cleaning.

The steps in data cleaning are:

1. Handling missing values

2. Removing duplicate data

3. Resolving inconsistencies

4. Smoothing noisy data.

9. Define Data Smoothing.

Data Smoothing involves reducing noise in data to identify underlying trends and patterns.

10. Define Classification.

Classification is a data mining process that assigns data items to predefined categories based on their

attributes.

11. Define KDD.


Solutions: Data Mining & Warehousing Question Bank

KDD, or Knowledge Discovery in Databases, is the process of discovering useful patterns and knowledge

from large datasets.

12. Define Clustering.

Clustering is a technique in data mining that groups similar data points together based on their

characteristics.

13. Define Distributed Data Mining.

Distributed Data Mining refers to mining data that is stored across multiple locations or databases.

14. Define OLTP.

OLTP (Online Transaction Processing) involves managing transactional data for day-to-day operations in

real-time.

15. State Application of Data Warehousing

Applications of Data Warehousing include:

1. Business Intelligence

2. Decision Support Systems

3. Market Research

4. Sales Analysis.

16. Define Multidimensional Data Models


Solutions: Data Mining & Warehousing Question Bank

Multidimensional Data Models organize data into cubes, allowing analysis across various dimensions like

time, product, and geography.

17. State Source data component.

Source data components include data from transactional systems, flat files, and external sources used to

feed the data warehouse.

18. State data staging component.

Data staging components include tools and processes for data extraction, transformation, and loading (ETL)

into the data warehouse.

19. Define Data reduction.

Data reduction simplifies large datasets by reducing their size without losing critical information, using

techniques like dimensionality reduction or data compression.

20. Define Text Mining.

Text Mining is the process of analyzing unstructured text data to extract meaningful information and patterns.

1. Explain Data Warehouse deployment.

Data warehouse deployment involves making a data warehouse operational. Steps include:

1. Data Integration: Collecting and integrating data from different sources.

2. ETL Process: Extracting, transforming, and loading data into the warehouse.

You might also like