Advanced Topics
Advanced Topics
Key Features:
1. Data Distribution: The data is stored at different locations (nodes), which may be geographically
separated.
2. Transparency: Users interact with the system as if it is a single database, unaware of the data
distribution.
3. Autonomy: Each site can manage its own data and operations independently.
5. Fault Tolerance: The system can continue to operate even if one or more sites fail.
Fragmentation: Splitting data into fragments and storing them at different sites.
Advantages:
Example:
A multinational company may store customer data in different regional databases (India, USA, Europe),
but employees can query the data as if it's from a single source.
In summary, a DDBMS enables efficient and seamless management of data across multiple locations,
ensuring consistent access and control.
Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 1/8
Here’s a comparison discussing the advantages of Distributed Database Management System
(DDBMS) and Centralized Database Management System (CDBMS):
Queries are processed locally at the nearest site, reducing response time.
Less network traffic for local queries.
3. Scalability
Easy to add more sites or databases without affecting existing system.
5. Local Autonomy
6. Geographical Distribution
Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 2/8
Requires fewer hardware and software resources initially.
🔸 Summary Table
Feature DDBMS Advantages CDBMS Advantages
Reliability High (due to redundancy) Low (single point of failure)
Let me know if you’d like a comparison diagram or real-life use cases for each system.
Briefly discuss about object oriented database management system ? differentiate DDBMS and
OODBMS
🔸 Key Features:
Object Storage: Data is stored as objects, not as rows and columns.
Encapsulation: Objects contain both data (attributes) and methods (functions).
Data Storage Format Data stored in tables (relational model) Data stored as objects (like in OOP)
Spread across multiple physical
Data Location Stored centrally or locally as objects
locations
System Architecture Distributed (multiple sites, coordinated) Can be centralized or distributed
Complex Data Limited, unless extended with custom Excellent for complex data types (e.g.
Handling logic multimedia)
Programming Loosely coupled with procedural
Tightly integrated with OOP languages
Integration languages
Not focused on distribution; focused on
Transparency Supports distribution transparency
data modeling
Banking systems, telecom, distributed CAD/CAM, AI, multimedia, scientific
Use Cases
applications applications
✅ In summary:
DDBMS focuses on distributing traditional database systems across locations.
OODBMS focuses on modeling and storing complex data using object-oriented principles.
🔸 Types of Fragmentation
1. Horizontal Fragmentation
Definition: Divides a table into subsets of rows (tuples) based on certain conditions.
2. Vertical Fragmentation
Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 4/8
Definition: Divides a table into subsets of columns (attributes).
Each fragment contains a subset of attributes (including a primary key for reconstruction).
Example:
A `Student` table can be split as:
Fragment 1: `StudentID`, `Name`, `DOB`
Example:
Split `Employee` table by location (horizontal), and then split each by personal/professional
attributes (vertical).
🔸 Benefits of Fragmentation
✅ Improved performance through localized data access.
✅ Reduced data transfer cost across network.
✅ Increased availability – only relevant fragments need to be accessed.
✅ Parallel query execution – fragments can be processed simultaneously.
✅ Security and privacy – only certain fragments can be exposed to specific sites/users.
🔸 Challenges in Fragmentation
❌ Complex query processing due to data being spread across sites.
❌ Maintaining consistency across fragments.
❌ Reconstruction of full database if required.
✅ In Summary:
Fragmentation is a core concept in DDBMS that helps optimize performance by splitting the database
into manageable, location-specific fragments. It comes in three types: horizontal, vertical, and hybrid,
each suitable for different application needs.
What is data warehouse ? How it is different from databases . Explain briefly about data mining
Normalization Highly normalized (for storage efficiency) De-normalized (for faster querying)
Users Application developers, users Data analysts, decision-makers
Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 6/8
✅ Summary:
A data warehouse is a specialized system for storing and analyzing large volumes of historical
data.
Data mining is the process of extracting meaningful insights from data, often using a data
warehouse as the data source.
Let me know if you need diagrams or real-life examples for better clarity!
Extracts useful information from the content of web pages (text, images, videos).
Analyzes the link structure of the web (like hyperlinks between pages).
Analyzes user behavior by studying web logs, clickstreams, and browsing history.
Example: Understanding how users navigate a website to improve design or suggest content.
2. Search Engines
Improving search results using click behavior
Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 7/8
3. Web Personalization
4. Business Intelligence
5. Fraud Detection
✅ Summary
Web mining uses data mining techniques to extract valuable insights from the web's content,
structure, and usage patterns.
It has wide applications in e-commerce, marketing, search engines, social media, and more,
helping organizations make smarter decisions based on web data.
Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 8/8