Admtttt

Big data is categorized into structured, unstructured, and semi-structured types, each requiring different storage and processing methods. A Data Mart is a specialized subset of a data warehouse focused on specific business needs, with types including dependent, independent, and hybrid. The star schema is a common data modeling approach that enhances query performance and understanding by organizing data into a central fact table and surrounding dimension tables.

Uploaded by

vinayak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views3 pages

Admtttt

Uploaded by

vinayak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

1)Explain types of big date.

Big data can be categorized into three main types based on its structure: structured,
unstructured, and semi-structured. Each type of data is stored, processed, and analyzed
differently depending on its characteristics. Here's a breakdown:
1. Structured Data : This type of data is organized in a predefined format, typically stored in
rows and columns (like in databases or spreadsheets).
2. Unstructured Data : Data that does not have a specific structure or organized format. It’s
often more complex to process and analyze because it doesn’t fit neatly into traditional
database tables.
3. Semi-Structured Data Definition: Data that doesn’t conform to a rigid structure but still
contains some organizational properties that make it easier to process than unstructured
data.

3) What is a Data Mart.?

Data Mart is a subset of a data warehouse that is focused on a specific business line,
department, or function. It contains a smaller, more specialized collection of data tailored
to the needs of a particular group of users, making it easier to access and analyze relevant
information without dealing with the vast amount of data stored in the enterprise-wide
data warehouse.
Types of Data Marts:
1. Dependent Data Mart:
2. Independent Data Mart:
3. Hybrid Data Mart:

4 )Explain any four advantages of Dala Warehouse.

Data Warehouse Advantages Complete control over the four main areas of data management
systems: - Clean data , Query processing: multiple options ,Indexes: multiple types , Security:
data and access

5 Explain Pivot OI AP operations with example.

1 Since OLAP servers are based on multidimensional view of data, we will discuss OLAP
operations in multidimensional data.
2 Pivot Pivot (also called rotate) is a visualization operation that rotates the data axes in view
to provide an alternative data presentation.
3 a pivot operation where the item and location axes in a 2-D slice are rotated
Q2 1 Write a short note on the star schema with an example.
The most common modeling paradigm, in which the DW contains
1: a large central table(fact table) containing the bulk of the data, with no redundancy and
2. a set of smaller attendant tables (dimension table), one for each dimension.
3. The schema graph resembles a starburst, with the dimension tables displayed in a radial
pattern around the central fact table.
4 Keys in Star schema:
1) Primary Key 2) Foreign Key 3) Surrogate key

Advantages of Star schema:

1) Query Performance: Has limited no. of tables and clear join paths: Query run faster than
OLTP
2) Load performance and administration: Simple structure(Dimension tables and fact table
are separate-load get reduced),
3) Built-in referential integrity: PK of Dimension table is FK in Fact table.
4) Easily Understood: Easy or simple to understand and navigate as a dimensions joined
through only fact table
Q3 1 List end explain basic tasks involved in Data Transfomation.
Now in this section, we will consider specific types of transformation tasks which are most
commonly performed on the extracted data before being moved in the data warehouse.
Format revision
These revisions include changes to the data types and lengths of individual data fields.
Decoding of fields
When the data comes from multiple source systems, the same data items may have been
described by different field values. The most common example is the coding for gender, with
one system using 0 and 1 for male and female, another using M and F, and the other using
male, female.
Data with cryptic codes must also be decoded before being moved in the data warehouse.
Splitting of fields
Earlier legacy systems stored names and addresses in large text fields.
Merging of information This type of data transformation is neither the opposite of the
previous task nor it means merging a number of fields to form a single field; instead, it means
bringing together the relevant information from different data sources.
Character set conversion This type of data transformation is done to the textual data to
convert its character set to an agreed standard character set. Some of the legacy systems on
the mainframes may have the source data in EBCDIC characters while in other source systems
the data may be stored using the ASCII character set. So you need to convert the data from
one character set to the other.
Conversions of units Many companies have global branches. So the sales amount may be
represented in different currencies in different source systems. But before moving the data in
the data warehouse, you need to convert the figures into a common unit of measurement.
Date and time conversion The date and time values also need to be represented in a
standard format.
Summarization This type of transformation is done to derive summarized/aggregate data
from the most granular data. The summarized data will then be loaded in the data warehouse
instead of loading the most granular level of data.
Key restructuring While extracting data from the data sources, you have to form the primary
keys for the fact tables and the dimensions tables. You cannot keep the primary keys of the
source data tables as the primary keys for the fact and dimension tables because the primary
keys of the source data have built-in meaning.

Chapter 1 Database Concept
No ratings yet
Chapter 1 Database Concept
23 pages
CH 10
No ratings yet
CH 10
15 pages
Ba CH02
No ratings yet
Ba CH02
23 pages
Z Data Warehouse Concepts
No ratings yet
Z Data Warehouse Concepts
19 pages
BSC IT TB For 5th Semester (Data Warehousing - 53) Kuvempu University
No ratings yet
BSC IT TB For 5th Semester (Data Warehousing - 53) Kuvempu University
7 pages
Welcome To Data Warehouse Presentation
No ratings yet
Welcome To Data Warehouse Presentation
38 pages
Monitoring and Supporting Data Conversion
No ratings yet
Monitoring and Supporting Data Conversion
5 pages
Unit 2
No ratings yet
Unit 2
144 pages
DM Unit-1
No ratings yet
DM Unit-1
14 pages
Cs 614
No ratings yet
Cs 614
12 pages
Ravi Data Warehousing Concepts Document 1665375367
No ratings yet
Ravi Data Warehousing Concepts Document 1665375367
49 pages
Datagu
No ratings yet
Datagu
20 pages
DW Basics
No ratings yet
DW Basics
24 pages
Introduction To Data Warehousing
No ratings yet
Introduction To Data Warehousing
46 pages
DW Notes
No ratings yet
DW Notes
13 pages
Database Management System: Introduction To DBMS Ms. Deepikkaa.S
No ratings yet
Database Management System: Introduction To DBMS Ms. Deepikkaa.S
45 pages
The Need of Data Analysis
No ratings yet
The Need of Data Analysis
12 pages
Data Warehousing and Data Mining
No ratings yet
Data Warehousing and Data Mining
10 pages
4a - Database Systems
No ratings yet
4a - Database Systems
35 pages
Database Management Systems
No ratings yet
Database Management Systems
44 pages
DMW Lab File Work
No ratings yet
DMW Lab File Work
18 pages
dw4 - Dimension1
No ratings yet
dw4 - Dimension1
75 pages
MIS 385/MBA 664 Systems Implementation With DBMS/ Database Management
No ratings yet
MIS 385/MBA 664 Systems Implementation With DBMS/ Database Management
39 pages
Data Warehousing and Data Mining Unit 1,2,3 Q and A
No ratings yet
Data Warehousing and Data Mining Unit 1,2,3 Q and A
41 pages
Dbms 1
No ratings yet
Dbms 1
87 pages
DW - Chap 5
No ratings yet
DW - Chap 5
5 pages
Database Systems
No ratings yet
Database Systems
9 pages
Monitor and Support Data Conversion
No ratings yet
Monitor and Support Data Conversion
5 pages
Practicalno: 1 Introduction To Database: Data
No ratings yet
Practicalno: 1 Introduction To Database: Data
33 pages
Ch09-Data Design
No ratings yet
Ch09-Data Design
45 pages
DWH Architecture & Concepts
No ratings yet
DWH Architecture & Concepts
37 pages
6 1 DWM 2019 S
No ratings yet
6 1 DWM 2019 S
7 pages
Data Modeling: Agnivesh Kumar
100% (1)
Data Modeling: Agnivesh Kumar
21 pages
Data Base Management Sysytem
No ratings yet
Data Base Management Sysytem
26 pages
Curriculum
No ratings yet
Curriculum
10 pages
Data Warehouse 2
No ratings yet
Data Warehouse 2
33 pages
Kuvempu University Data Warehousing
No ratings yet
Kuvempu University Data Warehousing
6 pages
What Is Data Warehouse?: Explanatory Note
No ratings yet
What Is Data Warehouse?: Explanatory Note
10 pages
Data Warehousing & Data Mining - Study Material
No ratings yet
Data Warehousing & Data Mining - Study Material
27 pages
Data Warehouse Concepts
No ratings yet
Data Warehouse Concepts
11 pages
Database Management: An Introduction
No ratings yet
Database Management: An Introduction
71 pages
Data Warehouse
No ratings yet
Data Warehouse
4 pages
Dimensional Modeling
No ratings yet
Dimensional Modeling
84 pages
DW Basic Questions
No ratings yet
DW Basic Questions
9 pages
Data Warehousing and Data Mining: Sunil Paudel
No ratings yet
Data Warehousing and Data Mining: Sunil Paudel
29 pages
Data Modeling Principles
100% (1)
Data Modeling Principles
21 pages
KDnuggets The Complete Collection of Data Science Cheatsheets
No ratings yet
KDnuggets The Complete Collection of Data Science Cheatsheets
17 pages
Knowledge Discovery Analysis
No ratings yet
Knowledge Discovery Analysis
7 pages
Data Warehousin G Concepts
No ratings yet
Data Warehousin G Concepts
41 pages
Chapter Four - Data Warehouse Design: SATA Technology and Business Collage
No ratings yet
Chapter Four - Data Warehouse Design: SATA Technology and Business Collage
10 pages
Chapter 2
No ratings yet
Chapter 2
79 pages
C Language
No ratings yet
C Language
82 pages
5.data Warehouse
No ratings yet
5.data Warehouse
19 pages
Chapter 9 - Data Design PDF
No ratings yet
Chapter 9 - Data Design PDF
45 pages
Database Management Systems: (Revised by Jiin-Feng Chen, National Chengchi University For Classroom Use)
No ratings yet
Database Management Systems: (Revised by Jiin-Feng Chen, National Chengchi University For Classroom Use)
40 pages
DWH Concepts Interview Q&A
No ratings yet
DWH Concepts Interview Q&A
12 pages
DW Concepts Shiva
No ratings yet
DW Concepts Shiva
32 pages
Data Warehouse Notes
No ratings yet
Data Warehouse Notes
5 pages
Designing The Data Warehouse Aima Second Lecture
No ratings yet
Designing The Data Warehouse Aima Second Lecture
34 pages
Ontological Engineering
No ratings yet
Ontological Engineering
17 pages
Introduction To Structured Programming Topic 1 and 2
No ratings yet
Introduction To Structured Programming Topic 1 and 2
35 pages
DBMS Unit 1 Notes
No ratings yet
DBMS Unit 1 Notes
64 pages
Ai Seminar Report
No ratings yet
Ai Seminar Report
17 pages
9.online Bike Rental
No ratings yet
9.online Bike Rental
27 pages
Library Automation - An Introduction
No ratings yet
Library Automation - An Introduction
7 pages
Ai 102
No ratings yet
Ai 102
34 pages
Application Representation
No ratings yet
Application Representation
89 pages
DWM
No ratings yet
DWM
64 pages
Chapter 1: Introduction: Database System Concepts, 7 Ed
No ratings yet
Chapter 1: Introduction: Database System Concepts, 7 Ed
37 pages
Attributes &entities
No ratings yet
Attributes &entities
15 pages
OSS MCQs
No ratings yet
OSS MCQs
16 pages
CAB430 Practical Week9 - 2025
No ratings yet
CAB430 Practical Week9 - 2025
11 pages
Class X It QP CH-8,9
No ratings yet
Class X It QP CH-8,9
2 pages
System Design Deep Dive
No ratings yet
System Design Deep Dive
16 pages
Informatica: The Powercenter/Powermart
No ratings yet
Informatica: The Powercenter/Powermart
3 pages
ResuCraft Resume Builder Using NLP
No ratings yet
ResuCraft Resume Builder Using NLP
8 pages
Big Data With Hadoop and Spark - 2023-25
No ratings yet
Big Data With Hadoop and Spark - 2023-25
4 pages
The AI Behind Watson - The Technical Article - AAAI
No ratings yet
The AI Behind Watson - The Technical Article - AAAI
52 pages
ASSIGNMENT 2 (Business Analytics For Managers)
No ratings yet
ASSIGNMENT 2 (Business Analytics For Managers)
5 pages
21BCAD5C01 IDA Module 1 Notes
No ratings yet
21BCAD5C01 IDA Module 1 Notes
24 pages
Proposal PHP
No ratings yet
Proposal PHP
2 pages
Bca Curricullum
No ratings yet
Bca Curricullum
5 pages
Predicting Consumer Behavior in E-Commerce Using Recommendation Systems
No ratings yet
Predicting Consumer Behavior in E-Commerce Using Recommendation Systems
8 pages
Yigao Fang: Education
No ratings yet
Yigao Fang: Education
1 page
Whittaker NewConceptsKingdoms 1969
No ratings yet
Whittaker NewConceptsKingdoms 1969
12 pages
Yogendra Sharma Resume - AIML
No ratings yet
Yogendra Sharma Resume - AIML
3 pages
Nishant Kumar Resume
No ratings yet
Nishant Kumar Resume
1 page
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
Basic Concepts in Data Structures
From Everand
Basic Concepts in Data Structures
K.Meenendranath Reddy
No ratings yet
Databases: System Concepts, Designs, Management, and Implementation
From Everand
Databases: System Concepts, Designs, Management, and Implementation
Jonathan Rigdon
No ratings yet

Admtttt

Uploaded by

Admtttt

Uploaded by

1)Explain types of big date.

3) What is a Data Mart.?

4 )Explain any four advantages of Dala Warehouse.

5 Explain Pivot OI AP operations with example.

Advantages of Star schema:

You might also like