0% found this document useful (0 votes)
13 views14 pages

ASSIGNMENT 6 and 7

The document discusses two types of databases: Relational Database Management Systems (RDBMS) which use structured tables and SQL, and Non-Relational Database Management Systems (NoSQL) which handle unstructured data with flexible schemas. It also covers the capabilities of Power BI in handling big data, the role of stakeholders, and the initial steps in data analysis including data preparation, cleaning, and transformation. Additionally, it explains how to replace values and create tables in Power BI.

Uploaded by

boy629709
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views14 pages

ASSIGNMENT 6 and 7

The document discusses two types of databases: Relational Database Management Systems (RDBMS) which use structured tables and SQL, and Non-Relational Database Management Systems (NoSQL) which handle unstructured data with flexible schemas. It also covers the capabilities of Power BI in handling big data, the role of stakeholders, and the initial steps in data analysis including data preparation, cleaning, and transformation. Additionally, it explains how to replace values and create tables in Power BI.

Uploaded by

boy629709
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 14

ASSIGNMENT 6&7

1.What is two type of database?


Ans: a) Rational Database (RDBMS)-The data
store is a structured tables with rows and
columns.
They use SQL(Structured Query Language) for
querying and managing data. Example include
MySQL,
PostgreSQL, and Microsoft SQL Server.
b) Non-Rational Database (NoSQL or NR DBMS)-
These database don’t use traditional table-based
structures. They are flexible and can handle
unstructured or semi-structured data types like
documents , key-value pairs, or graphs. Example
include MongoDB, Cassandra and Redis.

2.What is the difference between RDBMS and NR


DBMS?
Ans: The main differences between RDBMS
(Relational Database Management
System) and non-RDBMS (NoSQL databases) are:
SIDDARTH SINGH (BCA DS+AI)
202310101360182
1. Data Model
RDBMS: Uses a tabular data model where data is
structured into
tables with rows and columns. Relationships
between tables are
established using foreign keys.
Non-RDBMS: Uses various data models such as
key-value pairs,
document stores, wide-column stores, or graph
databases. These
models allow for more flexible and dynamic
schemas compared to
RDBMS.
2. Schema Flexibility:
RDBMS: Typically requires a predefined schema
where tables and
columns must be defined before data can be
inserted. Changes to the
schema can be complex and often require
downtime.
Non-RDBMS: Offers more flexibility with schema
design. Many
NoSQL databases support schemaless or flexible
schema approaches,

SIDDARTH SINGH (BCA DS+AI)


202310101360182
allowing developers to change the structure of
the data without
downtime.
2. Scalability:
RDBMS: Traditionally, scaling an RDBMS vertically
(adding more
CPU, RAM to a single server) can be costly and
has limits. Horizontal
scaling (adding more servers) is more complex
and often requires
sharding or partitioning.
Non-RDBMS: Designed to scale horizontally by
adding more servers
to distribute the load. This makes them suitable
for handling large
volumes of data and high throughput.
3. Query Language:
RDBMS: Uses SQL (Structured Query Language)
for defining,
querying, and manipulating data.

Non-RDBMS: Does not always use SQL; each type


of NoSQL

SIDDARTH SINGH (BCA DS+AI)


202310101360182
database has its own query language optimized
for its data model
(e.g., MongoDB uses BSON queries, Cassandra
uses CQL).
5. ACID Compliance:
RDBMS: Emphasizes ACID (Atomicity,
Consistency, Isolation,
Durability) properties to ensure data integrity,
reliability, and
transaction support.
Non-RDBMS: Provides varying levels of ACID
compliance
depending on the database type. Some NoSQL
databases sacrifice
certain ACID properties (like consistency) for
scalability and
performance gains.
6. Use Cases:
RDBMS: Well-suited for applications that require
complex queries,
transactions, and strong consistency guarantees.
Commonly used in
traditional business applications, financial
systems, and applications

SIDDARTH SINGH (BCA DS+AI)


202310101360182
where data relationships are well-defined.
Non-RDBMS: Ideal for applications with large-
scale distributed data,
real-time web applications, caching, and
situations where flexibility,
scalability, and high availability are prioritized
over strict consistency.

Ques-3 What is big data in power BI?


Ans: In Power BI, "big data"
refers to the capability of handling and
analyzing large volumes of data efficiently. Power
BI has several features and
components that enable working with big data:
1. Direct Query and Live Connection:
Power BI allows you to connect directly to data
sources such as SQL
Server, Azure SQL Database, and Azure Synapse
Analytics (formerly
SQL Data Warehouse) using Direct Query or Live
Connection mode.
This enables Power BI to query data directly from
these sources in
SIDDARTH SINGH (BCA DS+AI)
202310101360182
real-time without importing the data into the
Power BI dataset, which
is beneficial for
1 large datasets
2. Composite Models
3. Aggregations:
4. Data flows and Azure Data Lake Storage Gen2.
5. Performance Tuning

Ques-4 What is Apache Hadoop?


Ans Apache Hadoop is an open-source software
framework used for
distributed storage and processing of large
datasets across clusters of
computers. It provides a scalable, fault-tolerant
infrastructure that allows
organizations to store and process vast amounts
of data efficiently.

Ques-5 What do you mean by stakeholders in


power

SIDDARTH SINGH (BCA DS+AI)


202310101360182
BI?
Ans: Stakeholders refer to individuals or groups
who have an interest in or are
affected by the data analysis and insights
produced using Power BI reports and
dashboards. These stakeholders can include:
1. Business Users: Executives, managers, and
analysts who use Power BI to make data-driven
decisions, monitor key performance indicators
(KPIs), and
gain insights into business operations.
2. Data Analysts and Data Scientists:
Professionals responsible for creating
and maintaining Power BI reports, performing
data analysis, and deriving
actionable insights from data.
3. IT Administrators: Personnel responsible for
managing the Power BI
environment, including data governance,
security, and infrastructure
management.
4. End Users: Employees who consume reports
and dashboards created in

SIDDARTH SINGH (BCA DS+AI)


202310101360182
Power BI to perform their day-to-day tasks and
activities.
5. Customers and Clients: External parties who
may benefit from insights
generated through Power BI reports and
dashboards, such as in customer
analytics or service optimization.

Ques-6 Before analyzing the data what is the first


step
you take in data analysis in power BI?
Ans: The first step typically involves data
preparation. This step is crucial as it
ensures that the data is clean, structured, and
ready for analysis. Here are the
key tasks involved in data preparation in Power
BI:
1 Data Collection:
i)Identify and gather the relevant data sources
that you need for analysis.
ii)These could be databases, spreadsheets, web
sources, or cloud services.
2 Data Loading:
SIDDARTH SINGH (BCA DS+AI)
202310101360182
i)Import or connect to your data sources within
Power BI using its various
data connectors (e.g., SQL Server, Excel, CSV,
Azure services).
ii)Power BI allows you to import data into its own
internal data model or
establish Direct Query connections for real-time
analysis.
3)Data Cleaning and Transformation:
i) Cleanse your data to address issues such as
missing values, duplicates, and
inconsistencies.
ii) Perform transformations to prepare the data
for analysis, such as:
Data type conversion: Ensuring data types are
appropriate for
analysis.
Filtering: Removing irrelevant or incomplete data.
Merging and appending: Combining data from
multiple sources.
Calculations: Adding new columns or measures
based on existing data.
Splitting: Separating combined fields into distinct
columns.

SIDDARTH SINGH (BCA DS+AI)


202310101360182
4) Data Modeling:
5) Data Enrichment:
6) Data Validation:

Ques-7 How to replace value and how to create a


table
in power BI?
AnsAns In Power BI, you can replace values
within your data and create tables in
your reports using Power Query Editor and the
Data view respectively. Here’s
how you can do both:
Replace Values in Power BI
To replace values in Power BI, follow these steps
using Power Query Editor:
1. Load Data into Power Query Editor:
o Open your Power BI Desktop file.
o Go to the "Home" tab and
click on "Transform
data"
or "Edit Queries". This
opens the Power Query Editor.

SIDDARTH SINGH (BCA DS+AI)


202310101360182
2. Select the Column:
o In the Power Query Editor, navigate to the table
containing the
column where you want to replace values.
o Click on the column header to select it.
3. Replace Values:
o With the column selected, go to the
"Transform" tab in the
Power Query Editor.
o Click on "Replace Values"
in the toolbar.
4. Specify Replace Values:
o In the "Replace Values"
dialog box, enter the old value you want
to replace in the "Value to
Find" box.
o Enter the new value you want to replace it with
in the "Replace
with"
box.
o Click "OK" to apply the
replacement.
5. Close and Apply:

SIDDARTH SINGH (BCA DS+AI)


202310101360182
o After replacing values as needed, click on
"Close & Apply"
in
the Power Query Editor to apply the changes and
load the data into Power BI.
Create a Table in Power BI
To create a table in Power BI where you can
manually input data or import
from external sources:
1. Open Power BI Desktop:
o Launch Power BI Desktop.
2. Navigate to Data View:
o Click on the "Data" view
from the left navigation pane in Power
BI Desktop.
3. Add New Table:
In the "Data" view, click on
the "Modeling" tab
at the top.
o Click on "New Table" in
the "Tables" group.
4. Define Table Schema:
o In the formula bar that appears, start defining
your table schema. For
SIDDARTH SINGH (BCA DS+AI)
202310101360182
example, to create a table named
"Sales Data" with columns
"Date",
"Product", and
"Sales Amount", you would
write:
Sales Data = Table(
{
{#date(2023, 1, 1), "Product
A", 1000},
{#date(2023, 1, 2), "Product
B", 1500},
{#date(2023, 1, 3), "Product
A", 800}
},
{"Date",
"Product", "Sales
Amount"}
)
Replace the sample data with your actual data or
use M code functions
to fetch data dynamically.
5. Enter Data Manually:

SIDDARTH SINGH (BCA DS+AI)


202310101360182
o Alternatively, you can enter data manually by
clicking on the table
icon in the "Fields" pane,
which creates an empty table. Then, click
in each cell to input data directly.
6. Close and Apply:
o After creating your table and entering data,
click on "Close &
Apply" in the Power Query Editor to
save your changes and load the table
into BI.

SIDDARTH SINGH (BCA DS+AI)


202310101360182

You might also like