The document discusses two types of databases: Relational Database Management Systems (RDBMS) which use structured tables and SQL, and Non-Relational Database Management Systems (NoSQL) which handle unstructured data with flexible schemas. It also covers the capabilities of Power BI in handling big data, the role of stakeholders, and the initial steps in data analysis including data preparation, cleaning, and transformation. Additionally, it explains how to replace values and create tables in Power BI.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
13 views14 pages
ASSIGNMENT 6 and 7
The document discusses two types of databases: Relational Database Management Systems (RDBMS) which use structured tables and SQL, and Non-Relational Database Management Systems (NoSQL) which handle unstructured data with flexible schemas. It also covers the capabilities of Power BI in handling big data, the role of stakeholders, and the initial steps in data analysis including data preparation, cleaning, and transformation. Additionally, it explains how to replace values and create tables in Power BI.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 14
ASSIGNMENT 6&7
1.What is two type of database?
Ans: a) Rational Database (RDBMS)-The data store is a structured tables with rows and columns. They use SQL(Structured Query Language) for querying and managing data. Example include MySQL, PostgreSQL, and Microsoft SQL Server. b) Non-Rational Database (NoSQL or NR DBMS)- These database don’t use traditional table-based structures. They are flexible and can handle unstructured or semi-structured data types like documents , key-value pairs, or graphs. Example include MongoDB, Cassandra and Redis.
2.What is the difference between RDBMS and NR
DBMS? Ans: The main differences between RDBMS (Relational Database Management System) and non-RDBMS (NoSQL databases) are: SIDDARTH SINGH (BCA DS+AI) 202310101360182 1. Data Model RDBMS: Uses a tabular data model where data is structured into tables with rows and columns. Relationships between tables are established using foreign keys. Non-RDBMS: Uses various data models such as key-value pairs, document stores, wide-column stores, or graph databases. These models allow for more flexible and dynamic schemas compared to RDBMS. 2. Schema Flexibility: RDBMS: Typically requires a predefined schema where tables and columns must be defined before data can be inserted. Changes to the schema can be complex and often require downtime. Non-RDBMS: Offers more flexibility with schema design. Many NoSQL databases support schemaless or flexible schema approaches,
SIDDARTH SINGH (BCA DS+AI)
202310101360182 allowing developers to change the structure of the data without downtime. 2. Scalability: RDBMS: Traditionally, scaling an RDBMS vertically (adding more CPU, RAM to a single server) can be costly and has limits. Horizontal scaling (adding more servers) is more complex and often requires sharding or partitioning. Non-RDBMS: Designed to scale horizontally by adding more servers to distribute the load. This makes them suitable for handling large volumes of data and high throughput. 3. Query Language: RDBMS: Uses SQL (Structured Query Language) for defining, querying, and manipulating data.
Non-RDBMS: Does not always use SQL; each type
of NoSQL
SIDDARTH SINGH (BCA DS+AI)
202310101360182 database has its own query language optimized for its data model (e.g., MongoDB uses BSON queries, Cassandra uses CQL). 5. ACID Compliance: RDBMS: Emphasizes ACID (Atomicity, Consistency, Isolation, Durability) properties to ensure data integrity, reliability, and transaction support. Non-RDBMS: Provides varying levels of ACID compliance depending on the database type. Some NoSQL databases sacrifice certain ACID properties (like consistency) for scalability and performance gains. 6. Use Cases: RDBMS: Well-suited for applications that require complex queries, transactions, and strong consistency guarantees. Commonly used in traditional business applications, financial systems, and applications
SIDDARTH SINGH (BCA DS+AI)
202310101360182 where data relationships are well-defined. Non-RDBMS: Ideal for applications with large- scale distributed data, real-time web applications, caching, and situations where flexibility, scalability, and high availability are prioritized over strict consistency.
Ques-3 What is big data in power BI?
Ans: In Power BI, "big data" refers to the capability of handling and analyzing large volumes of data efficiently. Power BI has several features and components that enable working with big data: 1. Direct Query and Live Connection: Power BI allows you to connect directly to data sources such as SQL Server, Azure SQL Database, and Azure Synapse Analytics (formerly SQL Data Warehouse) using Direct Query or Live Connection mode. This enables Power BI to query data directly from these sources in SIDDARTH SINGH (BCA DS+AI) 202310101360182 real-time without importing the data into the Power BI dataset, which is beneficial for 1 large datasets 2. Composite Models 3. Aggregations: 4. Data flows and Azure Data Lake Storage Gen2. 5. Performance Tuning
Ques-4 What is Apache Hadoop?
Ans Apache Hadoop is an open-source software framework used for distributed storage and processing of large datasets across clusters of computers. It provides a scalable, fault-tolerant infrastructure that allows organizations to store and process vast amounts of data efficiently.
Ques-5 What do you mean by stakeholders in
power
SIDDARTH SINGH (BCA DS+AI)
202310101360182 BI? Ans: Stakeholders refer to individuals or groups who have an interest in or are affected by the data analysis and insights produced using Power BI reports and dashboards. These stakeholders can include: 1. Business Users: Executives, managers, and analysts who use Power BI to make data-driven decisions, monitor key performance indicators (KPIs), and gain insights into business operations. 2. Data Analysts and Data Scientists: Professionals responsible for creating and maintaining Power BI reports, performing data analysis, and deriving actionable insights from data. 3. IT Administrators: Personnel responsible for managing the Power BI environment, including data governance, security, and infrastructure management. 4. End Users: Employees who consume reports and dashboards created in
SIDDARTH SINGH (BCA DS+AI)
202310101360182 Power BI to perform their day-to-day tasks and activities. 5. Customers and Clients: External parties who may benefit from insights generated through Power BI reports and dashboards, such as in customer analytics or service optimization.
Ques-6 Before analyzing the data what is the first
step you take in data analysis in power BI? Ans: The first step typically involves data preparation. This step is crucial as it ensures that the data is clean, structured, and ready for analysis. Here are the key tasks involved in data preparation in Power BI: 1 Data Collection: i)Identify and gather the relevant data sources that you need for analysis. ii)These could be databases, spreadsheets, web sources, or cloud services. 2 Data Loading: SIDDARTH SINGH (BCA DS+AI) 202310101360182 i)Import or connect to your data sources within Power BI using its various data connectors (e.g., SQL Server, Excel, CSV, Azure services). ii)Power BI allows you to import data into its own internal data model or establish Direct Query connections for real-time analysis. 3)Data Cleaning and Transformation: i) Cleanse your data to address issues such as missing values, duplicates, and inconsistencies. ii) Perform transformations to prepare the data for analysis, such as: Data type conversion: Ensuring data types are appropriate for analysis. Filtering: Removing irrelevant or incomplete data. Merging and appending: Combining data from multiple sources. Calculations: Adding new columns or measures based on existing data. Splitting: Separating combined fields into distinct columns.
SIDDARTH SINGH (BCA DS+AI)
202310101360182 4) Data Modeling: 5) Data Enrichment: 6) Data Validation:
Ques-7 How to replace value and how to create a
table in power BI? AnsAns In Power BI, you can replace values within your data and create tables in your reports using Power Query Editor and the Data view respectively. Here’s how you can do both: Replace Values in Power BI To replace values in Power BI, follow these steps using Power Query Editor: 1. Load Data into Power Query Editor: o Open your Power BI Desktop file. o Go to the "Home" tab and click on "Transform data" or "Edit Queries". This opens the Power Query Editor.
SIDDARTH SINGH (BCA DS+AI)
202310101360182 2. Select the Column: o In the Power Query Editor, navigate to the table containing the column where you want to replace values. o Click on the column header to select it. 3. Replace Values: o With the column selected, go to the "Transform" tab in the Power Query Editor. o Click on "Replace Values" in the toolbar. 4. Specify Replace Values: o In the "Replace Values" dialog box, enter the old value you want to replace in the "Value to Find" box. o Enter the new value you want to replace it with in the "Replace with" box. o Click "OK" to apply the replacement. 5. Close and Apply:
SIDDARTH SINGH (BCA DS+AI)
202310101360182 o After replacing values as needed, click on "Close & Apply" in the Power Query Editor to apply the changes and load the data into Power BI. Create a Table in Power BI To create a table in Power BI where you can manually input data or import from external sources: 1. Open Power BI Desktop: o Launch Power BI Desktop. 2. Navigate to Data View: o Click on the "Data" view from the left navigation pane in Power BI Desktop. 3. Add New Table: In the "Data" view, click on the "Modeling" tab at the top. o Click on "New Table" in the "Tables" group. 4. Define Table Schema: o In the formula bar that appears, start defining your table schema. For SIDDARTH SINGH (BCA DS+AI) 202310101360182 example, to create a table named "Sales Data" with columns "Date", "Product", and "Sales Amount", you would write: Sales Data = Table( { {#date(2023, 1, 1), "Product A", 1000}, {#date(2023, 1, 2), "Product B", 1500}, {#date(2023, 1, 3), "Product A", 800} }, {"Date", "Product", "Sales Amount"} ) Replace the sample data with your actual data or use M code functions to fetch data dynamically. 5. Enter Data Manually:
SIDDARTH SINGH (BCA DS+AI)
202310101360182 o Alternatively, you can enter data manually by clicking on the table icon in the "Fields" pane, which creates an empty table. Then, click in each cell to input data directly. 6. Close and Apply: o After creating your table and entering data, click on "Close & Apply" in the Power Query Editor to save your changes and load the table into BI.
GmyEuBOavOruWjWEjpv8a7KMRnfi04rjGoWUUvcGijmgJbFE5VEWSJa5XXiLxzmEPn8k9QUZ9s9-5SHXnTxcOMAb4JazziTRiwYJPAdrlaDU5Efh14QNknaUuvzQde-Microsoft Power BI Certification Course