0% found this document useful (0 votes)

11 views6 pages

Assignment Python

The document provides an overview of key concepts in data analysis, including statistics, data visualization, structured and unstructured data, and the use of tools like Power BI and Python for data processing. It discusses the importance of statistical methods, logistic regression, and the differences between correlation and causation. Additionally, it outlines how to create visualizations and analyze data effectively using various techniques and tools.

Uploaded by

philomath Math

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views6 pages

Assignment Python

Uploaded by

philomath Math

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

University name - Calcutta University

Course name - Business analyst

Date - 14.08.2024
Name – ISHA BISWAS
Registration no. - CMO23

1.Seaborn :

Statistical
Statistics is a branch of math focused on collecting, organizing, and understanding numerical data. It
involves analyzing and interpreting data to solve real-life problems, using various quantitative
models. Some view statistics as a separate scientific discipline rather than just a branch of math. It
simplifies complex tasks and offers clear insights into regular activities. Statistics finds applications in
diverse fields like weather forecasting, stock market analysis, insurance, betting, and data science.
The statistics. mean() method calculates the mean (average) of the given data set. Tip: Mean = add
up all the given values, then divide by how many values there are.

Data Visualization
Data visualization is the discipline of trying to understand data by placing it in a visual context so that
patterns, trends, and correlations that might not otherwise be detected can be exposed. Data
visualization is the graphical representation of information and data. By using visual elements like
charts, graphs, and maps, data visualization tools provide an accessible way to see and understand
trends, outliers, and patterns in data.

Structured data
Structured data is data that has a standardized format for efficient access by software and humans
alike. It is typically tabular with rows and columns that clearly define data attributes. Computers can
effectively process structured data for insights due to its quantitative nature.
Here are examples of structured data systems:
* Excel files
* SQL databases
* Point-of-sale data
* Web form results
* Search engine optimization (SEO) tags
* Product directories
* Inventory control * Reservation system

Unstructured data
These files have a delimiter and either fixed or variable width where the missing values are
represented as blanks in between the delimiters. But sometimes we get data where the lines are not
fixed width, or they are just HTML, image or pdf files. Such data is known as unstructured data.
Unstructured data has an internal structure but does not contain a predetermined data model or
schema. It can be textual or non-textual. It can be human-generated or machine-generated. One of
the most common types of unstructured data is text.
2.Data visualization with tableau
What is Power BI used for in data visualization?
Power BI is a business analytics service by Microsoft used for data visualization, business intelligence,
and data analysis. It helps users: 1. Connect to various data sources
2. Create interactive visualizations (reports and dashboards)
3. Explore and analyze data 4. Share insights with others Power
BI is used for:
1. Data visualization: Create interactive charts, tables, maps, and more to represent data.
2. Business intelligence: Analyze data to inform business decisions.
3. Data mining: Discover patterns and trends in data.
4. Reporting: Create and share reports with others.
5. Dash boarding: Create custom dashboards for real-time monitoring.
6. Data storytelling: Present data insights in a clear and compelling way.
Power BI offers various features, including:
1. Data connectors (e.g., Excel, SQL, Azure)
2. Data modeling and transformation
3. Visualizations (e.g., charts, tables, maps)
4. Interactivity (e.g., filters, drill-downs)
5. Collaboration and sharing
6. Artificial intelligence (AI) and machine learning (ML) capabilities Power BI is used across various
industries and departments, such as:
1. Sales and marketing
2. Finance and accounting
3. Operations and supply chain
4. Human resources
5. Healthcare and life sciences
By using Power BI, organizations can gain insights, make data-driven decisions, and drive business
success.

How do you create a basic bar chart in Power BI?

1. Open Power BI: Launch the Power BI application on your computer or access it online.

2. Load Data: Connect to your data source (e.g., Excel file, database) or use a built-in sample dataset.

3. Create a New Visual: Click the "Visualizations" icon (a bar chart symbol) in the left sidebar and
select "Bar chart" from the dropdown menu.

4. Drag Fields: Drag the field you want to display on the x-axis (categories) to the "Axis" area.

5. Drag Values: Drag the field you want to display on the y-axis (values) to the "Values" area.

6. Customize: Adjust the chart's appearance by using the "Format" options (e.g., colors, font sizes,
titles).
7. Analyze: Explore your data by interacting with the bar chart (e.g., hover, click, filter).
What are filters in Power BI, and how do they help in data analysis?
In Power BI, filters are a way to narrow down data to a specific subset, allowing users to focus on
relevant information and gain insights. Filters help in data analysis by:

1. Reducing data volume: Filters exclude irrelevant data, making it easier to analyze and visualize.
2. Focusing on specific segments: Filters enable analysis of specific groups, such as regions, products,
or time periods.
3. Identifying trends and patterns: By applying filters, users can discover trends and patterns within
specific data segments.
4. Drilling down into details: Filters allow users to drill down into detailed data, enabling deeper
analysis.
5. Creating targeted visualizations: Filters help create visualizations that show specific data, making it
easier to communicate insights.

By applying filters, users can:

1. Analyze specific business scenarios
2. Identify areas for improvement
3. Track key performance indicators (KPIs)
4. Create targeted reports and dashboards
5. Gain deeper insights into their data
Filters are a powerful feature in Power BI, enabling users to extract valuable insights from their data
and make informed decisions.

3. INTEGRATING PYTHON WITH TABLEAU :

How can Python be used to enhance data analysis in Power BI?

Python is a very useful programming language for data analysis purposes, data science and machine
learning. With Python, you can import, transform, analyses, and visualize data from various sources
in different formats. It also boasts multiple libraries with advanced functions and algorithms for data
processing.
Microsoft Power BI is an interactive data analysis and visualization tool used for BI (business
intelligence). With Power BI, you can quickly and easily connect to, model, explore, and share data,
as well as create personalized, interactive visual reports that offer valuable insights about your
business.

Python integration with Power BI is limited to two main functionalities: data integration and analysis,
so Python can only be used in Power BI for sourcing data and creating custom visualizations.

In this article, we will show you how to:

Install and configure the Python and Power BI environment.
Use Python to import and transform data in Power BI.
Create custom visualizations using Seaborn and Matplotlib in Power BI.
Use Pandas to handle datasets in Power BI.
Reuse your existing Python source code in Power BI.
Understand the limitations of using Python in Power BI.
Use Kaggle, an open databank.
What is the process for connecting Python scripts in Power BI?
To connect Python scripts to Power BI, follow these steps:
Install Python and Required Packages: Ensure Python is installed on your system along with the
necessary packages (e.g., pandas, numpy). You can install packages using pip:

Copy code pip install pandas

numpy Enable Python Support in
Power BI:

Open Power BI Desktop.

Go to File > Options and settings > Options.
Under Global > Python scripting, specify the path to your Python executable.
Load Data Using Python Script:

In Power BI Desktop, go to Home > Get Data > More.

Select Other > Python script and click Connect.
Enter your Python script in the dialog box that appears. This script should include code to
import libraries, read data, and prepare it for Power BI. Run and Transform Data:

Power BI will execute the Python script and load the data as a Data Frame.
You can then use Power BI's data transformation tools to clean and shape the data as needed.
Visualize Data:

Use Power BI's visualization tools to create reports and dashboards based on the data processed
by your Python script. Refresh Data:

Ensure that your Python environment and scripts are properly configured to handle data refreshes if
you are using scheduled refreshes in the Power BI Service.
This process integrates Python scripts into Power BI for advanced data processing and analysis.

How do you execute Python code in a Power BI dashboard?

To run your Python script:
In the Home group of the Power BI Desktop ribbon, select Get data. In the Get Data dialog box,
select Other > Python script, and then select Connect.

4.ANALYTICS FOUNDATION USING STATISTICAL

METHODS:
What is the purpose of using statistical methods in analytics?
Statistical methods in analytics are used to collect, analyse, interpret, and present data. They help
identify patterns, relationships, and trends, enabling informed decision-making and accurate
predictions in various fields.

Data Summarization
Inference
Hypothesis Testing
Modelling Relationships
Risk Assessment
Optimizing Processes
Prediction and Forecasting
How do you calculate the mean and standard deviation of a data set?

To calculate the mean and standard deviation of a dataset, follow these steps:

1. Calculate the Mean (Average):

Step 1: Sum all the values in the dataset.
2. Calculate the Standard Deviation:
Step 1: Calculate each data point's deviation from the mean by subtracting the mean from each
value.
Step 2: Square each of these deviations.

Step 5: Take the square root of the variance to get the standard deviation. 𝑛 Step 4: Divide the
Step 3: Sum all the squared deviations.

sum of squared deviations by the number of data points n to get the variance.
What is the difference between correlation and causation in the statistical analysis?

A concise comparison between correlation and causation in a table format:

•Aspect
•Correlation
•Causation

•Definition - Measures the strength and direction of a relationship between two variables. -
Indicates that one event directly causes another.
•Implication -Shows association but does not imply one variable causes the other. -
Demonstrates that changes in one variable result from changes in another.
•Example - Ice cream sales and drowning incidents are correlated. -
Smoking causes lung cancer.
•Interpretation - Useful for identifying potential relationship -
Critical for establishing cause-effect relationships.
•Limitation -Can be misleading if misinterpreted as causation.
-Requires rigorous testing to confirm.

5.LOGISTIC REGRESSION:

What is logistic regression used for in data analysis?

Logistic regression is a data analysis technique that uses mathematics to find the relationships
between two data factors. It then uses this relationship to predict the value of one of those factors
based on the other. The prediction usually has a finite number of outcomes, like yes or no.

How do you interpret the output of a logistic regression model?

Standard interpretation of the ordered logit coefficient is that for a one unit increase in the predictor,
the response variable level is expected to change by its respective regression coefficient in the
ordered log-odds scale while the other variables in the model are held constant.
What is the difference between logistic regression and linear regression?
Linear Regression and Logistic Regression are both statistical models used for prediction, but they
differ in their approach and application:

Linear Regression:

1. Continuous Outcome: Predicts a continuous outcome variable (y) based on one or more predictor
variables (x).
2. Linear Relationship: Assumes a linear relationship between the predictors and the outcome.
3. Equation: y = β0 + β1x + ε (where β0 is the intercept, β1 is the slope, and ε is the error term)
4. Assumptions: Linearity, independence, homoscedasticity, normality, and no multicollinearity.
5. Example: Predicting house prices based on features like size, location, and number of bedrooms.

Logistic Regression:

1. Binary Outcome: Predicts a binary outcome variable (y) based on one or more predictor variables
(x).
2. Non-Linear Relationship: Assumes a non-linear relationship between the predictors and the
outcome (using the logistic function).
3. Equation: p(y=1) = 1 / (1 + e^(-z)) (where z = β0 + β1x and p(y=1) is the probability of the positive
outcome)
4. Assumptions: Independence, no multicollinearity, and linearity in the logit (not the original
variables).
5. Example: Predicting whether a customer will churn (yes/no) based on features like usage,
demographics, and satisfaction.

Osint Complete Resources
No ratings yet
Osint Complete Resources
43 pages
Pure+Moderation Brochure+General+2020+
No ratings yet
Pure+Moderation Brochure+General+2020+
20 pages
A11 BW Manual
100% (1)
A11 BW Manual
220 pages
Data Visualization Introduction-presentation (1)
No ratings yet
Data Visualization Introduction-presentation (1)
44 pages
Python in Power BI - Unleash The - Hayden Van Der Post
100% (5)
Python in Power BI - Unleash The - Hayden Van Der Post
475 pages
Data Visualization With Power BI
No ratings yet
Data Visualization With Power BI
49 pages
2-2 Report 2 PDF
No ratings yet
2-2 Report 2 PDF
12 pages
Ideas Bid - Dta Lab 2
No ratings yet
Ideas Bid - Dta Lab 2
19 pages
Power Bi Introduction & DV Lab Exp-1
No ratings yet
Power Bi Introduction & DV Lab Exp-1
106 pages
Power BI Guide
No ratings yet
Power BI Guide
46 pages
PowerBI Session 2 Notes
No ratings yet
PowerBI Session 2 Notes
12 pages
Power BI Guide
100% (2)
Power BI Guide
46 pages
Power BI PDF
No ratings yet
Power BI PDF
34 pages
Power BI Session 1
100% (2)
Power BI Session 1
34 pages
2-2 Report 2
No ratings yet
2-2 Report 2
15 pages
Data+Science+Meets+Power+BI+Transforming+Data+into+Insights+-+Learning+Overview
No ratings yet
Data+Science+Meets+Power+BI+Transforming+Data+into+Insights+-+Learning+Overview
22 pages
Lab Manual 05
No ratings yet
Lab Manual 05
33 pages
3 Data Visualization With PowerBI
No ratings yet
3 Data Visualization With PowerBI
20 pages
2 Customizing Power BI & Data Connection
No ratings yet
2 Customizing Power BI & Data Connection
7 pages
Power BI Tutorial
No ratings yet
Power BI Tutorial
15 pages
Unit 5
No ratings yet
Unit 5
29 pages
INFS5700 T2 2025 Week 2 Lecture Slides
No ratings yet
INFS5700 T2 2025 Week 2 Lecture Slides
57 pages
Data Analysis With Python, SQL &
No ratings yet
Data Analysis With Python, SQL &
18 pages
EXPERT at EXCEL - Power BI - A STE - Daniel Reed
No ratings yet
EXPERT at EXCEL - Power BI - A STE - Daniel Reed
142 pages
Dta102 Bid102
No ratings yet
Dta102 Bid102
42 pages
Introduction To Power BI
No ratings yet
Introduction To Power BI
13 pages
Power Bi
No ratings yet
Power Bi
45 pages
Power BI Overview
No ratings yet
Power BI Overview
16 pages
Powerbi Intro
No ratings yet
Powerbi Intro
9 pages
Lesson 3power BI
No ratings yet
Lesson 3power BI
25 pages
Iv Year Soc
No ratings yet
Iv Year Soc
32 pages
Introduction To Power BI
No ratings yet
Introduction To Power BI
4 pages
Power BI Bible
100% (6)
Power BI Bible
396 pages
Power BI Guide
No ratings yet
Power BI Guide
2 pages
Artificial Inteligence - 2
No ratings yet
Artificial Inteligence - 2
19 pages
PowerBI Pre Read
No ratings yet
PowerBI Pre Read
4 pages
Basic - 7 - Introduction To Power BI
No ratings yet
Basic - 7 - Introduction To Power BI
42 pages
COE201 Lab 1
No ratings yet
COE201 Lab 1
48 pages
Power BI
No ratings yet
Power BI
3 pages
MBA IV Semester Business Intelligence PracticalLab Question Bank 2024
No ratings yet
MBA IV Semester Business Intelligence PracticalLab Question Bank 2024
20 pages
DA PDF
No ratings yet
DA PDF
20 pages
File-1745922343420-Intro To Power Bi
No ratings yet
File-1745922343420-Intro To Power Bi
25 pages
AI (Module-02)
No ratings yet
AI (Module-02)
16 pages
Microsoft Power BI
No ratings yet
Microsoft Power BI
58 pages
Introduction To Power BI
No ratings yet
Introduction To Power BI
26 pages
Module 5
No ratings yet
Module 5
12 pages
Power BI Question Bank
No ratings yet
Power BI Question Bank
40 pages
Get Data With Power BI Desktop: Angeles University Foundation College of Computer Studies
No ratings yet
Get Data With Power BI Desktop: Angeles University Foundation College of Computer Studies
35 pages
Power BI Q&A
No ratings yet
Power BI Q&A
10 pages
Report 1
No ratings yet
Report 1
56 pages
Power Bi - HTT
No ratings yet
Power Bi - HTT
20 pages
Navigating PBI Damico Day1
No ratings yet
Navigating PBI Damico Day1
56 pages
Power BI Tutorial For Beginners - DataCamp
No ratings yet
Power BI Tutorial For Beginners - DataCamp
26 pages
Power BI Class Notes
No ratings yet
Power BI Class Notes
18 pages
PowerBI Training
No ratings yet
PowerBI Training
47 pages
Intro 2 Power BI
No ratings yet
Intro 2 Power BI
18 pages
Power Bl_Easy Steps_Pro Results
No ratings yet
Power Bl_Easy Steps_Pro Results
170 pages
DV 1 28
No ratings yet
DV 1 28
28 pages
Diploma in Microsoft Power BI For Beginners
No ratings yet
Diploma in Microsoft Power BI For Beginners
17 pages
Coco Cola Stock
No ratings yet
Coco Cola Stock
88 pages
Business Intelligence Project
No ratings yet
Business Intelligence Project
5 pages
Sales Project by Isha Biswas, (Cmo23) - 20240827 - 205144 - 0000
No ratings yet
Sales Project by Isha Biswas, (Cmo23) - 20240827 - 205144 - 0000
24 pages
Assignment 1 by ISHA BISWAS, (CMO23) - 20240827 - 210420 - 0000
No ratings yet
Assignment 1 by ISHA BISWAS, (CMO23) - 20240827 - 210420 - 0000
3 pages
Slitting and Rewinding of Aluminium Coils With Installing of Slitting Line Machine On Build Own and Operate Basis Corrigendum
No ratings yet
Slitting and Rewinding of Aluminium Coils With Installing of Slitting Line Machine On Build Own and Operate Basis Corrigendum
2 pages
Use-Cases Project
No ratings yet
Use-Cases Project
10 pages
USA BATCH IIi
No ratings yet
USA BATCH IIi
92 pages
Identifying Main Idea
No ratings yet
Identifying Main Idea
6 pages
Hitachi
No ratings yet
Hitachi
7 pages
Module1 DSDV
No ratings yet
Module1 DSDV
95 pages
Character Reference
No ratings yet
Character Reference
2 pages
PES MTech Brochure
No ratings yet
PES MTech Brochure
12 pages
Java Notes Module 4 3rd Year
No ratings yet
Java Notes Module 4 3rd Year
24 pages
Catalogo Edwards
100% (1)
Catalogo Edwards
8 pages
General Terminal Commands::cd:pwd
No ratings yet
General Terminal Commands::cd:pwd
19 pages
Foreword: 3GPP TS 36.101 V16.6.0 (2020-06) 32 Release 16
No ratings yet
Foreword: 3GPP TS 36.101 V16.6.0 (2020-06) 32 Release 16
645 pages
CWT-UWD-SD RS485 Ultrasonic Wind Speed and Direction Sensor Manual
100% (2)
CWT-UWD-SD RS485 Ultrasonic Wind Speed and Direction Sensor Manual
5 pages
Clang Integration
No ratings yet
Clang Integration
12 pages
Wricef Inventory Template-1
No ratings yet
Wricef Inventory Template-1
14 pages
Mio-5377r DS (100223) 20231002134454
No ratings yet
Mio-5377r DS (100223) 20231002134454
2 pages
Thesis Asset Management Client Login
100% (2)
Thesis Asset Management Client Login
4 pages
03U0095EN
No ratings yet
03U0095EN
20 pages
OWASP SCP Quick Reference Guide - en-US
No ratings yet
OWASP SCP Quick Reference Guide - en-US
17 pages
6298 Schematics List
No ratings yet
6298 Schematics List
2 pages
SOFTWARE MANUAL DesignStudioReference RevT
No ratings yet
SOFTWARE MANUAL DesignStudioReference RevT
936 pages
Answers: Exercise 1.1
No ratings yet
Answers: Exercise 1.1
17 pages
REAKTOR 6 What Is New English 072220
No ratings yet
REAKTOR 6 What Is New English 072220
34 pages
24p Syed Akhmal Syed Jamalil
No ratings yet
24p Syed Akhmal Syed Jamalil
38 pages
Jumat 10 Feb 2023 Diterima Sarana
No ratings yet
Jumat 10 Feb 2023 Diterima Sarana
1,535 pages
Week 04 Data Base Design: Database System
No ratings yet
Week 04 Data Base Design: Database System
47 pages
Control Engineering Completion
No ratings yet
Control Engineering Completion
20 pages
FUJITSU Mainboard D3401-B ATX: Data Sheet
No ratings yet
FUJITSU Mainboard D3401-B ATX: Data Sheet
6 pages
3a-105230 PBR 33 RH
No ratings yet
3a-105230 PBR 33 RH
1 page

Assignment Python

Uploaded by

Assignment Python

Uploaded by

University name - Calcutta University

Course name - Business analyst

How do you create a basic bar chart in Power BI?

By applying filters, users can:

3. INTEGRATING PYTHON WITH TABLEAU :

How can Python be used to enhance data analysis in Power BI?

In this article, we will show you how to:

Copy code pip install pandas

Open Power BI Desktop.

In Power BI Desktop, go to Home > Get Data > More.

How do you execute Python code in a Power BI dashboard?

4.ANALYTICS FOUNDATION USING STATISTICAL

1. Calculate the Mean (Average):

A concise comparison between correlation and causation in a table format:

What is logistic regression used for in data analysis?

How do you interpret the output of a logistic regression model?

You might also like