0% found this document useful (0 votes)
18 views7 pages

23MMAT16 Suba 1

Uploaded by

JOEL KISHOR J H
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
18 views7 pages

23MMAT16 Suba 1

Uploaded by

JOEL KISHOR J H
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 7

PYTHON FOR DATA

SCIENCE
INTERNSHIP REPORT
By
S.
Subhasini
Reg No.23083206511012016

Under the guidance of


Dr. Karthika
Banu
Inspire Softech
Solutions
Chennai
CONTENTS

1. WORKING WITH MICROSOFT EXCEL


2. PYTHON ESSENTIALS
3. WORKING WITH NUMPY AND PANDAS
4. DATA SCIENCE IN REAL TIME
APPLICATIONS
5. DATA VISUALIZATION
6. DATA SCIENCE COMPONENTS
WORKING WITH MICROSOFT EXCEL
Excel is a spreadsheet application developed by microsoft widely used
for data organization, analysis and visualization.

KEY FEATURES

• Spreadsheet layout
• Using formulas and functions(e.g., SUM, AVERAGE)
• Data analysis and visualization
• Pivot tables and pivot chats
• Conditional formatting

OPERATORS IN EXCEL

• Arithmetic operator
• Comparision or Relational operator
• Logical operator
PYTHON ESSENTIALS
Python is a widely-used programming language. It is a popular programming
language that's used for a variety of tasks, including data analysis, machine
learning and web development.

CONCEPTS OF ESSENTIAL PYTHON

Variables and operators


• Containers that stores data values
• Addition, Subtraction, Multiplication, Division
Control Structures
• Conditional Statement (if/else)
• Loops(for, while)
Data Structure
• Lists
• Sets
Modules and Packages
• Importing modules
• Creating and using packages
File Input/Output
• Reading and writing text files
• CSV,ISON, and other file formats
WORKING WITH NUMPY AND PANDAS
Pandas and NumPy are both Python libraries that are widely used for data analysis and manipulation.

NumPy Essentials Pandas Essentials

 Data Structures
 Arrays
• Create
• Series(1D labelled array)
• Operate on multidimensional arrays. • Data frame(2D labelled array)

 Data Types  Data Manipulation


• Understand NumPy data types (e.g., in. float, • Filtering, sorting, grouping and merging data
complex) • Handling missing data
 Array Indexing  Data Analysis
• Access and modify array elements. • Statistical function(e.g., mean)
• Data visualization integration
 Array Operations
• Perform element-wise Operations (e.g.,  Data Input/Output
arithmetic, comparison) • Reading and Writing CSV, Excel, JSON and other file
format
​Key Functions​​of NumPy and Pandas
• 'Numpy.array()'​​
• 'Pandas pivot_table()'
DATA SCIENCE IN REAL TIME APPLICATION
Data science plays a critical role in sales and marketing by providing insights
into customer behaviour, preference and trends. It uses various tools and
techniques to extract meaningful data from raw data. Data science is also known
as the future of artificial intelligence.

DATA SCIENCE APPLICATIONS

Healthcare
• Predictive analytics for diseases diagnosis and patient outcomes
• Personalized medicine and treatment recommendations
Gaming
• Data science improves online gaming experiences
Fintech
• Data science creates credit reports and financial profile
Transportation
• Route optimization and traffic prediction
• Real-time fleet management and logistic
Sports
• Player and team performance analysis
• Game outcome prediction and strategy optimization
DATA VISUALIZATION
Data visualization is the process of creating graphical representations
of data to the understand complex data insights, Identify patterns and
trends, Communication findings effectively, Support data-driven decision-
making.

TYPES OF DATA VISUALIZATION

Matplotlib
• Matplotlib is an easy-to-use, low-level data visualization library. It
consists of various plots like scatter plot, line plot, histogram, etc.
Seaborn
• Seaborn is a Python data visualization library based on Matplotib. It
provides a high-level interface for creating attractive graphs.
Bokeh
• Bokeh is the ideal tool to build dashboard and charts quickly with
interactivity.
Plotly
• Plotly creates more beautiful plots.
Benefits
• Improved understanding, Enhanced decision-making, better
communication, Identification of trends and patterns.

You might also like