0% found this document useful (0 votes)

28 views30 pages

Module4 DataAnalyticsLanguages

Uploaded by

Bhumika Kukade

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views30 pages

Module4 DataAnalyticsLanguages

Uploaded by

Bhumika Kukade

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 30

Module 4:

Data Analytics Languages--

Python

31/07/2024 Slide 1
History

• Python created by Guido van Rossum in the

Netherlands in 1990
• Popular programming language
• Widely used in industry and academia
• Simple, intuitive syntax
• Rich library
• Two versions in existence today Python 2 and
Python 3
eLahe Technologies 2020
31/07/2024 2
www.elahetech.com
Interpreted Language
• Python is an interpreted language as opposed
to being compiled
• An interpreter reads a high level program and
executes it
• A compiler translates the program into an
executable object code first which is
subsequently executed

eLahe Technologies 2020

31/07/2024 3
www.elahetech.com
Numpy

• NumPy is the fundamental package for scientific

computing with Python. It contains among other
things:
• a powerful N-dimensional array object
• sophisticated (broadcasting) functions
• tools for integrating C/C++ and Fortran code
• useful linear algebra, Fourier transform, and random
number capabilities

eLahe Technologies 2020

31/07/2024 4
www.elahetech.com
Matplotlib

• Matplotlib is a Python 2D plotting library

which produces publication quality figures in
a variety of hardcopy formats and interactive
environments across platforms.

eLahe Technologies 2020

31/07/2024 5
www.elahetech.com
pandas

• pandas is an open source, BSD-licensed

library providing high-performance, easy-to-
use data structures and data analysis tools
for Python

eLahe Technologies 2020

31/07/2024 6
www.elahetech.com
Python Regex

31/07/2024 Slide 7
Regular Expressions

In computing, a regular expression, also referred to as

"regex" or "regexp", provides a concise and flexible
means for matching strings of text, such as particular
characters, words, or patterns of characters. A regular
expression is written in a formal language
that can be interpreted by a regular expression
processor.

http://en.wikipedia.org/wiki/Regular_expression

31/07/2024 8
Python Regular Expressions
^ Matches the beginning of a line
$ Matches the end of the line
. Matches any character
\s Matches whitespace
\S Matches any non-whitespace character
* Repeats a character zero or more times
*? Repeats a character zero or more times (non-greedy)
+ Repeats a chracter one or more times
+? Repeats a character one or more times (non-greedy)
[aeiou] Matches a single character in the listed set
[^XYZ] Matches a single character not in the listed set
[a-z0-9] The set of characters can include a range
( Indicates where string extraction is to start
) Indicates where string extraction is to end

31/07/2024 9
The Regular Expression Module
• Before you can use regular expressions in your
program, you must import the library using
"import re"
• You can use re.search() to see if a string matches a
regular expression similar to using the find()
method for strings
• You can use re.findall() extract portions of a string
that match your regular expression similar to a
combination of find() and slicing: var[5:10]

31/07/2024 10
Wild-Card Characters

• The dot character matches any character

• If you add the asterisk character, the character is
"any number of times"
X-Sieve: CMU Sieve 2.3
X-DSPAM-Result: Innocent
X-DSPAM-Confidence: 0.8475 ^X.*:
X-Content-Type-Message-Body: text/plain

31/07/2024 11
Wild-Card Characters

• The dot character matches any character

• If you add the asterisk character, the character is
"any number of times"
Match the start of the line Many times
X-Sieve: CMU Sieve 2.3
X-DSPAM-Result: Innocent
X-DSPAM-Confidence: 0.8475 ^X.*:
X-Content-Type-Message-Body: text/plain
Match any character

31/07/2024 12
Wild-Card Characters

• Depending on how "clean" your data is and the

purpose of your application, you may want to
narrow your match down a bit
Match the start of the line Many times
X-Sieve: CMU Sieve 2.3
X-DSPAM-Result: Innocent
X-DSPAM-Confidence: 0.8475 ^X.*:
X-Content-Type-Message-Body: text/plain
Match any character

31/07/2024 13
Greedy Matching

• The repeat characters (* and +) push outward in both

directions (greedy) to match the largest possible string
One or more
>>> import re characters
>>> x = 'From: Using the : character'
>>> y = re.findall('^F.+:', x)
>>> print y
^F.+:
['From: Using the :']
First character in the Last character in the
Why not 'From:'? match is an F match is a :

31/07/2024 14
Non-Greedy Matching

• Not all regular expression repeat codes are greedy!

If you add a ? character - the + and * chill outOne
a bit...
or more
>>> import re characters but
>>> x = 'From: Using the : character' not greedily
>>> y = re.findall('^F.+?:', x)
>>> print y
^F.+?:
['From:']
First character in the Last character in the
match is an F match is a :

31/07/2024 15
Python Slicing

31/07/2024 Slide 16
String Slices
• >>>fruit = “apple”
• >>>fruit[1:3]
• >>>’pp’
• >>>fruit[1:]
• >>>’pple’
• >>>fruit[:4]
• >>>’appl’
• >>>fruit[:]
• >>>’apple’

31/07/2024 17
List Slices
• >>>b
• [3, 4, 5, 6]
• >>>b[0:3]
• [3,4,5]
• b[0:j] with j > 3 and b[0:] are same
• >>>b[:2]
• [3,4]

31/07/2024 18
List Slices
• >>>b[2:2]
• []
• b[i:j:k] is a subset of b[i:j] with elements
picked in steps of k
• >>>b=[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]
• >>>b[0:10:3]
• [1, 4, 7]

31/07/2024 19
NumPy array slicing
• 1-d array slicing and indexing is similar to
Python lists
• import numpy as np
• arr1=np.array([1,2,5,6,4,3])
• arr1[2:4]=99

• arr1
• Out[8]: array([ 1, 2, 99, 99, 4, 3])
eLahe Technologies 2020
31/07/2024 20
www.elahetech.com
NumPy array slicing

• Slicing in ndarrays is different from Python lists in that

data is not copied
• Slices are views on the original array!
• arr2=arr1[2:4]

• arr2[0]=88

• arr1
• Out[13]: array([ 1, 2, 88, 99, 4, 3])

eLahe Technologies 2020

31/07/2024 21
www.elahetech.com
Sets

31/07/2024 Slide 22
in and notin
• >>>setA= {1,3,5,7}
• >>>3 in setA
• True
• >>>3 not in setA
• False
• >>>4 not in setA
• True

31/07/2024 23
Subset
• >>>setA= {1,3,5,7}
• >>>setB= {1, 3, 5, 7, 9}
• >>>setC = {1,3,5,9,10}
• >>>setA issubset setB
• True
• >>> setA issubset setC
• False

31/07/2024 24
Superset
• >>>setA= {1,3,5,7}
• >>>setB= {1, 3, 5, 7, 9}
• >>>setC = {1,3,5,9,10}
• >>>setA issuperset setB
• False
• >>> setB issuperset setA
• True
• >>> setC issuperset setA
• False

31/07/2024 25
Set Union

• >>>setA= {1,3,5,7}
• >>>setB= {7, 5, 9}
• >>>setA.union(setB)
• {1,3,5,7,9}
• >>>setA | setB
• {1, 3, 5, 7, 9}

31/07/2024 26
Set Intersection

• >>>setA= {1,3,5,7}
• >>>setB= {7, 5, 9}
• >>>setA.intersection(setB)
• {5,7}
• >>>setA & setB
• {5, 7}

31/07/2024 27
Dictionaries

31/07/2024 Slide 28
Dictionaries

>>>
• Lists index their entries >>> purse = dict() >>>purse['money'] =
12
based on the position >>> purse['candy'] = 3
in the list >>> purse['tissues'] = 75
>>> print(purse)
• Dictionaries are like {'money': 12, 'tissues': 75, 'candy': 3}
bags - no order >>> print(purse['candy'])
3
• So we index the things >>> purse['candy'] = purse['candy'] + 2
we put in the dictionary >>> print(purse)
{'money': 12, 'tissues': 75, 'candy': 5}
with a “lookup tag”
Comparing Lists and
Dictionaries
Dictionaries are like lists except that they use keys instead of
numbers to look up values

>>> lst = list() >>> ddd = dict()

>>> lst.append(21) >>> ddd['age'] = 21
>>> lst.append(183) >>> ddd['course'] = 182
>>> print(lst) >>> print(ddd)
[21, 183] {'course': 182, 'age': 21}
>>> lst[0] = 23 >>> ddd['age'] = 23
>>> print(lst) >>> print(ddd)
[23, 183] {'course': 182, 'age': 23}

Python Refcard
100% (5)
Python Refcard
2 pages
Python 201 - (Slightly) Advanced Python Topics
No ratings yet
Python 201 - (Slightly) Advanced Python Topics
69 pages
Python Day - 10
No ratings yet
Python Day - 10
11 pages
Python Cheat Sheet Dataquest PDF
No ratings yet
Python Cheat Sheet Dataquest PDF
5 pages
Python For Data Science
100% (1)
Python For Data Science
4 pages
Python Cheat Sheet Intermediate
No ratings yet
Python Cheat Sheet Intermediate
1 page
Python Cheatsheet
100% (1)
Python Cheatsheet
1 page
3.III-Regular Expression Part-I & II 2022-23
No ratings yet
3.III-Regular Expression Part-I & II 2022-23
14 pages
Day2.2 DataAnalyticsLanguages
No ratings yet
Day2.2 DataAnalyticsLanguages
100 pages
Python Programming Language
No ratings yet
Python Programming Language
10 pages
Learn Python and Automate Network Tasks: Build Your Own Apps
100% (1)
Learn Python and Automate Network Tasks: Build Your Own Apps
10 pages
Python Progr Module 3 - 6th EC by 21EC643
No ratings yet
Python Progr Module 3 - 6th EC by 21EC643
24 pages
Py Regex
No ratings yet
Py Regex
50 pages
Module5 RegularExpressions
No ratings yet
Module5 RegularExpressions
10 pages
MLPA 1-7 Chapter
No ratings yet
MLPA 1-7 Chapter
117 pages
Regular Expressions: Python For Everybody
No ratings yet
Regular Expressions: Python For Everybody
34 pages
17 - Regular Expression
No ratings yet
17 - Regular Expression
20 pages
Regular Expression
No ratings yet
Regular Expression
39 pages
Untitled
No ratings yet
Untitled
53 pages
Introduction To Python Programming
No ratings yet
Introduction To Python Programming
9 pages
Regular Expressions: Python For Everybody
No ratings yet
Regular Expressions: Python For Everybody
34 pages
Python Module-3 Notes (21EC646) - Final
No ratings yet
Python Module-3 Notes (21EC646) - Final
37 pages
Sundeep Agarwal Understanding Python Re Gex
No ratings yet
Sundeep Agarwal Understanding Python Re Gex
228 pages
Python Re
No ratings yet
Python Re
101 pages
5A - Regex
No ratings yet
5A - Regex
32 pages
X - Table of Contents
No ratings yet
X - Table of Contents
5 pages
Regular
No ratings yet
Regular
9 pages
Day3.3 StringManipulation
No ratings yet
Day3.3 StringManipulation
43 pages
Module 3 Regular Expressions
No ratings yet
Module 3 Regular Expressions
8 pages
UNIT4
No ratings yet
UNIT4
67 pages
Module3 RegularExpressions
No ratings yet
Module3 RegularExpressions
8 pages
Unit7 RegularExpressionpdf 2023 10 17 09 16 29
No ratings yet
Unit7 RegularExpressionpdf 2023 10 17 09 16 29
17 pages
Python Ultimate Guide
100% (1)
Python Ultimate Guide
10 pages
Slicing and Indexing
No ratings yet
Slicing and Indexing
16 pages
Data Science Report
No ratings yet
Data Science Report
126 pages
BITypes Notes
No ratings yet
BITypes Notes
7 pages
Re Expression 19 and 20
No ratings yet
Re Expression 19 and 20
26 pages
Python Refcard
100% (1)
Python Refcard
2 pages
Unit-3 Python
No ratings yet
Unit-3 Python
72 pages
Unit 3 Python
No ratings yet
Unit 3 Python
72 pages
9python Simple Character Matches
No ratings yet
9python Simple Character Matches
19 pages
Regular Expressions
100% (1)
Regular Expressions
15 pages
Python Variables Collections
No ratings yet
Python Variables Collections
19 pages
Session22 To 24 PYTHON COLAB
No ratings yet
Session22 To 24 PYTHON COLAB
128 pages
PP - Module-3 Notes
No ratings yet
PP - Module-3 Notes
56 pages
A Winter Training Report On Automation Using Python
No ratings yet
A Winter Training Report On Automation Using Python
29 pages
Pythoncheatsheet: Dunder Methods
No ratings yet
Pythoncheatsheet: Dunder Methods
14 pages
Unit 2
No ratings yet
Unit 2
69 pages
Lec 2
No ratings yet
Lec 2
58 pages
13B RegExp
No ratings yet
13B RegExp
38 pages
Regular Exp
No ratings yet
Regular Exp
6 pages
Python - Slide 5
No ratings yet
Python - Slide 5
42 pages
A Winter Training Report On Automation Using Python
No ratings yet
A Winter Training Report On Automation Using Python
30 pages
Python Mid 1 Scheme
No ratings yet
Python Mid 1 Scheme
12 pages
Lec 06 - Regular Expression
No ratings yet
Lec 06 - Regular Expression
19 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Simplifying Data Science With Python
From Everand
Simplifying Data Science With Python
Billy David millican
No ratings yet
Advanced C++ Interview Questions You'll Most Likely Be Asked
From Everand
Advanced C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Storage Class Specifiers
No ratings yet
Storage Class Specifiers
22 pages
Form - Ant Design
No ratings yet
Form - Ant Design
1 page
Eloquent JavaScript
No ratings yet
Eloquent JavaScript
3 pages
2 - Class Objects - Access Specifier - C++
No ratings yet
2 - Class Objects - Access Specifier - C++
37 pages
NOTES
No ratings yet
NOTES
10 pages
Value Type - Reference Type
No ratings yet
Value Type - Reference Type
13 pages
Oop Concepts
No ratings yet
Oop Concepts
32 pages
Anukul Java 4
No ratings yet
Anukul Java 4
2 pages
DS Lab Manual - MyCEM (2022)
No ratings yet
DS Lab Manual - MyCEM (2022)
54 pages
12th Computer Science
No ratings yet
12th Computer Science
5 pages
PBL Report On Java
No ratings yet
PBL Report On Java
28 pages
Assignment On C
No ratings yet
Assignment On C
4 pages
Selenium Notes
No ratings yet
Selenium Notes
2 pages
To Program in Java
No ratings yet
To Program in Java
10 pages
C++ Computer Science by Christopher Topalian
No ratings yet
C++ Computer Science by Christopher Topalian
57 pages
Ahmed CP Assignment#2
No ratings yet
Ahmed CP Assignment#2
9 pages
1.a Numpy Code
No ratings yet
1.a Numpy Code
2 pages
Module 01 1102
No ratings yet
Module 01 1102
4 pages
Chapter 4
No ratings yet
Chapter 4
24 pages
Chap10 - Array
No ratings yet
Chap10 - Array
59 pages
Lab 5 PF
No ratings yet
Lab 5 PF
8 pages
Arrays C Example Programs
No ratings yet
Arrays C Example Programs
11 pages
Java Fullstack Codenera
No ratings yet
Java Fullstack Codenera
6 pages
Fullstack-Developer 20240527123203 40
No ratings yet
Fullstack-Developer 20240527123203 40
11 pages
Delegates Part 13
No ratings yet
Delegates Part 13
3 pages
The Lua Integration Guide
No ratings yet
The Lua Integration Guide
36 pages
II B.Tech Java Mid Paper-2
No ratings yet
II B.Tech Java Mid Paper-2
4 pages
Python (Programming Language)
No ratings yet
Python (Programming Language)
37 pages
Machine Problem Set
No ratings yet
Machine Problem Set
6 pages
Session5 Notes-1
No ratings yet
Session5 Notes-1
7 pages

Module4 DataAnalyticsLanguages

Uploaded by

Module4 DataAnalyticsLanguages

Uploaded by

Module 4:

Data Analytics Languages--

• Python created by Guido van Rossum in the

eLahe Technologies 2020

• NumPy is the fundamental package for scientific

eLahe Technologies 2020

• Matplotlib is a Python 2D plotting library

eLahe Technologies 2020

• pandas is an open source, BSD-licensed

eLahe Technologies 2020

In computing, a regular expression, also referred to as

• The dot character matches any character

• The dot character matches any character

• Depending on how "clean" your data is and the

• The repeat characters (* and +) push outward in both

• Not all regular expression repeat codes are greedy!

• Slicing in ndarrays is different from Python lists in that

eLahe Technologies 2020

>>> lst = list() >>> ddd = dict()

You might also like