0% found this document useful (0 votes)

122 views25 pages

Week 9 - Introduction To Numpy - Part 2

This document provides an introduction to NumPy programming concepts including: 1. Creating your own NumPy universal functions (ufuncs) using the frompyfunc() method. 2. Working with Boolean arrays including counting True entries, checking if any/all values are True, and aggregating counts along axes. 3. Exploring fancy indexing which allows accessing multiple array elements using arrays of indices. Fancy indexing can also be used to modify values. 4. NumPy functions for sorting arrays including np.sort(), np.argsort(), and np.partition(). Axis arguments allow sorting along rows or columns. 5. Searching arrays using np.where() and np.searchsorted(), and filtering arrays

Uploaded by

God is Good

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

122 views25 pages

Week 9 - Introduction To Numpy - Part 2

Uploaded by

God is Good

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 25

Data Science Programming

Introduction to NumPy – Part 2

Week 9
Program Studi Teknik Informatika
Fakultas Teknik – Universitas Surabaya
Create Your Own ufunc
• You can create own ufunc, you have to define a function, like you do with
normal functions in Python, then you add it to your NumPy ufunc library
with the frompyfunc() method.
• The frompyfunc() method takes the following arguments:
– function - the name of the function.
– inputs - the number of input arguments (arrays).
– outputs - the number of output arrays.

# Create your own ufunc for addition

def myadd(x, y):
return x+y

myadd = np.frompyfunc(myadd, 2, 1)
print(myadd([1, 2, 3, 4], [5, 6, 7, 8])) # Output: [6 8 10 12]
Working with Boolean Arrays
Counting entries
• Given a Boolean array, there are a host of useful operations
you can do.
• To count the number of True entries in a Boolean array,
np.count_nonzero is useful

x = np.random.randint(10, size=(3, 4))

print(x)
Output:
# how many values less than 6?
print(np.count_nonzero(x < 6))

• Another way to get at this information is to use np.sum; in this

case, False is interpreted as 0, and True is interpreted as 1
print(np.sum(x < 6))
Counting entries
• The benefit of sum() is that like with other NumPy aggregation
functions, this summation can be done along rows or columns as well.
# how many values less than 6 in each row?
print(np.sum(x < 6, axis=1)) # Output: [3 2 3]

• If we’re interested in quickly checking whether any or all the values are
true, we can use (you guessed it) np.any() or np.all()
# are there any values greater than 8?
print(np.any(x > 8)) #Output: True

# are all values less than 10?

print(np.all(x < 10)) #Output: True

# are all values in each row less than 8?

print(np.all(x < 8, axis=1)) #Output: [False False True]
Fancy Indexing
Exploring Fancy Indexing
• Fancy indexing is like the simple indexing we’ve already seen, but we pass
arrays of indices in place of single scalars.
• This allows us to very quickly access and modify complicated subsets of an
array’s values.
• Fancy indexing is conceptually simple: it means passing an array of indices to
access multiple array elements at once.
• For example, consider the following array

x = np.random.randint(100, size=10)
print(x) # Output: [97 2 8 94 77 38 18 49 91 50]
• Suppose we want to access three different elements. we can pass a single list
or array of indices to obtain the result
ind = [3, 7, 4]
print(x[ind]) # Output: [94 49 77]
Exploring Fancy Indexing
• With fancy indexing, the shape of the result reflects the shape of the
index arrays rather than the shape of the array being indexed.
ind = np.array([[3, 7],[4, 5]])
print(x[ind]) Output:

• Fancy indexing also works in multiple dimensions. Consider the

following array.
X = np.arange(12).reshape((3, 4)) Output:
print(X)

• Like with standard indexing, the first index refers to the row, and the
second to the column row = np.array([0, 1, 2])
col = np.array([2, 1, 3])
print(X[row, col]) # Output: [ 2 5 11]
Exploring Fancy Indexing
• Notice that the first value in the result is X[0, 2], the second is X[1, 1],
and the third is X[2, 3].
• If we combine a column vector and a row vector within the indices, we
get a two-dimensional result.
print(X[row[:, np.newaxis], col]) Output:

• For even more powerful operations, fancy indexing can be combined with the
other indexing schemes.
# We can combine fancy and simple indices
print(X[2, [2, 0, 1]]) # Output: [10 8 9]

# We can also combine fancy indexing with slicing

print(X[1:, [2, 0, 1]]) Output:
Modifying Values with Fancy Indexing
• Just as fancy indexing can be used to access parts of an array, it can
also be used to modify parts of an array.
• For example, imagine we have an array of indices and we’d like to set
the corresponding items in an array to some value.
x = np.arange(10)
i = np.array([2, 1, 8, 4])
x[i] = 99
print(x) # Output: [ 0 99 99 3 99 5 6 7 99 9]

• We can use any assignment-type operator for this. For example

x[i] -= 10
print(x) # Output: [ 0 89 89 3 89 5 6 7 89 9]
Sorting, Searching, and Filtering
Sorting
• This section covers algorithms related to sorting values in NumPy
arrays.
• For example, a simple selection sort repeatedly finds the minimum
value from a list, and makes swaps until the list is sorted.
• We can code this in just a few lines of Python.
def selection_sort(x):
for i in range(len(x)):
swap = i + np.argmin(x[i:])
(x[i], x[swap]) = (x[swap], x[i])
return x

x = np.array([2, 1, 4, 3, 5])
print(selection_sort(x)) # Output: [ 1 2 3 4 5 ]
Sorting
• The selection sort is useful for its simplicity, but is much too slow to be
useful for larger arrays.
• For a list of N values, it requires N loops, each of which does on the
order of ~ N comparisons to find the swap value.
• In terms of the “big-O” notation often used to characterize these
algorithms, selection sort averages O(N2).
• If you double the number of items in the list, the execution time will go
up by about a factor of four.
• Although Python has built-in sort and sorted functions to work with lists, we
won’t discuss them here because NumPy’s np.sort function turns out to be
much more efficient and useful for our purposes.
Sorting
• By default np.sort uses an O(N log N) , quicksort algorithm, though
mergesort and heapsort are also available.
• For most applications, the default quicksort is more than sufficient.
x = np.array([2, 1, 4, 3, 5])
print(np.sort(x)) # Output: [ 1 2 3 4 5 ]

• A related function is argsort, which instead returns the indices of the

sorted elements.
x = np.array([2, 1, 4, 3, 5])
i = np.argsort(x)
print(i) # Output: [1 0 3 2 4]

• The first element of that result gives the index of the smallest element, the
second value gives the index of the second smallest, and so on.
Sorting
• A useful feature of NumPy’s sorting algorithms is the ability to sort
along specific rows or columns of a multidimensional array using the
axis argument.
X = np.random.randint(0, 10, (4, 6))
print(X) Output:

# sort each column of X

print(np.sort(X, axis=0))
Output:

# sort each row of X

print(np.sort(X, axis=1)) Output:
Sorting
• Sometimes we’re not interested in sorting the entire array, but simply want to
find the K smallest values in the array.
• NumPy provides this in the np.partition function.
• np.partition takes an array and a number K; the result is a new array with the
smallest K values to the left of the partition, and the remaining values to the
right, in arbitrary order.

x = np.array([7, 2, 3, 1, 6, 5, 4])
print(np.partition(x, 3)) # Output: [2 1 3 4 6 5 7]

• Note that the first three values in the resulting array are the three smallest in
the array, and the remaining array positions contain the remaining values.
• Within the two partitions, the elements have arbitrary order.
Searching
• We can search an array for a certain value, and return the indexes
that get a match. To search an array, use the where() method.
arr = np.array([1, 2, 3, 4, 5, 4, 4])
x = np.where(arr == 4)
print(x) # Output: (array([3, 5, 6], dtype=int64),)

• Another example: Find the indexes where the values are even or
odd
arr = np.array([1, 2, 3, 4, 5, 6, 7, 8])
x = np.where(arr%2 == 0)
y = np.where(arr%2 == 1)
print(x) # Output: (array([1, 3, 5, 7], dtype=int64),)
print(y) # Output: (array([0, 2, 4, 6], dtype=int64),)
Searching
• There is a method called searchsorted() which performs a binary
search in the array, and returns the index where the specified value
would be inserted to maintain the search order.
arr = np.array([2, 7, 9, 12, 12])

# The number 8 should be inserted on index 2 to remain the sort order.

# The method starts the search from the left.
print(np.searchsorted(arr, 8)) # Output: 2

# Find the indexes where the value 10 should be inserted, starting from the right.
print(np.searchsorted(arr, 10, side='right')) # Output: 3

# Find the indexes where the values 2, 4, 6, and 11 should be inserted.

# The return value is an array: [0 1 1 3] containing the four indexes,
# where 2, 4, 6, 11 would be inserted in the original array to maintain the order.
print(np.searchsorted(arr, [2, 4, 6, 11])) # Output: [0 1 1 3]
Filtering
• Getting some elements out of an existing array and creating a
new array out of them is called filtering.
• In NumPy, you filter an array using a boolean index list.
arr = np.array([41, 42, 43, 44])
x = arr[[True, False, True, False]]
print(x) # Output: [41 43]

• The example above will return [41 43], why? Because the new
filter contains only the values where the filter array had the
value True, in this case, index 0 and 2.
Filtering
• Another example:
# Create a filter array that will return only even elements from the original array
arr = np.array([1, 2, 3, 4, 5, 6, 7])
filter_arr = arr % 2 == 0
newarr = arr[filter_arr]
print(filter_arr) #Output: [False True False True False True False]
print(newarr) #Output: [2 4 6]

# Create a filter array that will return only values higher than 42
arr = np.array([41, 42, 43, 44])
filter_arr = arr > 42
newarr = arr[filter_arr]
print(filter_arr) #Output: [False False True True]
print(newarr) #Output: [43 44]
Questions??
Exercise
• Create NRP_Nickname_ExWeek9.ipynb file.

Question 1
Create a 5X2 integer array from the range 100 to 200 so that the
difference between each element is 10. Here is an example of
what it looks like:
Exercise
Question 2
The following provides a numPy array.
np.array([[11 ,22, 33], [44, 55, 66], [77, 88, 99]])
Returns an array of items in the second column of all existing
rows. Here is the expected display:
Exercise
Question 3
The following provides a numPy array.
np.array([[3 ,6, 9, 12], [15 ,18, 21, 24],[27 ,30, 33, 36],
[39 ,42, 45, 48], [51 ,54, 57, 60]])
Returns the given array of odd rows and even columns. Here is
the expected display:
Exercise
Question 4
Add the following two NumPy arrays
arrayOne = np.array([[5, 6, 9], [21 ,18, 27]])
arrayTwo = np.array([[15 ,33, 24], [4 ,7, 1]])
And modify the resulting array by calculating the square root of each
element. Here is the expected display:

Williams & Sawyer 2015 Using Information Technology - A Practical Introduction To Computers & Communications
No ratings yet
Williams & Sawyer 2015 Using Information Technology - A Practical Introduction To Computers & Communications
100 pages
05 NumPy - Arrays and Vectorized Computation
No ratings yet
05 NumPy - Arrays and Vectorized Computation
47 pages
Project Crashing Example 1
100% (9)
Project Crashing Example 1
13 pages
SQL Cheat Sheet PDF
100% (2)
SQL Cheat Sheet PDF
2 pages
Numpy
No ratings yet
Numpy
27 pages
CS411 Finalterm MCQS Solved by ZA Academy
No ratings yet
CS411 Finalterm MCQS Solved by ZA Academy
44 pages
Unit 4
No ratings yet
Unit 4
62 pages
Numpy, Pandas and Matplotlib
No ratings yet
Numpy, Pandas and Matplotlib
60 pages
Print
No ratings yet
Print
296 pages
Kuliah #7 Alprog - Numpy, Pandas, Matplotlib
No ratings yet
Kuliah #7 Alprog - Numpy, Pandas, Matplotlib
48 pages
IOS Interview
No ratings yet
IOS Interview
7 pages
Unit 1
No ratings yet
Unit 1
170 pages
M3-Introduction To Numpy and Pandas
No ratings yet
M3-Introduction To Numpy and Pandas
55 pages
Essential Guide To Data Science For Petroleum Engineers
No ratings yet
Essential Guide To Data Science For Petroleum Engineers
150 pages
Perfbook-1c 2021 12 22a
No ratings yet
Perfbook-1c 2021 12 22a
930 pages
N Umpy Pandas Tutorial
No ratings yet
N Umpy Pandas Tutorial
65 pages
Java Final Exam
83% (6)
Java Final Exam
15 pages
CAP776 Numpy
No ratings yet
CAP776 Numpy
71 pages
Unit 3 - Numpy - VP
No ratings yet
Unit 3 - Numpy - VP
53 pages
python-notes-BCC-302 (Unit - 05)
No ratings yet
python-notes-BCC-302 (Unit - 05)
25 pages
Fundamentals of Data Science Unit 4 and 5
No ratings yet
Fundamentals of Data Science Unit 4 and 5
90 pages
Mds1111 Merged Numbered
No ratings yet
Mds1111 Merged Numbered
41 pages
Unit 3
No ratings yet
Unit 3
34 pages
Unit 4
No ratings yet
Unit 4
49 pages
Numpy (Numerical Python)
No ratings yet
Numpy (Numerical Python)
80 pages
NUMPYA03
No ratings yet
NUMPYA03
36 pages
Numpy - Pandas
No ratings yet
Numpy - Pandas
26 pages
Python Unit 4
No ratings yet
Python Unit 4
43 pages
Numpy Part-1
No ratings yet
Numpy Part-1
22 pages
APP Lab Manual Final
No ratings yet
APP Lab Manual Final
43 pages
Numpy Tutorial
No ratings yet
Numpy Tutorial
19 pages
Numpy Basics
No ratings yet
Numpy Basics
66 pages
10 Numpy
No ratings yet
10 Numpy
39 pages
1 Numpy
No ratings yet
1 Numpy
26 pages
Chapter 1 Introduction
0% (1)
Chapter 1 Introduction
64 pages
Numpy Operations
No ratings yet
Numpy Operations
55 pages
Lab 1 - Introduction
No ratings yet
Lab 1 - Introduction
14 pages
Sap Information Container 2
No ratings yet
Sap Information Container 2
18 pages
Numpy Handbook
No ratings yet
Numpy Handbook
16 pages
De Lab Manual New
No ratings yet
De Lab Manual New
24 pages
Numpy New
No ratings yet
Numpy New
16 pages
Array in Python
No ratings yet
Array in Python
33 pages
Numpy
No ratings yet
Numpy
14 pages
Funda 3
No ratings yet
Funda 3
35 pages
Session 15 Numpy Tricks
No ratings yet
Session 15 Numpy Tricks
10 pages
Numpy
No ratings yet
Numpy
7 pages
Unit-V Python - BCC402
No ratings yet
Unit-V Python - BCC402
20 pages
Dse Unit 3
No ratings yet
Dse Unit 3
12 pages
Python Numpy
No ratings yet
Python Numpy
20 pages
NUMPY
No ratings yet
NUMPY
8 pages
Unit 4 Python Numpy
No ratings yet
Unit 4 Python Numpy
18 pages
Data Science
No ratings yet
Data Science
24 pages
Numpy
No ratings yet
Numpy
15 pages
Numpy Guide
No ratings yet
Numpy Guide
1 page
Lab 1
No ratings yet
Lab 1
6 pages
Num Py
No ratings yet
Num Py
8 pages
Unit 4 Numpy
No ratings yet
Unit 4 Numpy
14 pages
Numpy Cheat Sheet
No ratings yet
Numpy Cheat Sheet
1 page
Module 3.2.5
No ratings yet
Module 3.2.5
21 pages
Practicals For Boards 22-23
No ratings yet
Practicals For Boards 22-23
30 pages
Value Added Course: Programming in Python and Machine Learning UNIT-2
No ratings yet
Value Added Course: Programming in Python and Machine Learning UNIT-2
41 pages
Lecture 4 - Exception Java Finally Block
No ratings yet
Lecture 4 - Exception Java Finally Block
6 pages
Python 2.1.1
No ratings yet
Python 2.1.1
7 pages
Numpy
No ratings yet
Numpy
9 pages
Lab 02
No ratings yet
Lab 02
5 pages
Mod 3 Numpy Ds
No ratings yet
Mod 3 Numpy Ds
15 pages
Lecture 2 - NumPy I
No ratings yet
Lecture 2 - NumPy I
12 pages
CCS353 Set2
No ratings yet
CCS353 Set2
2 pages
Curriculum - R 2020
No ratings yet
Curriculum - R 2020
98 pages
Visual Studio
No ratings yet
Visual Studio
18 pages
NumPy Methods
No ratings yet
NumPy Methods
10 pages
Scan Line Fill
No ratings yet
Scan Line Fill
39 pages
Final Keyword in Java
No ratings yet
Final Keyword in Java
5 pages
Pitt MechE Handbook
No ratings yet
Pitt MechE Handbook
57 pages
NumpyGUIA PYTHON-03
No ratings yet
NumpyGUIA PYTHON-03
1 page
02-Numpy Indexing and Selection
No ratings yet
02-Numpy Indexing and Selection
5 pages
Report
No ratings yet
Report
21 pages
Python Mini Project
No ratings yet
Python Mini Project
34 pages
Sort Table Control by Columns
No ratings yet
Sort Table Control by Columns
2 pages
23X Oop Lab 2
No ratings yet
23X Oop Lab 2
7 pages
MAWANDIYA - Foreign - Intern - Resume
No ratings yet
MAWANDIYA - Foreign - Intern - Resume
1 page
Learning Assembly Language - Part 5 - Pointer and Loop Instruction
No ratings yet
Learning Assembly Language - Part 5 - Pointer and Loop Instruction
15 pages
Control Flow Statements - Branching
No ratings yet
Control Flow Statements - Branching
6 pages
k8egWZm1RuqHoFmZtfbq3Q - Introduction To Python Programming - Syllabus
No ratings yet
k8egWZm1RuqHoFmZtfbq3Q - Introduction To Python Programming - Syllabus
4 pages
MOvie - Recommendation - System (GL - Shivam Bajaj)
No ratings yet
MOvie - Recommendation - System (GL - Shivam Bajaj)
3 pages
Binomial Heap
No ratings yet
Binomial Heap
9 pages
Useful Shortcuts or Key Bindings or Predefined Commands For emacs+AUCTeX - TeX - LaTeX Stack Exchange
No ratings yet
Useful Shortcuts or Key Bindings or Predefined Commands For emacs+AUCTeX - TeX - LaTeX Stack Exchange
8 pages
Balbhim Ramchandra Patil
No ratings yet
Balbhim Ramchandra Patil
4 pages
Fundamentals of MapReduce With Example
No ratings yet
Fundamentals of MapReduce With Example
2 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet

Week 9 - Introduction To Numpy - Part 2

Uploaded by

Week 9 - Introduction To Numpy - Part 2

Uploaded by

Data Science Programming

Introduction to NumPy – Part 2

# Create your own ufunc for addition

x = np.random.randint(10, size=(3, 4))

• Another way to get at this information is to use np.sum; in this

# are all values less than 10?

# are all values in each row less than 8?

• Fancy indexing also works in multiple dimensions. Consider the

# We can also combine fancy indexing with slicing

• We can use any assignment-type operator for this. For example

• A related function is argsort, which instead returns the indices of the

# sort each column of X

# sort each row of X

# The number 8 should be inserted on index 2 to remain the sort order.

# Find the indexes where the values 2, 4, 6, and 11 should be inserted.

You might also like