0% found this document useful (0 votes)

17 views49 pages

UNIT-03 Numpy

NumPy is a fundamental package for numerical computing in Python, providing a multidimensional array object (ndarray) and functions for efficient element-wise computations. It allows for batch operations on data without the need for explicit loops, enhancing performance and memory efficiency. The document covers the creation, manipulation, and arithmetic operations of ndarrays, as well as data types and indexing techniques.

Uploaded by

megha210103

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views49 pages

UNIT-03 Numpy

Uploaded by

megha210103

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 49

UNIT-03

Numpy basics arrays and vectorized computation

INTRODUCTION
• NumPy, short for Numerical Python, has long been a cornerstone of numerical computing in Python.

• NumPy is a basic package for scientific computing with Python and especially for data analysis.

NumPy contains, among other things:

• A fast and efficient multidimensional array object ndarray.

• Functions for performing element-wise computations with arrays or mathematical operations between
arrays

• Tools for reading and writing array-based datasets to disk

• Linear algebra operations, Fourier transform, and random number generation

• A mature C API to enable Python extensions and native C or C++ code to access NumPy’s data structures
and computational facilities.
• one of its primary uses in data analysis is as a container for data to be passed between algorithms and
libraries.
INTRODUCTION
• One of the reasons NumPy is so important for numerical computations in Python is because it is designed for
efficiency on large arrays of data. There are a number of reasons for this:

1. NumPy internally stores data in a contiguous block of memory, independent of other built-in Python objects.
NumPy’s library of algorithms written in the C language can operate on this memory without any type
checking or other overhead. NumPy arrays also use much less memory than built-in Python sequences.

2. NumPy operations perform complex computations on entire arrays without the need for Python for loops.
The NumPy ndarray:
A multidimensional array object
• One of the key features of NumPy is its N-dimensional array object, or ndarray, which is a fast, flexible container for large datasets in
Python. Arrays enable you to perform mathematical operations on whole blocks of data using similar syntax to the equivalent
operations between scalar elements.

• To give you a flavor of how NumPy enables batch computations with similar syntax to scalar values on built-in Python objects, I first
import NumPy and generate a small array of random data:

• In [12]: import numpy as np

• # Generate some random data

• In [13]: data = np.random.randn(2, 3)

• In [14]: data

• Out[14]:

• array([[-0.2047, 0.4789, -0.5194],

• [-0.5557, 1.9658, 1.3934]])

In [16]: data + data
Out[16]:array([[-0.4094, 0.9579, -1.0389], [-1.1115, 3.9316, 2.7868]])
In the first example, all of the elements have been multiplied by 10. In the second, the
corresponding values in each “cell” in the array have been added to each other.
First import NumPy and generate a small array of random data:

In [12]: import numpy as np

# Generate some random data
In [13]: data = np.random.randn(2, 3) An ndarray is a generic
In [14]: data multidimensional container for
Out[14]: homogeneous data; that is, all
array([[-0.2047, 0.4789, -0.5194], of the elements must be the same
[-0.5557, 1.9658, 1.3934]]) type. Every array has a shape, a tuple
I then write mathematical operations indicating the size of each dimension,
with data: and a dtype, an object describing the
In [15]: data * 10 data type of the array:
In [17]: data.shape
Out[15]: Out[17]: (2, 3)
array([[ -2.0471, 4.7894, -5.1944], In [18]: data.dtype
[ -5.5573, 19.6578, 13.9341]]) Out[18]: dtype('float64')
In [16]: data + data
Out[16]:

array([[-0.4094, 0.9579, -1.0389],

[-1.1115, 3.9316, 2.7868]])
The NumPy ndarray:
A multidimensional array object

• An ndarray is a generic multidimensional container for homogeneous data; that is, all of the
elements must be the same type.

• Every array has a shape, a tuple indicating the size of each dimension, and a dtype, an
object describing the data type of the array:

• >>> a = np.array([1, 2, 3])

• >>> a

• array([1, 2, 3])

• >>> type(a)

• <type 'numpy.ndarray'>
The NumPy ndarray:
A multidimensional array object

• In order to know the associated dtype to the just created ndarray, you have to use the dtype attribute.
• The data type is stored in a special dtype metadata object

• >>> a.dtype

• dtype('int32')
• >>> a.ndim
• 1
• >>> a.size
• 3
Creating ndarrays
• To define a new ndarray, the easiest way is to use the array() function, passing a Python list containing the elements to be included in it
as an argument.

• Example:
• In [19]: data1 = [6, 7.5, 8, 0, 1]
• In [20]: arr1 = np.array(data1)
• In [21]: arr1
• Out[21]: array([ 6. , 7.5, 8. , 0. , 1. ])
• But the use of arrays can be easily extended to the case with several dimensions. For example, if you define a two-dimensional array
2x2:
• >>> b = np.array([[1.3, 2.4],[0.3, 4.1]])
• >>> b.dtype
• dtype('float64')
• >>> b.ndim
• 2
• >>> b.size
• 4
• >>> b.shape
• (2L, 2L)
• This array has rank 2, since it has two axis, each of length 2.
Creating ndarrays
• In addition to np.array, there are a number of other functions for creating new arrays.
• As examples, zeros and ones create arrays of 0s or 1s, respectively, with a given length or shape.
• empty creates an array without initializing its values to any particular value.
• To create a higher dimensional array with these methods, pass a tuple for the shape:
• In [25]: np.empty((2, 3, 2))
• Out[25]:
• array([[[ 4.94065646e-324, 4.94065646e-324],
• [ 3.87491056e-297, 2.46845796e-130],
• [ 4.94065646e-324, 4.94065646e-324]],

• [[ 1.90723115e+083, 5.73293533e-053],
• [ -2.33568637e+124, -6.70608105e-012],
• [ 4.42786966e+160, 1.27100354e+025]]])
Data Types for ndarrays
• The data type or dtype is a special object containing the information (or metadata, data about data) the ndarray needs to
interpret a chunk of memory as a particular type of data:

• In [33]: arr1 = np.array([1, 2, 3], dtype=np.float64)

• In [34]: arr2 = np.array([1, 2, 3], dtype=np.int32)

• In [35]: arr1.dtype

• Out[35]: dtype('float64')

• Additionally NumPy provides types of its own. numpy.int32, numpy.int16, and numpy.float64 are some examples.

• dtypes are a source of NumPy’s flexibility for interacting with data coming from other systems.

• The numerical dtypes are named the same way: a type name, like float or int, followed by a number indicating the number
of bits per element.

• A standard doubleprecision floating-point value (what’s used under the hood in Python’s float object) takes up 8 bytes or
64 bits. Thus, this type is known in NumPy as float64.
Data Types for ndarrays

• The type of the array can also be explicitly specified at creation time:
• >>>c = np.array([[1, 2], [3, 4]], dtype=complex)
• >>>c
• >>>array([[1.+0.j, 2.+0.j],
• [3.+0.j, 4.+0.j]])
Data Types for ndarrays

• You can explicitly convert or cast an array from one dtype to another using ndarray’s astype
method:
• In [37]: arr = np.array([1, 2, 3, 4, 5])
• In [38]: arr.dtype
• Out[38]: dtype('int64')
• In [39]: float_arr = arr.astype(np.float64)
• In [40]: float_arr.dtype
• Out[40]: dtype('float64')
Data Types for ndarrays

• In this example, integers were cast to floating point. If I cast some floating-point numbers
to be of integer dtype, the decimal part will be truncated:

• In [41]: arr = np.array([3.7, -1.2, -2.6, 0.5, 12.9, 10.1])

• In [42]: arr

• Out[42]: array([ 3.7, -1.2, -2.6, 0.5, 12.9, 10.1])

• In [43]: arr.astype(np.int32)

• Out[43]: array([ 3, -1, -2, 0, 12, 10], dtype=int32)

Data Types for ndarrays

• If you have an array of strings representing numbers, you can use astype to
convert them to numeric form:

• In [44]: numeric_strings = np.array(['1.25', '-9.6', '42'], dtype=np.string_)

• In [45]: numeric_strings.astype(float)

• Out[45]: array([ 1.25, -9.6 , 42. ])

Printing Arrays

• When you print an array, NumPy displays it in a similar way to nested lists, but with the
following layout:
• the last axis is printed from left to right,
• the second-to-last is printed from top to bottom,
• the rest are also printed from top to bottom, with each slice separated from the next by
an empty line.
• One-dimensional arrays are then printed as rows, bidimensionals as matrices and
tridimensionals as lists of matrices.
• >>>a = np.arange(6) # 1d array
• >>>print(a)
• [0 1 2 3 4 5]
• >>>b = np.arange(12).reshape(4, 3) # 2d array
• >>>print(b)
• [[ 0 1 2]
• [ 3 4 5]
• [ 6 7 8]
• [ 9 10 11]]
• >>>c = np.arange(24).reshape(2, 3, 4) # 3d array
• >>>print(c)
• [[[ 0 1 2 3]
• [ 4 5 6 7]
• [ 8 9 10 11]]

• [[12 13 14 15]
• [16 17 18 19]
• [20 21 22 23]]]
Arithmetic with NumPy Arrays
• Arrays are important because they enable you to express batch operations on data without writing any for loops. NumPy users
call this vectorization. Any arithmetic operations between equal-size arrays applies the operation element-wise:
• >>>a = np.array([20, 30, 40, 50])
• >>>b = np.arange(4)
• >>>b
• >>>array([0, 1, 2, 3])
• >>>c = a - b
• >>>c
• >>>array([20, 29, 38, 47])
• >>>b**2
• >>>array([0, 1, 4, 9])
• >>>10 * np.sin(a)
• >>>array([ 9.12945251, -9.88031624, 7.4511316 , -2.62374854])
• >>>a < 35
• array([ True, True, False, False])
Arithmetic with NumPy Arrays

• In [51]: arr = np.array([[1., 2., 3.], [4., 5., 6.]])

• In [52]: arr
• Out[52]:array([[ 1., 2., 3.],
• [ 4., 5., 6.]])

• In [53]: arr * arr

• Out[53]:array([[ 1., 4., 9.],
• [ 16., 25., 36.]])
• In [54]: arr - arr
• Out[54]: array([[ 0., 0., 0.],
• [ 0., 0., 0.]])
Arithmetic with NumPy Arrays
• Comparisons between arrays of the same size yield boolean arrays:
• In [57]: arr2 = np.array([[0., 4., 1.], [7., 2., 12.]])

• In [58]: arr2
• Out[58]:
• array([[ 0., 4., 1.],
• [ 7., 2., 12.]])

• In [59]: arr2 > arr

• Out[59]:
• array([[False, True, False],
• [ True, False, True]], dtype=bool)
• Operations between differently sized arrays is called broadcasting.
Basic Indexing and Slicing
• NumPy array indexing is a rich topic, as there are many ways you may want to select a subset of your data
or individual elements. One-dimensional arrays are simple:
• In [60]: arr = np.arange(10)
• In [61]: arr
• Out[61]: array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])

• In [62]: arr[5]
• Out[62]: 5

• In [63]: arr[5:8]
• Out[63]: array([5, 6, 7])

• In [64]: arr[5:8] = 12
• In [65]: arr
• Out[65]: array([ 0, 1, 2, 3, 4, 12, 12, 12, 8, 9])
Fancy Indexing
• Fancy indexing is a term adopted by NumPy to • To select out a subset of the rows in a particular order,
describe indexing using integer arrays. you can simply pass a list or
• Suppose we had an 8 × 4 array:
• ndarray of integers specifying the desired order:
• In [117]: arr = np.empty((8, 4))
• In [120]: arr[[4, 3, 0, 6]]
• In [118]: for i in range(8):
• Out[120]:
• .....: arr[i] = i
• array([[ 4., 4., 4., 4.],
• In [119]: arr
• [ 3., 3., 3., 3.],
• Out[119]:
• [ 0., 0., 0., 0.],
• array([[ 0., 0., 0., 0.],
• [ 6., 6., 6., 6.]])
• [ 1., 1., 1., 1.],
• Using negative indices selects rows from the end:
• [ 2., 2., 2., 2.],
• In [121]: arr[[-3, -5, -7]]
• [ 3., 3., 3., 3.],
• Out[121]:
• [ 4., 4., 4., 4.],
• array([[ 5., 5., 5., 5.],
• [ 5., 5., 5., 5.],
• [ 3., 3., 3., 3.],
• [ 6., 6., 6., 6.],
• [ 7., 7., 7., 7.]])
• [ 1., 1., 1., 1.]])
Fancy Indexing
• Passing multiple index arrays does something slightly different; it selects a one dimensional array of elements
corresponding to each tuple of indices:
• In [122]: arr = np.arange(32).reshape((8, 4))
• In [123]: arr
• Out[123]:
• array([[ 0, 1, 2, 3],
• [ 4, 5, 6, 7],
• [ 8, 9, 10, 11],
• [12, 13, 14, 15],
• [16, 17, 18, 19],
• [20, 21, 22, 23], Here the red color
elements represents the
• [24, 25, 26, 27], position ,0,3,1,2 location
• [28, 29, 30, 31]]) elements will be fetched
from 1,5,7,2 rows
• In [124]: arr[[1, 5, 7, 2], [0, 3, 1, 2]]
• Out[124]: array([ 4, 23, 29, 10])
• Here the elements (1, 0), (5, 3), (7, 1), and (2, 2) were selected. Regardless of how many dimensions the array
Fancy Indexing

• The behavior of fancy indexing in this case is a bit different from what some users might have expected
(myself included), which is the rectangular region formed by selecting a subset of the matrix’s rows and
columns. Here is one way to get that:

• In [125]: arr[[1, 5, 7, 2]][:, [0, 3, 1, 2]]

Red color numbers indicates the
• Out[125]: position which means the new
elements will be arranged in this
format
• array([[ 4, 7, 5, 6],

• [20, 23, 21, 22],

• [28, 31, 29, 30],

• [ 8, 11, 9, 10]])

• Keep in mind that fancy indexing, unlike slicing, always copies the data into a new array.
Transposing Arrays and Swapping Axes
• Transposing is a special form of reshaping that similarly returns a view on the underlying data without copying
anything.
• Arrays have the transpose method and also the special T attribute:
• In [126]: arr = np.arange(15).reshape((3, 5))
• In [127]: arr
• Out[127]:
• array([[ 0, 1, 2, 3, 4],
• [ 5, 6, 7, 8, 9],
• [10, 11, 12, 13, 14]])
• In [128]: arr.T
• Out[128]:
• array([[ 0, 5, 10],
• [ 1, 6, 11],
• [ 2, 7, 12],
• [ 3, 8, 13],
• [ 4, 9, 14]])
Transposing Arrays and Swapping Axes
• Simple transposing with .T is a special case of swapping axes.
• ndarray has the method swapaxes, which takes a pair of axis numbers and switches the indicated axes to rearrange the data:
• In [135]: arr
• Out[135]:
• array([[[ 0, 1, 2, 3],
• [ 4, 5, 6, 7]],
• [[ 8, 9, 10, 11],
• [12, 13, 14, 15]]])
• In [136]: arr.swapaxes(1, 2)
• Out[136]:
• array([[[ 0, 4],
• [ 1, 5],
• [ 2, 6],
• [ 3, 7]],
• [[ 8, 12], swapaxes similarly returns a view on the data without making a copy..
• [ 9, 13],
• [10, 14],
• [11, 15]]])
Universal Functions: Fast Element-Wise Array
Functions
• A universal function, or ufunc, is a function that performs element-wise operations on data in ndarrays.
• You can think of them as fast vectorized wrappers for simple functions that take one or more scalar
values and produce one or more scalar results.
• Many ufuncs are simple element-wise transformations, like sqrt or exp:
• In [137]: arr = np.arange(10)
• In [138]: arr
• Out[138]: array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])
• In [139]: np.sqrt(arr)
• Out[139]:
• array([ 0. , 1. , 1.4142, 1.7321, 2. , 2.2361, 2.4495,
• 2.6458, 2.8284, 3. ])
• In [140]: np.exp(arr) These are referred to as unary ufuncs.
• Out[140]:
• array([ 1. , 2.7183, 7.3891, 20.0855, 54.5982,
• 148.4132, 403.4288, 1096.6332, 2980.958 , 8103.0839])
Universal Functions: Fast Element-Wise Array
Functions
• Others, such as add or maximum, take two arrays (thus, binary ufuncs) and return a single array as the result:
• In [141]: x = np.random.randn(8)
• In [142]: y = np.random.randn(8)
• In [143]: x
• Out[143]:
• array([-0.0119, 1.0048, 1.3272, -0.9193, -1.5491, 0.0222, 0.7584,
• -0.6605])
• In [144]: y
• Out[144]:
• array([ 0.8626, -0.01 , 0.05 , 0.6702, 0.853 , -0.9559, -0.0235,
• -2.3042])
• In [145]: np.maximum(x, y)
• Out[145]:
• array([ 0.8626, 1.0048, 1.3272, 0.6702, 0.853 , 0.0222, 0.7584,
• -0.6605])
Array-Oriented Programming with
Arrays
• Using NumPy arrays enables you to express many kinds of data
processing tasks as concise array expressions that might otherwise
require writing loops.
• This practice of replacing explicit loops with array expressions is
commonly referred to as vectorization.
• In general, vectorized array operations will often be one or two (or
more) orders of magnitude faster than their pure Python equivalents,
with the biggest impact in any kind of numerical computations.
• As a simple example, suppose we wished to evaluate the function
sqrt(x^2 + y^2) across a regular grid of values.
• The np.meshgrid function takes two 1D arrays and produces two 2D
matrices corresponding to all pairs of (x, y) in the two arrays:
In [155]: points = np.arange(-5, 5, 0.01) # 1000
equally spaced points
In [156]: xs, ys = np.meshgrid(points, points)
In [157]: ys
Out[157]:
array([[-5. , -5. , -5. , ..., -5. , -5. , -5. ],
[-4.99, -4.99, -4.99, ..., -4.99, -4.99, -4.99],
[-4.98, -4.98, -4.98, ..., -4.98, -4.98, -4.98],
...,
[ 4.97, 4.97, 4.97, ..., 4.97, 4.97, 4.97],
[ 4.98, 4.98, 4.98, ..., 4.98, 4.98, 4.98],
[ 4.99, 4.99, 4.99, ..., 4.99, 4.99, 4.99]])
• Now, evaluating the function is a matter of writing the same
expression you would write with two points:
In [158]: z = np.sqrt(xs ** 2 + ys ** 2)
In [159]: z
Out[159]:
array([[ 7.0711, 7.064 , 7.0569, ..., 7.0499, 7.0569,
7.064 ],
[ 7.064 , 7.0569, 7.0499, ..., 7.0428, 7.0499,
7.0569],
[ 7.0569, 7.0499, 7.0428, ..., 7.0357, 7.0428,
7.0499],
...,
[ 7.0499, 7.0428, 7.0357, ..., 7.0286, 7.0357,
7.0428],
[ 7.0569, 7.0499, 7.0428, ..., 7.0357, 7.0428,
7.0499],
[ 7.064 , 7.0569, 7.0499, ..., 7.0428, 7.0499,
7.0569]])
• In [160]: import matplotlib.pyplot as
plt
• In [161]: plt.imshow(z, cmap=plt.cm.gray);
plt.colorbar()
• Out[161]: <matplotlib.colorbar.Colorbar at
0x7f715e3fa630>
• In [162]: plt.title("Image plot of $\sqrt{x^2
+ y^2}$ for a grid of values")
• Out[162]: <matplotlib.text.Text at
0x7f715d2de748>

See Figure 4-3. Here I used the matplotlib

function imshow to create an image plot
from a two-dimensional array of function
values.
Expressing Conditional Logic as
Array Operations
• The numpy.where function is a vectorized version of the ternary
expression x if condition else y. Suppose we had a boolean array and
two arrays of values:
In [165]: xarr = np.array([1.1, 1.2, 1.3, 1.4, 1.5])
In [166]: yarr = np.array([2.1, 2.2, 2.3, 2.4, 2.5])
In [167]: cond = np.array([True, False, True, True,
False])
Suppose we wanted to take a value from xarr whenever the corresponding value in
cond is True, and otherwise take the value from yarr. A list comprehension doing
this might look like:

In [168]: result = [(x if c else y)

.....: for x, y, c in zip(xarr, yarr, cond)]
In [169]: result
Out[169]: [1.1000000000000001, 2.2000000000000002, 1.3, 1.3999999999999999,
2.5]
This has multiple problems. First, it will not be very fast for large arrays (because all
the work is being done in interpreted Python code). Second, it will not work with
multidimensional arrays. With np.where you can write this very concisely:

In [170]: result = np.where(cond, xarr, yarr)

In [171]: result
Out[171]: array([ 1.1, 2.2, 1.3, 1.4, 2.5])
• The second and third In [172]: arr = np.random.randn(4, 4)
arguments to np.where In [173]: arr
don’t need to be arrays; Out[173]:
array([[-0.5031, -0.6223, -0.9212, -
one or both of them can be 0.7262],
scalars. [ 0.2229, 0.0513, -1.1577, 0.8167],
• A typical use of where in [ 0.4336, 1.0107, 1.8249, -0.9975],
data analysis is to produce [ 0.8506, -0.1316, 0.9124, 0.1882]])
a new array of values In [174]: arr > 0
Out[174]:
based on another array. array([[False, False, False, False],
• Suppose you had a matrix [ True, True, False, True],
of randomly generate data [ True, True, True, False],
and you wanted to replace [ True, False, True, True]], dtype=bool)
all positive values with 2 In [175]: np.where(arr > 0, 2, -2)
Out[175]:
and all negative values array([[-2, -2, -2, -2],
with –2. This is very easy to [ 2, 2, -2, 2],
do with np.where:
[ 2, 2, 2, -2],
[ 2, -2, 2, 2]])
You can combine scalars and arrays when using np.where. For
example, I can replace
all positive values in arr with the constant 2 like so:
In [176]: np.where(arr > 0, 2, arr) # set only positive values to 2
Out[176]:
array([[-0.5031, -0.6223, -0.9212, -0.7262],
[ 2. , 2. , -1.1577, 2. ],
[ 2. , 2. , 2. , -0.9975],
[ 2. , -0.1316, 2. , 2. ]])
The arrays passed to np.where can be more than just equal-sized
arrays or scalars.
Mathematical and Statistical
Methods
• A set of mathematical functions that compute statistics
about an entire array or about the data along an axis are
accessible as methods of the array class.
• You can use aggregations (often called reductions) like sum,
mean, and std (standard deviation) either by calling the
array instance method or using the top-level NumPy
function.
Here I generate some normally distributed random
data and compute some aggregate
statistics:
Functions like mean and sum take an optional axis
argument that computes the statistic
In [177]: arr = np.random.randn(5, 4) over the given axis, resulting in an array with one
In [178]: arr fewer dimension
Out[178]:
array([[ 2.1695, -0.1149, 2.0037, 0.0296],
[ 0.7953, 0.1181, -0.7485, 0.585 ], In [182]: arr.mean(axis=1)
[ 0.1527, -1.5657, -0.5625, -0.0327], Out[182]: array([ 1.022 , 0.1875, -
0.502 , -0.0881, 0.3611])
[-0.929 , -0.4826, -0.0363, 1.0954],
In [183]: arr.sum(axis=0)
[ 0.9809, -0.5895, 1.5817, -0.5287]]) Out[183]: array([ 3.1693, -2.6345,
In [179]: arr.mean() 2.2381, 1.1486])
Out[179]: 0.19607051119998253
In [180]: np.mean(arr)
Out[180]: 0.19607051119998253
In [181]: arr.sum()
Out[181]: 3.9214102239996507
Here, arr.mean(1) means “compute mean across the columns” where
arr.sum(0)
means “compute sum down the rows.”
Other methods like cumsum and cumprod do not aggregate, instead
producing an array
of the intermediate results:
In [184]: arr = np.array([0, 1, 2, 3, 4, 5, 6, 7])
In [185]: arr.cumsum()
Out[185]: array([ 0, 1, 3, 6, 10, 15, 21, 28])
In multidimensional arrays, accumulation functions like cumsum return
an array of In [188]: arr.cumsum(axis=0)
the same size, but with the partial aggregates computed along the
Out[188]:
array([[ 0, 1, 2],
indicated axis [ 3, 5, 7],
according to each lower dimensional slice:[ 9, 12, 15]])
In [186]: arr = np.array([[0, 1, 2], [3, 4, 5], [6, 7, 8]]) In [189]: arr.cumprod(axis=1)
In [187]: arr Out[189]:
Out[187]: array([[ 0, 0, 0],
array([[0, 1, 2], [ 3, 12, 60],
[3, 4, 5], [ 6, 42, 336]])
Table 4-5. Basic array statistical methods
Methods for Boolean Arrays
Boolean values are coerced to 1 (True) and 0 (False) in the preceding
methods. Thus,
sum is often used as a means of counting True values in a boolean array:
In [190]: arr = np.random.randn(100)
In [191]: (arr > 0).sum() # Number of positive values
Out[191]: 42
There are two additional methods, any and all, useful especially for boolean
arrays.
any tests whether one or more values in an array is True, while all checks if
every
value is True:
In [192]: bools = np.array([False, False, True, False])
In [193]: bools.any()
Out[193]: True
In [194]: bools.all()
Out[194]: False
These methods also work with non-boolean arrays, where non-zero elements
evaluate
Sorting
Like Python’s built-in list type, NumPy arrays can be sorted in-place with the
sort
method:
In [195]: arr = np.random.randn(6)
In [196]: arr
Out[196]: array([ 0.6095, -0.4938, 1.24 , -0.1357, 1.43 , -0.8469])
In [197]: arr.sort()
In [198]: arr
Out[198]: array([-0.8469, -0.4938, -0.1357, 0.6095, 1.24 , 1.43 ])
You can sort each one-dimensional section of values in a multidimensional
array inplace
along an axis by passing the axis number to sort:
In [199]: arr = np.random.randn(5, 3)
In [200]: arr
Out[200]:
array([[ 0.6033, 1.2636, -0.2555],
[-0.4457, 0.4684, -0.9616],
[-1.8245, 0.6254, 1.0229],
[ 1.1074, 0.0909, -0.3501],
[ 0.218 , -0.8948, -1.7415]])
In [201]: arr.sort(1)
In [202]: arr
Out[202]:
array([[-0.2555, 0.6033, 1.2636],
[-0.9616, -0.4457, 0.4684],
[-1.8245, 0.6254, 1.0229],
[-0.3501, 0.0909, 1.1074],
[-1.7415, -0.8948, 0.218 ]])
The top-level method np.sort returns a sorted copy of an array
instead of modifying
the array in-place. A quick-and-dirty way to compute the
quantiles of an array is to
sort it and select the value at a particular rank:
In [203]: large_arr = np.random.randn(1000)
In [204]: large_arr.sort()
In [205]: large_arr[int(0.05 * len(large_arr))] # 5% quantile
Out[205]: -1.5311513550102103

Technical Brief Stats Concepts 19c
No ratings yet
Technical Brief Stats Concepts 19c
27 pages
Develop Disaster Recovery Back-Up Procedures and Recovery Instructions
No ratings yet
Develop Disaster Recovery Back-Up Procedures and Recovery Instructions
4 pages
NumPy Notes
No ratings yet
NumPy Notes
13 pages
11.1.06 Warman Pump Shaft Seals
100% (2)
11.1.06 Warman Pump Shaft Seals
12 pages
It, Culture, and The Society
78% (9)
It, Culture, and The Society
17 pages
Assessing Marketing Information Needs
100% (1)
Assessing Marketing Information Needs
34 pages
Numpy Numerical Python - Unit3
No ratings yet
Numpy Numerical Python - Unit3
69 pages
Unit 2
No ratings yet
Unit 2
15 pages
Unit 3
No ratings yet
Unit 3
42 pages
Unit 3
No ratings yet
Unit 3
37 pages
Numpy: Usage For Data Analysis Operations
No ratings yet
Numpy: Usage For Data Analysis Operations
20 pages
Num Py
No ratings yet
Num Py
31 pages
Tentative NumPy Tutorial
No ratings yet
Tentative NumPy Tutorial
30 pages
Data Science Handwritten Notes - 3
No ratings yet
Data Science Handwritten Notes - 3
26 pages
Numpy
No ratings yet
Numpy
54 pages
NumPy Quickstart
No ratings yet
NumPy Quickstart
26 pages
Unit8 DataAnalyticsandVisualizationpdf 2023 10 17 09 16 46
No ratings yet
Unit8 DataAnalyticsandVisualizationpdf 2023 10 17 09 16 46
64 pages
Numerical Python Numpy
No ratings yet
Numerical Python Numpy
28 pages
Num Py
No ratings yet
Num Py
46 pages
Analitical Reseach
No ratings yet
Analitical Reseach
15 pages
Unit III Python
No ratings yet
Unit III Python
42 pages
Python Day 7
No ratings yet
Python Day 7
10 pages
Unit 3 Numpy
No ratings yet
Unit 3 Numpy
23 pages
Python Sem V Portion 2
No ratings yet
Python Sem V Portion 2
29 pages
Numpy ML - AI
No ratings yet
Numpy ML - AI
135 pages
Numpy Matplot
No ratings yet
Numpy Matplot
14 pages
Day 3-Numpy Basics - Jupyter Notebook
No ratings yet
Day 3-Numpy Basics - Jupyter Notebook
8 pages
03 Numpy
No ratings yet
03 Numpy
30 pages
Module Numpy
No ratings yet
Module Numpy
67 pages
NumPy Library and Function
No ratings yet
NumPy Library and Function
129 pages
Int246 L1
No ratings yet
Int246 L1
25 pages
Numpy Python
No ratings yet
Numpy Python
36 pages
Unit 4 Final
No ratings yet
Unit 4 Final
100 pages
Print
No ratings yet
Print
296 pages
Python Module 5
No ratings yet
Python Module 5
43 pages
Unit - Iii
No ratings yet
Unit - Iii
79 pages
PYTHON UNIT-5 Part-B
No ratings yet
PYTHON UNIT-5 Part-B
3 pages
Unit 2
No ratings yet
Unit 2
21 pages
Numpy
No ratings yet
Numpy
64 pages
CSE488 Lab3 Numpy
No ratings yet
CSE488 Lab3 Numpy
14 pages
Numpy Full
100% (1)
Numpy Full
40 pages
Numpy @CodeProgrammer
No ratings yet
Numpy @CodeProgrammer
64 pages
11 Arrays
No ratings yet
11 Arrays
12 pages
Python Presentation 3
No ratings yet
Python Presentation 3
44 pages
Numpy, Pandas and Matplotlib
No ratings yet
Numpy, Pandas and Matplotlib
60 pages
Unit 3
No ratings yet
Unit 3
56 pages
Python 5th Sem
No ratings yet
Python 5th Sem
33 pages
Introduction To Numpy: Aniruddh Kadam Reg No-12109237 Lovely Professional University
100% (1)
Introduction To Numpy: Aniruddh Kadam Reg No-12109237 Lovely Professional University
84 pages
Numpy Tutorial
No ratings yet
Numpy Tutorial
13 pages
Numpy
No ratings yet
Numpy
27 pages
Numpy - Basics
No ratings yet
Numpy - Basics
18 pages
Numpy User
No ratings yet
Numpy User
565 pages
Numpy User
No ratings yet
Numpy User
486 pages
Scientific Computing
No ratings yet
Scientific Computing
24 pages
NumPy
No ratings yet
NumPy
39 pages
Numpyintro PDF
No ratings yet
Numpyintro PDF
17 pages
Unit Iii Using Numpy
No ratings yet
Unit Iii Using Numpy
23 pages
NumPy Notes
No ratings yet
NumPy Notes
15 pages
FALLSEM2023-24 CSI3007 ETH VL2023240104352 2023-09-27 Reference-Material-I
No ratings yet
FALLSEM2023-24 CSI3007 ETH VL2023240104352 2023-09-27 Reference-Material-I
47 pages
Numpy User
No ratings yet
Numpy User
502 pages
Numpy in Pandas
No ratings yet
Numpy in Pandas
30 pages
Python Lectures
No ratings yet
Python Lectures
29 pages
NumPy and NumPy Arrays
No ratings yet
NumPy and NumPy Arrays
5 pages
Unit 05
No ratings yet
Unit 05
26 pages
Unit 04 Pandas
No ratings yet
Unit 04 Pandas
46 pages
QUEUES ADT (Programs)
No ratings yet
QUEUES ADT (Programs)
15 pages
Unit-03 Notes
No ratings yet
Unit-03 Notes
11 pages
Physical Layer in Network: University of Technology Computer Science Department
No ratings yet
Physical Layer in Network: University of Technology Computer Science Department
12 pages
Vaishnavi Report FSWD
No ratings yet
Vaishnavi Report FSWD
40 pages
WinterTech Inventions Volume 1
No ratings yet
WinterTech Inventions Volume 1
14 pages
Digital Progress and Trends Report 2023
No ratings yet
Digital Progress and Trends Report 2023
177 pages
COSEC Reports Time Attendance
No ratings yet
COSEC Reports Time Attendance
74 pages
Novo9 Spark User Manual
No ratings yet
Novo9 Spark User Manual
34 pages
The New Standard: High Speed Stability
No ratings yet
The New Standard: High Speed Stability
16 pages
APT1608EC
No ratings yet
APT1608EC
4 pages
SHC Tracker Mar 2013
No ratings yet
SHC Tracker Mar 2013
8 pages
DBMS Interview Questions: Click Here
No ratings yet
DBMS Interview Questions: Click Here
26 pages
Network License Plate Camera Manual
No ratings yet
Network License Plate Camera Manual
24 pages
Evax-Hmx PDF
No ratings yet
Evax-Hmx PDF
4 pages
Property Database (Market Value Finder)
No ratings yet
Property Database (Market Value Finder)
4 pages
Pneumatic Rotary Actuator SIRCA NEW
No ratings yet
Pneumatic Rotary Actuator SIRCA NEW
16 pages
H3C S5560X-EI Series Converged Gigabit Switches: Product Overview
No ratings yet
H3C S5560X-EI Series Converged Gigabit Switches: Product Overview
16 pages
Notebook LM Masterclass - Ebook
No ratings yet
Notebook LM Masterclass - Ebook
63 pages
Btech Cut Off For Ipu University
No ratings yet
Btech Cut Off For Ipu University
8 pages
(EM) FWC ICT 2025 1st Term Paper With Scheme-1
No ratings yet
(EM) FWC ICT 2025 1st Term Paper With Scheme-1
21 pages
Unit 3 P4 Functions
No ratings yet
Unit 3 P4 Functions
32 pages
COF - International Cyber Olympiad 2025
No ratings yet
COF - International Cyber Olympiad 2025
12 pages
Phases of A Compiler
No ratings yet
Phases of A Compiler
6 pages
MCA 2024 Syllabus Sorted
No ratings yet
MCA 2024 Syllabus Sorted
25 pages
3HAC065036 OM OmniCore-en
No ratings yet
3HAC065036 OM OmniCore-en
284 pages
Here Are Some Cases of ERP Implementation:: ERP Case Find Out About #1: Cadbury - Success
No ratings yet
Here Are Some Cases of ERP Implementation:: ERP Case Find Out About #1: Cadbury - Success
5 pages
ThirdYear CSBS-2023
No ratings yet
ThirdYear CSBS-2023
87 pages

UNIT-03 Numpy

Uploaded by

UNIT-03 Numpy

Uploaded by

UNIT-03

Numpy basics arrays and vectorized computation

NumPy contains, among other things:

• A fast and efficient multidimensional array object ndarray.

• Tools for reading and writing array-based datasets to disk

• Linear algebra operations, Fourier transform, and random number generation

• In [12]: import numpy as np

• # Generate some random data

• In [13]: data = np.random.randn(2, 3)

• array([[-0.2047, 0.4789, -0.5194],

• [-0.5557, 1.9658, 1.3934]])

In [12]: import numpy as np

array([[-0.4094, 0.9579, -1.0389],

• >>> a = np.array([1, 2, 3])

• In [33]: arr1 = np.array([1, 2, 3], dtype=np.float64)

• In [34]: arr2 = np.array([1, 2, 3], dtype=np.int32)

• In [41]: arr = np.array([3.7, -1.2, -2.6, 0.5, 12.9, 10.1])

• Out[42]: array([ 3.7, -1.2, -2.6, 0.5, 12.9, 10.1])

• Out[43]: array([ 3, -1, -2, 0, 12, 10], dtype=int32)

• In [44]: numeric_strings = np.array(['1.25', '-9.6', '42'], dtype=np.string_)

• Out[45]: array([ 1.25, -9.6 , 42. ])

• In [51]: arr = np.array([[1., 2., 3.], [4., 5., 6.]])

• In [53]: arr * arr

• In [59]: arr2 > arr

• In [125]: arr[[1, 5, 7, 2]][:, [0, 3, 1, 2]]

• [20, 23, 21, 22],

• [28, 31, 29, 30],

See Figure 4-3. Here I used the matplotlib

In [168]: result = [(x if c else y)

In [170]: result = np.where(cond, xarr, yarr)

You might also like