Python NumPy
Python NumPy
Overview
NumPy (short for Numerical Python) is a powerful open-source Python library used for
numerical and scientific computing. It provides support for multi-dimensional arrays,
mathematical functions, linear algebra, random number generation, and more. NumPy
forms the foundation of many other data science libraries, including Pandas, SciPy, scikit-learn,
and TensorFlow.
python
CopyEdit
import numpy as np
arr = np.array([1, 2, 3])
print(arr)
2. Broadcasting
Automatically expands smaller arrays during arithmetic operations to match the shape of
larger arrays.
python
CopyEdit
a = np.array([1, 2, 3])
b = 2
print(a + b) # Output: [3 4 5]
3. Vectorization
Replace loops with array expressions, making operations faster and more readable.
python
CopyEdit
arr = np.array([1, 2, 3, 4])
print(arr * 2) # Output: [2 4 6 8]
4. Mathematical Functions
NumPy provides a wide variety of mathematical functions like np.mean(), np.sum(),
np.std(), np.exp(), and np.sin().
5. Linear Algebra
Support for matrix operations, eigenvalues, determinants, and more via numpy.linalg.
python
CopyEdit
A = np.array([[1, 2], [3, 4]])
B = np.linalg.inv(A)
6. Random Module
For generating random numbers, simulating distributions, or performing Monte Carlo
simulations.
python
CopyEdit
rand_nums = np.random.rand(3)
7. Performance
NumPy is implemented in C and optimized for speed. It is much faster than using Python
lists for numerical operations.
Use Cases
Conclusion
NumPy is essential for any work involving numerical data in Python. Its speed, efficiency, and
extensive functionality make it a must-know library for data scientists, engineers, and
researchers. Learning NumPy is typically the first step before moving on to higher-level libraries
like Pandas, SciPy, or TensorFlow.