Elc Report
Elc Report
submitted by
KOENA (1024030831)
B.TECH (COE)
PANDAS
PANDAS is a software library written for the Python programming language for data
manipulation and analysis. It was released on 11th January 2008.
Pandas library is built on top of Numpy library, it is expected that Numpy will work with Pandas.
Key features:
It is a DataFrame object that is quick and effective, with both standard and custom
indexing.
The library is used for reshaping and turning of the informational indexes.
It is used to align the data and integrate the missing data.
Process a variety of data sets in various formats, such as matrix data, heterogeneous
tabular data, and time series.
It incorporates with different libraries like SciPy, and scikit-learn.
Application:
Data Cleaning and Preprocessing: Handle missing values, convert data types,remove duplicates
and reshape data .
Data Exploration: Perform descriptive statistics, filter/subset data, group and aggregate and
sort data .
Data Visualization: Create plots using Pandas or integrate with Matplotlib and Seaborn.
Machine Learning: Engineer features,prepare data and integrate with ML libraries like Scikit-
learn and TensorFlow .
Time Series Analysis: Work with timestamps,resample data and perform time-specific
operations .
NUMPY
NumPy is the fundamental package for scientific computing in Python. It is a Python library that
provides a multidimensional array object, various derived objects and an assortment of
routines for fast operations on arrays
NumPy arrays facilitate advanced mathematical and other types of operations on large
numbers of data
Key features:
Application:
Data Creation: Generate arrays using functions like arrange(), zeros(),ones() and random().
TensorFlow
TensorFlow is an open-source software library for high performance numerical computation. Its
flexible architecture allows easy deployment of computation across a variety of platforms
(CPUs, GPUs, TPUs), and from desktops to clusters of servers to mobile and edge devices.
It is a library which is released on 9 November 2015 and developed by Google Brain Team. It is
entirely based on Python programming language and use for numerical computation and data
flow, which makes machine learning faster and easier.
Key features:
Application:
Model Building: Create machine learning models using Sequential API or Functional API.
Model Deployment: Export models for deployment with TensorFlow Serving or TensorFlow
Lite.
Custom Operations: Write custom layers, loss functions, and optimizers for specific tasks.
Matplotlib
Matplotlib is a comprehensive library for creating static, animated, and interactive
visualizations in Python. Matplotlib makes easy things easy and hard things possible.
It was released in 2003 by John Hunter, it enables users to graphically represent data,
facilitation easier analysis and understanding.
Key features:
Application:
Diverse Plotting: Create various plots like line, scatter, bar, and histograms.
Customization: Enhance visuals with tailored titles, labels, legends, and colors.
Export Options: Save high-quality plots in formats like PNG, PDF, and SVG.
NUMPY
A PROJECT REPORT
submitted by
KOENA (1024030831)
B.TECH (COE)
INTRODUCTION
WHAT IS NUMPY?
NUMPY is an open-source python library which provides a multidimensional array
object, various derived objects and an assortment of routines for fast operations
on arrays (including mathematical, logical, shape manipulation, sorting, selecting
etc.). NUMPY easily integrates with other libraries like pandas, matplotlib, and
SciPy. Use of NUMPY is high prevalent in data science training projects and
mathematical fields.
PANDAS is a powerful, open-source python library that is used for data analysis
and manipulation. Built on top of NumPy, PANDAS provides data structures and
operations like Series and DataFrame for efficiently handling of structured data
sets (eg: TABLES) very easily. Widely used in the fields of Data Science, Artificial
Intelligence and Machine learning and various other places where data
manipulation is required.
FEATURES
Multi-dimensional arrays
NumPy supports multi-dimensional arrays, masked arrays, and matrices.
Mathematical functions
Vectorized operations
Broadcasting
Open source
Interoperable
NumPy integrates with other libraries and tools in the Python ecosystem, such as SciPy,
Matplotlib, pandas, and scikit-learn.
Efficient
Easy to use
NumPy has a clear and intuitive syntax, making it accessible for both beginners and
experienced developers.
APPLICATION
NUMPY is widely used in various domains
Data Science: It’s used to handle many numerical analyses and functions.
Machine Learning: NUMPY helps in multiple mathematical tasks and helps
in data manipulation.
Finance Modelling: Time series and risk analysis, Excel integration and Data
visualization.
Scientific Research: Statistical analysis, interdisciplinary research and
statistical analysis.
Web Analytics: Mathematical optimization and linear programming.
REFERENCES
https://www.geeksforgeeks.org/python-numpy/
https://www.altexsoft.com/blog/numpy-pros-and-cons/
https://numpy.org/doc/stable/user/whatisnumpy.html
https://medium.com/@techwithjulles/introduction-to-numpy-advantages-
and-applications-a1977b288ca5