0% found this document useful (0 votes)
8 views2 pages

Chapter-5 DS

Chapter 5 introduces essential tools and libraries used in Data Science, including TensorFlow for machine learning, Matplotlib for data visualization, and Pandas for data manipulation. It also covers NumPy and SciPy for numerical and scientific computing, Scrapy for web data extraction, Scikit-learn for machine learning models, and PyGame for game development. Each library is briefly described, highlighting its primary functions and applications.

Uploaded by

trexwarrior92
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views2 pages

Chapter-5 DS

Chapter 5 introduces essential tools and libraries used in Data Science, including TensorFlow for machine learning, Matplotlib for data visualization, and Pandas for data manipulation. It also covers NumPy and SciPy for numerical and scientific computing, Scrapy for web data extraction, Scikit-learn for machine learning models, and PyGame for game development. Each library is briefly described, highlighting its primary functions and applications.

Uploaded by

trexwarrior92
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Chapter-5

Tools and Libraries


Tools and Libraries: Introduction to popular libraries and technologies used,
in Data Science like Pandas, NumPy, Sci-kit Learn, etc.

1. TensorFlow: TensorFlow is a free and open-source software library for


machine learning and artificial intelligence. It can be used across a range of
tasks but has a particular focus on training and inference of deep neural
networks. It was developed by the Google Brain team for Google's internal
use in research and production.

2. Matplotlib: Matplotlib is a multi-platform data visualization library built on


NumPy arrays and designed to work with the broader SciPy stack. It was
introduced by John Hunter in 2002. One of the greatest benefits of
visualization is that it allows us visual access to huge amounts of data in easily
digestible visuals. Matplotlib consists of several plots like line, bar, scatter,
histogram, etc.

3. Pandas: Pandas are built on top of two core Python libraries—matplotlib for
data visualization and NumPy for mathematical operations. Pandas acts as a
wrapper over these libraries, allowing you to access many of matplotlib and
NumPy's methods with less code.

4. Numpy: NumPy (Numerical Python) is an open-source Python library that's


used in almost every field of science and engineering. It's the universal
standard for working with numerical data in Python, and it's at the core of the
scientific Python and PyData ecosystems.

5. Scipy: SciPy is an open-source Python library that's used in almost every field
of science and engineering optimization, stats, and signal processing. Like
NumPy, SciPy is open source so we can use it freely. SciPy was created by
NumPy's creator Travis Olliphant.
6. Scrapy: Scrapy is a comprehensive open-source framework and is among the
most powerful libraries used for web data extraction. Scrapy natively
integrates functions for extracting data from HTML or XML sources using
CSS and XPath expressions.

7. Scikit-learn: Scikit-Learn, also known as sklearn is a python library to


implement machine learning models and statistical modelling. Through scikit-
learn, we can implement various machine learning models for regression,
classification, clustering, and statistical tools for analyzing these models.

8. PyGame: Pygame is a cross-platform set of Python modules designed for


writing video games. It includes computer graphics and sound libraries
designed to be used with the Python programming language.

Chapter Ends…

You might also like