0% found this document useful (0 votes)

10 views5 pages

Expt-1 Dav

The document outlines a lab course on Data Analytics and Visualization, focusing on libraries in Python and R. It details key libraries such as NumPy, Pandas, and TensorFlow for Python, and dplyr, ggplot2, and Shiny for R, highlighting their features and applications. The document also includes theory questions aimed at understanding the differences between Python and R in data analytics.

Uploaded by

prabhugaurav54

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views5 pages

Expt-1 Dav

Uploaded by

prabhugaurav54

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

DEPARTMENT OF ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

Lab Code: Course Code: 24AIMLPCC601 Class: T.E./AIML/DS

Lab Name: Data Analytics and Visualisation

AIM: Getting introduced to data analytics libraries in Python and R.

Theory Questions:---
1.What are the key differences between Python and R for data analytics?
2.What are the most popular libraries for data analytics in both languages?
3. What is the difference between NumPy arrays and pandas DataFrames?
4. How do I read and write data in R?

Data analytics libraries in Python:

1. NumPy

NumPy is a free Python software library for numerical computing on data that can be in the form of
large arrays and multi-dimensional matrices. These multidimensional matrices are the main objects
in NumPy where their dimensions are called axes and the number of axes is called
a rank.
Key Features:

• N-dimensional array objects

• Broadcasting functions

• Linear algebra, Fourier transforms, and random number capabilities

2. Pandas

Pandas is one of the best libraries for Python, which is a free software library for data analysis and data handling. In
short, Pandas is perfect for quick and easy data manipulation, data aggregation, reading, and writing the data and data
visualization.

Key Features:
• DataFrame manipulation

• Grouping, joining, and merging datasets

• Time series data handling

• Data cleaning and wrangling

3. Seaborn

Seaborn is a powerful Python data visualization library built on top of Matplotlib, designed to make it
easier to create attractive and informative statistical graphics. Seaborn is widely used by data scientists
due to its ease of use, intuitive syntax, and integration with Pandas, which allows seamless plotting
directly from DataFrames.
Key Features:

• High-level interface for drawing statistical plots

• Supports themes for better aesthetics

• Integrates with Pandas DataFrames

4. TensorFlow

TensorFlow is a free end-to-end open-source platform that has a wide variety of tools, libraries, and
resources for Artificial Intelligence. You can easily build and train Machine Learning models with high-
level APIs such as Keras using TensorFlow. It also provides multiple levels of abstraction so you can
choose the option you need for your model.
Key Features:

• Support for distributed training

• High-level APIs (Keras) for quick prototyping

• Deployable on multiple platforms, including mobile and cloud

5. PyTorch

PyTorch is an open-source deep learning framework that has gained immense popularity among
researchers and developers due to its flexibility and speed. PyTorch offers an intuitive interface and
dynamic computation capabilities, making it a go-to choice for many machine learning practitioners.
Key Features:

• Dynamic computational graph

• Strong community support and active development

• Great for research and production-level applications

6 . Scikit-learn

Scikit-learn is among those libraries for Python that is a free, software library for Machine Learning
coding primarily in the Python programming language. While Scikit-learn is written mainly in Python,
it has also used Cython to write some core algorithms in order to improve performance.
Key Features:
• Implements regression, classification, clustering, and more

• Cross-validation, hyperparameter tuning, and pipeline building

• Easy integration with NumPy and Pandas.

Data analytics libraries in R.

1. dplyr

One of the most widely used libraries for data manipulation, dplyr streamlines working with data frames
and allows users to perform various data wrangling operations. It provides a set of core functions that
make data wrangling faster and more intuitive. These functions can also be combined with the
group_by() function to perform operations on grouped data.
Key Features of dplyr:

• mutate(): Adds new columns based on existing data, allowing for easy feature engineering.

• select(): Picks specific columns by name, making it easy to focus on the most relevant data.

• filter(): Filters rows based on logical conditions, enabling you to subset your data quickly.

• summarise(): Reduces a dataset to summary statistics, great for aggregation and descriptive analysis.

• arrange(): Orders rows based on column values, simplifying sorting.

Best for : Data wrangling, filtering, and summarization

2. ggplot2

ggplot2 is an R data visualization library that is based on The Grammar of Graphics. ggplot2 can
create data visualizations such as bar charts, pie charts, histograms, scatterplots, error charts, etc.
using high-level API. It also allows you to add different types of data visualization components or layers
in a single visualization. Once ggplot2 has been told which variables to map to which aesthetics in the
plot, it does the rest of the work so that the user can focus on interpreting the visualizations and take
less time to create them.
Key Features:

• Easily combine different elements (geoms, stats, scales) in a single plot.

• ggplot2 provides a flexible framework for styling and customizing plots.

• Automatically maps data to visual properties like size, color, and shape.

• Easily create multiple plots based on a factor variable, making it simple to visualize subgroup differences.

Best for : Creating complex, customizable plots

3. Esquisse

Esquisse is a data visualization tool in R that allows you to create detailed data visualizations using the
ggplot2 package. You can create all sorts of scatter plots, histograms, line charts, bar charts, pie charts,
error bars, box plots, multiple axes, sparklines, dendrograms, 3-D charts, etc. using Esquisse and also
export these graphs or access the code for creating these graphs. Esquisse is such a famous and
easily used data visualization tool because of its drag-and-drop ability which makes it popular even
among beginners.
Key Features:

• Drag-and-drop functionality for easy chart creation

• Supports multiple chart types (scatter, bar, line, etc.)

• Export visualizations and view underlying code

Best for : Easy and quick visualizations for beginners

4. Shiny

Shiny is an R package that can be used to build interactive web applications in R. Basically, Shiny gives
a combination of R and the modern web. And you can easily create web applications using Shiny
without needing any special web development skills. Using Shiny, you can embed web applications in
R documents, create standalone applications on a webpage, or even create web visualization
dashboards. You can also deploy the Shiny app to the cloud or on your servers with an open-source
or commercial license.
Key Features:

• Build interactive web apps easily

• Embed apps in R documents or host on the web

• Extend functionality with HTML, CSS, and JavaScript

Best for : Building interactive dashboards and web apps

5. mlr3

mlr3 is an R tool created specifically for Machine Learning. You can implement various Supervised and
Unsupervised Machine learning models on Scikit-learn like Classification, Regression, Support Vector
Machine, Random Forests, Nearest Neighbors, Naive Bayes, Decision Trees, Clustering, etc. with
mlr3. It is also connected to the OpenML R package which is dedicated to supporting machine learning
online.
Key Features:

• Supports a wide range of machine learning models

• Integration with OpenML for online resources

• Improved functionality over its predecessor, mlr

Best for : Implementing machine learning algorithms with hyperparameter tuning

6. Lubridate

Lubridate is an R library that is particularly focused on making date-time easy to handle. Working with
date-time data can be frustrating with R because R commands are unintuitive for this type of data and
can change based on the type of date-time object. There are many new time span classes in Lubridate
as well that help in handling mathematical operations.
Key Features:

• Simplifies date-time manipulation with intuitive functions

• Handles components like seconds, minutes, and years easily

• Offers time span classes for mathematical operations

Best for : Parsing, manipulating, and converting date-time formats

2016 - SW PDM - Api Professional
No ratings yet
2016 - SW PDM - Api Professional
221 pages
10.additional Topics
No ratings yet
10.additional Topics
13 pages
Template - Handover Documentation
80% (10)
Template - Handover Documentation
4 pages
Module 1 - Introduction To Data Science
100% (1)
Module 1 - Introduction To Data Science
59 pages
Data Science - UNIT-3 - Notes
No ratings yet
Data Science - UNIT-3 - Notes
32 pages
Visualization - Python Data Analysis
No ratings yet
Visualization - Python Data Analysis
13 pages
Data Mining Tools - Javatpoint
No ratings yet
Data Mining Tools - Javatpoint
12 pages
Machine Learning Python Packages
No ratings yet
Machine Learning Python Packages
9 pages
Web Technologies: Anurag Singh Mohit Srivastava Rishi Pandey
No ratings yet
Web Technologies: Anurag Singh Mohit Srivastava Rishi Pandey
43 pages
Introduction To Computer Security
No ratings yet
Introduction To Computer Security
38 pages
(Tutorial) The 10 Most Important Packages in R For Data Science - DataCamp
No ratings yet
(Tutorial) The 10 Most Important Packages in R For Data Science - DataCamp
8 pages
COPA2 Ndsem TT
No ratings yet
COPA2 Ndsem TT
246 pages
PDF 1675791423
No ratings yet
PDF 1675791423
11 pages
Python Control Flow Statements and Loops: Pynative
No ratings yet
Python Control Flow Statements and Loops: Pynative
16 pages
Best ML Packages in R
No ratings yet
Best ML Packages in R
9 pages
Data Analysis Library: by Muthu Priya J 19MZ06
No ratings yet
Data Analysis Library: by Muthu Priya J 19MZ06
3 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Top 18 Python Libraries
100% (1)
Top 18 Python Libraries
11 pages
DDI Book Chapter Tools and Techniques
No ratings yet
DDI Book Chapter Tools and Techniques
13 pages
Microsoft Word, Excel, Powerpoint & Adobe PDF Files On Your Smartphone or Pda
No ratings yet
Microsoft Word, Excel, Powerpoint & Adobe PDF Files On Your Smartphone or Pda
2 pages
TOP 7 Python Libraries For DATA Visualization!!
No ratings yet
TOP 7 Python Libraries For DATA Visualization!!
9 pages
Chapter 1-Introduction
No ratings yet
Chapter 1-Introduction
19 pages
AW MSFS GUIDE - How To Create A Custom Aerial Scenery v1.1
No ratings yet
AW MSFS GUIDE - How To Create A Custom Aerial Scenery v1.1
40 pages
20140268ece312prj - Byron Chamunorwa Ngoshi
No ratings yet
20140268ece312prj - Byron Chamunorwa Ngoshi
11 pages
TT239 Frequency Pulse PWM Input 2 Wire Transmitter
No ratings yet
TT239 Frequency Pulse PWM Input 2 Wire Transmitter
3 pages
Dav Exp8 56
No ratings yet
Dav Exp8 56
4 pages
CH 2-Software Testing Fundamentals - KM
No ratings yet
CH 2-Software Testing Fundamentals - KM
42 pages
Experiment No 2 Introduction To Various Python Packages and Their Basic Use
No ratings yet
Experiment No 2 Introduction To Various Python Packages and Their Basic Use
5 pages
DAV Exp.1-8 Output
No ratings yet
DAV Exp.1-8 Output
19 pages
MCQ Test On Unit 4.4 - Attempt Review
No ratings yet
MCQ Test On Unit 4.4 - Attempt Review
3 pages
200Mhz/100Mhz/60Mhz Digital Storage Oscilloscope: Gds-2000 Series
No ratings yet
200Mhz/100Mhz/60Mhz Digital Storage Oscilloscope: Gds-2000 Series
2 pages
Basic Libraries For Data Science
No ratings yet
Basic Libraries For Data Science
4 pages
Pre ML Practise
No ratings yet
Pre ML Practise
14 pages
Lab - Manual FDS
No ratings yet
Lab - Manual FDS
12 pages
Machine Learning Experiment
No ratings yet
Machine Learning Experiment
69 pages
Dav Lab
No ratings yet
Dav Lab
8 pages
Staple Python Libraries For Data Science
No ratings yet
Staple Python Libraries For Data Science
26 pages
Implementing Fuzzy Control Systems Using VHDL and Statecharts
No ratings yet
Implementing Fuzzy Control Systems Using VHDL and Statecharts
7 pages
Toolkits
No ratings yet
Toolkits
10 pages
Exp1ml
No ratings yet
Exp1ml
6 pages
8086 Addressing Modes:: Instruction Operand (8-Bit or 16-Bit)
No ratings yet
8086 Addressing Modes:: Instruction Operand (8-Bit or 16-Bit)
4 pages
Essential Python Libraries and Functions For Data Science 1706295212
No ratings yet
Essential Python Libraries and Functions For Data Science 1706295212
12 pages
The Data Science Toolkit
No ratings yet
The Data Science Toolkit
5 pages
Unit 3 (Python)
No ratings yet
Unit 3 (Python)
29 pages
Ishamp User Manual - Mobile APP Version 2.5
No ratings yet
Ishamp User Manual - Mobile APP Version 2.5
7 pages
Sandpiper 2B Electronics Launch Package
No ratings yet
Sandpiper 2B Electronics Launch Package
14 pages
Combinepdf
No ratings yet
Combinepdf
101 pages
Core Libraries For Machine Learning
No ratings yet
Core Libraries For Machine Learning
5 pages
6th Sem Cse Data Science Analytics SM o
No ratings yet
6th Sem Cse Data Science Analytics SM o
40 pages
Data Visualization
No ratings yet
Data Visualization
25 pages
21CS44 - Operating Systems - Module-2
No ratings yet
21CS44 - Operating Systems - Module-2
44 pages
21bcp420 ML Lab Report
No ratings yet
21bcp420 ML Lab Report
69 pages
10 Essential Python Libraries For Data Professionals - by Sigli Mumuni - Medium
No ratings yet
10 Essential Python Libraries For Data Professionals - by Sigli Mumuni - Medium
6 pages
Combinepdf
No ratings yet
Combinepdf
77 pages
Machine Learning Document
No ratings yet
Machine Learning Document
7 pages
Python Libraries
No ratings yet
Python Libraries
17 pages
Python Libs For Ds
No ratings yet
Python Libs For Ds
5 pages
Sec-D ML Practical File PDF
No ratings yet
Sec-D ML Practical File PDF
19 pages
PYTHON
No ratings yet
PYTHON
11 pages
Libraries For Data Science
No ratings yet
Libraries For Data Science
2 pages
UNIT 3 PPTT
No ratings yet
UNIT 3 PPTT
35 pages
Python For Data Analysis
No ratings yet
Python For Data Analysis
49 pages
CSC 203 Human Computer Interaction Chapter 1
No ratings yet
CSC 203 Human Computer Interaction Chapter 1
13 pages
34.data Visualiztion Tools
No ratings yet
34.data Visualiztion Tools
4 pages
Intro To DS Assignmnt 1 (Amna Iqbal) ....
No ratings yet
Intro To DS Assignmnt 1 (Amna Iqbal) ....
4 pages
OOP Mini Project (Library Management)
No ratings yet
OOP Mini Project (Library Management)
10 pages
10EXP01
No ratings yet
10EXP01
12 pages
Dsbda Unit4
No ratings yet
Dsbda Unit4
110 pages
CompTIA A+ Core 2 Study Notes
No ratings yet
CompTIA A+ Core 2 Study Notes
7 pages
13 - Data Visualization
No ratings yet
13 - Data Visualization
15 pages
Introduction 1 3
No ratings yet
Introduction 1 3
32 pages
DAV Practical 1
No ratings yet
DAV Practical 1
5 pages
Dsur Ea2352001010391 W4
No ratings yet
Dsur Ea2352001010391 W4
3 pages
Basic Features of R Programming
No ratings yet
Basic Features of R Programming
10 pages
Embedded Wireless Controller Conversion
No ratings yet
Embedded Wireless Controller Conversion
13 pages
R Assignment
No ratings yet
R Assignment
22 pages
B38DF LS1 Introduction
No ratings yet
B38DF LS1 Introduction
46 pages
Maharevision Part-3
No ratings yet
Maharevision Part-3
144 pages
Unit 4
No ratings yet
Unit 4
105 pages
Libraries For Data Science - CBS - PDS
No ratings yet
Libraries For Data Science - CBS - PDS
2 pages
Practical 1
No ratings yet
Practical 1
8 pages
Top 20 Python Libraries For Data Science
No ratings yet
Top 20 Python Libraries For Data Science
15 pages
Unit 5
No ratings yet
Unit 5
3 pages
ML Lab File
No ratings yet
ML Lab File
33 pages
Note 5-7
No ratings yet
Note 5-7
21 pages
Numpy Code
No ratings yet
Numpy Code
10 pages
MATLAB Data Science
From Everand
MATLAB Data Science
Henry Codwell
No ratings yet
Flutter Full-Stack
From Everand
Flutter Full-Stack
HAROLD WHITES
No ratings yet
Learn C++
From Everand
Learn C++
Aishik Dutta
No ratings yet
Python Programming: General-Purpose Libraries; NumPy,Pandas,Matplotlib,Seaborn,Requests,os & sys: Python, #2
From Everand
Python Programming: General-Purpose Libraries; NumPy,Pandas,Matplotlib,Seaborn,Requests,os & sys: Python, #2
e3
No ratings yet