0% found this document useful (0 votes)

82 views4 pages

Submitted By-Pawan Yadav, Roll No. (18PT1-17)

The document discusses the ggplot2 package in R for creating data visualizations. It describes the key concepts of ggplot2 including aesthetic mappings which link data variables to visual properties, different geometric objects for plotting like points and lines, and how to create a basic ggplot. It also discusses other visualization libraries in R for creating interactive plots.

Uploaded by

GAURAV YADAV

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

82 views4 pages

Submitted By-Pawan Yadav, Roll No. (18PT1-17)

Uploaded by

GAURAV YADAV

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Submitted by- Pawan Yadav, Roll No.

(18PT1-17)

1. Study the ggplot function within R

Just as the grammar of language helps us construct meaningful sentences out of words, the Grammar of
Graphics helps us to construct graphical figures out of different visual elements. This grammar gives us a
way to talk about parts of a plot: all the circles, lines, arrows, and words that are combined into a diagram
for visualizing data. Originally developed by Leland Wilkinson, the Grammar of Graphics was adapted
by Hadley Wickham to describe the components of a plot, including

 the data being plotted
 the geometric objects (circles, lines, etc.) that appear on the plot
 a set of mappings from variables in the data to the aesthetics (appearance) of the geometric
objects
 a statistical transformation used to calculate the data values used in the plot
 a position adjustment for locating each geometric object on the plot
 a scale (e.g., range of values) for each aesthetic mapping used
 a coordinate system used to organize the geometric objects
 the facets or groups of data shown in different plots

Wickham further organizes these components into layers, where each layer has a single geometric object,
statistical transformation, and position adjustment. Following this grammar, you can think of each plot as
a set of layers of images, where each image’s appearance is based on some aspect of the data set.

All together, this grammar enables us to discuss what plots look like using a standard set of vocabulary.
And similar to how tidyr and dplyr provide efficient data transformation and manipulation, ggplot2
provides more efficient ways to create specific visual images.

In order to create a plot, you:

1. Call the ggplot() function which creates a blank canvas

2. Specify aesthetic mappings, which specifies how you want to map variables to visual aspects. In
this case we are simply mapping the displ and hwy variables to the x- and y-axes.
3. You then add new layers that are geometric objects which will show up on the plot. In this case
we add geom_point to add a layer with points (dot) elements as the geometric shapes to represent
the data.

# create canvas
ggplot(mpg)

# variables of interest mapped

ggplot(mpg, aes(x = displ, y = hwy))

# data plotted
ggplot(mpg, aes(x = displ, y = hwy)) +
geom_point()
Submitted by- Pawan Yadav, Roll No. (18PT1-17)

Aesthetic Mappings-
The aesthetic mappings take properties of the data and use them to influence visual characteristics, such
as position, color, size, shape, or transparency. Each visual characteristic can thus encode an aspect of the
data and be used to convey information.

All aesthetics for a plot are specified in the aes() function call (later in this tutorial you will see that
each geom layer can have its own aes specification). For example, we can add a mapping from the class
of the cars to a color characteristic:

ggplot(mpg, aes(x = displ, y = hwy, color = class)) +

geom_point()

Specifying Geometric Shapes

Building on these basics, ggplot2 can be used to build almost any kind of plot you may want. These plots
are declared using functions that follow from the Grammar of Graphics.

The most obvious distinction between plots is what geometric objects (geoms) they

include. ggplot2 supports a number of different types of geoms, including:

 geom_point for drawing individual points (e.g., a scatter plot)

 geom_line for drawing lines (e.g., for a line charts)
 geom_smooth for drawing smoothed lines (e.g., for simple trends or approximations)
 geom_bar for drawing bars (e.g., for bar charts)
 geom_histogram for drawing binned values (e.g. a histogram)
 geom_polygon for drawing arbitrary shapes
 geom_map for drawing polygons in the shape of a map! (You can access the data to use for these
maps by using the map_data() function).

Each of these geometries will leverage the aesthetic mappings supplied although the specific visual
properties that the data will map to will vary.

Some other functions are -

Submitted by- Pawan Yadav, Roll No. (18PT1-17)

 Statistical Transformations

 Position Adjustments

 Managing Scales

 Coordinate Systems

 Facets

 Labels & Annotations

 Other Visualization Libraries - ggplot2 is easily the most popular library for producing data
visualizations in R. That said, ggplot2 is used to produce static visualizations: unchanging “pictures” of
plots. Static plots are great for for explanatory visualizations: visualizations that are used to
communicate some information—or more commonly, an argument about that information. All of the
above visualizations have been ways for us to explain and demonstrate an argument about the data (e.g.,
the relationship between car engines and fuel efficiency). Data visualizations can also be highly effective
for exploratory analysis, in which the visualization is used as a way to ask and answer questions about
the data (rather than to convey an answer or argument). While it is perfectly feasible to do such
exploration on a static visualization, many explorations can be better served with interactive
visualizations in which the user can select and change the view and presentation of that data in order to
understand it.
While ggplot2 does not directly support interactive visualizations, there are a number of additional R
libraries that provide this functionality, including:

 ggvis is a library that uses the Grammar of Graphics (similar to ggplot), but for interactive
visualizations.
 plotly is a open-source library for developing interactive visualizations. It provides a number of
“standard” interactions (pop-up labels, drag to pan, select to zoom, etc) automatically. Moreover,
it is possible to take a ggplot2 plot and wrap it in Plotly in order to make it interactive. Plotly has
many examples to learn from, though a less effective set of documentation.
 htmlwidgets provides a way to utilize a number of JavaScript interactive visualization libraries.
JavaScript is the programming language used to create interactive websites (HTML files), and so
is highly specialized for creating interactive experiences.

2. Run word count on H G Wells collection and plot the same

3. Study the “tm” package and its usage for all possible operations within word processing and
sentiment analysis?

The tm package was created by Ingo Feinerer and enables novice researchers (like me) to harness the
power of R without an in-depth understanding of the programming language. With this understanding
in mind, let’s explore some of the practical applications of the tm package.
Submitted by- Pawan Yadav, Roll No. (18PT1-17)

The tm package utilizes the Corpus as its main structure. A corpus is simply a collection of documents,
but like most things in R, the corpus has specific attributes that enable certain types of analysis. Corpora
in R exist in two ways:

 Volitile Corpus (VCorpus) is a temporary object within R and is the default when assigning

documents to a corpus.
 Permanent Corpus (PCorpus) is a permanent object that can be stored outside of R.

Compared to the volatile corpus the corpus encapsulated by a permanent corpus object is not
destroyed if the corresponding R object is released. Within the corpus constructor, x must be a Source
object which abstracts the input location. tm provides a set of predefined sources, e.g., DirSource,
VectorSource, or DataframeSource, which handle a directory, a vector interpreting each component
as document, or data frame like structures (like CSV files), respectively. Except DirSource, which is
designed solely for directories on a file system, and VectorSource, which only accepts (character)
vectors, most other implemented sources can take connections as input (a character string is
interpreted as file path). getSources() lists available sources, and users can create their own sources.
The second argument readerControl of the corpus constructor has to be a list with the named
components reader and language. The first component reader constructs a text document from
elements delivered by a source. The tm package ships with several readers (e.g., readPlain(),
readPDF(), readDOC(), . . . ). See getReaders() for an up-to-date list of available readers. Each source
has a default reader which can be overridden. E.g., for DirSource the default just reads in the input
files and interprets their content as text. Finally, the second component language sets the texts’
language (preferably using ISO 639-2 codes). In case of a permanent corpus, a third argument
dbControl has to be a list with the named components dbName giving the filename holding the
sourced out objects (i.e., the database), and dbType holding a valid database type as supported by
package filehash. Activated database support reduces the memory demand, however, access gets
slower since each operation is limited by the hard disk’s read and write capabilities.

Some of the key features of tm package are :-

Corpus Transformations
One of the best features of the tm package is the ability to transform text into workable data without a
great deal of code. To do this, we can use Transformations which are available in the tm package. To see
available Transformations enter getTransformations() in the console.
Data Import
Data Export
Inspecting Corpora
Filters
Metadata Management
Creating Term-Document Matrices

4. Eexamine case study on Garrettgman

5. Convert transcript into a table for 'Mann ke Baat'

Lecture 1 Introduction To Information Technology
100% (10)
Lecture 1 Introduction To Information Technology
40 pages
Service Manual: Mb97 Idtv
100% (1)
Service Manual: Mb97 Idtv
64 pages
Unit 2
No ratings yet
Unit 2
32 pages
Commodore 64 Omnibus
No ratings yet
Commodore 64 Omnibus
516 pages
590 WK
No ratings yet
590 WK
17 pages
Intro To R
No ratings yet
Intro To R
253 pages
R Lab
No ratings yet
R Lab
114 pages
Learn R Extra Web
No ratings yet
Learn R Extra Web
138 pages
R Programming
No ratings yet
R Programming
61 pages
R Concepts - 25092018 PDF
No ratings yet
R Concepts - 25092018 PDF
51 pages
Data Visualization in R - With Cheat Sheets PDF
100% (1)
Data Visualization in R - With Cheat Sheets PDF
62 pages
Apostila Ggplot
No ratings yet
Apostila Ggplot
59 pages
2.R Concepts - BDSM - Oct2020 PDF
No ratings yet
2.R Concepts - BDSM - Oct2020 PDF
37 pages
Introdataviz Preprint
No ratings yet
Introdataviz Preprint
59 pages
R Language Lab Manual Lab 1
100% (1)
R Language Lab Manual Lab 1
33 pages
R Module 4
No ratings yet
R Module 4
31 pages
Introduction To R: Alka Vaidya Nibm
No ratings yet
Introduction To R: Alka Vaidya Nibm
50 pages
R Graphics
No ratings yet
R Graphics
76 pages
RFP For Private Cloud Final Published2
No ratings yet
RFP For Private Cloud Final Published2
123 pages
Visualization in R
No ratings yet
Visualization in R
44 pages
Introduction To R II
No ratings yet
Introduction To R II
35 pages
Introduction To Ggplot2: Saier (Vivien) Ye September 16, 2013
No ratings yet
Introduction To Ggplot2: Saier (Vivien) Ye September 16, 2013
32 pages
Basics PDF
No ratings yet
Basics PDF
21 pages
R Programming For NGS Data Analysis
No ratings yet
R Programming For NGS Data Analysis
5 pages
Unit I R Data Structures
No ratings yet
Unit I R Data Structures
30 pages
Vivado HLS
No ratings yet
Vivado HLS
110 pages
DSCI Key Terms and Ideas For Review
No ratings yet
DSCI Key Terms and Ideas For Review
98 pages
A Report On R Name-Kaveena ROLL NO-12EE46
No ratings yet
A Report On R Name-Kaveena ROLL NO-12EE46
10 pages
06 Plots Export Plots
100% (1)
06 Plots Export Plots
17 pages
R Packages
No ratings yet
R Packages
6 pages
Sprout Annotated Reports All
No ratings yet
Sprout Annotated Reports All
50 pages
Test Plan V1
No ratings yet
Test Plan V1
14 pages
Vectors:: Status Poor, Improved, Excellent
No ratings yet
Vectors:: Status Poor, Improved, Excellent
4 pages
R Programming Unit-3 Complete Notes
No ratings yet
R Programming Unit-3 Complete Notes
10 pages
22MSM40206 Data Visualisation
No ratings yet
22MSM40206 Data Visualisation
13 pages
Data Analytics Using R
No ratings yet
Data Analytics Using R
37 pages
CAPTCHA
No ratings yet
CAPTCHA
10 pages
Introduction of Android
No ratings yet
Introduction of Android
9 pages
Introduction To The TM Package Text Mining in R: Ingo Feinerer June 10, 2014
No ratings yet
Introduction To The TM Package Text Mining in R: Ingo Feinerer June 10, 2014
7 pages
NLP Exam
No ratings yet
NLP Exam
3 pages
F5 Advanced WAF: Key Benefits
No ratings yet
F5 Advanced WAF: Key Benefits
2 pages
Using Ggplot2 For Plots in R
No ratings yet
Using Ggplot2 For Plots in R
8 pages
Data Layers Niveditha Haridas 2302032
No ratings yet
Data Layers Niveditha Haridas 2302032
18 pages
02 Graphs and Chart in R-2012
No ratings yet
02 Graphs and Chart in R-2012
24 pages
Graphics Lecture
No ratings yet
Graphics Lecture
14 pages
Lybra ICS
No ratings yet
Lybra ICS
93 pages
Chapter 10
No ratings yet
Chapter 10
72 pages
Code Source Du Shell Madspot
No ratings yet
Code Source Du Shell Madspot
15 pages
Mod1 R Programming
No ratings yet
Mod1 R Programming
49 pages
BA Notes
No ratings yet
BA Notes
5 pages
Manual GPRS Manager en
No ratings yet
Manual GPRS Manager en
10 pages
Longrich Back Office User Manual - For Stockist: Topic 1 - How To Login in To The System
No ratings yet
Longrich Back Office User Manual - For Stockist: Topic 1 - How To Login in To The System
12 pages
Introduction To The TM Package Text Mining in R: Ingo Feinerer April 20, 2024
No ratings yet
Introduction To The TM Package Text Mining in R: Ingo Feinerer April 20, 2024
8 pages
PDF Respuesta A Las Preguntas DL
No ratings yet
PDF Respuesta A Las Preguntas DL
6 pages
Fluid Pitch Manual
No ratings yet
Fluid Pitch Manual
37 pages
Unit II Java
No ratings yet
Unit II Java
60 pages
MTech R Notes
No ratings yet
MTech R Notes
14 pages
PIP Broadband 2013
No ratings yet
PIP Broadband 2013
14 pages
Theory of Computation Lecture Notes
No ratings yet
Theory of Computation Lecture Notes
50 pages
R Programming Unit 3
No ratings yet
R Programming Unit 3
48 pages
IDS Unit-5
No ratings yet
IDS Unit-5
39 pages
M4 DAR Part1
No ratings yet
M4 DAR Part1
16 pages
Visualizing Data in R
No ratings yet
Visualizing Data in R
20 pages
Levendary Cafe Gaurav Draft
No ratings yet
Levendary Cafe Gaurav Draft
2 pages
AI Models
100% (1)
AI Models
10 pages
The Fitness Mindset: Eat For Energy, Train For Tension, Manage Your Mindset, Reap The Results
No ratings yet
The Fitness Mindset: Eat For Energy, Train For Tension, Manage Your Mindset, Reap The Results
3 pages
Q1-Run Word Count On H G Wells Collection and Plot The Same
No ratings yet
Q1-Run Word Count On H G Wells Collection and Plot The Same
2 pages
R Programming 2 MARKS
No ratings yet
R Programming 2 MARKS
12 pages
Amit B.tech 2ndy Result
No ratings yet
Amit B.tech 2ndy Result
1 page
Mod3 Tables EPP
No ratings yet
Mod3 Tables EPP
9 pages
Decision Form 1st Year - Team 10
No ratings yet
Decision Form 1st Year - Team 10
1 page
Persons Assigment - 18pt1 - 09
100% (1)
Persons Assigment - 18pt1 - 09
2 pages
Module 4
No ratings yet
Module 4
23 pages
R Inter
No ratings yet
R Inter
6 pages
Week1 Slides
No ratings yet
Week1 Slides
64 pages
Navneet Singh: Profile
No ratings yet
Navneet Singh: Profile
2 pages
Figures With GGPlot
No ratings yet
Figures With GGPlot
58 pages
Creating and Manipulating Objects
No ratings yet
Creating and Manipulating Objects
12 pages
DS-R Block 4 All
No ratings yet
DS-R Block 4 All
50 pages
Basic Computer Skills Module
No ratings yet
Basic Computer Skills Module
7 pages
R & Python Notes
No ratings yet
R & Python Notes
131 pages
Ggplot2 For Data Visualization: Grammer of Graphics "
No ratings yet
Ggplot2 For Data Visualization: Grammer of Graphics "
19 pages
REQ Summary
No ratings yet
REQ Summary
8 pages
Data Science Using R - Lab Manual-Complete Ver 2.0 - Nov 2024
No ratings yet
Data Science Using R - Lab Manual-Complete Ver 2.0 - Nov 2024
36 pages
Nordmann Et Al. (2022) - Data Visualization Using R For Researchers Who Do Not Use R
No ratings yet
Nordmann Et Al. (2022) - Data Visualization Using R For Researchers Who Do Not Use R
36 pages
MIT 201 - Tutorial 02
No ratings yet
MIT 201 - Tutorial 02
12 pages
WIREs Computational Stats - 2011 - Wickham - Ggplot2
No ratings yet
WIREs Computational Stats - 2011 - Wickham - Ggplot2
6 pages
R Language Lab Manual Lab 1
No ratings yet
R Language Lab Manual Lab 1
32 pages
Stackjunior Course Catalog 2025
No ratings yet
Stackjunior Course Catalog 2025
15 pages
GePG POS APP USER GUIDE
No ratings yet
GePG POS APP USER GUIDE
11 pages
Lecture 2 Data Presentation
No ratings yet
Lecture 2 Data Presentation
18 pages
Lec06-Data Visualization
No ratings yet
Lec06-Data Visualization
70 pages
Ian Talks Python A-Z
From Everand
Ian Talks Python A-Z
Ian Eress
No ratings yet
Visualizing Data Structures
From Everand
Visualizing Data Structures
Rhonda Hoenigman
No ratings yet

Submitted By-Pawan Yadav, Roll No. (18PT1-17)

Uploaded by

Submitted By-Pawan Yadav, Roll No. (18PT1-17)

Uploaded by

Submitted by- Pawan Yadav, Roll No.

1. Study the ggplot function within R

In order to create a plot, you:

1. Call the ggplot() function which creates a blank canvas

# variables of interest mapped

ggplot(mpg, aes(x = displ, y = hwy, color = class)) +

Specifying Geometric Shapes

The most obvious distinction between plots is what geometric objects (geoms) they

 geom_point for drawing individual points (e.g., a scatter plot)

Some other functions are -

 Labels & Annotations

2. Run word count on H G Wells collection and plot the same

 Volitile Corpus (VCorpus) is a temporary object within R and is the default when assigning

Some of the key features of tm package are :-

4. Eexamine case study on Garrettgman

5. Convert transcript into a table for 'Mann ke Baat'

You might also like