Mydata - Read - CSV ("Nameofthedatafile - CSV") : Sorting A Data Frame

This document provides an introduction to performing basic statistical analysis and data manipulation in R. It explains how to read data from a CSV file into a data frame, access columns of the data frame, calculate summary statistics like the mean and median, add or subtract values from columns, sort the data frame, select subsets of the data, perform linear regression, and plot regression lines. The document also provides some example code and functions to carry out these tasks like read.csv(), $, mean(), order(), lm(), plot(), and abline().

Uploaded by

moonlightsonata

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views

Mydata - Read - CSV ("Nameofthedatafile - CSV") : Sorting A Data Frame

Uploaded by

moonlightsonata

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

mydata <- read.csv("nameofthedatafile.

csv")

What we do is just read the data file and assign the data to an R internal object called 'mydata'
This is technically called a data frame. Type mydata to visualise your data frame and see the
columns.
To access one of the columns of the data frame, just use the \$ operator:
mydata$column1

Will retrieve column1 in your data. Easy, isn't it?

Now you can apply several statistical functions to it. For instance, if you want to know the mean
of column1, type
mean(mydata$column1)

This will work if you don't have NA's in your data. But maybe there are some NA's, and then the
function won't work. But we can fix it by telling R to ignore the NA's
mean(mydata$column1, na.rm=TRUE)

That's it. To calculate a standard deviation it's the same procedure, but this time use the sd()
function. As you might reckon, the median is computed with the median() function. All easypeasy.
You can also see a bunch of summary statistics of an object with the summary() function:
summary(mydata)

If you want to add a constant to each of the values of your column1, say 1500, simply type
1500+mydata$column1

The same applies to all the other arithmetic operations.

Sorting a data frame
Ah, this is kind of a nightmare for many beginning users, but there's no need to worry, actually.
Let's see.
The key function here is order() With order you can define which column will be the ordering
index, and tell R to sort in ascending or descending order. The syntax is straightforward:
mysorteddata <- mydata[order(mydata$column1),]

If you want it sorted in descending order, just add a minus sign before the column name.

Now type mysorteddata and see the new data frame: it's now sorted by column1! Cool.
Now, let's say you want to select only the 15 first rows (those with lower column1 values). Use
the head() function:
low <- head(mysorteddata, 15)

And if you want to select only the 15 last rows (those with higher column1 values,) use the tail()
function instead:
high <- tail(mysorteddata, 15)

Then you can apply statistical functions as normal on the new 'low' and 'high' data frames.
Linear regression
Here our buddy is the lm() function. A linear regression comes like this: y=+x. In R it
would be like:
lm(y ~ x, somedata)

Where y and x are columns of a dataframe called 'somedata'. To see the results of the regression,
use our old friend summary():
summary(lm(y ~ x, somedata))

Is this really this easy? Yes.

And if you want to plot your new regression, use the plot() funtion:
plot(x=somedata$x, y=somedata$y)

Suddenly, RStudio will show the plot. To add the regression line, use the abline function (type it
immediately after the above command, just separated by an enter line):
plot(x=somedata$x, y=somedata$y)
abline(lm(y ~ x, somedata))

And there you have it, guys! Here's all you need to get through this week's homework. If you
want to know more about R and statistics, check out my blog mathsuser.blogspot.com. It's full of
loads of cool stuff.
Last tip: As a side remark, I had trouble with the illiteracy rate questions. Just remember that
illiteracy rate is the same as 100 minus the literacy rate.
Good luck!

Les Expressions Idiomatiques en Français
No ratings yet
Les Expressions Idiomatiques en Français
7 pages
Misty Rain - Piano
100% (3)
Misty Rain - Piano
4 pages
Web Developer Resume
0% (1)
Web Developer Resume
3 pages
R Course Own English HS
No ratings yet
R Course Own English HS
70 pages
Broomspatial
No ratings yet
Broomspatial
31 pages
R Tutorial #1: Applied Econometrics (Econ3005)
No ratings yet
R Tutorial #1: Applied Econometrics (Econ3005)
21 pages
An R Tutorial Starting Out
No ratings yet
An R Tutorial Starting Out
9 pages
Basic Statistics
No ratings yet
Basic Statistics
66 pages
Lesson 7 - The Data Frame
No ratings yet
Lesson 7 - The Data Frame
7 pages
R Commands
No ratings yet
R Commands
18 pages
Basic R Commands For Data Analysis
No ratings yet
Basic R Commands For Data Analysis
7 pages
Brief Introduction To R Kaustav Banerjee: Decision Sciences Area, IIM Lucknow
No ratings yet
Brief Introduction To R Kaustav Banerjee: Decision Sciences Area, IIM Lucknow
7 pages
Introduction To R PDF
No ratings yet
Introduction To R PDF
56 pages
Introduction to R for Business Analytics(1)
No ratings yet
Introduction to R for Business Analytics(1)
7 pages
R study material I
No ratings yet
R study material I
8 pages
Econometrics I - R Summary (Maite Cabeza-Gutes)
No ratings yet
Econometrics I - R Summary (Maite Cabeza-Gutes)
77 pages
Chapter 03 Wrangling
No ratings yet
Chapter 03 Wrangling
40 pages
R Tutorial
No ratings yet
R Tutorial
15 pages
Lab1 411 Eman Yahya 7773225
No ratings yet
Lab1 411 Eman Yahya 7773225
16 pages
Daur Unit 2
No ratings yet
Daur Unit 2
28 pages
6 Working With Data Frames in R
No ratings yet
6 Working With Data Frames in R
8 pages
RSTUDIO
No ratings yet
RSTUDIO
44 pages
Time Series Analysis With R - Part I
No ratings yet
Time Series Analysis With R - Part I
23 pages
Lecture 1
No ratings yet
Lecture 1
35 pages
R
No ratings yet
R
13 pages
R Programming-1
No ratings yet
R Programming-1
6 pages
R Cheatsheet Base R
No ratings yet
R Cheatsheet Base R
2 pages
R Tutorial
No ratings yet
R Tutorial
15 pages
BigData_BCom-Unit-4
No ratings yet
BigData_BCom-Unit-4
9 pages
Lab 1
No ratings yet
Lab 1
26 pages
Introduction To R: Arin Basu MD MPH Dataanalytics
No ratings yet
Introduction To R: Arin Basu MD MPH Dataanalytics
33 pages
R Advbeginner v5
No ratings yet
R Advbeginner v5
73 pages
R Studio Cheat Sheet
No ratings yet
R Studio Cheat Sheet
6 pages
R Basic and Advanced
No ratings yet
R Basic and Advanced
9 pages
Module - 4 (R Training) - Basic Stats & Modeling
No ratings yet
Module - 4 (R Training) - Basic Stats & Modeling
15 pages
Chap 1
No ratings yet
Chap 1
32 pages
Beginner Guide To R and R Studio V1
No ratings yet
Beginner Guide To R and R Studio V1
27 pages
Module III
No ratings yet
Module III
53 pages
Stats Lab1
No ratings yet
Stats Lab1
11 pages
Teaching Notes of R
No ratings yet
Teaching Notes of R
78 pages
Problem Set 1: Introduction To R - Solutions With R Output: 1 Install Packages
No ratings yet
Problem Set 1: Introduction To R - Solutions With R Output: 1 Install Packages
24 pages
Lab0 R Tutorial EHS
No ratings yet
Lab0 R Tutorial EHS
9 pages
All Codes
No ratings yet
All Codes
10 pages
Getting Started With R
No ratings yet
Getting Started With R
155 pages
CH 03
No ratings yet
CH 03
42 pages
Essential R
No ratings yet
Essential R
261 pages
R - Tutorial: Matrices Are Vectors
No ratings yet
R - Tutorial: Matrices Are Vectors
13 pages
R Exercise 1 - Introduction To R For Non-Programmers
No ratings yet
R Exercise 1 - Introduction To R For Non-Programmers
9 pages
UL2
No ratings yet
UL2
2 pages
Unit - 2: Data Manipulation With R & Data Visualization in Watson Studio
No ratings yet
Unit - 2: Data Manipulation With R & Data Visualization in Watson Studio
58 pages
Apunts BLOC 1 Estadística
No ratings yet
Apunts BLOC 1 Estadística
15 pages
R Manual PDF
No ratings yet
R Manual PDF
78 pages
DSCI 100 Cheat Sheet
No ratings yet
DSCI 100 Cheat Sheet
3 pages
Visual Statistics Use R!
50% (2)
Visual Statistics Use R!
388 pages
Visual Statistics Use R PDF
No ratings yet
Visual Statistics Use R PDF
388 pages
Visual Statistics Use R
No ratings yet
Visual Statistics Use R
451 pages
Data manipulation in R
No ratings yet
Data manipulation in R
5 pages
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Graphs with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Graphs with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
4/5 (2)
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
The Definitive Guide To Dopamine Fasting 2.0 - The Hot Silicon Valley Trend
100% (1)
The Definitive Guide To Dopamine Fasting 2.0 - The Hot Silicon Valley Trend
16 pages
What You Draw Is Good Enough
No ratings yet
What You Draw Is Good Enough
32 pages
1-2kyuu Matome Kanji-Goi Shuu
No ratings yet
1-2kyuu Matome Kanji-Goi Shuu
125 pages
Piotr Kaczmarek (CV)
No ratings yet
Piotr Kaczmarek (CV)
1 page
CH 3.3 - INSTRUCTION SET OF 8085
No ratings yet
CH 3.3 - INSTRUCTION SET OF 8085
65 pages
nTopCL
No ratings yet
nTopCL
16 pages
New Delhi Institute of Management PGDM (G) / PGDM (M) / PGDM (F) PGDM 2019-21 - Semester-II End Term Examination - March/April, 2020
No ratings yet
New Delhi Institute of Management PGDM (G) / PGDM (M) / PGDM (F) PGDM 2019-21 - Semester-II End Term Examination - March/April, 2020
2 pages
XML With Ata Spec 2000
No ratings yet
XML With Ata Spec 2000
16 pages
Musicplayer - Docx Usining Python
No ratings yet
Musicplayer - Docx Usining Python
45 pages
TCS ASPIRE Assignments On Database, JAVA, UNIX With Solutions
0% (2)
TCS ASPIRE Assignments On Database, JAVA, UNIX With Solutions
4 pages
Access C How to Program 7th Edition Deitel Solutions Manual All Chapters Immediate PDF Download
100% (12)
Access C How to Program 7th Edition Deitel Solutions Manual All Chapters Immediate PDF Download
53 pages
Erlang Tutorial
No ratings yet
Erlang Tutorial
7 pages
A Computational Framework For Combinatorial Optimization Problems
No ratings yet
A Computational Framework For Combinatorial Optimization Problems
4 pages
School Education Department Villupuram District
No ratings yet
School Education Department Villupuram District
1 page
40 Microsoft Excel Interview Questions and Answers (2024)
100% (1)
40 Microsoft Excel Interview Questions and Answers (2024)
12 pages
Cucumber Notes
No ratings yet
Cucumber Notes
18 pages
SMA 2276 Assignment I PDF
No ratings yet
SMA 2276 Assignment I PDF
2 pages
CP1E CPU Unit Hardware Users Manual
No ratings yet
CP1E CPU Unit Hardware Users Manual
246 pages
Deep Learning - Handwritten Digit Recognition Using Python
No ratings yet
Deep Learning - Handwritten Digit Recognition Using Python
46 pages
Python for Bioinformatics 2nd Edition Sebastian Bassi - The ebook in PDF/DOCX format is ready for download now
No ratings yet
Python for Bioinformatics 2nd Edition Sebastian Bassi - The ebook in PDF/DOCX format is ready for download now
78 pages
All Basic Principles and Concept of Databases
No ratings yet
All Basic Principles and Concept of Databases
83 pages
Drawing in SwiftUI
No ratings yet
Drawing in SwiftUI
12 pages
Bresenham'S Ellipse Drawing Algorithm: / REGION 1
No ratings yet
Bresenham'S Ellipse Drawing Algorithm: / REGION 1
3 pages
Chameleon: A Hierarchical Clustering Algorithm Using Dynamic Modeling
No ratings yet
Chameleon: A Hierarchical Clustering Algorithm Using Dynamic Modeling
18 pages
Igor Pontes: Data Scientist - Head of Marketing Intelligence
No ratings yet
Igor Pontes: Data Scientist - Head of Marketing Intelligence
5 pages
24CSE401
No ratings yet
24CSE401
41 pages
JZOS Batch Launcher and Toolkit Function in IBM SDK For ZOS, Java Technology Edition, Version 8 Installation and User's Guide
100% (1)
JZOS Batch Launcher and Toolkit Function in IBM SDK For ZOS, Java Technology Edition, Version 8 Installation and User's Guide
60 pages
Tibco Ems - LB&FT
50% (2)
Tibco Ems - LB&FT
18 pages
Introduction to Python
No ratings yet
Introduction to Python
22 pages
Macros and Macro Processors
100% (1)
Macros and Macro Processors
6 pages
Artificial Intelligence and Machine Learning
No ratings yet
Artificial Intelligence and Machine Learning
426 pages
CSE31 - Lab #6 - MIPS Programming
No ratings yet
CSE31 - Lab #6 - MIPS Programming
3 pages

Mydata - Read - CSV ("Nameofthedatafile - CSV") : Sorting A Data Frame

Uploaded by

Mydata - Read - CSV ("Nameofthedatafile - CSV") : Sorting A Data Frame

Uploaded by

mydata <- read.csv("nameofthedatafile.

Will retrieve column1 in your data. Easy, isn't it?

The same applies to all the other arithmetic operations.

Is this really this easy? Yes.

You might also like