0% found this document useful (0 votes)

13 views6 pages

Import Xls Sas Code

The document discusses various statistical options and techniques that can be used with the PROC MEANS procedure in SAS. It explains how to generate summary statistics, perform group analysis, save output to a dataset, and more. Statistical options like N, NMISS, MEAN, STD, MIN, MAX are described.

Uploaded by

Nik Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views6 pages

Import Xls Sas Code

Uploaded by

Nik Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 6

/* Download the dataset file */

filename mydata temp;

proc http
url="https://github.com/deepanshu88/Datasets/raw/master/UploadedFiles/Test.xls"
method="GET"
out=mydata;
run;

/* Import */
proc import
file=mydata
out=test replace
dbms=xls;
run;

Proc Means Data = test;

Var q1 - q5;
Run;

Statistical Option Description

N Number of observations
NMISS Number of missing observations
MEAN Arithmetic average
STD Standard Deviation
MIN Minimum
MAX Maximum
SUM Sum of observations
MEDIAN 50th percentile
P1 1st percentile
P5 5th percentile
P10 10th percentile
P90 90th percentile
P95 95th percentile
P99 99th percentile
Q1 First Quartile
Q3 Third Quartile
Other Statistical Options
Statistical Option Description
VAR Variance
RANGE Range
USS Uncorr. sum of squares
CSS Corr. sum of squares
STDERR Standard Error
T Student’s t value for testing Ho: md = 0
PRT P-value associated with t-test above
SUMWGT Sum of the WEIGHT variable values
QRANGE Quartile range

How to See Specific Statistics

Suppose you want to see only two statistics - number of non-missing values and
number of missing values.

Proc Means Data = test N NMISS;

Var q1 - q5 ;
Run;
N refers to number of non-missing values and NMISS implies number of missing
values.

NMISS option in PROC MEANS

Tips : Add NOLABELS option to delete Label column in the PROC MEAN table.

Proc Means data = test N NMISS NOLABELS;

Var q1 - q5;
Run;
Group Analysis using PROC MEANS
Suppose you want to group or classify the analysis by Age. You can use the CLASS
statement to accomplish this task. It is equivalent to GROUP BY in SQL.

Proc Means data = test N NMISS NOLABELS;

Class Age;
Var q1 - q5;
Run;
Group analysis using PROC MEANS
You can use NONOBS option to delete N Obs column from the Proc Means table.

Proc Means data = test N NMISS NOLABELS NONOBS;

Class Age;
Var q1 - q5;
Run;
How to use Format in Proc Means
First, you need to create an user defined format.

Proc Format;
Value Age
1 = 'Less than 25'
2 = '25-34'
3 = '35-43'
4 = '44-50'
5 = '51-59'
6 = '60 or more';
Run;
Add FORMAT statement to use user defined format in PROC MEANS.

Proc Means data = test N MEAN;

Class Age;
Format Age Age.;
Var q1 - q5;
Run;
How to change Sorting Order
The DESCENDING option to the right of the slash in the first CLASS statement
instructs PROC MEANS to analyze the data in DESCENDING order of the values of Age.

Proc Means Data = test;

Class Age / descending;
Var q1 - q5 ;
Run;
Instead of displaying the results in "sort order" of the values of the
Classification Variable (s) you specified in the CLASS Statement, order the results
by frequency order using the ORDER=FREQ option in the CLASS Statement.

Proc Means Data = test N;

Class Age / Order = FREQ;
Var q1 - q5 ;
Run;
You can order the results by user-defined format of a variable specified in the
CLASS statement using the ORDER=FORMATTED option in the CLASS Statement.

Proc Means data = test N MEAN;

Class Age / Order = formatted;
Format Age Age.;
Var q1 - q5;
Run;
Custom formats in PROC MEANS
Note : If you specify CLASS statement without VAR statement, it classifies the
analysis by all numeric variables in your data set.

Grouping and Output in Separate Tables

Suppose you want to analyze variables Q1 - Q5 by variable AGE and want the output
of each levels of AGE in separate tables. You can use BY statement to accomplish
this task. See the example below-

Make sure you sort the data before using BY statement.

proc sort data= test;

by age;
run;
proc means data = test;
by age;
var q1 - q5 ;
run;
Difference between CLASS and BY statement
The CLASS statement returns analysis for a grouping (classification) variable in a
single table whereas BY statement returns the analysis for a grouping variable in
separate tables. Another difference is CLASS statement does not require the
classification variable to be pre-sorted whereas BY statement demands sorting.

Difference between CLASS and BY statement in PROC MEANS

Save Output in a Dataset
You can use NOPRINT option to tell SAS not to print output in output window.

Proc Means data = test NOPRINT;

Class Age / Order = formatted;
Format Age Age.;
Var q1 - q5;
Output out = readin mean= median = /autoname;
Run;
In the above code, readin is a data set in which output will be stored. The MEAN=
MEDIAN= options tells SAS to generate mean and median in the output dataset. The
AUTONAME Option automatically assigns unique variable names in the Output Data Set
“holding” the statistics requested in the OUTPUT statement.

You can use AUTOLABEL option to automatically assigns unique label names in the
Output Data Set “holding” the statistics requested in the OUTPUT statement.

Proc Means Data = test noprint;

Class Age ;
Var q1 q2;
Output out=F1 mean= / autoname autolabel;
Run;

You can specify variables for which you want summary statistics to be saved in a
output data set.

Proc Means Data = test noprint;

Class Age ;
Var q1 q2;
Output out=F1 mean(q1)= median(q2)= / autoname;
Run;
You can give custom names to variables stored in a output data set.

Proc Means Data = test noprint;

Class Age;
Var q1 - q5 ;
Output out=F1 mean=_mean1-_mean5 median=_median1-_median5;
Run;
DROP = , KEEP = option
We can use DROP and KEEP options to remove or keep some specific variables.

Proc Means Data = test noprint;

Class Age;
Var q1 - q5 ;
Output out=F1 (drop = _type_ _freq_) mean=_mean1-_mean5 median=_median1-_median5;
Run;
WHERE Statement
The WHERE statement is used to filter or subset data. In the code below, we are
filtering on variable Q1 and telling SAS to keep only those observations in which
value of Q1 is greater than 1.

Proc Means Data = test noprint;

Where Q1 > 1;
Class Age;
Var q1 - q5 ;
Output out=F1(drop= _FREQ_) mean= median= / autoname;
Run;
Like WHERE statement, we can use WHERE= OPTION to filter data. See the following
program -

Proc Means Data = test (Where=( Q1 > 1)) noprint;

Class Age;
Var q1 - q5 ;
Output out=F1(drop= _FREQ_) mean= median= / autoname;
Run;
Grouping by Two or More Variables
When two ore more variables are included in the CLASS statement, PROC MEANS returns
3 levels of classification which is shown in the _TYPE_ variable. Suppose we are
specifying variables AGE BU in the CLASS statement. SAS first returns mean and
median of variables Q1-Q5 by BU. It is the first level of classification which can
be filtered by using WHERE = ( _TYPE_ = 1). The same analysis by AGE is shown
against _TYPE_ = 2. When _TYPE_ = 3, SAS returns analysis by both the variables AGE
and BU.

Proc Means Data = test noprint;

Class Age BU;
Var q1 - q5 ;
Output out=F1 (where=(_type_=1) drop= AGE _FREQ_) mean= median= / autoname;
Output out=F2 (where=(_type_=2) drop= BU _FREQ_) mean= median= / autoname;
Output out=F3 (where=(_type_=3) drop= _FREQ_) mean= median= / autoname;
Run;
Using the NWAY option instructs PROC MEANS to output only observations with the
highest value of _TYPE_ to the new data set it is creating.

Proc Means Data = test nway noprint;

Class Age;
Var q1 - q5 ;
Output out=F1 mean=_mean1-_mean5 median=_median1-_median5;
Run;
By default, PROC MEANS will analyze the numeric analysis variables at all possible
combinations of the values of the classification variables. With the TYPES
statement, only the analyses specified in it are carried out by PROC MEANS.

Proc Means Data = test noprint;

Class Age BU Q1;
Types()
Age * BU
Age * BU * Q1;
Var q1 - q5;
Output out=F1 mean=_mean1-_mean5 max=_median1-_median5;
Run;
DESCENDTYPES Option : Orders rows/observations in the output data set by descending
value of _TYPE_.

Proc Means Data = test DESCENDTYPES noprint;

Class Age;
Var q1 - q5 ;
Output out=F1 mean=_mean1-_mean5 median=_median1-_median5;
Run;
Multiple CLASS Statements
Multiple CLASS statement permit user control over how the levels of the
classification variables are portrayed or written out to new data sets created by
PROC MEANS. It means any one of the classification variable can be displayed in
descending order.

Proc Means Data = test noprint;

Class Age / descending;
Class BU;
Var q1 - q5 ;
Output out=F1 mean=_mean1-_mean5 max=_median1-_median5;
Run;
Identifying Extreme Values
The IDGROUP options tells SAS to calculate the N largest and smallest values of the
variable specified in the VAR statement. The OUT[2] argument within IDGROUP option
means we want two extreme values to output.

data sales;
input products $ revenue;
datalines;
ProductA 100
ProductA 200
ProductA 300
ProductA 150
ProductA 250
ProductB 350
ProductB 200
ProductB 300
ProductB 400
;
run;

proc means data=sales noprint nway;

class products;
var revenue;
output out= myoutput
idgroup (max(revenue) out[2] (revenue)=maxrev)
idgroup (min(revenue) out[2] (revenue)=minrev)
sum= mean= /autoname;
run;
Sample T-Test using PROC MEANS
With PROC MEANS, we can perform hypothesis testing using sample t-test.

Null Hypothesis - Population Mean of Q1 is equal to 0

Alternative Hypothesis - Population Mean of Q1 is not equal to 0.

proc means data = test t prt;

var Q1;
run;
The PRT option returns p-value which implies lowest level of significance at which
we can reject null hypothesis. Since p-value is less than 0.05, we can reject the
null hypothesis and concludes that mean is significantly different from zero.

Difference between PROC MEANS and PROC FREQ

PROC MEANS is used to calculate summary statistics such as mean, count etc of
numeric variables. It requires at least one numeric variable whereas Proc Freq does
not have such limitation. In other words, if you have only one character variable
to analyse, PROC FREQ is the procedure to use.

Sas Cheat Sheet
No ratings yet
Sas Cheat Sheet
3 pages
Applied Statistics and The SAS Programming 5th Edition
0% (2)
Applied Statistics and The SAS Programming 5th Edition
44 pages
ADM-SHS-StatProb-Q3-M10-Illustrating A Normal Random Variable and Its Characteristics
100% (1)
ADM-SHS-StatProb-Q3-M10-Illustrating A Normal Random Variable and Its Characteristics
27 pages
PHC 6052 SAS Skills
No ratings yet
PHC 6052 SAS Skills
52 pages
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet
NP5
No ratings yet
NP5
19 pages
Kalman Filter
No ratings yet
Kalman Filter
14 pages
Topic: Generating Reports
No ratings yet
Topic: Generating Reports
15 pages
Descriptive Statistics Using SAS
No ratings yet
Descriptive Statistics Using SAS
10 pages
Proc Freq
No ratings yet
Proc Freq
57 pages
Lecture 9 Working With Grouped or Sorted obs-Chapter11-StepByStepProgrammingBaseSAS
No ratings yet
Lecture 9 Working With Grouped or Sorted obs-Chapter11-StepByStepProgrammingBaseSAS
28 pages
SAS Slides 8: BASE SAS Statistics Procedures
100% (1)
SAS Slides 8: BASE SAS Statistics Procedures
24 pages
Sas 101
No ratings yet
Sas 101
17 pages
Proe Summary: A Powerful Exploratory Data Analysis Tool: Systems Seminar Consultants, Kalamazoo, MI
No ratings yet
Proe Summary: A Powerful Exploratory Data Analysis Tool: Systems Seminar Consultants, Kalamazoo, MI
10 pages
Sas 101
No ratings yet
Sas 101
17 pages
SAS Programming by Example (14) : Chapter 14 Efficiency Making Your Programs More Efficient
No ratings yet
SAS Programming by Example (14) : Chapter 14 Efficiency Making Your Programs More Efficient
9 pages
Sas 101
No ratings yet
Sas 101
17 pages
Proc Summary
No ratings yet
Proc Summary
19 pages
SAS Notes Part 7
No ratings yet
SAS Notes Part 7
8 pages
SAS Procedures
No ratings yet
SAS Procedures
8 pages
Base Programming Ref Sheet
No ratings yet
Base Programming Ref Sheet
4 pages
Unit Iii Sas Procedures
No ratings yet
Unit Iii Sas Procedures
27 pages
Guido's Guide To PROC MEANS - A Tutorial For Beginners Using The SAS® System
No ratings yet
Guido's Guide To PROC MEANS - A Tutorial For Beginners Using The SAS® System
11 pages
Sas 101
No ratings yet
Sas 101
17 pages
Proc Means
No ratings yet
Proc Means
22 pages
Sas 101
No ratings yet
Sas 101
17 pages
c164 Biva Exp2
No ratings yet
c164 Biva Exp2
21 pages
Add Names For The Following Examples in The Practice Questionnaire: Serial No., Section A. Question 1, Section A. Question 2
No ratings yet
Add Names For The Following Examples in The Practice Questionnaire: Serial No., Section A. Question 1, Section A. Question 2
6 pages
Introduction To Sas Procedures: 1
100% (2)
Introduction To Sas Procedures: 1
73 pages
PROC MEANS Freq Corr Regression Annova
No ratings yet
PROC MEANS Freq Corr Regression Annova
60 pages
Proc Summaery Print
No ratings yet
Proc Summaery Print
11 pages
An Introduction To Data Analysis Using IBM SPSS, 1st Edition ISBN 1032891793, 9781032891798 Direct Ebook Download
No ratings yet
An Introduction To Data Analysis Using IBM SPSS, 1st Edition ISBN 1032891793, 9781032891798 Direct Ebook Download
15 pages
Week 12 - Data Analysis
No ratings yet
Week 12 - Data Analysis
83 pages
Sas
No ratings yet
Sas
19 pages
Sorting Through The Features of Proc SORT
No ratings yet
Sorting Through The Features of Proc SORT
44 pages
Sas Proc Summary and Proc Format
No ratings yet
Sas Proc Summary and Proc Format
7 pages
Sas 201
No ratings yet
Sas 201
17 pages
Chapter 6 - Evaluating Quantitative Data
No ratings yet
Chapter 6 - Evaluating Quantitative Data
21 pages
Introduction To Tables and Graphs in SAS
No ratings yet
Introduction To Tables and Graphs in SAS
8 pages
Here Is The Output Produced by The Proc Print Statement Above
No ratings yet
Here Is The Output Produced by The Proc Print Statement Above
6 pages
Sascheatsheet 170401221255
100% (1)
Sascheatsheet 170401221255
29 pages
Sas Week3 Summary - Explore
No ratings yet
Sas Week3 Summary - Explore
4 pages
S A S Guide
No ratings yet
S A S Guide
33 pages
The MEANS/SUMMARY Procedure: Getting Started: Arthur L. Carpenter California Occidental Consultants, Anchorage, AK
No ratings yet
The MEANS/SUMMARY Procedure: Getting Started: Arthur L. Carpenter California Occidental Consultants, Anchorage, AK
10 pages
Advanced Analytics Using SAS
No ratings yet
Advanced Analytics Using SAS
14 pages
BRM File
No ratings yet
BRM File
55 pages
Correlation: Type Informat Name What It Does
No ratings yet
Correlation: Type Informat Name What It Does
6 pages
17 Biostat
No ratings yet
17 Biostat
22 pages
Stsa 3732 Sas Notes 1
No ratings yet
Stsa 3732 Sas Notes 1
9 pages
Sas Programming
No ratings yet
Sas Programming
30 pages
Unit5 A
No ratings yet
Unit5 A
104 pages
Sas 201
No ratings yet
Sas 201
17 pages
Lecture. MidTerm
No ratings yet
Lecture. MidTerm
49 pages
Analytics
No ratings yet
Analytics
4 pages
W3 Syntax Review
No ratings yet
W3 Syntax Review
4 pages
David Franklin, SAS Programmer - Consultant - Useful SAS Tips and Other Code
No ratings yet
David Franklin, SAS Programmer - Consultant - Useful SAS Tips and Other Code
11 pages
RM Lab Main File BBA Project
No ratings yet
RM Lab Main File BBA Project
100 pages
SAS Aid
No ratings yet
SAS Aid
19 pages
Efficiency Techniques and Methods Kelley Weston Q2 2009
No ratings yet
Efficiency Techniques and Methods Kelley Weston Q2 2009
46 pages
SET Where Label Rename Format
No ratings yet
SET Where Label Rename Format
10 pages
Advanced SAS Interview Questions You'll Most Likely Be Asked
From Everand
Advanced SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Java Programming Tutorial With Screen Shots & Many Code Example
From Everand
Java Programming Tutorial With Screen Shots & Many Code Example
Desmond Ohwofosirai
No ratings yet
Excel Techniques
From Everand
Excel Techniques
Online Trainees
2/5 (1)
Worksheet 1 GRADE 12
No ratings yet
Worksheet 1 GRADE 12
2 pages
Descriptive Statistics
100% (1)
Descriptive Statistics
17 pages
Geometric Mean & Harmonic Mean
No ratings yet
Geometric Mean & Harmonic Mean
9 pages
Modul 2 - Placement - Data - Muhammad Zayyinul Bahri Ashfahani - 105012300206 - AB-47-10
No ratings yet
Modul 2 - Placement - Data - Muhammad Zayyinul Bahri Ashfahani - 105012300206 - AB-47-10
17 pages
Module 16 - Analyzing Data - 2
No ratings yet
Module 16 - Analyzing Data - 2
37 pages
Averages
No ratings yet
Averages
75 pages
1 - Particle Size and Distribution Analaysis
No ratings yet
1 - Particle Size and Distribution Analaysis
52 pages
Maths-Class-X-Chapter-13-Statistics-Practice (DPP) Answers
No ratings yet
Maths-Class-X-Chapter-13-Statistics-Practice (DPP) Answers
9 pages
Understanding Your Situational Judgement Test (SJT) Score: SJT As A Measure of Meeting The Person Specification
No ratings yet
Understanding Your Situational Judgement Test (SJT) Score: SJT As A Measure of Meeting The Person Specification
3 pages
John Cod - Coding Languages - SQL, Linux, Python, Machine Learning. The Step-By-Step Guide For Beginners
No ratings yet
John Cod - Coding Languages - SQL, Linux, Python, Machine Learning. The Step-By-Step Guide For Beginners
472 pages
Summary Measures..
No ratings yet
Summary Measures..
35 pages
Physics 4AL - Manual - v18 - 0
No ratings yet
Physics 4AL - Manual - v18 - 0
112 pages
2017 ACTL2131 Exercises
No ratings yet
2017 ACTL2131 Exercises
171 pages
6thgrade Math I Can Statements
No ratings yet
6thgrade Math I Can Statements
155 pages
Assignment 3 Research Methodlogy 20040621068 PDF
No ratings yet
Assignment 3 Research Methodlogy 20040621068 PDF
2 pages
Suggested Reading: General Statistics Books
No ratings yet
Suggested Reading: General Statistics Books
12 pages
Measurements and Statistics Test
No ratings yet
Measurements and Statistics Test
6 pages
Chapter IV Mathematics in Modern World
No ratings yet
Chapter IV Mathematics in Modern World
3 pages
Chapter 4: Standardized Scores and The Normal Distribution
No ratings yet
Chapter 4: Standardized Scores and The Normal Distribution
9 pages
NSM 4
No ratings yet
NSM 4
3 pages
10 Questions BBA (Stat - 1) (19 Pages)
No ratings yet
10 Questions BBA (Stat - 1) (19 Pages)
28 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
4 pages
Python For Engineering and Scientific Computing 1st Edition Veit Steinkamp Download
0% (1)
Python For Engineering and Scientific Computing 1st Edition Veit Steinkamp Download
36 pages
Summary Psychological Assessment and Theory Creating and Using Psychological Tests
No ratings yet
Summary Psychological Assessment and Theory Creating and Using Psychological Tests
48 pages
Script Namo
No ratings yet
Script Namo
28 pages
Term 1 11 Ashish Kumar
No ratings yet
Term 1 11 Ashish Kumar
6 pages

Import Xls Sas Code

Uploaded by

Import Xls Sas Code

Uploaded by

/* Download the dataset file */

filename mydata temp;

Proc Means Data = test;

Statistical Option Description

How to See Specific Statistics

Proc Means Data = test N NMISS;

NMISS option in PROC MEANS

Proc Means data = test N NMISS NOLABELS;

Proc Means data = test N NMISS NOLABELS;

Proc Means data = test N NMISS NOLABELS NONOBS;

Proc Means data = test N MEAN;

Proc Means Data = test;

Proc Means Data = test N;

Proc Means data = test N MEAN;

Grouping and Output in Separate Tables

Make sure you sort the data before using BY statement.

proc sort data= test;

Difference between CLASS and BY statement in PROC MEANS

Proc Means data = test NOPRINT;

Proc Means Data = test noprint;

Proc Means Data = test noprint;

Proc Means Data = test noprint;

Proc Means Data = test noprint;

Proc Means Data = test noprint;

Proc Means Data = test (Where=( Q1 > 1)) noprint;

Proc Means Data = test noprint;

Proc Means Data = test nway noprint;

Proc Means Data = test noprint;

Proc Means Data = test DESCENDTYPES noprint;

Proc Means Data = test noprint;

proc means data=sales noprint nway;

Null Hypothesis - Population Mean of Q1 is equal to 0

proc means data = test t prt;

Difference between PROC MEANS and PROC FREQ

You might also like