100% found this document useful (1 vote)

286 views8 pages

Effective Use of SQL in SAS Programming

Structured Query Language (SQL) is a data manipulation tool of which many SAS programmers are unaware, or not comfortable. Using fewer lines of code and improved performance, SQL can accomplish the same goal as many SAS data steps. This paper gives a brief introduction on the subject of relational databases and SQL syntax followed by a variety of tips on how to use SQL effectively in SAS programming.

Uploaded by

sakawdin_004409

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

286 views8 pages

Effective Use of SQL in SAS Programming

Uploaded by

sakawdin_004409

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

NESUG 2008 Programming Beyond the Basics

Effective Use of SQL in SAS Programming

Yi Zhao
Merck & Co. Inc., Upper Gwynedd, Pennsylvania

INTRODUCTION

Structured Query Language (SQL) is a data manipulation tool of which many SAS®
programmers are unaware, or not comfortable. Using fewer lines of code as well as
achieving improved performance, SQL can accomplish the same goal as many SAS data
steps. This paper gives a brief introduction on the subject of relational databases and SQL
syntax followed by a variety of tips on how to use SQL effectively in SAS programming.

RELATIONAL DATABASE

SQL is primarily designed as a programming language to work with relational databases.

Many of the features of SQL are directly related to database activities such as retrieving
data, updating or deleting data, and so on. In relational databases, relations or tables are
associated to each other by primary keys and foreign keys. Primary keys are used to
identify each row in a table uniquely and foreign keys are used to maintain the integrity
of the database. Many of the SQL primary keys and foreign keys are similar to variables
used in the SAS by-merge data step. A database schema is used to describe the structure
and relationship among tables. Using SQL gives the ability to check the schema to find
common variables between tables and variable attributes such as data type, format, etc.
To make data retrieval or updating more efficient, SQL can create and use a database
index. This is similar to SAS Proc SQL where we could create and store an index within
a dataset when working with large datasets. Using SQL, views or virtual tables can be
created to manipulate data in exactly the same way as they are created in SAS Proc SQL.
In summary, knowing the basics of SQL in relational database can help SAS
programmers develop better SAS code.

SQL BASICS

SQL – Structured Query Language - developed by IBM in the early 1970s, is a standard
interactive and programming language for querying, modifying data, and managing
databases. The basic syntax is shown in the following example:

Select [Link],
d.treat_cd,
a.exam_val
From demos d,
assy a
Where [Link] = [Link]
Group by d.treat_cd
Order by [Link];

1
NESUG 2008 Programming Beyond the Basics

Although SQL is both an ANSI and an ISO standard, many database products support
SQL with proprietary extensions to the standard language such as Oracle SQL, SQL
Server, MySQL, and so on. Proc SQL is the SAS version of SQL. Proc SQL adopts most
of the standard SQL features with additional SAS ingredients such as dataset options,
SAS functions, etc. As a result, SAS SQL has the power of regular SQL and many SAS
special add-on features.

TERMINOLOGY

To help less-experienced SAS programmers better understand the different terms used by
database SQL programmers and SAS programmers, a comparison of these terms is
displayed in Table 1 below:

Table 1: Comparison of SQL Terminology

SAS Term Database Term SQL Term

Dataset Relation Table
Observation Tuple Row
Variable Attribute Column
Merge Join Join
Missing value NULL NULL

USE OF SQL IN SAS

SAS uses SQL in two different ways – Where statement and Proc SQL. Where statement
is one of the most commonly used SAS statements. The concept and syntax, however,
were originally adopted from SQL - this is one example that SAS is a powerful language
that imports and mixes syntax from other languages.

Proc SQL is the main tool within SAS to use SQL. While Proc SQL is a SAS procedure,
it performs many functions similar to those found within SAS data steps. Often, for data
manipulation, data step or Proc SQL can be used either individually or interchangeably.
Four major areas which describe the effective use of SQL in SAS Proc SQL are outlined
in the following sections.

I. Access Relational Database

In SAS, there are two approaches to access relational databases. One is the LIBNAME
Statement and the other is the pass-through facility. Below is an example of the pass-
through facility. The code is to read a demographic table from an Oracle database and
output all those allocated subjects.

2
NESUG 2008 Programming Beyond the Basics

Proc sql;
connect to odbc (dsn=&dsn uid=&uid pwd=&pwd);
create table demo as
select *
from connection to odbc
(select distinct allocation_number subjid,
visit_number vt_num,
age
from std_demos
where allocation_number is not null
);
disconnect from odbc;
quit;

Programming Tips:

• Get login credentials from interactive Window input for security reasons.
• Do not use multiple joins to retrieve data - it is more efficient if multiple
CREATE TABLE statements are used.
• If possible, avoid the use of ORDER BY to speed up execution.
• Use index if available.

II. Create Macro Variables Using the Into Clause

SAS programmers often use %LET or SAS function CALL SYMPUT() to create macro
variables. The following is an example:

Data _null_;
set dup nobs=obs;
call symput(‘totdup', compress(put(obs, best.)));

There is an alternative approach to achieving the same result by using the following SQL
procedure:

Proc sql noprint;

select count(*) into : totdup
from dup;

The Into clause stores the value of one or more columns in macro variable(s) for use later
in another Proc SQL query or SAS statement - below is an example:

Proc sql noprint;

select count (distinct treat_cd) into : tot_trt
from sero_all;
select distinct treat_cd into :_trt1 - :_trt&tot_trt
from sero_all;

3
NESUG 2008 Programming Beyond the Basics

quit;

The above code creates a macro variable &TOT_TRT to store the total number of
treatment groups, creates macro variables &TRT1, &TRT2 …, and stores the names of
treatment groups in them. The total number of macro variables is determined by the value
in &TOT_TRT.

Following is another example using an automatic macro variable &SQLOBS:

Proc sql noprint;
create table count_by as
select distinct (&byvar) from datadir.&inds;
select &byvar into :byv1 - :byv&sqlobs from count_by;
quit;

Programming Tips:

• &Sqlobs is an automatic macro variable created by SAS to store the number of

observations in a dataset. It is similar to _null_ in data step.
• Use option Noprint to prevent printing to the SAS list.
• No need to repeat Proc SQL for each SQL statement.
• Separate variables with a comma, not a space.
• Use Distinct to select unique observations.
• Use Quit, not Run, at the end.

III. Merge (Join) Tables

The biggest advantage of a SQL join is that there is no need for sorting and renaming
which is especially useful when dealing with large datasets. The following is
corresponding code for a by-merge data step and SQL join:

Merge (Join)
Proc sort data = one; Proc sql;
By subjid; Create table three as
Select *
Proc sort data = two (rename = (an_num = From one, two
subjid)); Where [Link] = two.an_num;
By subjid; Quit;

Data three;
Merge one two;
By subjid;
Run;

There are two kinds of joins in SQL: inner join and outer join. An inner join returns a
result table for all the rows in a table that have one or more matching rows in the other

4
NESUG 2008 Programming Beyond the Basics

table(s). The example above is an implied inner join and can be re-written with specific
inner join key words as shown below:

Inner Join
Proc sort data = one; Proc sql;
By subjid; Create table three as
Select *
Proc sort data = two (rename = (an_num = From one INNER JOIN two
subjid)); ON [Link] = two.an_num;
By subjid; Quit;

Data three;
Merge one(in=a) two(in=b);
By subjid;
If a and b;
Run;

Outer joins are inner joins that have been augmented with rows that did not match with
any row from the other table in the joins. The three types of outer joins are left, right, and
full join. Below are examples of outer joins:

Left Join
Proc sort data = one; Proc sql;
By subjid; Create table three as
Select *
Proc sort data = two (rename = (an_num = From one LEFT JOIN two
subjid)); ON [Link] = two.an_num;
By subjid; Quit;

Data three;
Merge one(in=a) two(in=b);
By subjid;
If a;
Run;

5
NESUG 2008 Programming Beyond the Basics

Right Join
Proc sort data = one; Proc sql;
By subjid; Create table three as
Select *
Proc sort data = two (rename = (an_num = From one RIGHT JOIN two
subjid)); ON [Link] = two.an_num
By subjid; Quit;

Data three;
Merge one(in=a) two(in=b);
By subjid;
If b;
Run;

A full outer join, specified with the keywords FULL JOIN and ON, returns all the rows
from all the tables regardless of whether they match. The full outer join is rarely used in
the real world.

IV. Transform Data

SQL is used frequently for creating, renaming new variables, and ordering output.
Suppose we have the following task at hand:
• Create a new variable new_v1 by concatenating v1 and v2
• Create a new variable new_v2 as the sum of v3
• Rename v4 and v5 as out4 and out5
• Only output new_v1, new_v2, out4, out5, v3 and in that particular order in the
output dataset

Here is the code:

Proc sql;
create table new as
select v1 || v2 as new_v1, sum(v3) as new_v2, v4 as out4, v5 as out5, v3
from old;
quit;

Programming Tips:

• SAS dataset options such as keep, drop, rename and SAS functions can be used
within Proc SQL. Here is an example:

%let label=This is the label;

Proc sql;
create table one (label="&label" drop=subject_no center) as
select *
from tx t1,

6
NESUG 2008 Programming Beyond the Basics

scores t2
where t1.subject_no=input(substr(t2.subject_id,5),8.) and
[Link]=input(substr(t2.subject_id,1,3),8.);
quit;

• Use of a sub query or in-line view.

A query-expression is called subquery when used in WHERE or HAVING clauses. It

is nested as part of another query-expression. An in-line view is a special subquery
used in the FROM clause. This can be used in situations such as identifying those
patients who are older than the average age of all patients and who experienced an
Adverse Event.

Select subjid, birth_dt, age, gender

from std_demos
where age >
(select avg(age)
from std_demos)
and subjid in
(select distinct subjid
from std_ae)
order by subjid;

• Use of set operators like UNION, INTERSECT, EXCEPT

• Should avoid Cartesian product which is similar to SAS merge without by
variable(s)

CONCLUSION

• Proc SQL is more powerful and efficient than SAS data steps in certain cases,
with fewer lines of code.
• SQL is a basic tool for many job functions that involve working with databases.
Mastering SQL could result in project (or job) opportunities and enhance career
growth.
• Proc SQL must be used wisely or it can become complicated and inefficient..
• In summary, Proc SQL is an excellent alternative to non-SQL Base SAS, making
it worth the programmers' time to explore its use.

REFERENCES

Feng, Ying “Tips for Using SQL: When to Use and How?"
Proceedings of the 18th Annual NorthEast SAS Users Group Conference,
POS12, 2005.

SAS and all other SAS Institute Inc. product or service names are registered trademarks
or trademarks of SAS Institute Inc. in the USA and other countries. ® indicates USA

7
NESUG 2008 Programming Beyond the Basics

registration. Other brand and product names are trademarks of their respective
companies.

AUTHOR CONTACT INFORMATION

Yi Zhao
Senior Scientific Programming Analyst
Merck Research Laboratories
UG1CD-38
PO Box 1000
North Wales, PA 19454
Phone: 267-305-7672
Email: yi_zhao@[Link]

SQL Techniques for Data Analysts
100% (1)
SQL Techniques for Data Analysts
7 pages
Advanced SAS PROC SQL Interview Guide
No ratings yet
Advanced SAS PROC SQL Interview Guide
27 pages
Ten Good Reasons To Learn Sas Software'S SQL Procedure: Sigurd W. Hermansen, Westat, Rockville, MD
No ratings yet
Ten Good Reasons To Learn Sas Software'S SQL Procedure: Sigurd W. Hermansen, Westat, Rockville, MD
5 pages
Introduction to PROC SQL
No ratings yet
Introduction to PROC SQL
10 pages
Introduction To Data Management and Programming in SAS
No ratings yet
Introduction To Data Management and Programming in SAS
105 pages
Interview
No ratings yet
Interview
34 pages
Statistical Graphics Procedures by Example Effective Graphs Using SAS
No ratings yet
Statistical Graphics Procedures by Example Effective Graphs Using SAS
370 pages
SAS Sort Accum Total
No ratings yet
SAS Sort Accum Total
74 pages
Introduction To Sas Procedures: 1
100% (2)
Introduction To Sas Procedures: 1
73 pages
SAS Macro Interview Guide
No ratings yet
SAS Macro Interview Guide
14 pages
PROC SQL Vs FEDSQL Summary
No ratings yet
PROC SQL Vs FEDSQL Summary
1 page
SAS Export
No ratings yet
SAS Export
35 pages
Come September
No ratings yet
Come September
1 page
SAS Enterprise Guide
No ratings yet
SAS Enterprise Guide
48 pages
Epoch Forumotions in t67 General Sas Macro Interview Questio
No ratings yet
Epoch Forumotions in t67 General Sas Macro Interview Questio
8 pages
SAS Interview Prep Guide
100% (2)
SAS Interview Prep Guide
3 pages
Using - SASeg - Effectively - Proc SQL Good One Read at Home
No ratings yet
Using - SASeg - Effectively - Proc SQL Good One Read at Home
30 pages
SAS Slides 7: Match Merging With Datastep
No ratings yet
SAS Slides 7: Match Merging With Datastep
22 pages
PROC SQL in Clinical Trials
No ratings yet
PROC SQL in Clinical Trials
6 pages
Tips and Techniques For The SAS Programmer
No ratings yet
Tips and Techniques For The SAS Programmer
19 pages
Read Text: Sas My Code
No ratings yet
Read Text: Sas My Code
6 pages
Sas fAQ'S 1
No ratings yet
Sas fAQ'S 1
114 pages
Carpenter's Complete Guide To SAS Macro
100% (2)
Carpenter's Complete Guide To SAS Macro
407 pages
SAS Manipulate Datasets
No ratings yet
SAS Manipulate Datasets
32 pages
SAS Data Step Coding FAQs
No ratings yet
SAS Data Step Coding FAQs
112 pages
SAS DO Loops and Arrays Guide
100% (1)
SAS DO Loops and Arrays Guide
118 pages
A Complete Tutorial On SAS Macros For Faster Data Manipulation
No ratings yet
A Complete Tutorial On SAS Macros For Faster Data Manipulation
36 pages
Base SAS Interview Questions
No ratings yet
Base SAS Interview Questions
10 pages
Top 25 SAS Interview Questions
No ratings yet
Top 25 SAS Interview Questions
2 pages
Macro Chapter2
No ratings yet
Macro Chapter2
42 pages
SAS 9.4 Graph Template Language Reference
No ratings yet
SAS 9.4 Graph Template Language Reference
1,454 pages
Statistical Graphics Procedures by Example Effective Graphs Using SAS by Sanjay Matange, Dan Heath
No ratings yet
Statistical Graphics Procedures by Example Effective Graphs Using SAS by Sanjay Matange, Dan Heath
371 pages
Lauren Haworth, Genentech, Inc., South San Francisco, CA: ODS RTF: The Basics and Beyond
No ratings yet
Lauren Haworth, Genentech, Inc., South San Francisco, CA: ODS RTF: The Basics and Beyond
19 pages
SAS Interview Questions
100% (1)
SAS Interview Questions
25 pages
(Pronounced ": History of Sas SAS
No ratings yet
(Pronounced ": History of Sas SAS
141 pages
SAS Interview Questions and Answers
No ratings yet
SAS Interview Questions and Answers
107 pages
SAS Macro Programming Guide
No ratings yet
SAS Macro Programming Guide
7 pages
SAS Output Delivery System
No ratings yet
SAS Output Delivery System
96 pages
SQL Notes
No ratings yet
SQL Notes
96 pages
SAS Interview Questions: Click Here
No ratings yet
SAS Interview Questions: Click Here
31 pages
Iefbr14 Iebgener Idcams Iebcopy Iebupdte Iebcompar Sort: Ashok Kumar Kumaresan
No ratings yet
Iefbr14 Iebgener Idcams Iebcopy Iebupdte Iebcompar Sort: Ashok Kumar Kumaresan
13 pages
Learning SAS by Example A Programmers Guide Answers
50% (2)
Learning SAS by Example A Programmers Guide Answers
42 pages
SAS Data Merging Techniques Guide
100% (1)
SAS Data Merging Techniques Guide
1 page
SQL Query Operators & Clauses Guide
No ratings yet
SQL Query Operators & Clauses Guide
3 pages
SAS Data Handling Guide
No ratings yet
SAS Data Handling Guide
26 pages
SAS Formats and Informats
No ratings yet
SAS Formats and Informats
378 pages
SAS Macro for FTP File Listing
No ratings yet
SAS Macro for FTP File Listing
9 pages
SAS SUGI Paper
No ratings yet
SAS SUGI Paper
12 pages
SAS Programming Interview Guide
No ratings yet
SAS Programming Interview Guide
12 pages
PROC REPORT for Clinical Data
No ratings yet
PROC REPORT for Clinical Data
43 pages
Lesson 1 PG2
No ratings yet
Lesson 1 PG2
47 pages
SAS Macro Variables Guide
No ratings yet
SAS Macro Variables Guide
59 pages
PROC SQL - The Dark Side of SAS ?: Kirsty Lauderdale, PRA International, Victoria, BC
No ratings yet
PROC SQL - The Dark Side of SAS ?: Kirsty Lauderdale, PRA International, Victoria, BC
5 pages
Merge
0% (1)
Merge
16 pages
Sas SQL
No ratings yet
Sas SQL
5 pages
Routine SAS SQL
No ratings yet
Routine SAS SQL
161 pages
Simple and Commplex Queries
No ratings yet
Simple and Commplex Queries
9 pages
0.0 - Hypothesis Testing - AA
No ratings yet
0.0 - Hypothesis Testing - AA
13 pages
Introduction To Using Proc SQL Sas
No ratings yet
Introduction To Using Proc SQL Sas
7 pages
Advanced SAS Programming Syntax Reference Guide
No ratings yet
Advanced SAS Programming Syntax Reference Guide
6 pages
NetBrain Quick Setup Guide AWS
No ratings yet
NetBrain Quick Setup Guide AWS
47 pages
SteppII Device Configuration Guide
No ratings yet
SteppII Device Configuration Guide
2 pages
Commvault Corporate Overview
No ratings yet
Commvault Corporate Overview
22 pages
DrayTek CompanyProfile
No ratings yet
DrayTek CompanyProfile
39 pages
OID Mikrotik
No ratings yet
OID Mikrotik
27 pages
VLSI LAB Manual 2021 Regulation
No ratings yet
VLSI LAB Manual 2021 Regulation
91 pages
UEFIBIOS ToolsForHPbusinessdesktops
No ratings yet
UEFIBIOS ToolsForHPbusinessdesktops
4 pages
1e 3PP Musg 01015
No ratings yet
1e 3PP Musg 01015
5 pages
Install MacPorts on Mac OS X
No ratings yet
Install MacPorts on Mac OS X
3 pages
Wa0000.
No ratings yet
Wa0000.
95 pages
Week 07
No ratings yet
Week 07
13 pages
Zenon Editor Manual
No ratings yet
Zenon Editor Manual
132 pages
Nmon Analyser V34a
No ratings yet
Nmon Analyser V34a
9 pages
Mobistel Cynus F10
No ratings yet
Mobistel Cynus F10
2 pages
Microsoft Office 2003 Setup (0001) - Task (0001)
No ratings yet
Microsoft Office 2003 Setup (0001) - Task (0001)
60 pages
11 Ias It Notes Unit 1
No ratings yet
11 Ias It Notes Unit 1
84 pages
OPUS-QUAD Update Guide en
No ratings yet
OPUS-QUAD Update Guide en
4 pages
Smart Classroom IB-TTS4 AIO English
No ratings yet
Smart Classroom IB-TTS4 AIO English
15 pages
System Security Status: Computer Profile Summary
No ratings yet
System Security Status: Computer Profile Summary
6 pages
1 Introduction To Computing Technology
No ratings yet
1 Introduction To Computing Technology
46 pages
Sinamics S120 - Basic Drive Commissioning With Startdrive
No ratings yet
Sinamics S120 - Basic Drive Commissioning With Startdrive
27 pages
AIO Windows
No ratings yet
AIO Windows
6 pages
S32G PFE Software Overview
No ratings yet
S32G PFE Software Overview
12 pages
OceanStor Dorado 6.x & OceanStor 6.x Host Connectivity Guide For Windows
No ratings yet
OceanStor Dorado 6.x & OceanStor 6.x Host Connectivity Guide For Windows
119 pages
Unit-4 (Notes) OS
No ratings yet
Unit-4 (Notes) OS
39 pages
HP Color Laserjet 9500 Service Manual
No ratings yet
HP Color Laserjet 9500 Service Manual
536 pages
Pig Latin Users Guide
No ratings yet
Pig Latin Users Guide
13 pages
Truenas Community Hardware Guide: 2021-01 Edition Revision 2A) Maintained by Ericloewe of The Truenas Forums
No ratings yet
Truenas Community Hardware Guide: 2021-01 Edition Revision 2A) Maintained by Ericloewe of The Truenas Forums
20 pages
Evolve Teams Enterprise Voice Implementation Checklist
No ratings yet
Evolve Teams Enterprise Voice Implementation Checklist
3 pages
Codename One: A Lightweight Mobile Framework
No ratings yet
Codename One: A Lightweight Mobile Framework
18 pages

Effective Use of SQL in SAS Programming

Uploaded by

Effective Use of SQL in SAS Programming

Uploaded by

NESUG 2008 Programming Beyond the Basics

Effective Use of SQL in SAS Programming

SQL is primarily designed as a programming language to work with relational databases.

Table 1: Comparison of SQL Terminology

SAS Term Database Term SQL Term

USE OF SQL IN SAS

I. Access Relational Database

II. Create Macro Variables Using the Into Clause

Proc sql noprint;

Proc sql noprint;

Following is another example using an automatic macro variable &SQLOBS:

• &Sqlobs is an automatic macro variable created by SAS to store the number of

III. Merge (Join) Tables

IV. Transform Data

Here is the code:

%let label=This is the label;

• Use of a sub query or in-line view.

A query-expression is called subquery when used in WHERE or HAVING clauses. It

Select subjid, birth_dt, age, gender

• Use of set operators like UNION, INTERSECT, EXCEPT

AUTHOR CONTACT INFORMATION

You might also like