0% found this document useful (0 votes)

120 views13 pages

Quick Hits - My Favorite SAS Tricks: Marje Fecht, Prowerk Consulting

Uploaded by

sai nadh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

120 views13 pages

Quick Hits - My Favorite SAS Tricks: Marje Fecht, Prowerk Consulting

Uploaded by

sai nadh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Paper 1459-2014

Quick Hits - My Favorite SAS® Tricks

Marje Fecht, Prowerk Consulting

ABSTRACT
Are you time-poor and code-heavy?

It's easy to get into a rut with your SAS code and it can be time-consuming to spend your time learning and
implementing improved techniques.

This presentation is designed to share quick improvements that take 5 minutes to learn and about the same time to
implement. The quick hits are applicable across versions of SAS and require only BASE SAS knowledge.

Included are:
- simple macro tricks
- little known functions that get rid of messy coding
- dynamic conditional logic
- data summarization tips to reduce data and processing
- generation data sets to improve data access and rollback.

INTRODUCTION
Many SAS programmers learn SAS “by example” - using the programs they have inherited or a co-worker’s
examples. Some examples are better than others! Sometimes, the examples we learn from reflect coding
practices from earlier SAS versions, prior to the introduction of the large function library and language extensions that
exist today.

When you are tasked with producing results, there isn’t always time to learn and implement new features. This paper
shares quick tricks that are easy to learn and implement.

CODE REDUCTION
The SAS function library grows with each release of SAS and provides built-in functionality for accomplishing many
common tasks. To help you relate the described functionality to your existing code, “before and after” examples are
included.
CONCATENATION
If you join strings of data together, then you have likely used the concatenation operator ( || ) as well as other
operators to obtain the desired results. Concatenation and managing delimiters and blanks can be frustrating and
may involve a lot of steps including:
• TRIM
• LEFT
• STRIP
• ||
• Adding delimiters.

The old way of joining the contents of the variables n (numeric), with a, b, c (all character) might look like:

old = put(n,1.) || ' ' || trim(a) || ' ' || trim(b) || ' ' || c;

Unfortunately the above code won’t always produce a pleasing result, and thus could require even more complication.
Consider the case when the variable a contains all blanks. This would result in 3 blanks in a row in the variable old
since trim(a) would reduce to a single blank and then add the additional blanks used as delimiters. Furthermore, a
LEFT or STRIP function would be needed to insure that leading blanks don’t cause issues.

-1–
Marje Fecht, Prowerk Consulting Ltd, 2013
The SAS 9 family of CAT functions reduces complexity when concatenating strings!

• CAT concatenates multiple strings in one function call

• CATT - same as CAT but also TRIMs
• CATS - same as CAT but also STRIPs leading and trailing blanks
• CATX - same as CATS but you can specify a delimiter.

Using CATX, the above example would be reduced to:

new = CATX ( ' ', n , a, b, c);

Note:
• If any of n, a, b, c are blank, CATX will not include the blanks and is smart enough to therefore not include
the delimiters.
• If any arguments are numeric, the CAT functions will handle the numeric to character conversion without
LOG messages.

CONDITIONAL CONCATENATION
Consider an example where you have a series of indicators that represent marketing channels used in a campaign.
You want to build a single string (see channels_CATX ) showing all channels used for each client. For this
example, we start with 4 indicators in the data set.

ch_dm ch_on ch_cc ch_st Desired Result

channels_CATX
Y Y Y Y DM_ON_CC_ST
N Y Y N ON_CC
Y N N Y DM_ST
N Y Y Y ON_CC_ST

Example 1 - Solution 1:
A classic solution would be to create a variable that corresponds to each indicator with either a blank or the two letter
code that is desired for the result Then concatenate the 4 new variables.

/* Create text field for each channel */

if ch_dm = 'Y' then dm = 'DM';
if ch_on = 'Y' then on = 'ON';
if ch_cc = 'Y' then cc = 'CC';
if ch_st = 'Y' then st = 'ST';

chann_1 = dm || '_' || on || '_' || cc || '_' || st;

This solution is
• wordy since it requires creating an extra set of variables
• problematic
o extra underscores in result since delimiters are included regardless of “missing data”
o TRIM or STRIP could be needed if variables had differing length strings.

ch_dm ch_on ch_cc ch_st chann_1

Y Y Y Y DM_ON_CC_ST
N Y Y N _ON_CC_
Y N N Y DM_ _ _ST
N Y Y Y _ON_CC_ST

2
Marje Fecht, Prowerk Consulting Ltd, 2014
Example 1 - Solution 2:
Improve upon Solution 1 by using the CATX function. A LENGTH statement is needed since CATX returns a string of
length 200.

/*** Using CATX – ignores missing values and handles STRIP ***/
length channels_CATX $12;
channels_CATX = CATX ( '_' , dm , on , cc , st);

ch_dm ch_on ch_cc ch_st chann_1 channels_CATX

Y Y Y Y DM_ON_CC_ST DM_ON_CC_ST
N Y Y N _ON_CC_ ON_CC
Y N N Y DM_ _ _ST DM_ST
N Y Y Y _ON_CC_ST ON_CC_ST

Solution 2 still uses the extra 4 variables but it behaves when some values are missing, so you don’t need COMPRESS
or special logic.

CONDITIONAL ASSIGNMENT OF VALUES - SIMPLE

In the above concatenation example, fields were created conditional on the values for a set of indicators. Conditional
assignments are often accomplished with if-then-else logic (or SQL CASE-WHEN coding). You can consolidate
sets of if-then-else code with a single function call using the IFC or IFN functions.

IFC (IFN) returns a character (numeric) value based on whether an expression is

• true result of expression NOT 0 and NOT . (missing)
• false result of expression is 0
• missing result of expression is . (missing)

IFC (IFN) is coded as

IFC (expression-to-evaluate , true-result , false-result , missing-result)

Consider the statements

if ch_dm = 'Y' then dm = 'DM';
else dm = ' ';

This logic can be rewritten as

dm = IFC( ch_dm = 'Y' , 'DM' , ' ');

Note that
• the expression ch_dm = 'Y' can ONLY result in TRUE or FALSE and thus two result arguments are provided
• the result 'DM' is returned only when ch_dm = 'Y'
• any value of ch_dm other than 'Y' results in a blank since the expression is FALSE.

3
Marje Fecht, Prowerk Consulting Ltd, 2014
Example 1 - Solution 3:
Improve upon Example 1, Solution 2 by removing the need to create the extra 4 variables. Note that as with most
function calls in SAS, IFC can easily be embedded within CATX.

/* NO NEED TO CREATE EXTRA TEXT VARIABLES */

length channels_IFC_CATX $12;
channels_IFC_CATX = CATX(
'_' /* note delimeter of underscore */
, IFC( ch_dm = 'Y' , 'DM' , ' ')
, IFC( ch_on = 'Y' , 'ON' , ' ')
, IFC( ch_cc = 'Y' , 'CC' , ' ')
, IFC( ch_st = 'Y' , 'ST' , ' ')
) ;

ch_dm ch_on ch_cc ch_st channels_IFC_CATX

Y Y Y Y DM_ON_CC_ST
N Y Y N ON_CC
Y N N Y DM_ST
N Y Y Y ON_CC_ST

CONDITIONAL ASSIGNMENT OF VALUES – MORE COMPLEX

Conditionally assigning values with IFC / IFN is powerful but you need to be careful with how the function works.

Example 2: Assign course status based on course grade.

Using the value of grade, assign a status of Pass for grades of at least 70, Fail for grades that are less than 70 but
not missing, and Pending for grades that are still outstanding (missing).

Example 2 - Solution 1
If grade ge 70 then status = "Pass";
else if grade = . then status = "Pending";
else status = "Fail";
This coding works fine but is wordier than necessary.

Example 2 - Solution 2
Recall that IFC returns a character value based on whether an expression is
• true ( not 0 or . )
• false ( 0 )
• or missing ( . ).

Length Status $7; /IFC defaults the result to length 200/

Status = IFC( grade ge 70 ,
"Pass" , /* TRUE */
"Fail" , /* FALSE */
"Pending" /* Missing */
);

The previous example will only result in Pass and Fail, since the logical expression will only result in True (1) or
False(0). BEWARE!! It is helpful to note that when SAS evaluates an expression, a value of ZERO denotes FALSE
and any other numeric non-missing values other than zero denotes TRUE.

4
Marje Fecht, Prowerk Consulting Ltd, 2014
Example 2 - Solution 3
Correct the above logic by using a mechanism to insure that a missing value of grade will result in a missing value
for the expression. A solution I commonly employ is to multiply the logical expression ( grade ge 70 ) by the
variable being evaluated ( grade ).
For example, for

grade * (grade ge 70)

the expression is
• TRUE when grade ge 70
since grade * (grade ge 70) = grade * 1 = grade  TRUE
• FALSE when grade lt 70 and grade NE .
since grade * (grade ge 70) = grade * 0 = 0  FALSE
• MISSING when grade = .
since grade * (grade ge 70) = . * 0 = .  MISSING.

The corrected logic is:

Length Status $7;

Status = IFC( grade * (grade ge 70) ,
"Pass" , /* TRUE */
"Fail" , /* FALSE = 0 */
"Pending" /* Missing */
);

MORE FUNCTIONS FOR YOUR TOOLKIT

Many SAS programmers write beautifully coded logic to handle functionality that already exists in the SAS Function
Library. Before you embark down that fruitless path, review the SAS online documentation to see what functions are
already available for you.

A few more of my favorite (and more obscure) functions include:

• TRIMN  removes trailing blanks – returns null for values that are all blank

• COUNT, COUNTC  counts # of occurrences of a string or Character

• INDEX, INDEXC, INDEXW  locates position of first occurrence of string, Character, or Word

• LENGTH (min=1) , LENGTHN (min=0)  position of last non-blank.

Note: LENGTHN returns 0 for values that are all blank

• LARGEST ( k , var1, var2, …)  kth largest non-missing value

• SMALLEST ( k , var1, var2, …)  kth smallest non-missing value

• PROPCASE  handles upper / lower case to assist with Proper Names and Addresses, etc.

5
Marje Fecht, Prowerk Consulting Ltd, 2014
CODE GENERALIZATION
If you suffer from copy-paste syndrome or if every request seems to require a “from scratch” build, then you need to
work on changing your approach!! Before you begin writing a program, think about what could change about this
request in the future? If the request is time-specific or if it focuses on just a subset of the available population,
chances are good that you can generalize the code so that it can be easily adapted to future requests. Think also
about what have I written before that could be leveraged now?

BEST PRACTICES: REUSABLE CODE

According to wikipedia,

In computer science and software engineering, reusability is the likelihood that a segment of source code
can be used again to add new functionalities with slight or no modification.

Reusable code modules

• reduce implementation time
• increase the likelihood that prior testing and use has eliminated bugs
• localizes code modifications when a change in implementation is required.

My programs commonly include a comment line of /***** NO CHANGES BELOW HERE *****/
since I pass parameters to my source code and use generalized coding practices with
• User Defined Macro variables for “static” information
• Data-Driven values via metadata or functions for “dynamic” information
• Generalized location / naming / etc to enable easy changes
• File names and locations that are parameterized
• System locations, options, settings that are generalized and parameterized.

My source code remains “un-touched”, unless upgrades are implemented, which then roll-out to all programs that call
the source code.

CODE GENERALIZATION: LOGS AND OUTPUT

To generalize file locations and names, such as logs and output files, a typical example would
• Use macro variables to supply path and project ID information
• Specify the stage of development (Development, Test, Production)
• Use macro and SAS functions to determine the date and time and then format them for the file naming
o Note: the N in yymmddN8. requests NO separators ( - , / , etc) so that just an 8 digit string is produced
o Note: the compress removes the : from the time value
• PROC PRINTTO is used to route the log so that it remains available for perusal

%let requestID = PROJECT974;

%let filepath = mygroup\myprojectfiles ;
%let stage = prod; ** could be devl, test, prod **;

... <other program logic – following parameter def'n> ...

%let dir = \&filepath.\&requestID.;

%let filename = &requestID._extract1;
%let datetime =
%sysfunc(compress(%sysfunc(today(),yymmddN8.)_%sysfunc(time(),hhmm6.)
, ': '));
** route Log and Listing to permanent location **;
proc printto
log ="&dir.\logs\&stage.\&filename._&datetime..log"
print="&dir.\output\&stage.\&filename._&datetime..lst";
run;
... < other program logic > ...
** reroute to default locations **;
proc printto; run;
6
Marje Fecht, Prowerk Consulting Ltd, 2014
The resulting “Versioned” file names would be

• PROJECT974_20120507_1329.log
• PROJECT974_20120507_1329.lst

CODE MODULARIZATION: SOURCE CODE AND DRIVER PROGRAMS

Once you have generalized your code to make it readily available for re-use, you need a mechanism for separating
the source modules from request-specific input. My approach is to utilize driver programs that contain all of the
parameters and other input, and that call the appropriate source modules.

*** Driver Program – specify parameters;

/* NO CHANGES below here */

*** run standard extract and reporting programs;
%include 'ClaimsReportExtract_20110310.sas';
%include 'ClaimsReportOutput_20110113.sas';

One problem occurs with the above approach…

When the source module changes, you have to find ALL the drivers that call it to change the version date specified in
the %include. This could involve searching 100’s of drivers.

Solution:
Create a program that calls the latest version of the source, and call that “control program” from your drivers.

Contents of Control Program: ClaimsReportExtract_CurrentPgm.sas

*** call latest version of source program;

%include 'ClaimsReportExtract_20110310.sas';

Now, when source changes, your drivers always call the most current version.

*** Driver Program – specify correct parameters;

%let prestart = 01SEP2011;
%let prestop = 31DEC2011;
%let poststart = 01JAN2012;
%let poststop = 30JUN2012;
%let title = "January 2012 Introduction of Claims Changes";
%let plans = ('78','7X','7R','55','85','18');
%let codes = ('015', '119', '214');
*** run standard reporting programs;
%include 'ClaimsReportExtract_CurrentPgm.sas';
%include 'ClaimsReportOutput_CurrentPgm.sas';

Helpful Hint:
• There is no limit to the # of %include statements you can use
• If you want the code from a %include to display in your log, use the SOURCE2 option
• In addition to %include, reusable modules may be
• Macros
• Format Libraries.
7
Marje Fecht, Prowerk Consulting Ltd, 2014
CODE GENERALIZATION : DYNAMIC CODE
Reporting and analytics requests frequently revolve around lists of dates, campaigns, products, departments, etc.
Thus, generalizing your code to dynamically create and accept LISTS is worth the effort. If a list can be
programmatically generated, you avoid manual input and thus the introduction of typos and errors.

Example: Extract and summarize data for all marketing within a given date range and focus.

Solution 1: Create a generalized program that expects a list of all campaign codes for the date range with the
focus of interest. Manually input the list of campaign codes within the dates of interest.

%let codes =
2010307ABC
,2010337ABC
,2011003ABC
... Etc ...
;

Problem: Someone has to manually locate the campaign codes of interest and correctly input them for the program.

CREATE DYNAMIC LISTS

If the campaign codes follow a pattern or if additional metadata are available to assist you in generating the list, use
that information programmatically.

Solution 2: Use SQL to build a macro variable ( codes ) that contains a comma-delimited list of all of the campaign
codes
• with campaign “drop” between a specified begin and end date
• that end in ‘ABC’ since that identifies the campaign focus.

proc sql noprint;

/** include database connection, if needed **/
select distinct cmpgn_code
into :codes separated by ','

from mktg_metadata

where
drop_dt between %str(%')&start%str(%')
and %str(%')&end%str(%')
and substr(cmpgn_code , 8 , 3) = 'ABC'
;

%let NumCodes = &SQLOBS; /** # of rows returned from SQL query **/

quit;

%put NumCodes = &NumCodes;

%put Campaign Codes = &codes;

The resulting comma delimited list can be used

• as an IN list for subsetting
• as input to a macro loop for processing the data
• for variable and data set naming.

8
Marje Fecht, Prowerk Consulting Ltd, 2014
DYNAMIC VARIABLE NAMING
Suppose you need to summarize the latest 3 months of data, and you want monthly variables representing the totals
for each month of data.

You want the variable names to reflect the month, such as

• TotAmt_1205 represents Current Month (May2012)
• TotAmt_1204 represents one month ago (Apr2012)
• TotAmt_1203 represents two months ago (Mar2012)

You plan to use SQL with CASE to create the monthly amounts, and you don’t want to manually intervene with the
program. Instead, you want the program to determine the current month and generate the latest three months of data
and names.

The SQL clause to bucket the monthly data might look like:

sum(case when txn_date between "&M1_beg"d and "&M1_end"d

then txn_amt
else 0
end ) as Tot_Amt_&M1

You need macro variables for:

• Beginning Date of each month, in SAS Date format (ddMMMyy) – M1_beg
• Ending Date of each Month, in SAS Date format (ddMMMyy) – M1_end
• YYMM for each month (as a suffix for the variable names) – M1

INTNX – move in intervals

To dynamically generate the dates above, you need the ability to determine the current date and then move in
increments of months back from today. Additionally, you need to be able to identify the first and last day of the month
(without worrying about the nuances of the calendar). The INTNX function is one of the most versatile of the SAS
date functions, enabling you to move forward and backward from a date and time using the interval of your choice,
such as month, quarter, day, week, etc.

The INTNX function increments dates by intervals:

INTNX ( interval, from, n < , alignment > ) ;
o interval - interval name eg: 'MONTH', 'DAY', 'YEAR'
o from - a SAS date value (for date intervals) or datetime value (for datetime intervals)
o n - number of intervals to increment from the from value
o alignment - alignment of resulting SAS date, within the interval. Eg: BEGINNING, MIDDLE, END.

Example: Create 3 macro variables that contain the current and two previous months in the format: yymm
%let M0 = %sysfunc( today() , yymmN4.);

%let M1 = %sysfunc( intnx( MONTH ,

%sysfunc( today() ) ,
-1) , yymmN4.); /** go back one month from today **/

%let M2 = %sysfunc( intnx( MONTH ,

%sysfunc( today() ) ,
-2) , yymmN4.); /** go back two months from today **/

Note: when INTNX is used in %sysfunc, do not use quotes for the arguments of INTNX.

The above code produces three macro variables with values such as
• M0 = 1205
• M1 = 1204
• M2 = 1203
9
Marje Fecht, Prowerk Consulting Ltd, 2014
Example: Create 2 macro variables per month that contain the 1st and last day of the month, in SAS date format.
%let M2_beg = %sysfunc( intnx( MONTH
, %sysfunc( today() ) , -2 , B) /** return Beginning of month **/
, date9.);

%let M2_end = %sysfunc( intnx( MONTH

, %sysfunc( today() ) , -2 , E) /** return End of month **/
, date9.);

Note: Use whatever date format is appropriate for your data. For the example data, a SAS Date value is used.

If you are inclined to copy-paste and you expand this for the three months requested, the below code creates 3 macro
variables per month to create the variable suffix (yymm) and the beginning and end date ranges for each month. This
works but can obviously be improved upon.

%let M0 = %sysfunc( today() , yymmN4.);

%let M0_beg = %sysfunc( intnx( MONTH , %sysfunc( today() ) , 0 , B)
, date9.);
%let M0_end = %sysfunc( intnx( MONTH , %sysfunc( today() ) , 0 , E)
, date9.);

%let M1 = %sysfunc( intnx( MONTH , %sysfunc( today() ) , -1)

, yymmN4.);
%let M1_beg = %sysfunc( intnx( MONTH , %sysfunc( today() ) , -1 , B)
, date9.);
%let M1_end = %sysfunc( intnx( MONTH , %sysfunc( today() ) , -1 , E)
, date9.);

%let M2 = %sysfunc( intnx( MONTH , %sysfunc( today() ) , -2)

, yymmN4.);
%let M2_beg = %sysfunc( intnx( MONTH , %sysfunc( today() ) , -2 , B)
, date9.);
%let M2_end = %sysfunc( intnx( MONTH , %sysfunc( today() ) , -2 , E)
, date9.);

The 3 macro variables for each of the 3 months could be used to compute the monthly totals in code such as:

select
sum(case when txn_date between
"&M0_beg"d and "&M0_end"d
then txn_amt else 0 end ) as Tot_Amt_&m0

,sum(case when txn_date between

"&M1_beg"d and "&M1_end"d
then txn_amt else 0 end ) as Tot_Amt_&m1

,sum(case when txn_date between

"&M2_beg"d and "&M2_end"d
then txn_amt else 0 end ) as Tot_Amt_&m2

from amts

where txn_date between "&M2_beg"d and "&M0_end"d

10
Marje Fecht, Prowerk Consulting Ltd, 2014
Helpful Hint:
The previous code assumed that SAS Date Values are needed (ddMMMyy).

Suppose your query is for a database table with

• dates stored as yyyy-mm-dd
• single quotes are needed to surround date values (not ").
Change
• format of dates in macro variables  yymmddD10.
• change the double quotes to single quotes: %str(%')&M0_Beg%str(%')

DYNAMIC CODE GENERATION

The above examples do provide dynamic code and it DOES work. But, what if we now want 12 months of results?

COPY – PASTE syndrome. . .

Notice that the code could easily be generated in a macro loop as long as MaxMonth (the maximum number of
monthly computations) is defined.

Developing the appropriate code requires planning and understanding of the desired outcome.

What Macro variables and SAS variables are needed?

Macro Variable Name Value SAS Variable Name

M0 1205 Tot_Amt_1205
M0_Beg 1May2012
M0_End 31May2012
M1 1204 Tot_Amt_1204
M1_Beg 1Apr2012

M1_End 30Apr2012
. . . . . .
M11 1106 Tot_Amt_1106
M11_Beg 01Jun2011
M11_End 30Jun2011

11
Marje Fecht, Prowerk Consulting Ltd, 2014
Example: Create a macro to generate the Macro Variable Names and Values, using as input only today’s date and
the number of months desired ( MaxMonth ).

%let MaxMonth=11; / Months start at ZERO /

%macro Monthly;
%do Num = 0 %to &MaxMonth;
/*** use %global if macro variables needed outside macro ***/
%global M&Num. M&Num._beg M&Num._end;

/ create suffix for variable names /

%let M&Num. = %sysfunc( intnx( MONTH ,%sysfunc( today() )
, -&Num.) , yymmN4.);

/ create the date corresponding to beginning of month /

%let M&Num._beg = %sysfunc( intnx( MONTH , %sysfunc( today() )
, -&Num. , B) , date9.);

/ create the date corresponding to end of month /

%let M&Num._end = %sysfunc( intnx( MONTH , %sysfunc( today() )
, -&Num. , E) , date9.);

%end;
%mend Monthly;

%Monthly
%put _user_; /** write all user defined macro variables and values to LOG **/

The above example dynamically generates the macro variables needed for the extensible solution. In a similar macro
%DO loop, the SQL to create the case statements could be accomplished.

SPACE MANAGEMENT - BEST PRACTICES

REUSE SPACE !!
Your use (or abuse) of SAS Work Space could crash other jobs (including yours)! If you are processing large data
files, please delete intermediate data sets when they are no longer needed. For example, once joins are complete
with other data, delete the intermediate data sets that comprised the join.

PROC DATASETS is handy to manage your SAS datasets including deleting files, modifying attributes, changing
names, etc.

proc datasets lib = work

memtype = data details;
/** delete old data **/
delete ChqReport_1a
ChqReport_1b
ChqReport_1c ;
quit;

Note: Before deleting intermediate data, you should confirm there are no error conditions, etc.

12
Marje Fecht, Prowerk Consulting Ltd, 2014
MINIMIZE THE AMOUNT OF DATA YOU READ
You do not have to read data to change many data set attributes. Many SAS programmers rely on the DATA step to
handle changes to variable attributes (labels, formats, renaming, etc.). However, the DATA step reads every record
which is problematic if your data include millions of records. Instead, the following PROC DATASETS example
• changes a data set name
• changes variables names
• assigns a variable format.
No data values are read!

proc datasets lib = project memtype = data details;

/** rename dataset **/
change ChqReport = ChqReport_&lastMonth ;
/** change variable attributes in ChqExtract **/
modify ChqExtract;
rename txns = Transactions
Date = Txn_Date;
format TotalAmt Dollar12.;
quit;

What about SAS Data sets?

When creating permanent SAS data sets, a date-time stamp in the name may be problematic for processing from
other programs and processes (eg: Excel Pivots). Instead, consider generation data sets.

Each data set in a SAS generation data group has the same member name (data set name) but has a different
version number. Every time the SAS data set is updated, a new generation data set is created and the version
numbers of the older versions are incremented. The DEFAULT version is called the base version, and is the most
recent version of the data. What are the advantages of using generation data sets?
• The most current version of the generation data group is referenced using the base data set name. Thus,
downstream programs do not need to worry about date-time stamps in the data set name.
• Older versions of the data are available for PROC COMPARE testing when validating the data.
• You don’t have to remember to save the data before creating a new version .

For further information on creating and using generation data sets, see the genmax and gennum data set options in
SAS online documentation. Or, reference Lisa Eckler’s SAS Global Forum Paper on Generation Data Sets:
http://support.sas.com/resources/papers/proceedings12/051-2012.pdf .

CONCLUSION
Deadlines and deliverables leave little time for learning new tricks and making changes to existing programs. But
there are improvements that can be made that save you time and headaches.

This paper provides techniques you may want to consider to update and improve your programs. In the long run, the
changes could generalize your code and decrease your maintenance efforts.

CONTACT INFORMATION AND RELATED TOPICS

Related papers are available at:
http://www.sascommunity.org/wiki/Presentations:Marje_Papers_and_Presentations

Your comments and questions are valued and encouraged. Contact the author at:

Marje Fecht, Prowerk Consulting

[email protected]

SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the
USA and other countries. ® indicates USA registration. Other brand and product names are trademarks of their respective
companies.

13
Marje Fecht, Prowerk Consulting Ltd, 2014

0 Git t24 Documentation PDF
100% (2)
0 Git t24 Documentation PDF
347 pages
Topic 2 - Problem Solving Concepts For The Computer
No ratings yet
Topic 2 - Problem Solving Concepts For The Computer
41 pages
Comp:P2
No ratings yet
Comp:P2
483 pages
Characater Functions
No ratings yet
Characater Functions
8 pages
Latest Algorithm Design Using Pseudocode
No ratings yet
Latest Algorithm Design Using Pseudocode
28 pages
Analyzing Information Using Ict
100% (3)
Analyzing Information Using Ict
13 pages
Oracle Iprocurement Set Up
No ratings yet
Oracle Iprocurement Set Up
25 pages
Aksa Lte NW Assessment
100% (2)
Aksa Lte NW Assessment
43 pages
Chapter 18 SB Answers
100% (1)
Chapter 18 SB Answers
9 pages
01 - CVP Comprehensive Call Flows
100% (2)
01 - CVP Comprehensive Call Flows
49 pages
Selection Control Structure
No ratings yet
Selection Control Structure
18 pages
CE 205-MATLAB For Civil Engineers Irfan Turk Fatih University, 2013-14
No ratings yet
CE 205-MATLAB For Civil Engineers Irfan Turk Fatih University, 2013-14
14 pages
B2B E-Commerce - Nobo - IT - Proposal
100% (1)
B2B E-Commerce - Nobo - IT - Proposal
36 pages
C Programming
No ratings yet
C Programming
212 pages
Dictm Aom 20091
100% (1)
Dictm Aom 20091
110 pages
SAS Functions by Example - Herman Lo
100% (1)
SAS Functions by Example - Herman Lo
18 pages
An Introduction To SAS Character Functions (Including Some New SAS 9 Functions)
No ratings yet
An Introduction To SAS Character Functions (Including Some New SAS 9 Functions)
48 pages
3 Functions
No ratings yet
3 Functions
43 pages
Sas Functions Pocketref
No ratings yet
Sas Functions Pocketref
171 pages
6 Matlab
No ratings yet
6 Matlab
72 pages
1738576707-Paper 2 Pseudocode Basics-1
No ratings yet
1738576707-Paper 2 Pseudocode Basics-1
76 pages
Matlab
No ratings yet
Matlab
72 pages
2013 Seminar Part1, Part2, Stracture Answer (English)
No ratings yet
2013 Seminar Part1, Part2, Stracture Answer (English)
12 pages
C Notes
No ratings yet
C Notes
172 pages
Squillace Erros
No ratings yet
Squillace Erros
57 pages
Sas Functions Pocketref
No ratings yet
Sas Functions Pocketref
171 pages
JIO FI Manual
100% (1)
JIO FI Manual
17 pages
04 Slide
No ratings yet
04 Slide
40 pages
Functions
No ratings yet
Functions
19 pages
Solution Manual For Problem Solving and Programming Concepts 9 E 9th Edition 132492644 PDF
0% (1)
Solution Manual For Problem Solving and Programming Concepts 9 E 9th Edition 132492644 PDF
6 pages
LBSIM Business Analytics Slides - Day 8
No ratings yet
LBSIM Business Analytics Slides - Day 8
38 pages
My Document
No ratings yet
My Document
37 pages
All 37 Functions
No ratings yet
All 37 Functions
33 pages
COBOLDay 2
No ratings yet
COBOLDay 2
75 pages
Cambridge International AS & A Level: Computer Science 9618/22
No ratings yet
Cambridge International AS & A Level: Computer Science 9618/22
28 pages
MSBTE Papers
No ratings yet
MSBTE Papers
28 pages
Transforming Data With SAS Functions
No ratings yet
Transforming Data With SAS Functions
48 pages
Chapter 4 Programming
No ratings yet
Chapter 4 Programming
56 pages
REXX Basics
No ratings yet
REXX Basics
38 pages
SAS Commands
No ratings yet
SAS Commands
13 pages
Stansys: Software Solutions
No ratings yet
Stansys: Software Solutions
19 pages
Base Five 08
No ratings yet
Base Five 08
19 pages
Class 5 Notes
No ratings yet
Class 5 Notes
33 pages
BVSS 2020 P1
No ratings yet
BVSS 2020 P1
14 pages
Abap 7.4
No ratings yet
Abap 7.4
16 pages
New Functions in SAS 9
No ratings yet
New Functions in SAS 9
7 pages
CS2
No ratings yet
CS2
10 pages
Imelda C. Go, Lexington County School District One, Lexington, SC
No ratings yet
Imelda C. Go, Lexington County School District One, Lexington, SC
4 pages
Base Interview 2
No ratings yet
Base Interview 2
10 pages
Functions: String Function
No ratings yet
Functions: String Function
27 pages
Phuse 2017: Geetha Kesireddi, Gce Solutions Inc, Hyderabad, India
No ratings yet
Phuse 2017: Geetha Kesireddi, Gce Solutions Inc, Hyderabad, India
8 pages
DATALINES, Sequential Files, CSV, HTML and More - Using INFILE and INPUT Statements To Introduce External Data Into The SAS System
No ratings yet
DATALINES, Sequential Files, CSV, HTML and More - Using INFILE and INPUT Statements To Introduce External Data Into The SAS System
18 pages
8 PseudocodePython
No ratings yet
8 PseudocodePython
9 pages
Phishing, Pharming, Vishing and Smishing
100% (1)
Phishing, Pharming, Vishing and Smishing
2 pages
8 PseudocodePython
No ratings yet
8 PseudocodePython
9 pages
Computer Awareness 1 PDF
No ratings yet
Computer Awareness 1 PDF
17 pages
Computer Science Paper 2 MS by Aqib Khan
No ratings yet
Computer Science Paper 2 MS by Aqib Khan
9 pages
8 PseudocodePython
No ratings yet
8 PseudocodePython
7 pages
Functtion in Base Sas and Proc SQL
No ratings yet
Functtion in Base Sas and Proc SQL
12 pages
SNR Digital Solutions 20 Ea Mock MC Question
No ratings yet
SNR Digital Solutions 20 Ea Mock MC Question
12 pages
9608 s15 Pre 21
No ratings yet
9608 s15 Pre 21
8 pages
A Few Alternatives To IF-THEN-ELSE
No ratings yet
A Few Alternatives To IF-THEN-ELSE
2 pages
Paper 233-30 An Introduction To SAS® Character Functions (Including Some New SAS®9 Functions) Ronald Cody, Ed.D
No ratings yet
Paper 233-30 An Introduction To SAS® Character Functions (Including Some New SAS®9 Functions) Ronald Cody, Ed.D
15 pages
SAS Chapter 03
No ratings yet
SAS Chapter 03
6 pages
217-2007 3 PDF
No ratings yet
217-2007 3 PDF
17 pages
Comp 12 Half
No ratings yet
Comp 12 Half
3 pages
Vxrail Simulator
No ratings yet
Vxrail Simulator
4 pages
Module Summary: Preparing Data: Reading and Filtering Data
No ratings yet
Module Summary: Preparing Data: Reading and Filtering Data
3 pages
The Best of Cheesy, SleazSAS Tricksy SAS Tricks
No ratings yet
The Best of Cheesy, SleazSAS Tricksy SAS Tricks
2 pages
Html5 and Css3
100% (1)
Html5 and Css3
88 pages
Running E-Business Suite On Exadata
No ratings yet
Running E-Business Suite On Exadata
37 pages
Privacy Tools v19.84 Secure Open List: Ubuntu Touch: Android Alternative For Phones and Tablets
No ratings yet
Privacy Tools v19.84 Secure Open List: Ubuntu Touch: Android Alternative For Phones and Tablets
84 pages
Service Quotas: User Guide
No ratings yet
Service Quotas: User Guide
19 pages
Privacy Preserving Data Mining Thesis PDF
100% (3)
Privacy Preserving Data Mining Thesis PDF
4 pages
Bria 3 Dial Plan Guide R1
No ratings yet
Bria 3 Dial Plan Guide R1
8 pages
Gitam: Department of Computer Science & Engineering and Department of Information Technology
No ratings yet
Gitam: Department of Computer Science & Engineering and Department of Information Technology
5 pages
1 What Is A Pivot Table
No ratings yet
1 What Is A Pivot Table
6 pages
Build Your Own Windows Server IT Lab PDF
No ratings yet
Build Your Own Windows Server IT Lab PDF
15 pages
Li Fi Technology
No ratings yet
Li Fi Technology
15 pages
Open Research Online: Integrating Web Services Into Data Intensive Web Sites
No ratings yet
Open Research Online: Integrating Web Services Into Data Intensive Web Sites
9 pages
Revit Shortcuts Cheat Sheet
No ratings yet
Revit Shortcuts Cheat Sheet
1 page
Quanser Qbot - Manual
No ratings yet
Quanser Qbot - Manual
23 pages
RCA 5-Why's Template
No ratings yet
RCA 5-Why's Template
2 pages
200 Assignment
No ratings yet
200 Assignment
2 pages
Final Essay 15% ENGLISH FOR ENGINEERS V
No ratings yet
Final Essay 15% ENGLISH FOR ENGINEERS V
2 pages
Spring 2023 Assignment 1 (CS301p)
No ratings yet
Spring 2023 Assignment 1 (CS301p)
3 pages
Authorizations and Roles (BPM)
No ratings yet
Authorizations and Roles (BPM)
6 pages
C Programming
From Everand
C Programming
Netra
No ratings yet
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)

Quick Hits - My Favorite SAS Tricks: Marje Fecht, Prowerk Consulting

Uploaded by

Quick Hits - My Favorite SAS Tricks: Marje Fecht, Prowerk Consulting

Uploaded by

Paper 1459-2014

Quick Hits - My Favorite SAS® Tricks

• CAT concatenates multiple strings in one function call

Using CATX, the above example would be reduced to:

new = CATX ( ' ', n , a, b, c);

ch_dm ch_on ch_cc ch_st Desired Result

/*** Create text field for each channel ***/

chann_1 = dm || '_' || on || '_' || cc || '_' || st;

ch_dm ch_on ch_cc ch_st chann_1

ch_dm ch_on ch_cc ch_st chann_1 channels_CATX

CONDITIONAL ASSIGNMENT OF VALUES - SIMPLE

IFC (IFN) returns a character (numeric) value based on whether an expression is

IFC (IFN) is coded as

Consider the statements

This logic can be rewritten as

/*** NO NEED TO CREATE EXTRA TEXT VARIABLES ***/

ch_dm ch_on ch_cc ch_st channels_IFC_CATX

CONDITIONAL ASSIGNMENT OF VALUES – MORE COMPLEX

Example 2: Assign course status based on course grade.

Length Status $7; /*IFC defaults the result to length 200*/

grade * (grade ge 70)

The corrected logic is:

Length Status $7;

MORE FUNCTIONS FOR YOUR TOOLKIT

A few more of my favorite (and more obscure) functions include:

• COUNT, COUNTC  counts # of occurrences of a string or Character

• LENGTH (min=1) , LENGTHN (min=0)  position of last non-blank.

• LARGEST ( k , var1, var2, …)  kth largest non-missing value

• SMALLEST ( k , var1, var2, …)  kth smallest non-missing value

BEST PRACTICES: REUSABLE CODE

Reusable code modules

CODE GENERALIZATION: LOGS AND OUTPUT

%let requestID = PROJECT974;

... <other program logic – following parameter def'n> ...

%let dir = \&filepath.\&requestID.;

CODE MODULARIZATION: SOURCE CODE AND DRIVER PROGRAMS

*** Driver Program – specify parameters;

/*** NO CHANGES below here ***/

One problem occurs with the above approach…

Contents of Control Program: ClaimsReportExtract_CurrentPgm.sas

*** call latest version of source program;

*** Driver Program – specify correct parameters;

CREATE DYNAMIC LISTS

proc sql noprint;

%put NumCodes = &NumCodes;

The resulting comma delimited list can be used

You want the variable names to reflect the month, such as

sum(case when txn_date between "&M1_beg"d and "&M1_end"d

You need macro variables for:

INTNX – move in intervals

The INTNX function increments dates by intervals:

%let M1 = %sysfunc( intnx( MONTH ,

%let M2 = %sysfunc( intnx( MONTH ,

%let M2_end = %sysfunc( intnx( MONTH

%let M0 = %sysfunc( today() , yymmN4.);

%let M1 = %sysfunc( intnx( MONTH , %sysfunc( today() ) , -1)

%let M2 = %sysfunc( intnx( MONTH , %sysfunc( today() ) , -2)

,sum(case when txn_date between

,sum(case when txn_date between

where txn_date between "&M2_beg"d and "&M0_end"d

Suppose your query is for a database table with

DYNAMIC CODE GENERATION

COPY – PASTE syndrome. . .

What Macro variables and SAS variables are needed?

Macro Variable Name Value SAS Variable Name

%let MaxMonth=11; /** Months start at ZERO **/

/** create suffix for variable names **/

/** create the date corresponding to beginning of month **/

/** create the date corresponding to end of month **/

SPACE MANAGEMENT - BEST PRACTICES

proc datasets lib = work

proc datasets lib = project memtype = data details;

What about SAS Data sets?

CONTACT INFORMATION AND RELATED TOPICS

Marje Fecht, Prowerk Consulting

You might also like

/* Create text field for each channel */

/* NO NEED TO CREATE EXTRA TEXT VARIABLES */

Length Status $7; /IFC defaults the result to length 200/

/* NO CHANGES below here */

%let MaxMonth=11; / Months start at ZERO /

/ create suffix for variable names /

/ create the date corresponding to beginning of month /

/ create the date corresponding to end of month /