Prior Classes 2021

WUSS Online Classes 2021 – PRIOR CLASSES

In lieu of an annual conference, WUSS is offering an extensive menu of online training classes taught by seasoned industry experts throughout 2021. The classes listed below have already taken place. Please click on each class title for a detailed description of the course and information about the instructors.

These are past classes – for information about UPCOMING CLASSES, please click here!

Date Course Title (click for description) Instructor(s)
APRIL 2021
Apr 26 SAS + R Part 1: Connecting SAS and R in Your Data Science Workflow Hunter Glanz
Apr 28 SAS + R Part 2: Using R Shiny to Make Your Data Wrangling and Visualization Interactive Hunter Glanz
JUNE 2021
Jun 17 Why You Are Using PROC GLM Too Much (and What You Should Be Using Instead) Part 1 Deanna Schreiber-Gregory and Peter Flom
Jun 18 Why You Are Using PROC GLM Too Much (and What You Should Be Using Instead) Part 2 Deanna Schreiber-Gregory and Peter Flom
Jun 22 How Sick is my Cohort of Patients? A General Approach to Identifying Chronic Conditions Patricia Ferido
Jun 25 Elementary Logistic Regression with Predictive Modeling Bruce Lund
SEPTEMBER 2021
Sep 2 Introduction to the SQL Procedure Kirk Paul Lafler
Sep 3 PROC SQL Programming: Beyond the Basics Using SAS® Kirk Paul Lafler
Sep 17 Python Programming Seminar – Basics for Statistical Programmers and Statisticians Kevin Lee
Sep 20 Advanced DATA Step Programming Techniques Josh Horstman
Sep 22 SAS Essentials (or How I Learned to Stop Worrying and Love Code) Part 1 Tasha Chapman
Sep 24 Python Programming Seminar – Advanced with Machine Learning Kevin Lee
NOVEMBER 2021
Nov 5 SAS Essentials (or How I Learned to Stop Worrying and Love Code) Part 1 – ENCORE Tasha Chapman
Nov 15 SAS Essentials (or How I Learned to Stop Worrying and Love Code) Part 2 Tasha Chapman
Nov 17 SAS Macro Quoting: Learning the Skills of a Macro Developer Russ Lavery

Course Descriptions

SAS + R Part 1: Connecting SAS and R in Your Data Science Workflow

Hunter Glanz
Monday, April 26, 2021, 10:00am-2:00pm Pacific Daylight Time

As robust statistical software packages, SAS and R boast a great number of tools for addressing all of your data-related needs. While there exists large overlap in what they provide, today’s statistical and data science problems increasingly involve multiple software packages. After all, if you have access to all of these tools then why not explore how they can improve your workflow! In this class we will explore the complete workflow of cleaning a dataset, exploring it, visualizing it using a combination of SAS and R.

SAS + R Part 2: Using R Shiny to Make Your Data Wrangling and Visualization Interactive

Hunter Glanz
Wednesday, April 28, 2021, 10:00am-2:00pm Pacific Daylight Time

While both SAS and R include a rich suite of tools for working with your data, there often exists a collection of tasks and activities that get repeated with every new dataset. Traditionally such repetition could be addressed by building macros or functions. R Shiny enhances this process by making your data work interactive! Not only can this save you some code and work, but it provides a way for consumers of your work to do all of your cool data science-y things without needing to know how to program. In this class we will build our very own basic shiny applications using R.

Why You Are Using PROC GLM Too Much (and What You Should Be Using Instead) Part 1

Deanna Schreiber-Gregory and Peter Flom
Thursday, June 17, 2021, 10:00am-2:00pm Pacific Daylight Time

The general linear model (linear regression and ANOVA) is one of the most commonly used statistical methods. However, the GLM makes assumptions and sometimes these assumptions are violated. There are many techniques that can be used to deal with various violations, and there are SAS PROCs to implement these. These include: Quantile regression, Robust regression, Cubic splines and other forms of splines, Multivariate adaptive regression splines (MARS), Regression trees, Multilevel models, Ridge Regression, LASSO, and Elastic Nets, among other methods. Covered PROCs include QUANTREG, ROBUSTREG, ADAPTIVEREG and MIXED.

Part 1: Intro, assumptions, diagnosing violations, quantile regression (QUANTREG), MARS (ADAPTIVEREG) and splines (TRANSREG).

Why You Are Using PROC GLM Too Much (and What You Should Be Using Instead) Part 2

Deanna Schreiber-Gregory and Peter Flom
Friday, June 18, 2021, 10:00am-2:00pm Pacific Daylight Time

The general linear model (linear regression and ANOVA) is one of the most commonly used statistical methods. However, the GLM makes assumptions and sometimes these assumptions are violated. There are many techniques that can be used to deal with various violations, and there are SAS PROCs to implement these. These include: Quantile regression, Robust regression, Cubic splines and other forms of splines, Multivariate adaptive regression splines (MARS), Regression trees, Multilevel models, Ridge Regression, LASSO, and Elastic Nets, among other methods. Covered PROCs include QUANTREG, ROBUSTREG, ADAPTIVEREG and MIXED.

Part 2: Ridge regression (REG), Lasso and elastic nets (GLMSELECT), and multilevel models (MIXED and GLIMMIX).

How Sick is my Cohort of Patients? A General Approach to Identifying Chronic Conditions

Patricia Ferido
Tuesday, June 22, 2021, 10:00am-2:00pm Pacific Daylight Time

With the COVID-19 pandemic, the need for evidence-based healthcare research has become increasingly apparent. Even before the recent health crisis, the volume of available data on healthcare had been growing exponentially. Claims data and electronic health records provide rich insight into the health status of patients and the care provided by health care systems. Successfully uncovering these insights, however, requires an understanding of the data, as well as standardized and validated methods of analysis. This class will provide an overview of best practices when working with claims data, specifically Medicare claims data. Topics covered will include: the general structure of claims data, how best to use that information to identify disease cohorts, different approaches for measuring the health status of patients (e.g., Charlson Comorbidity Index, the Elixhauser Comorbidity Index, Hierarchical Condition Category Coding, etc.), and a deep dive into the Chronic Condition Warehouse (CCW) algorithms. Finally, the class will conclude with the workshopping of a SAS Macro that applies CCW-like rules to any dataset that resembles insurance claims or electronic health records with a full-picture of diagnoses and procedures from patient medical visits. The macro package includes the CCW validated algorithms (the default option), but also has the flexibility for the user to apply the algorithm to a different set of diagnoses and procedures. The user can either implement variations of the CCW-definitions or identify entirely new conditions, so long as they can be implemented using diagnosis or procedure codes, claim types, and CCW-like rules. After taking the class, students will have an understanding of key factors to consider in disease cohort analysis and will have direct experience using this package to identify diseases in simulated data.

Elementary Logistic Regression with Predictive Modeling

Bruce Lund
Friday, June 25, 2021, 10:00am-2:00pm Pacific Daylight Time

This class presents light theory, supported by simulations, for understanding binary logistic regression models using SAS®. This discussion of logistic regression begins at the beginning. No prior experience is assumed.

Once the basics of logistic regression are introduced, the class focuses on using logistic models in predictive modeling on large datasets. Examples from credit risk and automotive marketing are given. The class will be less focused on explanatory models as would arise in the bio-sciences.
Topics include: Logistic regression versus other methods; Likelihood function and maximum likelihood estimators; Statistics for predictor and overall model fit; Screening, binning, transforming of predictors (including weight of evidence coding); Discussion of multicollinearity; Predictor selection methods using PROC LOGISTIC, HPLOGISTIC, HPGENSELECT including best subsets, stepwise with sbc/aic, Lasso; Model validation and assessment including c statistic, R-squares classification error, and lift charts in the context of training, cross-validation, and validation samples.

Class uses BASE SAS and SAS/STAT. No usage of Viya or Enterprise Miner.

Introduction to the SQL Procedure

Kirk Paul Lafler
Thursday, September 2, 2021, 10:00am-2:00pm Pacific Daylight Time

This introductory-level course provides SAS® users with a solid foundation with the Base-SAS software’s SQL procedure. Attendees learn core concepts and features on how PROC SQL can be used to interact and access data stored in SAS datasets (or tables); use PROC SQL to retrieve, subset, order, and group data and results; understand the similarities and differences between DATA step merges and joins; perform simple inner and outer join operations to shape results; produce “quality” output and results using PROC SQL and Output Delivery System (ODS); create new tables; create and modify data in tables; and manage the tables themselves.

PROC SQL Programming: Beyond the Basics Using SAS®

Kirk Paul Lafler
Friday, September 3, 2021, 10:00am-2:00pm Pacific Daylight Time

PROC SQL Programming: Beyond the Basics Using SAS® provides SAS® and SQL users with core concepts, features and techniques on how to effectively use PROC SQL. Attendees learn how to use PROC SQL to access data in SAS datasets (tables); review essential programming tasks including retrieving, execution order of the SELECT clauses, subsetting, ordering, and grouping data; construct logic scenarios with case expressions; explore one-to-one, one-to-many, and many-to-many data relationships; understand the similarities and differences between DATA step merges and joins; create inner and outer join constructs as well as apply set operators to combine two or more tables together; use summary (statistical) functions to aggregate data; use three different approaches to create new tables; interface PROC SQL and the macro facility to create single-value and multi-value (list) macro variables; apply a number of query debugging techniques to help detect coding errors, warnings and other messages; and scale SQL queries for improved performance.

Python Programming Seminar – Basics for Statistical Programmers and Statisticians

Kevin Lee
Friday, September 17, 2021, 10:00am-2:00pm Pacific Daylight Time

Python is one of the most popular language nowadays. Python can be used to build just about anything, and it is a great language for back-end web development, data analysis, scientific computing, machine learning and many more.

The seminar is intended for Statistical Programmers and Statisticians who are familiar with SAS programming. It is not easy for programmers and biostatisticians to learn new language alone. The seminar will provide basic concept and foundation of Python programming, and the seminar will provide its comparison and similarity with SAS programming. Therefore, Statistical Programmers and Statisticians have easier time to understand how Python programming works.

The basic Python Programming seminar will cover basic Python programming. It is recommended for those who has a little or no experience in Python programming. It will help SAS programmers and statisticians how to start Python programming and how to use Jupyter Notebook/Lab (the most popular python platform).

Agenda for the seminar: Python Programming Seminar – Basics

  • Introduction to Python for statistical programmers and statisticians
  • Jupyter Notebook (Python programming platform) download and implementation
  • Python Variables Type: Number, String, Lists, Dictionaries, Arrays, Data Frames
  • Simple variable manipulation – If & For statements
  • Python Function development and comparison with SAS Macro
  • Import external Modules/Functions
  • Reading and writing external data (excel, SAS datasets, Images)
  • Data manipulation using Python
  • Introduction of NumPy and Array
  • Introduction of Pandas and DataFrame: DataFrame vs SAS datasets
  • Basic data manipulation – merge, sort, variables drop/addition
  • Create SDTM DM dataset using SAS raw datasets

Advanced DATA Step Programming Techniques

Josh Horstman
Monday, September 20, 2021, 10:00am-2:00pm Pacific Daylight Time

To solve complex coding problems with the SAS® DATA step, one must go beyond a basic understanding of the individual statements. You need to understand how the various statements interact with each other and how their options can be leveraged to build DATA step code that provides innovative solutions to the toughest of problems. Based on Art Carpenter’s book, Carpenter’s Guide to Innovative SAS® Techniques, this class is a must for the DATA step programmer who wants to take his or her programs to the ‘next’ level.

Topics include working across multiple observations using look-ahead and Look-back techniques, employing the DOW loop, taking advantage of double SET statements, working with hash objects, performing table lookups, using arrays to transpose data from columns to rows and back again, evaluating complex expressions, applying data set options, adopting new DATA step functions (and old function with new options), and more.

This course is designed to be taken by a student who has a basic understanding of the DATA step and its primary statements. The material will focus on advanced topics that will give the student a deeper understanding of the operation of the DATA step. Through examples, students will be exposed to innovative techniques for solving difficult programming problems.

SAS Essentials (or How I Learned to Stop Worrying and Love Code) Part 1

Tasha Chapman
Wednesday, September 22, 2021, 10:00am-2:00pm Pacific Daylight Time

SAS Essentials is a three-part instructor-led course that provides a thorough introduction to the basics of SAS programming including DATA steps, PROC steps, and the Output Delivery System. In these classes we focus entirely on coding, providing a fundamental education in how SAS thinks and unlocking the power to use the incredible versatility of SAS code. Whether you’re entirely new to SAS, new to coding, or just want to brush up on the fundamentals, these classes are for you.

Part 1: DATA steps and data manipulation (or How to Train Your SAS Datasets)

Together we’ll walk through the fundamental building blocks of a SAS program with a deep dive into DATA steps and data manipulation, including SAS libraries, conditional processing, functions, and more.

Python Programming Seminar – Advanced with Machine Learning

Kevin Lee
Friday, September 24, 2021, 10:00am-2:00pm Pacific Daylight Time

The advanced Python programming seminar will cover more advanced Python programming. It is recommended for those who took morning seminar or last year’s Python course, or for those who have some knowledge, but want to learn more advanced Python programming. This seminar will also cover Machine Learning implementation using Python.

Agenda for advanced Python programming seminar

  • Simple review of basic Python Programming seminar
  • Metadata analysis (PROC CONTENT)
  • Advanced Programming – transpose, remove duplicate record, group-by
  • Statistical Analysis – Pair t-test, Fisher Exact Test, Survival Analysis
  • Data visualization – Scatter Plot, Histogram, Kaplan Meier Curves
  • Machine Learning Introduction – concepts and theory
  • Machine Learning Algorithm – Regression, Logistic Regression, Decision Tree
  • Deep Learning Algorithm – Deep Neural Network (DNN), Convolutional Neural Network (CNN), Recurrent Neural Network (RNN)
  • Python Machine Learning modules – Sklearn, Tensorflow, Keras
  • Python Machine Learning workshop using image data

Through the seminars, programmers and statisticians will be able to learn the following:

  • Deeper Understanding of Python programming
  • Jupyter Notebook download and experience
  • Real time Python coding exercise
  • Difference and similarity with SAS programming
  • Data Manipulation and analysis in Python
  • Machine Learning programming in Python





SAS Essentials (or How I Learned to Stop Worrying and Love Code) Part 1 – ENCORE

Tasha Chapman
Friday, November 5, 2021, 10:00am-2:00pm Pacific Time

SAS Essentials is a three-part instructor-led course that provides a thorough introduction to the basics of SAS programming including DATA steps, PROC steps, and the Output Delivery System. In these classes we focus entirely on coding, providing a fundamental education in how SAS thinks and unlocking the power to use the incredible versatility of SAS code. Whether you’re entirely new to SAS, new to coding, or just want to brush up on the fundamentals, these classes are for you.

Part 1: DATA steps and data manipulation (or How to Train Your SAS Datasets)

Together we’ll walk through the fundamental building blocks of a SAS program with a deep dive into DATA steps and data manipulation, including SAS libraries, conditional processing, functions, and more.





SAS Essentials (or How I Learned to Stop Worrying and Love Code) Part 2

Tasha Chapman
Monday, November 15, 2021, 10:00am-2:00pm Pacific Time

SAS Essentials is a three-part instructor-led course that provides a thorough introduction to the basics of SAS programming including DATA steps, PROC steps, and the Output Delivery System. In these classes we focus entirely on coding, providing a fundamental education in how SAS thinks and unlocking the power to use the incredible versatility of SAS code. Whether you’re entirely new to SAS, new to coding, or just want to brush up on the fundamentals, these classes are for you.

Part 2: PROC steps and basic reporting (or How to Succeed in SAS Without Really Trying)

This course will feature a showcase of the most common reporting procedures, including MEANS, FREQ, PRINT, TABULATE, and REPORT. We’ll also walk through the Output Delivery System and how it can be used to build professional reports the easy way.





SAS Macro Quoting: Learning the Skills of a Macro Developer

Russ Lavery
Wednesday, November 17, 2021, 10:00am-2:00pm Pacific Time

Attending this seminar that will give people the skills to move from being a macro writer to macro developer by understanding macro quoting. It works through more macro quoting examples than exist in all other online materials – combined.

This seminar focuses on using examples, and animated PowerPoints of the internal workings of the SAS system, to make words in the documentation clear and understandable. This seminar is effective for several reasons. Firstly, pictures/maps of the system are easier to understand than words when establishing relationships among system components. Secondly, a programmer must understand the sequence, and timing, of the steps in macro quoting in order to write/debug macros. This is best communicated as a series of “detailed moving images” that show the states of different parts of the system.

The moving graphical presentation has several advantages:

  • The picture of the process allows a reader to “check their understanding”. If the picture agrees with an attendee’s understanding of the words, the concept has been understood. If the attendee wants to ask questions, the map/picture allows the attendee to ask very specific/focused question and get answers that address their point of confusion.
  • This graphic presentation of the material helps bridge language barriers (I’ve done this in China 3 times). When English skills are lacking, the picture provides a second channel of communication.
  • The macro quoting process is complex and understanding/learning the process requires a student “hold the state of the system in their memory.” Using pictures of the system greatly reduces the mental complexity (human memory needs) of the learning process.

Meet the Instructors

Tasha Chapman has been using SAS since 2003 and an active member of the SAS User Community since 2006. Throughout her career she has championed continuing education and professional development, presented trainings and workshops at professional conferences across the country, sat as an executive committee member on nationally recognized associations, and founded two organizations dedicated to educating colleagues about research, data, and statistics. She was the chair of the Western Users of SAS Software Conference and Educational Forum in 2017.

Tasha received a Master’s Degree in Psychology, with an emphasis on personality assessment and psychometrics from the University of California, Riverside. Since 2005 she has worked for the State of Oregon as a lead research analyst providing data and policy analysis for Oregon OSHA, the Oregon Child Welfare Program, and the Oregon Health Authority. She lives in Salem, Oregon with her two adorable kids, Nadia and Atlas.

Patricia Ferido is a Senior Research Programmer at the Leonard D. Schaeffer Center for Health Policy and Economics where she analyzes medical data for research on dementia care and treatment. Prior to joining the Schaeffer Center, she worked as an economics litigation consultant specializing in the analysis of labor data for wage and hour litigation. She holds a BA in both Economics and International Development Studies from UCLA and is pursuing a Masters in Public Policy Data Science at USC.
Peter Flom is a retired independent statistical consultant who worked with graduate students and researchers in the social, medical and behavioral sciences. He has been using SAS for over 20 years and has given talks at SAS Global Forum and many local and regional SAS user groups.
Hunter Glanz is an Associate Professor of Statistics and Data Science at California Polytechnic State University (Cal Poly, San Luis Obispo). He received a BS in Mathematics and a BS in Statistics from Cal Poly, San Luis Obispo followed by an MA and PhD in Statistics from Boston University. He maintains a passion for machine learning and statistical computing, and enjoys advancing education efforts in these areas. In particular, Cal Poly’s courses in R, SAS, and Python give him the opportunity to connect students with exciting data science topics amidst a firm grounding in communication of statistical ideas. Hunter serves on numerous committees and organizations dedicated to delivering cutting edge statistical and data science content to students and professionals alike. In particular, the ASA’s DataFest event at UCLA has been an extremely rewarding experience for the teams of Cal Poly students Hunter has had the pleasure of advising.
Josh Horstman is an independent statistical programmer based in Indianapolis with 20 years’ experience using SAS in the life sciences industry. He specializes in analyzing clinical trial data, and his clients have included major pharmaceutical corporations, biotech companies, and research organizations. A SAS Certified Advanced Programmer, Josh loves coding and is a frequent presenter at SAS Global Forum and various regional and local SAS users’ group. Josh holds a bachelor’s degree in mathematics and computer science, and a master’s degree in statistics from Colorado State University.
Kirk Paul Lafler is an entrepreneur, consultant, programmer and educator, and has been a SAS user since 1979. Kirk is a lecturer and adjunct professor at San Diego State University; an advisor and adjunct professor at the University of California San Diego Extension; and teaches dozens of SAS, SQL, Excel, R and Python courses, seminars, workshops, and webinars to users around the world. As the author of several books including PROC SQL: Beyond the Basics Using SAS, Third Edition (SAS Press. 2019) along with hundreds of papers and articles on a variety of SAS topics; Kirk has been selected as an Invited speaker, educator, keynote and section leader at SAS conferences and meetings worldwide; and is the recipient of 25 “Best” contributed paper, hands-on workshop (HOW), and poster awards.
Russ Lavery is a frequent and multiple-award winning presenter at SAS and other programming conferences. He has been the technical reviewer for five books by SAS press and has lectured all over the U.S. in Europe and in Asia.
Kevin Lee is a Data Scientist, Machine Learning Leader/Instructor/Evangelist in Pharmaceutical Industry. Currently, Kevin is Assistant Vice President of AI/Machine Learning Consultant at Genpact and teaches Machine Learning/Python/CDISC/Oncology courses at conferences and university. Kevin has been a big advocate in leadership and innovative technologies, with which Kevin wants to innovate Pharmaceutical Industry. Kevin earned an M.S. in Applied Statistics at Villanova University following a B.S. from University of Pennsylvania.
Bruce Lund is a statistical modeling consultant and trainer. For 15 years he was a statistical and modeling consultant for OneMagnify of Detroit. Before OneMagnify, he was the customer database manager at Ford Motor Company and a mathematics professor at University of New Brunswick, Canada. He has a mathematics PhD from Stanford University. Bruce Lund has presented at SAS Global Forum, SAS AnalyticsX, ASA CSP, and at regional SAS user group conferences.
Deanna Schreiber-Gregory is a Lead Research Statistician and Data Manager on contract through the Henry M Jackson Foundation for the Advancement of Military Medicine to the Department of Defense in Bethesda, MD. She is also an Independent Consultant for Statistics, Research Methods, and Data Management in the private sector through Juxdapoze, LLC. Deanna has an MS in Health and Life Science Analytics, a BS in Statistics, and a BS in Psychology. Deanna has presented as a contributed and invited speaker at over 50 local, regional, national, and global SAS user group conferences since 2011.