WUSS 2023 Classes

WUSS will once again be offering a full menu of optional pre- and post-conference training classes on a variety of topics taught by seasoned experts. Classes are not included in conference registration and must be added separately. Half-day classes are $200, and full-day classes are $400. Don’t miss this chance to maximize your experience at WUSS 2023!

Course Title (click for description) Instructor(s)
(click for bio)
Time
Monday, Oct. 30, 2023 – FULL DAY
Hands-on Workshop (HOW*) – Python Programming Masterclass with Comparison to SAS® Kirk Paul Lafler 8:00 AM – 5:00 PM
Introduction to Regression Analysis Theresa Ngo 8:00 AM – 5:00 PM
Tuesday, Oct. 31, 2023 – HALF DAY MORNING
SAS Arrays To Save the Day! Troy Hughes 7:30 AM – 11:30 AM
Flip It, Slice It, Merge It, Splice It: Manipulating and Transforming Data Using SAS Josh Horstman 7:30 AM – 11:30 AM
Statistics for Programmers Jim Box 7:30 AM – 11:30 AM
Hands-on Workshop (HOW*) – Getting Started with SAS Studio: A Point-And-Click Approach to Preparing and Exploring Data Tom Grant 7:30 AM – 11:30 AM
Thursday, Nov. 2, 2023 – HALF DAY AFTERNOON
Hands-on Workshop (HOW*) – Commit early, commit often! A gentle introduction to the joy of Git and GitHub! Isaiah Lankham
& Matthew Slaughter
1:30 PM – 5:30 PM
Automate your Business Processes with SAS Macros Michael Aleman 1:30 PM – 5:30 PM
Mastering the Machine Learning Toolkit to Power Your Classification and Regression Needs Ryan Lafler 1:30 PM – 5:30 PM

*HOW = Hands-on Workshop. These courses will feature instructor-led exercises that can be completed in class.




Course Descriptions

Hands-on Workshop (HOW) – Python Programming Masterclass with Comparison to SAS®
Kirk Paul Lafler
Monday, October 30, 2023
8:00 AM – 5:00 PM PDT

As a general-purpose programming language used by millions of users and developers around the world, Python offers clear syntax, scalability, versatility, and powerful libraries that add tremendous value for anyone to incorporate into their skill sets. Python’s use cases include programming, analytics, data science, web development, web scraping, text processing, image recognition, game development, artificial intelligence, machine learning, and Internet of Things. What seems to be propelling Python’s dominance is that it is relatively easy to learn, consists of a large and growing user community, and is freely available as open source. This course is designed for beginners who have never used Python and/or SAS software before, as well as programmers of other languages who want to enhance their skill set and career opportunities by learning Python and/or SAS software programming techniques.

Intended Audience: Programmers, Data Analysts, Data Scientists, Statisticians, and Others wanting to learn Python and/or SAS software
Prerequisites: No previous Python / SAS programming experience required
Delivery Method: Instructor-led with code examples
Course Material: e-Course Notes (PDF format) and Python / SAS code are provided to Attendees.

Back to top


Introduction to Regression Analysis
Theresa Ngo
Monday, October 30, 2023
8:00 AM – 5:00 PM PDT

This course provides an introduction to regression analysis from model building and variable screening to residual analysis. Some regression pitfalls are highlighted with solutions to resolve or minimize errors and improve model fitting and accuracy. We will begin with Multiple Linear Regression, predicting a quantitative response variable based on two or more independent variables. Then we will build a Logistic Regression model for a binary qualitative response variable. At the end of this course, students will have a good understanding of regression analysis to model quantitative and binary qualitative response variables using SAS/STAT and SAS/GRAPH. 

Back to top


SAS Arrays To Save the Day!
Troy Hughes
Tuesday, October 31, 2023
7:30 AM – 11:30 AM PDT

Attend and receive a FREE copy of the author’s 550-page book, SAS® Data-Driven Development: From Abstract Design to Dynamic Functionality, Second Edition, released in 2022! Students will receive the physical book at the training.
SAS arrays containerize linear, tabular, and multidimensional data into same-type data structures. Arrays can be iterated using various DO loops, and can greatly simplify software by reducing code redundancy and complexity, which in turn maximizes software readability and maintainability. In many cases, an array can be declared and used in lieu of a series of variables, such as when the OF operator is leveraged by built-in functions and subroutines. Moreover, arrays represent the only method to pass multi-element arguments to and from user-defined functions and subroutines created using PROC FCMP (aka, the SAS Function Compiler). This course summarily demonstrates array syntax and advantages in both the DATA step and PROC FCMP.
DATA step array functionality includes:
• declaration and usage (e.g., DO loop) of explicitly indexed arrays
• declaration and usage (e.g., DO OVER loop) of implicitly indexed arrays
• usage of _I_ automatic variable to increment loops
• usage of the DIM, LBOUND, and HBOUND functions to evaluate array dimensions
• declaration of _TEMPORARY_ arrays and their benefits
• use of the OF operator to reference arrays within built-in functions and subroutines
• advanced array syntax to merge data sets and perform lookup operations
PROC FCMP array functionality additionally includes:
• passing a dynamic array to a user-defined function or subroutine
• returning a dynamic array from a user-defined function or subroutine (using the OUTARGS statement)
• declaring a static array inside a user-defined function or subroutine
• usage of the READ_ARRAY function to ingest arrays into FCMP, and the WRITE_ARRAY function to convert arrays into data sets
• usage of the DYNAMIC_ARRAY built-in subroutine (and NOSYMBOLS option) to declare a dynamic array
• conversion between arrays and hash objects
• limitations (and workarounds) of arrays within PROC FCMP

Back to top


Flip It, Slice It, Merge It, Splice It: Manipulating and Transforming Data Using SAS
Josh Horstman
Tuesday, October 31, 2023
7:30 AM – 11:30 AM PDT

Data are rarely received in a format and structure convenient for analysis and reporting. Data frequently requires extensive preparation to rearrange and transform it. Fortunately, SAS provides a comprehensive array of tools for manipulating data. Understanding how these tools work and when to use them is essential for getting the most value out of your data. This half-day course will provide an overview of various SAS programming techniques for combining, rearranging, transposing, and summarizing data. Topics will include DATA step merges, PROC SQL joins, PROC TRANSPOSE, PROC MEANS/SUMMARY, and more. Through real-life examples, the course shows how these various tools can be combined to accomplish complex data manipulation. Upon completion of this course, students will have a better understanding how to get from “the data they have” to “the data they want”.

Back to top


Statistics for Programmers
Jim Box
Tuesday, October 31, 2023
7:30 AM – 11:30 AM PDT

Ever wonder about the how statistics behind some of the analyses you create work? Want to know what p-values really mean? Join this class for a look at the concepts of probability and statistcs used in research. This is a concepts class, so we won’t go too heavy on the math, but will cover topics like hypothesis testing and other inferential statistics

Back to top


Hands-on Workshop (HOW) – Getting Started with SAS Studio: A Point-And-Click Approach to Preparing and Exploring Data
Tom Grant
Tuesday, October 31, 2023
7:30 AM – 11:30 AM PDT

This hands-on workshop shows how one can use the menu driven tasks and SAS code in SAS Studio to perform common reporting and research tasks, including querying, reporting, and analyzing data. SAS Studio provides a point-and-click, graphical user interface, as well as predefined code that helps you exploit the power of SAS. In this workshop you will learn to access your data, combine tables, compute new variables, and explore data with simple statistics and graphs.

Back to top


Hands-on Workshop (HOW) – Commit early, commit often! A gentle introduction to the joy of Git and GitHub!
Isaiah Lankham, Matthew Slaughter
Thursday, November 2, 2023
1:30 PM – 5:30 PM PDT

In this hands-on workshop, we’ll introduce you to the joy of Git and GitHub for managing codebases of any size, whether working alone or as part of a team.

In recent years, the social coding platform GitHub has become synonymous with open-source software development, with many developers also publishing their code as a form of résumé. Behind the scenes, GitHub uses software called Git, which was originally developed as a distributed version control system for managing contributions of thousands of developers to the Linux kernel.

Collaborating together, we’ll practice using the GitHub website and Git from the command line. Topics will include basic Git/GitHub concepts like forking, cloning, and branching, as well as best practices for maintaining a well-organized history of code changes. We’ll also use the GitHub web interface for pull requests, which are the standard mechanism for contributing to open-source projects, and we’ll ensure every participant leaves this workshop with (a) a fully setup GitHub account and (b) at least one open-source contribution.

No knowledge of Git or GitHub will be assumed, and no software will need to be installed. In order to work through interactive examples, accounts will be needed for GitHub and Google. Complete setup steps will be provided at https://github.com/saspy-bffs/wuss-2023-class

Back to top


Automate your Business Processes with SAS Macros
Michael Aleman
Thursday, November 2, 2023
1:30 PM – 5:30 PM PDT

The course will be presented to leverage SAS to enhance your existing data ecosystem to automate key reports and processes to stakeholders, automate alerts when processes are run and completed, and creating automated programs that require low to no maintenance using SAS Macros. The course will go over aspects regarding the identification of processes and/or reports that are candidates for automation and benefits of maximizing your outputs. Inputs of data, internal or external (e.g. Oracle, IBM, etc.) that can be transformed into macros such as lists, table transformations, and parameters in the where clause will also be reviewed

Back to top


Mastering the Machine Learning Toolkit to Power Your Classification and Regression Needs
Ryan Lafler
Thursday, November 2, 2023
1:30 PM – 5:30 PM PDT

The rise of Big Data has led to the rapid development of new statistical and machine learning models introduced in Python and SAS. Suitable for all data scientists interested in developing models using Python and SAS, this course empowers them to choose, fine-tune, optimize, and deploy powerful models that are tailored to their organization’s needs. Topics include minimizing the bias-variance tradeoff associated with choosing the right model, statistical inference vs. predictive power, generalizing models to predict beyond their training dataset, balancing model complexity with interpretability, and providing examples of each model programmed in Python and SAS. Generalized linear models, ensemble learning methods, gradient boosting, support vector machines, and multi-layered neural networks are fully developed and showcased for classification and regression applications. Code examples for each model are shown using SAS/STAT High-Performance (HP) Procedures and popular Python packages including scikit-learn, Statsmodels, and TensorFlow.

Back to top


Meet the Instructors

Michael Aleman is a SAS Programmer with 10 years experience in the financial (loan originations and asset recovery) and health care sectors.  Michael’s areas of expertise include automating daily patient identification processes, targeting populations for communications using APIs, dashboard reporting, and data management /monitoring.  He received a Bachelors’ of Science from National University and is currently employed at MedImpact Healthcare Systems, Inc.

Back to top

Jim Box is a Data Scientist with the Life Sciences Customer Advisory group at SAS. Prior to that, he worked as a statistician for clinical trials for over 20 years.

Back to top

Tom Grant holds a Master of Science degree in operations research from Virginia Commonwealth University. Prior to joining SAS in 2000, Tom worked in the accounting department of a national furniture retailer and developed statistical models to predict inventory losses. Tom was also manager of the Marketing Analytics Group for a retail clothing catalog company, in charge of building predictive response and life-time value models. Since joining SAS, Tom has worked as a consultant, assisting SAS customers with implementation of analytic projects in various industries, including banking, retail services, insurance, manufacturing, and web-site analysis. Tom also teaches Intro to Statistics at a small liberal arts college. Tom is currently a principal analytical training consultant in the SAS Global Academic Program, and he assists higher education institutions in the effective use of SAS.

Back to top

Josh Horstman is an independent statistical programmer based in Indianapolis with over 20 years’ experience using SAS in the life sciences industry. He specializes in analyzing clinical trial data, and his clients have included major pharmaceutical corporations, biotech companies, and research organizations. A SAS Certified Advanced Programmer, Josh loves coding and is a frequent presenter and trainer at SAS user conferences. Josh holds a bachelor’s degree in mathematics and computer science, and a master’s degree in statistics from Colorado State University.

Back to top

Troy Martin Hughes has been a SAS practitioner for more than 20 years, has managed SAS projects in support of federal, state, and local government initiatives, and is a SAS Certified Advanced Programmer, SAS Certified Base Programmer, SAS Certified Clinical Trials Programmer, and SAS Professional V8. He has authored three groundbreaking books that model software design and development best practices: PROC FCMP User-Defined Functions: An Introduction to the SAS® Function Compiler (2023); SAS® Data-Driven Development: From Abstract Design to Dynamic Functionality, Second Edition (2022); and SAS® Data Analytic Development: Dimensions of Software Quality (2016).

Troy has an MBA in information systems management as well as other credentials, including: PMP, PMI-RMP, PMI-PBA, PMI-ACP, SSCP, CISSP, CSSLP, Network+, Security+, CySA+, CASP+, Cloud+, CISA, CGEIT, CISM, CRISC, ITIL Foundation, CSM, CSD, A-CSD, CSPO, CSP, CSP-SM, CSP-PO, and SAFe Government Practitioner (SGF). He is a US Navy veteran with two tours of duty in Afghanistan.

Back to top

Kirk Paul Lafler is a developer, programmer, consultant, educator, and data analyst; and works as a lecturer and adjunct professor at San Diego State University and the University of California San Diego Extension; and teaches SAS, SQL, Python, Excel, and cloud-based technology courses, workshops, and webinars to users around the world. Kirk has nearly 50 years of programming experience and specializes in SAS software, SQL, RDBMS technologies (Oracle, SQL-Server, Teradata, DB2), Python, and other languages and productivity tools. As an author, Kirk’s books include PROC SQL: Beyond the Basics Using SAS, Third Edition (SAS Press. 2019), along with numerous papers and articles on a variety of topics. Kirk is actively involved with international, regional, and local SAS, SQL, and Python user groups and conferences serving as an Invited speaker, educator, keynote, and section leader; and is the recipient of 27 “Best” contributed paper, hands-on workshop (HOW), and poster awards.

Back to top

Ryan Paul Lafler, M.Sc., is the founder, lead consultant, and data scientist at Premier Analytics Consulting, LLC and specializes in cleaning, visualizing, storytelling, and modeling structured and unstructured big data using statistical models and machine learning algorithms with Python, SAS, R, and SQL. He received his Master of Science in Big Data Analytics from San Diego State University following the publication of his Thesis (2023) and holds a Bachelor of Science in Statistics from San Diego State University. Ryan’s a lecturer at San Diego State University for the Department of Mathematics and Statistics, receiving highly positive feedback from colleagues and students enrolled in his statistics and data science courses. He’s presented at several SAS Conferences including the WUSS, SCSUG, PharmaSUG, and SAS Global Forum.

Back to top

Isaiah Lankham specializes in data analysis using Tableau, SAS, and Python, currently serving as a research analyst for the Kaiser Permanente Center for Health Research in Portland, Oregon. Initially trained as a mathematician and educator, Isaiah is also an adjunct faculty member for the Statistics Department at California State University, East Bay, and enjoys regularly teaching graduate SAS programming courses.

Back to top

Theresa Ngo is a Director of Advancement Analytics at Caltech. Theresa and her team apply a wide range of analytics to generate insights on donors’ affinity, giving capacity, and likelihood of giving and to support Caltech’s operations, strategies, and divisions. Prior to Caltech, Theresa was at Accenture, advising  clients on analytics vision, strategy, and transformation. She was a Systems Engineer at SAS® Institute, recommending appropriate SAS solutions to clients to address various business needs and use cases. Theresa holds a Master of Science in Applied Statistics from the University of California, Riverside. 

Back to top

Matthew T. Slaughter, MS is an Advanced SAS Certified Programmer and a Data Scientist at the Kaiser Permanente Center for Health Research in Portland, Oregon. With a focus on clinical prediction modeling, Matthew provides data management, programming, and analytical support to research projects in various topic areas.

Back to top