Posts

Some Basic SQL Joins

A non-technical friend recently asked me for help with a merge problem. They had two separate data pulls of electronic medical records based on specific study parameters. The set of people in the database who fit the study parameters changed in between the data pulls, for example by having people age into our out of a study, or by having new diagnoses added to their records that cause them to either be newly included or excluded.

Univariate Missing Data with PROC MI

In Chapter 3 of van Buuren’s Flexible Imputation of Missing Data a variety of methods for imputing univariate missing data are presented. This post will summarize these techniques and show how to implement them in SAS.

SUC - a Slack Clone for Modern Unix

I love simple CLI tools and am a big fan of the Unix philosophy. Recently I came across The Dam, a public Unix server that implements a clever tool they termed suc - the Simple Unix Chat.

Explore C Code With GNU Tools

This post will introduce three GNU tools to help you explore your C code: ctags, cscope, and cflow. The first two can help you navigate your code as you work on it and can be used directly within Vim.

Sampling Regression Lines

Last week we saw how to generate posterior samples using PROC MCMC for simple linear and logistic regression models. This week, I want to show how to sample regression lines from the data set returned by MCMC by plotting several sample regression linse on top of a scatter plot of the source data.

Simple Regression With PROC MCMC

In this post I’ll show how to fit simple linear and logistic regression models using the MCMC procedure in SAS. Note that the point of this post is to show how the mathematical model is translated into PROC MCMC syntax and not to discuss the method itself.

Loading Several XPT Files From a URL

The SAS Transport File Format (XPORT) is an open file format maintained by SAS for exchanging datasets. Its use is mandated by the FDA for data set submission for new drug or device applications and the CDC uses this format to distribute public data.

Are the Rich Paying Their Fair Share?

Today is tax day in the US. In celebration we’re going to take a look at some of the data available on the IRS Statistics page.

Takeaways from 'Deep Work'

I have recently read Cal Newport’s book “Deep Work” (2016). Overall, it is a short but engaging read discussing his tips for how to spend more time doing intellectually focused and engaging work in a society whose attention and focus is ever more divided.

Statistical Tests as Linear Models

Andrew Gelman’s statmodeling blog recently contained a link to an interesting document by Jonas Lindeløv. It tries to explain various statistical tests in terms of linear models.