Posts

Statistical Tests as Linear Models

Andrew Gelman’s statmodeling blog recently contained a link to an interesting document by Jonas Lindeløv. It tries to explain various statistical tests in terms of linear models.

Sharing SSH Keys With a Devcontainer

VS Code devcontainers are a great resource for creating reusable containers to share between developers on the same project. When properly setup, it automatically passes your SSH credentials to the container.

Takeaways - 'Big Data Is Dead'

I recently read a great blog post by Jordan Tigani about Big Data. While Jordan’s post focuses on enterprise needs, I believe it contains relevant insights to individual researchers as well.

PROC MI Added to SASPy

I’m excited to announce that the new SAPy v4.6.0 release includes a pull request of mine that adds PROC MI to the SAS/STAT procedures directly exposed in SASPy. This procedure allows you to analyze missing data patterns and create imputations for missing data.

Why I'm Not Worried About ChatGPT

ChatGPT has been all over my newsfeed lately, with a considerable amount of hype. In particular, many are wondering or even worrying whether the emergence of this technology will threaten jobs with moderate to high education requirments.

Dec22 SAS ODA Update - Impact on SASPy Users

During December 2022, SAS ODA received substantial updates - see the upgrade page for details. It’s really nice to see that ODA is now using SAS 9.4M7. If you are a SASPy user, you may now bump into an error while logging in with your existing configuration.

Setting up a Virtual Lab Computer

Dealing with computer resources in a modern lab can be tricky. Even if all participating researchers have laptops, a central location for storage or to host licensed software is desirable. While a physical computer can be setup for such a use, that is not always the most desirable solution.

Missing Data Mechanisms

Understanding whether a variable’s missingness from a dataset is related to the underlying value of the data is a key concept in the field of missing data analysis. We distinguish three broad categories: missing completely at random (MCAR), missing at random (MAR), and missing not at random (MNAR).

Some CLI Tools

A few convenience CLI tools I find myself installing on new systems regularly.

CSV2DS

CSV2DS is a new program I wrote in Go to help me create minimum working examples for SAS that can be shared as a single SAS script.