This document is meant to offer guidance to instructors wishing to augment their statistics courses with computational elements, by way of an extended example due to Horton (2013): I Hear, I Forget. I Do, I Understand: A Modified Moor-Method Mathematical Statistics Course. The American Statistician 67:4 219-228. This document describes in detail the challenges students may face when implementing a basic simulation of a non-trivial experiment, and the potential benefits in understanding that accompany working through these challenges. Also described is the analytical solution to the same problem, and how computation can be used by students to check their work and develop confidence in their solutions.
The example here is presented in two parts. A “basic” part uses mostly base R
functionality, and is suitable for instructors who want to use computation alongside analytical problem solving in their course. This may be most useful for introductory courses like STA220/221/257/261, and for upper-year theory-based courses, where computation can be used to aid student understanding and engagement, but isn’t part of the main learning objectives. The basic simulation code is extended throughout with an “advanced” part, which improves the base R
code using modern packages and techniques. This may be more suitable for more applied courses such as STA302/303/314/414, where part of the learning objectives of the course are to have students learn to compute with data.
This exercise introduces a medium-difficulty introductory probability problem, translates it into an algorithm, implements this algorithm in R
, uses the result in simulations solves the original problem analytically, and critically evaluates the solution by comparing the analytical and empirical versions. As such, it aims to cover many learning outcomes. Below is a perspective on this, categorized by the relevant Statistics Undergraduate Program Learning Outomes:
R
experience, and where the learning outcome is less on coding skills themselves and more on properly utilizing statistical computation as a means for approaching theoretical problems. The latter is intended for students who have introductory R
experience, and for cases where the learning objective actually focusses on good programming practice..Rmd
file that can be executed in its entirety to reproduce the analysis, as is modern best practice