51短视频

What to Consider When We Consider Data

Andee Rubin
Rubin, A., What to consider when we consider data, Teaching Statistics 43 (2021), S23鈥 S33.

Abstract

The data sets used in statistics education have changed over time, from mathematically 鈥渨ell-behaved鈥 ones that facilitated computation, to more context-rich sources and now, with the increasing influence of data science practices, to 鈥渇ound鈥 data, often from open data sites. As data sources change, it is important for educators to take a fresh look at the ways we engage students in thinking about the processes that generated the data they encounter. The use of already collected data requires particular attention because many of the decisions that went into the processes of obtaining the data are hidden. Students need to learn to ask 鈥淲ho, When, How, Where, and Why?鈥 data were collected and to wonder if the data really measure what needs to be measured. Our advocacy in this paper is to deepen the educational treatment of data production to better reflect the current and future practice of statistics and data science.