Search CORE

8 research outputs found

Tools and Recommendations for Reproducible Teaching

Author: Dogucu Mine
Çetinkaya-Rundel Mine
Publication venue: 'Informa UK Limited'
Publication date: 25/10/2022
Field of study

It is recommended that teacher-scholars of data science adopt reproducible workflows in their research as scholars and teach reproducible workflows to their students. In this article, we propose a third dimension to reproducibility practices and recommend that regardless of whether they teach reproducibility in their courses or not, data science instructors adopt reproducible workflows for their own teaching. We consider computational reproducibility, documentation, and openness as three pillars of reproducible teaching framework. We share tools, examples, and recommendations for the three pillars

UCL Discovery

An Educator’s Perspective of the Tidyverse

Author: Baumer Benjamin
Hardin Johanna
Horton Nicholas J.
McNamara Amelia
Rundel Colin W.
Çetinkaya-Rundel Mine
Publication venue: Smith ScholarWorks
Publication date: 23/04/2022
Field of study

Computing makes up a large and growing component of data science and statistics courses. Many of those courses, especially when taught by faculty who are statisticians by training, teach R as the programming language. A number of instructors have opted to build much of their teaching around use of the tidyverse. The tidyverse, in the words of its developers, “is a collection of R packages that share a high-level design philosophy and low-level grammar and data structures, so that learning one package makes it easier to learn the next” (Wickham et al. 2019). These shared principles have led to the widespread adoption of the tidyverse ecosystem. A large part of this usage is because the tidyverse tools have been intentionally designed to ease the learning process and make it easier for users to learn new functions as they engage with additional pieces of the larger ecosystem. Moreover, the functionality offered by the packages within the tidyverse spans the entire data science cycle, which includes data import, visualisation, wrangling, modeling, and communication. We believe the tidyverse provides an effective and efficient pathway for undergraduate students at all levels and majors to gain computational skills and thinking needed throughout the data science cycle. In this paper, we introduce the tidyverse from an educator’s perspective. We provide a brief introduction to the tidyverse, demonstrate how foundational statistics and data science tasks are accomplished with the tidyverse, and discuss the strengths of the tidyverse, particularly in the context of teaching and learning

Smith College: Smith ScholarWorks

Infrastructure and Tools for Teaching Computing Throughout the Statistical Curriculum

Author: Colin Rundel (4544227)
Mine Çetinkaya-Rundel (4544224)
Publication venue
Publication date
Field of study

<p>Modern statistics is fundamentally a computational discipline, but too often this fact is not reflected in our statistics curricula. With the rise of big data and data science, it has become increasingly clear that students want, expect, and need explicit training in this area of the discipline. Additionally, recent curricular guidelines clearly state that working with data requires extensive computing skills and that statistics students should be fluent in accessing, manipulating, analyzing, and modeling with professional statistical analysis software. Much has been written in the statistics education literature about pedagogical tools and approaches to provide a practical computational foundation for students. This article discusses the computational infrastructure and toolkit choices to allow for these pedagogical innovations while minimizing frustration and improving adoption for both our students and instructors. Supplementary materials for this article are available online.</p

FigShare

Teaching Statistics in the Health Sciences: Teaching to, and Learning from, the Masses

Author: Anderson Lorin W.
Mine Çetinkaya-Rundel
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

An educator's perspective of the tidyverse

Author: Baumer Benjamin S.
Hardin Johanna
Horton Nicholas J.
McNamara Amelia
Rundel Colin
Çetinkaya-Rundel Mine
Publication venue
Publication date: 01/01/2022
Field of study

Computing makes up a large and growing component of data science and statistics courses. Many of those courses, especially when taught by faculty who are statisticians by training, teach R as the programming language. A number of instructors have opted to build much of their teaching around use of the tidyverse. The tidyverse, in the words of its developers, "is a collection of R packages that share a high-level design philosophy and low-level grammar and data structures, so that learning one package makes it easier to learn the next". These shared principles have led to the widespread adoption of the tidyverse ecosystem. A large part of this usage is because the tidyverse tools have been intentionally designed to ease the learning process and make it easier for users to learn new functions as they engage with additional pieces of the larger ecosystem. Moreover, the functionality offered by the packages within the tidyverse spans the entire data science cycle, which includes data import, visualisation, wrangling, modeling, and communication. We believe the tidyverse provides an effective and efficient pathway for undergraduate students at all levels and majors to gain computational skills and thinking needed throughout the data science cycle. In this paper, we introduce the tidyverse from an educator's perspective. We provide a brief introduction to the tidyverse, demonstrate how foundational statistics and data science tasks are accomplished with the tidyverse, and discuss the strengths of the tidyverse, particularly in the context of teaching and learning

arXiv.org e-Print Archive

eScholarship - University of California

Smith College: Smith ScholarWorks

Taking a Chance in the Classroom: Five Concrete Reasons Your Students Should Be Learning to Analyze Data in the Reproducible Paradigm

Author: Allaire J. J.
Andrew Bray
Announcement
Baumer Ben
Dalene Stangl
Mine Çetinkaya-Rundel
Stodden V.
Xie Y.
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Infrastructure and Tools for Teaching Computing Throughout the Statistical Curriculum

Author: Allaire J.
Baumer B.
Colin Rundel
Edwards S. H.
Finzer W.
Horton N.
Kaplan D.
Loeliger J.
Mine Çetinkaya-Rundel
R Core Team
RStudio Team
Stallman R. M.
Waller L. A.
Xie Y.
———
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref