Exploring student struggle in introductory data science courses

Abstract Number:

3569 

Submission Type:

Contributed Abstract 

Contributed Abstract Type:

Speed 

Participants:

Aimee Schwab-McCoy (1), Pamela Fellers (2)

Institutions:

(1) zyBooks, N/A, (2) Wiley/zyBooks, N/A

Co-Author:

Pamela Fellers  
Wiley/zyBooks

First Author:

Aimee Schwab-McCoy  
zyBooks

Presenting Author:

Aimee Schwab-McCoy  
zyBooks

Abstract Text:

Introductory data science classes cover a range of topics, including data gathering, exploration, modeling, and visualization. However, data science is still a young discipline, which means little is known about which topics students particularly struggle with.

This paper analyzes student data from three interactive, online data science textbooks. Activity metrics like average number of attempts, proportion of students giving up, and average time to completion, will be used to quantify student struggle. Struggle data from conceptual and programming-based activities will be aggregated from over 50 institutions to identify challenging topics in a first data science course. Data will also be compared between book versions to determine if certain tasks are more difficult in Python or R, or if programming language does not affect performance. Although specific activities are limited to a single course platform, challenging topics and lessons learned will apply broadly.

Keywords:

data science|student struggle|online learning|interactive textbooks| |

Sponsors:

Section on Statistics and Data Science Education

Tracks:

Miscellaneous

Can this be considered for alternate subtype?

Yes

Are you interested in volunteering to serve as a session chair?

Yes

I have read and understand that JSM participants must abide by the Participant Guidelines.

Yes

I understand that JSM participants must register and pay the appropriate registration fee by June 1, 2024. The registration fee is non-refundable.

I understand