Authors
Michael Muller, Ingrid Lange, Dakuo Wang, David Piorkowski, Jason Tsay, Q Vera Liao, Casey Dugan, Thomas Erickson
Publication date
2019/5/2
Book
Proceedings of the 2019 CHI conference on human factors in computing systems
Pages
1-15
Description
With the rise of big data, there has been an increasing need for practitioners in this space and an increasing opportunity for researchers to understand their workflows and design new tools to improve it. Data science is often described as data-driven, comprising unambiguous data and proceeding through regularized steps of analysis. However, this view focuses more on abstract processes, pipelines, and workflows, and less on how data science workers engage with the data. In this paper, we build on the work of other CSCW and HCI researchers in describing the ways that scientists, scholars, engineers, and others work with their data, through analyses of interviews with 21 data science professionals. We set five approaches to data along a dimension of interventions: Data as given; as captured; as curated; as designed; and as created. Data science workers develop an intuitive sense of their data and processes …
Total citations
20192020202120222023202484149787626
Scholar articles
M Muller, I Lange, D Wang, D Piorkowski, J Tsay… - Proceedings of the 2019 CHI conference on human …, 2019