B2: Bridging code and interactive visualization in computational notebooks

© 2020 Owner/Author. Data scientists have embraced computational notebooks to author analysis code and accompanying visualizations within a single document. Currently, although these media may be interleaved, they remain siloed: interactive visualizations must be manually specified as they are divor...

Full description

Bibliographic Details
Main Authors: Wu, Y (Author), Hellerstein, JM (Author), Satyanarayan, A (Author)
Format: Article
Language:English
Published: ACM, 2021-11-05T14:57:08Z.
Subjects:
Online Access:Get fulltext
LEADER 02027 am a22001813u 4500
001 137497
042 |a dc 
100 1 0 |a Wu, Y  |e author 
700 1 0 |a Hellerstein, JM  |e author 
700 1 0 |a Satyanarayan, A  |e author 
245 0 0 |a B2: Bridging code and interactive visualization in computational notebooks 
260 |b ACM,   |c 2021-11-05T14:57:08Z. 
856 |z Get fulltext  |u https://hdl.handle.net/1721.1/137497 
520 |a © 2020 Owner/Author. Data scientists have embraced computational notebooks to author analysis code and accompanying visualizations within a single document. Currently, although these media may be interleaved, they remain siloed: interactive visualizations must be manually specified as they are divorced from the analysis provenance expressed via dataframes, while code cells have no access to users' interactions with visualizations, and hence no way to operate on the results of interaction. To bridge this divide, we present B2, a set of techniques grounded in treating data queries as a shared representation between the code and interactive visualizations. B2 instruments data frames to track the queries expressed in code and synthesize corresponding visualizations. These visualizations are displayed in a dashboard to facilitate interactive analysis. When an interaction occurs, B2 reifies it as a data query and generates a history log in a new code cell. Subsequent cells can use this log to further analyze interaction results and, when marked as reactive, to ensure that code is automatically recomputed when new interaction occurs. In an evaluative study with data scientists, we find that B2 promotes a tighter feedback loop between coding and interacting with visualizations. All participants frequently moved from code to visualization and vice-versa, which facilitated their exploratory data analysis in the notebook. 
546 |a en 
655 7 |a Article 
773 |t 10.1145/3379337.3415851 
773 |t UIST 2020 - Proceedings of the 33rd Annual ACM Symposium on User Interface Software and Technology