Provenance Visualization as an Entry Point to the History and Curation of Information Collections

Main Article Content

Tomas Vancisin
https://orcid.org/0000-0003-1283-5673
Mary Orr
Loraine Clarke
https://orcid.org/0000-0001-9213-1013
Uta Hinrichs

Abstract

Provenance—the origin and production methods of an artifact or piece of information—is an essential part across all
fields of knowledge production. Its disclosure ensures authenticity, reproducibility, and transparency. While digital tools can automate provenance tracking and disclosure, the amount and complexity of provenance information presents a challenge, particularly within the context of cultural collections, where physically-born artifacts are transformed into digital space. This process introduces a number of methodological and curatorial decisions that, in turn, can have a grave influence on how the—once physical—collection is represented and how it will be interpreted. In previous work, we have started to address this issue by introducing provenance-driven visualization as an approach to provenance disclosure that (1) traces and categorizes both the physical and digital provenance of information collections (e.g., transcriptions, modifications of content and structure, ex/inclusion of information items) and (2) utilizes visualization to disclose and make provenance explorable in interactive ways. While this approach has shown potential, there are challenges to designing provenance-driven visualizations which can be perceived as complex and abstract and, ultimately, a distraction from the information collections’ content. How can visualization design navigate tensions between making visible provenance information and underlying curatorial decisions in a holistic and compact way, while enabling easy entry points to and promoting the critical interpretation of the collection’s content? In this paper, we present a novel design approach to provenance-driven visualization that combines abstract visualization, textual descriptions, and representations of artifactual form with storytelling techniques to introduce provenance information. Our findings from a qualitative study demonstrate the success of this approach in (1) providing a visual entry point into the collection’s provenance, (2) promoting an in-depth understanding of the transitions and underlying curatorial decisions the
collection has gone through, all the while (3) positively influencing the collections’ content exploration and its critical interpretation. Our work contributes new perspectives on how visualization can be applied to add transparency and to raise awareness of the constructed and situated nature of data, in particular, in the context of cultural collections, but also beyond.

Article Details

How to Cite
[1]
Vancisin, T. et al. 2025. Provenance Visualization as an Entry Point to the History and Curation of Information Collections. Journal of Visualization and Interaction. 1, 1 (Apr. 2025). DOI:https://doi.org/10.54337/jovi.v1i1.8436.
Section
Articles

References

E. W. Anderson, J. P. Ahrens, K. Heitmann, S. Habib, and C. T. Silva. Provenance in comparative analysis: A study in cosmology. Computing in Science & Engineering, 10(3):30–37, 2008. 2

E. W. Anderson, G. A. Preston, and C. T. Silva. Towards Development of a Circuit Based Treatment for Impaired Memory: A Multidisciplinary Approach. In 2007 3rd International IEEE/EMBS Conference on Neural Engineering, pp. 302–305. IEEE, 2007. 2

S. Bateman, R. L. Mandryk, C. Gutwin, A. Genest, D. McDine, and C. Brooks. Useful junk? the effects of visual embellishment on comprehension and memorability of charts. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI ’10, 10 pages, p. 2573–2582. Association for Computing Machinery, New York, NY, USA, 2010. doi: 10.1145/1753326.1753716 5

O. Biton, S. Cohen-Boulakia, S. B. Davidson, and C. S. Hara. Querying and managing provenance through user views in scientific workflows. In 2008 IEEE 24th International Conference on Data Engineering, pp. 1072–1081. IEEE, 2008. 2

G.-P. Bonneau, H.-C. Hege, C. R. Johnson, M. M. Oliveira, K. Potter, P. Rheingans, and T. Schultz. Overview and state-of-the-art of uncertainty visualization. Scientific visualization: Uncertainty, multifield, biomedical, and scalable visualization, pp. 3–27, 2014. 14

M. A. Borkin, C. S. Yeh, M. Boyd, P. Macko, K. Z. Gajos, M. Seltzer, and H. Pfister. Evaluation of filesystem provenance visualization tools. IEEE transactions on visualization and computer graphics, 19(12):2476–2485, 2013. 2

R. E. Boyatzis. Transforming Qualitative Information: Thematic Analysis and Code Development. Sage Publications, 1998. 8

C. Chabot, C. Stotle, A. Beers, and P. Hanrahan. Tableau prep, 2018. 1

M. Chakhchoukh, N. Boukhelifa, and A. Bezerianos. Understanding how in-visualization provenance can support trade-off analysis. IEEE Transactions on Visualization and Computer Graphics, 29(9):3758–3774, 2022. 2

M. Chakhchoukh, N. Boukhelifa, and A. Bezerianos. Understanding how in-visualization provenance can support trade-off analysis. IEEE Transactions on Visualization and Computer Graphics, 29(9):3758–3774, 2023. doi: 10.1109/TVCG.2022.3171074 1

M. Correll. Ethical Dimensions of Visualization Research. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, 13 pages, p. 1–13, 2019. 2

C. D’Ignazio and L. F. Klein. Feminist Data Visualization. In Workshop on Visualization for the Digital Humanities (VIS4DH), IEEE, 2016. 2

C. D’Ignazio and L. F. Klein. Data Feminism. MIT press, 2023. 2

M. Dörk, P. Feng, C. Collins, and S. Carpendale. Critical infovis: Exploring the politics of visualization. In CHI’13 Extended Abstracts on Human Factors in Computing Systems, pp. 2189–2198. 2013. 2

J. Drucker. Humanities Approaches to Graphical Display. Digital Humanities Quarterly, 5(1):1–21, 2011. 13

C. Dunne, N. Henry Riche, B. Lee, R. Metoyer, and G. Robertson. Graphtrail: Analyzing large multivariate, heterogeneous networks while supporting exploration history. In Proceedings of the SIGCHI conference on human factors in computing systems, pp. 1663–1672, 2012. 2

G. Feigenbaum, I. Reist, and I. J. Reist. Provenance: An Alternate History of Art. Getty Publications, 2012. 1, 2

J.-D. Fekete and J. Freire. Exploring reproducibility in visualization. IEEE Computer Graphics and Applications, 40(5):108–119, 2020. 2

M. H. Freedman, J. Gukelberger, M. B. Hastings, S. Trebst, M. Troyer, and Z. Wang. Galois conjugates of topological phases. Physical Review B, 85(4):045414, 2012. 2

J. Freire, D. Koop, E. Santos, and C. T. Silva. Provenance for Computational Tasks: A Survey. Computing in Science & Engineering, 10(3):11–21, 2008. 1, 2

C. Gonzalez. Does animation in user interfaces improve decision making? In Proceedings of the SIGCHI conference on human factors in computing systems, pp. 27–34, 1996. 5

D. P. Groth and K. Streefkerk. Provenance and annotation for visual exploration systems. IEEE transactions on visualization and computer graphics, 12(6):1500–1510, 2006. 2

G. Guest, K. M. MacQueen, and E. E. Namey. Introduction to Applied Thematic Analysis. Sage Publications, 2012. 8

Z. Hensley, J. Sanyal, and J. New. Provenance in sensor data management. Communications of the ACM, 57(2):55–62, 2014. 2

U. Hinrichs, B. Alex, J. Clifford, A. Watson, A. Quigley, E. Klein, and C. M. Coates. Trading Consequences: A Case Study of Combining Text Mining and Visualization to Facilitate Document Exploration. Digital Scholarship in the Humanities, 30(suppl_1):i50–i75, 2015. 1

U. Hinrichs, S. Forlini, and B. Moynihan. In defense of sandcastles: Research thinking through visualization in digital humanities. Digital Scholarship in the Humanities (DSH), 34(Issue Supplement_1):i80––i99, 2019. 14

J. Hullman and N. Diakopoulos. Visualization Rhetoric: Framing Effects in Narrative Visualization. IEEE Transactions on Visualization and Computer Graphics, 17(12):2231–2240, 2011. 2

D. Huynh, D. R. Karger, D. Quan, et al. Haystack: A platform for creating, organizing and visualizing information using rdf. In Semantic Web Workshop, vol. 52, 2002. 2

W. Javed and N. Elmqvist. Explates: spatializing interactive analysis to scaffold visual exploration. In Computer Graphics Forum, vol. 32, pp. 441–450. Wiley Online Library, 2013. 2

M. P. Kouroupas. U.s. efforts to protect cultural property: Implementation of the 1970 unesco convention. African Arts, 28(4):32–41, 1995. 2

H. Lamqaddam, A. V. Moere, V. V. Abeele, K. Brosens, and K. Verbert. Introducing layers of meaning (lom): A framework to reduce semantic distance of visualization in humanistic research. IEEE Transactions on Visualization and Computer Graphics, 27(2):1084–1094, 2021. 2

A. M. MacEachren, R. E. Roth, J. O’Brien, B. Li, D. Swingley, and M. Gahegan. Visual semiotics & uncertainty visualization: An empirical study. IEEE transactions on visualization and computer graphics, 18(12):2496–2505, 2012. 14

E. Maguire, P. Rocca-Serra, S.-A. Sansone, J. Davies, and M. Chen. Visual compression of workflow visualizations with automated detection of macro motifs. IEEE transactions on visualization and computer graphics, 19(12):2576–2585, 2013. 2

J. Matejka, T. Grossman, and G. Fitzmaurice. Patina: Dynamic heatmaps for visualizing application usage. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 3227–3236, 2013. 2

Merriam-Webster. Provenance. 2

Microsoft. Power query, 2010. 1

P. Missier, K. Belhajjame, and J. Cheney. The w3c prov family of specifications for modelling provenance metadata. In Proceedings of the 16th International Conference on Extending Database Technology, EDBT ’13, 4 pages, p. 773–776. Association for Computing Machinery, 2013. 2

J. T. Morisette, C. S. Jarnevich, T. R. Holcombe, C. B. Talbert, D. Ignizio, M. K. Talbert, C. Silva, D. Koop, A. Swanson, and N. E. Young. VisTrails SAHM: Visualization and workflow management for species habitat modeling. Ecography, 36(2):129–135, 2013. 2

G. Panagiotidou, H. Lamqaddam, J. Poblome, K. Brosens, K. Verbert, and A. V. Moere. Communicating uncertainty in digital humanities visualization research. IEEE Transactions on Visualization and Computer Graphics, 29(1):635–645, 2022. 14

E. D. Ragan, A. Endert, J. Sanyal, and J. Chen. Characterizing provenance in visualization and data analysis: An organizational framework of provenance types and purposes. IEEE Transactions on Visualization and Computer Graphics, 22(1):31–40, 2016. doi: 10.1109/TVCG.2015.2467551 2

C. Schulz, A. Nocaj, J. Goertler, O. Deussen, U. Brandes, and D. Weiskopf. Probabilistic graph layout for uncertain network visualization. IEEE transactions on visualization and computer graphics, 23(1):531–540, 2016. 14

E. Segel and J. Heer. Narrative visualization: Telling stories with data. IEEE Transactions on Visualization and Computer Graphics, 16(6):1139–1148, 2010.doi: 10.1109/TVCG.2010.179 5

R. N. Smart. Literate Ladies - A Fifty Year Experiment. St Andrews University Alumnus Chronicle, 59:21–31, 1968. 4

H. Stitz, S. Luger, M. Streit, and N. Gehlenborg. Avocado: Visualization of workflow-derived data provenance for reproducible biomedical research. Computer Graphics Forum (EuroVis ’16), 35(3):481–490, jun 2016. 2

D. Stoecker, O. Duane Adams, and N. Harding. Alteryx, 1997. 1

R. Straughn-Navarro. Provenance in situ: Documenting information of origins across glam contexts. The International Information & Library Review, 48(4):287–293, 2016. doi: 10.1080/10572317.2016.1243964 2

T. Vancisin, L. Clarke, M. Orr, and U. Hinrichs. Provenance visualization: Tracing people, processes, and practices through a data-driven approach to provenance. Digital Scholarship in the Humanities, 38(3):1322–1339, 2023. 1, 2, 3, 4, 6, 7, 8, 12, 13, 14

T. Vancisin, A. Crawford, M. Orr, and U. Hinrichs. From people to pixels: Visualizing historical university records. In Proceedings of the 5th Biennial Transdisciplinary Imaging Conference 2018 (Transimage 2018), pp. 41–57, 2018. 3, 4, 6

T. Vancisin, M. Orr, and U. Hinrichs. Externalizing transformations of historical documents: Opportunities for provenance-driven visualization. In Proceedings of the 5th Workshop on Visualization for the Digital Humanities (VIS4DH2020), 2020. 4, 13