Acta Informatica Pragensia 2023, 12(2), 327-341 | DOI: 10.18267/j.aip.2193243
Digital Archives as Research Infrastructure of the Future
- Department of Information and Library Studies, Faculty of Arts, Masaryk University, Brno, Czech Republic
While a new paradigm of scientific research based on data centres and research infrastructures is gaining ground in science, and convergence between infrastructures and scientific domains is growing in cyberspace, epistemic cultures, particularly conservative in some fields, play a significant role in the dynamics of knowledge production in general and the adoption of data-intensive scientific practices in particular. In the present study, we focus on the transformations of scholarly communication through the perspective of digital curation of research data in the humanities, which certainly belong to these conservative epistemic cultures. The aim of this paper is to explore perspectives on the evolution of data curation in the context of the transformation of scholarly communication and research infrastructure in the humanities, specifically static archives, into living, continuously enriched data archives supported by artificial intelligence tools. To explore this perspective, we have chosen to compare scholarly communication in the humanities and in high-energy physics, in addition to analysing the practices of data curation itself. We further thematize the identified differences in terms of virtual research environments that can help humanities scholars exploit the potential of data-intensive research infrastructures.
Keywords: Artificial intelligence; Digital curation; Humanities; Knowledge; Scientific research.
Received: January 29, 2023; Revised: June 2, 2023; Accepted: June 8, 2023; Prepublished online: June 17, 2023; Published: October 10, 2023 Show citation
References
- Abdallah, S., Benetos, E., Gold, N., Hargreaves, S., Weyde, T. & Wolff, W. (2017). The digital music lab: A big data infrastructure for digital musicology. Journal on Computing and Cultural Heritage, 10(1), 2. https://doi.org/10.1145/2983918
Go to original source...
- Allan, R. (2009). Virtual Research Environments: From portals to science gateways. Chandos Publishing.
- Ball, A. & Duke, M. (2015). How to Cite Datasets and Link to Publications. DCC How-to Guides. https://www.dcc.ac.uk/guidance/how-guides/cite-datasets
- Barrios, C., Flores, E., Martínez, M. A., & Ruiz-Martínez, M. (2019). Is there convergence in international research collaboration? An exploration at the country level in the basic and applied science fields. Scientrometrics, 120, 631-659. https://doi.org/10.1007/s11192-019-03133-9
Go to original source...
- Bates, J. (2018). Data cultures, power and the city. In Kitchin, R., Lauriault, T. P. & McArdle, G. (Eds.) Data and the City, (pp. 189-200). Routledge.
Go to original source...
- Beagrie, N. (2008). Digital Curation for Science, Digital Libraries, and Individuals. International Journal of Digital Curation, 1, 3-16. https://doi.org/10.2218/ijdc.v1i1.2
Go to original source...
- Birkbeck, G., Nagle, T., & Sammon, D. (2022). Challenges in research data management practices: a literature analysis. Journal of Decision Systems, 31(sup1), 153-167. https://doi.org/10.1080/12460125.2022.2074653
Go to original source...
- Bowker, G. C. (2005). Memory Practices in the Sciences. MIT Press.
- Borgman, Ch. L. (2015). Big Data, Little Data, No Data: Scholarship in the Networked World. The MIT Press.
Go to original source...
- Candela, L., Castelli, D., & Pagano, P. (2013). Virtual Research Environments: An Overview and Research Agenda. Data Science Journal, 12, GRDI75-G RDI81. http://doi.org/10.2481/dsj.GRDI-013
Go to original source...
- Cetina, K. K. (1999). Epistemic Cultures: How the Sciences Make Knowledge. Harvard University Press.
- Coccia, M. & Wang, L. (2016). Evolution and convergence of the patterns of international scientific collaboration. Proceedings of the National Academy of Sciences,113 (8) 2057-2061. https://doi.org/10.1073/pnas.1510820113
Go to original source...
- Cronin, B. (2003). Scholarly communication and epistemic cultures. New Review of Academic Librarianship, 9(1), 1-24. https://doi.org/10.1080/13614530410001692004
Go to original source...
- Dalrymple, P. (2016). What's in a Name? A Brief History of Informatics Education. In Seadle, M., Chu, C. M., Stöckel, U. & Crumpton, B., Educating the Profession: 40 years of the IFLA Section on Education and Training (pp.149-164). De Gruyter Saur. https://doi.org/10.1515/9783110375398-015
Go to original source...
- Data Management Plans. (2022). Digital Curation Centre. https://www.dcc.ac.uk/dmps
- De Roure, R., Goble, C. & Stevens, R. (2009). The design and realisation of the myExperiment Virtual Research Environment for social sharing of workflows. Future Generation Computer Systems, 25(5), 561-567. https://doi.org/10.1016/j.future.2008.06.010
Go to original source...
- Discover the world of Scientific Literature. (2022, Dec 21). Litmaps. https://www.litmaps.com/
- Duff, W., Carter, J., Cherry, J. M., MacNeil, H. & Howarth, L. C. (2013). From coexistence to convergence: studying partnerships and collaboration among libraries, archives and museums. Information Research, 18(3), paper 585. http://informationr.net/ir/18-3/paper585.html
- Eitan, A. T., Smolyansky, E., Harpaz, I. K. & Perets, S. (2022, October 20). Find and explore academic papers. Connected Papers. https://www.connectedpapers.com/
- Finholt, T. A. (2002). Collaboratories. Annual Review of Information Science and Technology, 36(1), 73-107. https://doi.org/10.1002/aris.1440360103
Go to original source...
- Fourman, M. P. (2003). Informatics. In Feather,J., & Sturges, P., International Encyclopedia of Information and Library Science. Routledge.
- Garfield, E. (1979). Citation Indexing - Its Theory and Application in Science, Technology, and Humanities. ISI Press.
- Giere, R. N. (2002). Distributed Cognition in Epistemic Cultures. Philosophy of Science, 69(4), 637-644. https://doi.org/10.1086/344627
Go to original source...
- Ginsparg, P. (2021). Lessons from arXiv's 30 years of information sharing. Nature Reviews Physics, 3, 602-603. https://doi.org/10.1038/s42254-021-00360-z
Go to original source...
- Hall, S. (2001). Constituting an Archive. Third text, 15(54), 89-92. https://doi.org/10.1080/09528820108576903
Go to original source...
- Henrich, A., & Gradl, T. (2013). DARIAH(-DE): Digital Research Infrastructure for the Arts and Humanities - Concepts and Perspectives. International Journal of Humanities and Arts Computing, 7 (supplement), 47-58. https://doi.org/10.3366/ijhac.2013.0059
Go to original source...
- Hey, T., Tansley, S., & Tolle, K. (Eds.). (2009). The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Corporation.
- Hung, Y. N., Yang, C. H. H., Chen, P. Y., & Lerch, A. (2022). Low-Resource Music Genre Classification with Advanced Neural Model Reprogramming. arXiv preprint arXiv:2211.01317. https://doi.org/10.48550/arXiv.2211.01317
Go to original source...
- Kitchin, R., & Lauriault, T. P. (2018). Towards critical data studies: Charting and unpacking data assemblages and their work. In Thatcher, J., Eckert, J. & Shears, A. (Eds.). Thinking big data in geography: new regimes, new research. University of Nebraska Press.
- Koltay, T. (2019). Research Data Management and Data Literacy as We See Them Today. In Lichnerová, L. & Steinerová, J. (Eds.), Library and Information Science XXVIII, (pp. 7-16). Comenius University.
- Lefebvre, A. E. J., Schermerhorn, E. & Spruit, M. R. (2018). How Research Data Management Can Contribute to Efficient and Reliable Science. In 26th European Conference on Information Systems (paper no. 35). AIS. https://aisel.aisnet.org/ecis2018_rp/35/
- Lewis, S., Shepherd, K., Latt, Y. Y., Schweer, A. & Field, A. (2012). Repository as a service (RaaS). Journal of Digital Information, 13(1). http://journals.tdl.org/jodi/article/view/5872
- Lorenz, M. (2022). Digitální muzikologie a otázka smysluplného zpřístupnění zvukových digitalizátů. In Lorenz, M. et al. Záchrana zvukového kulturního dědictví: aktuální situace, problémy, možnosti. Littera.
- Murugesan, P. & Moravcsik, M. J. (1978). Variation of the nature of citation measures with journal and scientific specialties. Journal of the American Society for Information Science and Technology, 29(3), 141-147. https://doi.org/10.1002/asi.4630290307
Go to original source...
- Mushi, G. E. (2021). Research data management and services: Resources for different data practitioners. IASSIST Quarterly, 45(3-4). https://doi.org/10.29173/iq995
Go to original source...
- Neuroth, H., Lohmeier, F. & Smith, K. M. (2011). TextGrid - Virtual Research Environment for the Humanities. The International Journal of Digital Curation, 2(6), 222-231. https://doi.org/10.2218/ijdc.v6i2.198
Go to original source...
- Plesser, H. E. (2018). Reproducibility vs. Replicability: A Brief History of a Confused Terminology. Frontiers in Neuroinformatics, 11, 76. https://doi.org/10.3389/fninf.2017.00076
Go to original source...
- Plotkin, D. (2021). Data stewardship: an actionable guide to effective data management and data governance. Academic Press.
Go to original source...
- Pollock, D., Yan, A., Parker, M. & Allard, S. (2022). The Role of Data in an Emerging Research Community: Environmental Health Research as an Exemplar. International Journal of Digital Curation, 16(1), 1-15. https://doi.org/10.2218/ijdc.v16i1.653
Go to original source...
- Poole, A. H. (2015). How has your science data grown? Digital curation and the human factor: a critical literature review. Archival Science, 15(2), 101-139. https://doi.org/10.1007/s10502-014-9236-y
Go to original source...
- Research Data Management at Monash. (2022, July 14). Library. Monash University. https://www.monash.edu/library/researchers/data-collection-management/about
- Rudy, S. (2010, October 24). The State of Knowledge about "Living Archives, New Media Archives". Sustaining Digital Scholarship for Sustainable Culture. http://sustainableknowledgeproject.blogspot.com/2010/10/state-of-knowledge-about-living.html
- Schreibman, S., Siemens, R. & Unsworth, J. (2004). A New Companion to Digital Humanities. Blackwell Publishing.
Go to original source...
- Smale, N., Unsworth, K., Denyer, G. & Barr, D. (2018). The History, Advocacy and Efficacy of Data Management Plans. bioRxiv, 443499. https://doi.org/10.1101/443499
Go to original source...
- Snow, C. P., (2012). The Two Cultures. Cambridge University Press.
Go to original source...
- Straka, M. & Straková, J. (2019). NameTag (version 2.0). [Web Application]. http://hdl.handle.net/11858/00-097C-0000-0023-43CE-E
- The Turing Way Community, Arnold, B., Bowler, L., Gibson, S., Herterich, P., Higman, R. … Whitaker, K. (2022, July 27). The Turing Way: A Handbook for Reproducible Data Science (Version v1.0.2). Zenodo. https://doi.org/10.5281/zenodo.3233986
Go to original source...
- Van Gorp, P. & Mazanek, S. (2011). SHARE: a web portal for creating and sharing executable research papers. Procedia Computer Science, 4, 589-597. https://doi.org/10.1016/j.procs.2011.04.062
Go to original source...
- Výzkumná data. (2022, July 20). Centrum pro podporu open science. Univerzita Karlova. https://openscience.cuni.cz/OSCI-61.html
- Why do research data management? (2022, July 29). Research Data Service. The University of Edinburgh. https://www.ed.ac.uk/information-services/research-support/research-data-service/research-data-management
- Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., Appleton, G., Axton, M., Baak, A., Blomberg, N., Boiten, J., Da Silva Santos, L. O. B., Bourne, P. E., Bouwman, J., Brookes, A. J., Clark, T. W., Crosas, M., Dillo, I., Dumon, O., Edmunds, S., Evelo, C. T., Finkers, R., … Mons, B. (2016). The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data, 3(1), 160018. https://doi.org/10.1038/sdata.2016.18
Go to original source...
- Wolfram, S. (2023, Jan 9). Wolfram|Alpha as the Way to Bring Computational Knowledge Superpowers to ChatGPT. https://writings.stephenwolfram.com/2023/01/wolframalpha-as-the-way-to-bring-computational-knowledge-superpowers-to-chatgpt/
This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, distribution, and reproduction in any medium, provided the original publication is properly cited. No use, distribution or reproduction is permitted which does not comply with these terms.