Acta Informatica Pragensia 2023, 12(2), 327-341 | DOI: 10.18267/j.aip.2193243

Digital Archives as Research Infrastructure of the Future

Michal Lorenz ORCID..., Michal Konečný
Department of Information and Library Studies, Faculty of Arts, Masaryk University, Brno, Czech Republic

While a new paradigm of scientific research based on data centres and research infrastructures is gaining ground in science, and convergence between infrastructures and scientific domains is growing in cyberspace, epistemic cultures, particularly conservative in some fields, play a significant role in the dynamics of knowledge production in general and the adoption of data-intensive scientific practices in particular. In the present study, we focus on the transformations of scholarly communication through the perspective of digital curation of research data in the humanities, which certainly belong to these conservative epistemic cultures. The aim of this paper is to explore perspectives on the evolution of data curation in the context of the transformation of scholarly communication and research infrastructure in the humanities, specifically static archives, into living, continuously enriched data archives supported by artificial intelligence tools. To explore this perspective, we have chosen to compare scholarly communication in the humanities and in high-energy physics, in addition to analysing the practices of data curation itself. We further thematize the identified differences in terms of virtual research environments that can help humanities scholars exploit the potential of data-intensive research infrastructures.

Keywords: Artificial intelligence; Digital curation; Humanities; Knowledge; Scientific research.

Received: January 29, 2023; Revised: June 2, 2023; Accepted: June 8, 2023; Prepublished online: June 17, 2023; Published: October 10, 2023  Show citation

ACS AIP APA ASA Harvard Chicago Chicago Notes IEEE ISO690 MLA NLM Turabian Vancouver
Lorenz, M., & Konečný, M. (2023). Digital Archives as Research Infrastructure of the Future. Acta Informatica Pragensia12(2), 327-341. doi: 10.18267/j.aip.219
Download citation

References

  1. Abdallah, S., Benetos, E., Gold, N., Hargreaves, S., Weyde, T. & Wolff, W. (2017). The digital music lab: A big data infrastructure for digital musicology. Journal on Computing and Cultural Heritage, 10(1), 2. https://doi.org/10.1145/2983918 Go to original source...
  2. Allan, R. (2009). Virtual Research Environments: From portals to science gateways. Chandos Publishing.
  3. Ball, A. & Duke, M. (2015). How to Cite Datasets and Link to Publications. DCC How-to Guides. https://www.dcc.ac.uk/guidance/how-guides/cite-datasets
  4. Barrios, C., Flores, E., Martínez, M. A., & Ruiz-Martínez, M. (2019). Is there convergence in international research collaboration? An exploration at the country level in the basic and applied science fields. Scientrometrics, 120, 631-659. https://doi.org/10.1007/s11192-019-03133-9 Go to original source...
  5. Bates, J. (2018). Data cultures, power and the city. In Kitchin, R., Lauriault, T. P. & McArdle, G. (Eds.) Data and the City, (pp. 189-200). Routledge. Go to original source...
  6. Beagrie, N. (2008). Digital Curation for Science, Digital Libraries, and Individuals. International Journal of Digital Curation, 1, 3-16. https://doi.org/10.2218/ijdc.v1i1.2 Go to original source...
  7. Birkbeck, G., Nagle, T., & Sammon, D. (2022). Challenges in research data management practices: a literature analysis. Journal of Decision Systems, 31(sup1), 153-167. https://doi.org/10.1080/12460125.2022.2074653 Go to original source...
  8. Bowker, G. C. (2005). Memory Practices in the Sciences. MIT Press.
  9. Borgman, Ch. L. (2015). Big Data, Little Data, No Data: Scholarship in the Networked World. The MIT Press. Go to original source...
  10. Candela, L., Castelli, D., & Pagano, P. (2013). Virtual Research Environments: An Overview and Research Agenda. Data Science Journal, 12, GRDI75-G RDI81. http://doi.org/10.2481/dsj.GRDI-013 Go to original source...
  11. Cetina, K. K. (1999). Epistemic Cultures: How the Sciences Make Knowledge. Harvard University Press.
  12. Coccia, M. & Wang, L. (2016). Evolution and convergence of the patterns of international scientific collaboration. Proceedings of the National Academy of Sciences,113 (8) 2057-2061. https://doi.org/10.1073/pnas.1510820113 Go to original source...
  13. Cronin, B. (2003). Scholarly communication and epistemic cultures. New Review of Academic Librarianship, 9(1), 1-24. https://doi.org/10.1080/13614530410001692004 Go to original source...
  14. Dalrymple, P. (2016). What's in a Name? A Brief History of Informatics Education. In Seadle, M., Chu, C. M., Stöckel, U. & Crumpton, B., Educating the Profession: 40 years of the IFLA Section on Education and Training (pp.149-164). De Gruyter Saur. https://doi.org/10.1515/9783110375398-015 Go to original source...
  15. Data Management Plans. (2022). Digital Curation Centre. https://www.dcc.ac.uk/dmps
  16. De Roure, R., Goble, C. & Stevens, R. (2009). The design and realisation of the myExperiment Virtual Research Environment for social sharing of workflows. Future Generation Computer Systems, 25(5), 561-567. https://doi.org/10.1016/j.future.2008.06.010 Go to original source...
  17. Discover the world of Scientific Literature. (2022, Dec 21). Litmaps. https://www.litmaps.com/
  18. Duff, W., Carter, J., Cherry, J. M., MacNeil, H. & Howarth, L. C. (2013). From coexistence to convergence: studying partnerships and collaboration among libraries, archives and museums. Information Research, 18(3), paper 585. http://informationr.net/ir/18-3/paper585.html
  19. Eitan, A. T., Smolyansky, E., Harpaz, I. K. & Perets, S. (2022, October 20). Find and explore academic papers. Connected Papers. https://www.connectedpapers.com/
  20. Finholt, T. A. (2002). Collaboratories. Annual Review of Information Science and Technology, 36(1), 73-107. https://doi.org/10.1002/aris.1440360103 Go to original source...
  21. Fourman, M. P. (2003). Informatics. In Feather,J., & Sturges, P., International Encyclopedia of Information and Library Science. Routledge.
  22. Garfield, E. (1979). Citation Indexing - Its Theory and Application in Science, Technology, and Humanities. ISI Press.
  23. Giere, R. N. (2002). Distributed Cognition in Epistemic Cultures. Philosophy of Science, 69(4), 637-644. https://doi.org/10.1086/344627 Go to original source...
  24. Ginsparg, P. (2021). Lessons from arXiv's 30 years of information sharing. Nature Reviews Physics, 3, 602-603. https://doi.org/10.1038/s42254-021-00360-z Go to original source...
  25. Hall, S. (2001). Constituting an Archive. Third text, 15(54), 89-92. https://doi.org/10.1080/09528820108576903 Go to original source...
  26. Henrich, A., & Gradl, T. (2013). DARIAH(-DE): Digital Research Infrastructure for the Arts and Humanities - Concepts and Perspectives. International Journal of Humanities and Arts Computing, 7 (supplement), 47-58. https://doi.org/10.3366/ijhac.2013.0059 Go to original source...
  27. Hey, T., Tansley, S., & Tolle, K. (Eds.). (2009). The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Corporation.
  28. Hung, Y. N., Yang, C. H. H., Chen, P. Y., & Lerch, A. (2022). Low-Resource Music Genre Classification with Advanced Neural Model Reprogramming. arXiv preprint arXiv:2211.01317. https://doi.org/10.48550/arXiv.2211.01317 Go to original source...
  29. Kitchin, R., & Lauriault, T. P. (2018). Towards critical data studies: Charting and unpacking data assemblages and their work. In Thatcher, J., Eckert, J. & Shears, A. (Eds.). Thinking big data in geography: new regimes, new research. University of Nebraska Press.
  30. Koltay, T. (2019). Research Data Management and Data Literacy as We See Them Today. In Lichnerová, L. & Steinerová, J. (Eds.), Library and Information Science XXVIII, (pp. 7-16). Comenius University.
  31. Lefebvre, A. E. J., Schermerhorn, E. & Spruit, M. R. (2018). How Research Data Management Can Contribute to Efficient and Reliable Science. In 26th European Conference on Information Systems (paper no. 35). AIS. https://aisel.aisnet.org/ecis2018_rp/35/
  32. Lewis, S., Shepherd, K., Latt, Y. Y., Schweer, A. & Field, A. (2012). Repository as a service (RaaS). Journal of Digital Information, 13(1). http://journals.tdl.org/jodi/article/view/5872
  33. Lorenz, M. (2022). Digitální muzikologie a otázka smysluplného zpřístupnění zvukových digitalizátů. In Lorenz, M. et al. Záchrana zvukového kulturního dědictví: aktuální situace, problémy, možnosti. Littera.
  34. Murugesan, P. & Moravcsik, M. J. (1978). Variation of the nature of citation measures with journal and scientific specialties. Journal of the American Society for Information Science and Technology, 29(3), 141-147. https://doi.org/10.1002/asi.4630290307 Go to original source...
  35. Mushi, G. E. (2021). Research data management and services: Resources for different data practitioners. IASSIST Quarterly, 45(3-4). https://doi.org/10.29173/iq995 Go to original source...
  36. Neuroth, H., Lohmeier, F. & Smith, K. M. (2011). TextGrid - Virtual Research Environment for the Humanities. The International Journal of Digital Curation, 2(6), 222-231. https://doi.org/10.2218/ijdc.v6i2.198 Go to original source...
  37. Plesser, H. E. (2018). Reproducibility vs. Replicability: A Brief History of a Confused Terminology. Frontiers in Neuroinformatics, 11, 76. https://doi.org/10.3389/fninf.2017.00076 Go to original source...
  38. Plotkin, D. (2021). Data stewardship: an actionable guide to effective data management and data governance. Academic Press. Go to original source...
  39. Pollock, D., Yan, A., Parker, M. & Allard, S. (2022). The Role of Data in an Emerging Research Community: Environmental Health Research as an Exemplar. International Journal of Digital Curation, 16(1), 1-15. https://doi.org/10.2218/ijdc.v16i1.653 Go to original source...
  40. Poole, A. H. (2015). How has your science data grown? Digital curation and the human factor: a critical literature review. Archival Science, 15(2), 101-139. https://doi.org/10.1007/s10502-014-9236-y Go to original source...
  41. Research Data Management at Monash. (2022, July 14). Library. Monash University. https://www.monash.edu/library/researchers/data-collection-management/about
  42. Rudy, S. (2010, October 24). The State of Knowledge about "Living Archives, New Media Archives". Sustaining Digital Scholarship for Sustainable Culture. http://sustainableknowledgeproject.blogspot.com/2010/10/state-of-knowledge-about-living.html
  43. Schreibman, S., Siemens, R. & Unsworth, J. (2004). A New Companion to Digital Humanities. Blackwell Publishing. Go to original source...
  44. Smale, N., Unsworth, K., Denyer, G. & Barr, D. (2018). The History, Advocacy and Efficacy of Data Management Plans. bioRxiv, 443499. https://doi.org/10.1101/443499 Go to original source...
  45. Snow, C. P., (2012). The Two Cultures. Cambridge University Press. Go to original source...
  46. Straka, M. & Straková, J. (2019). NameTag (version 2.0). [Web Application]. http://hdl.handle.net/11858/00-097C-0000-0023-43CE-E
  47. The Turing Way Community, Arnold, B., Bowler, L., Gibson, S., Herterich, P., Higman, R. … Whitaker, K. (2022, July 27). The Turing Way: A Handbook for Reproducible Data Science (Version v1.0.2). Zenodo. https://doi.org/10.5281/zenodo.3233986 Go to original source...
  48. Van Gorp, P. & Mazanek, S. (2011). SHARE: a web portal for creating and sharing executable research papers. Procedia Computer Science, 4, 589-597. https://doi.org/10.1016/j.procs.2011.04.062 Go to original source...
  49. Výzkumná data. (2022, July 20). Centrum pro podporu open science. Univerzita Karlova. https://openscience.cuni.cz/OSCI-61.html
  50. Why do research data management? (2022, July 29). Research Data Service. The University of Edinburgh. https://www.ed.ac.uk/information-services/research-support/research-data-service/research-data-management
  51. Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., Appleton, G., Axton, M., Baak, A., Blomberg, N., Boiten, J., Da Silva Santos, L. O. B., Bourne, P. E., Bouwman, J., Brookes, A. J., Clark, T. W., Crosas, M., Dillo, I., Dumon, O., Edmunds, S., Evelo, C. T., Finkers, R., … Mons, B. (2016). The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data, 3(1), 160018. https://doi.org/10.1038/sdata.2016.18 Go to original source...
  52. Wolfram, S. (2023, Jan 9). Wolfram|Alpha as the Way to Bring Computational Knowledge Superpowers to ChatGPT. https://writings.stephenwolfram.com/2023/01/wolframalpha-as-the-way-to-bring-computational-knowledge-superpowers-to-chatgpt/

This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, distribution, and reproduction in any medium, provided the original publication is properly cited. No use, distribution or reproduction is permitted which does not comply with these terms.