Acta Informatica Pragensia X:X | DOI: 10.18267/j.aip.281248
Analysis of Benford’s Law Conformity with Web of Science Citations of Documents
- Institute of Information Studies and Librarianship, Faculty of Arts, Charles University, Prague, Czech Republic
- Library of the Czech Academy of Sciences, Prague, Czech Republic
Background: Benford’s law is a statistical phenomenon that predicts the probability of a particular digit at a particular position in a number. This law has been successfully applied in a number of areas, such as accounting. In the area of scientometrics, research has been devoted mostly to journal data.
Objective: This paper investigates the conformity of Benford’s law with the citation counts of records retrieved from the Web of Science database. We evaluate the conformity levels with Benford’s law in the complete dataset. We determine the effect of document type (article, proceedings paper and review), year of publication (2014–2018) and Web of Science categories (254 categories) on the level of conformity of the citation counts with Benford’s law.
Methods: The dataset of this research contains over 8.47 million records. All available records from the Web of Science were downloaded, so this set is the entire population of data available at the time of download. The distributions of the first significant digits in the citation counts of these records are compared with Benford’s law. Mean absolute deviation (MAD) recommended by Nigrini (2012) and sum of squared deviations (SSD) recommended by Kossovsky (2015) are used to categorize the similarity of the citation counts to Benford’s law.
Results: The entire dataset of this study shows marginal conformity according to both MAD and SSD intervals (with a MAD value of 0.1257 and an SSD value of 29.9; a lower value indicates a better agreement). The review document type shows a high level of conformity, while proceedings paper shows a lower level. We found significant differences in conformity between Web of Science categories.
Conclusion: This study mapped the level of conformity of the citation counts with Benford’s law in data from the Web of Science database. Further directions for possible research are suggested.
Keywords: Benford’s law; Citations; Scientometrics; Bibliometrics; Web of Science.
Received: March 8, 2025; Revised: June 30, 2025; Accepted: July 6, 2025; Prepublished online: August 11, 2025
References
- Alves, A.D., Yanasse, H. H., & Soma, N. Y. (2014). Benford's Law and articles of scientific journals: Comparison of JCR® and Scopus data. Scientometrics, 98(1), 173-184. https://doi.org/10.1007/s11192-013-1030-8
Go to original source...
- Alves, A.D., Yanasse, H. H., & Soma, N. Y. (2016). An analysis of bibliometric indicators to JCR according to Benford's law. Scientometrics, 107(3), 1489-1499. https://doi.org/10.1007/s11192-016-1908-3
Go to original source...
- Aksnes, D. W., Langfeldt, L. & Wouters, P. (2019) Citations, Citation Indicators, and Research Quality: An Overview of Basic Concepts and Theories. Online. Sage Open, 9(1), 1-17. https://doi.org/10.1177/2158244019829575
Go to original source...
- Bantange, Ch., Burgett, D., Haws, L., & Nelson, S. P. (2023). The "Benfordness" of Bach Music. Journal of Humanistic Mathematics, 13(2), 389-397. https://doi.org/10.5642/jhummath.SGFV8169
Go to original source...
- Benford, F. (1938). The Law of Anomalous Numbers. Proceedings of the American philosophical society, 78(4), 551-572.
- Berger, A., Hill, T. P., & Rogers, E. (2009). Benford Online Bibliography. http://www.benfordonline.net
- Berger, A., & Hill, T. P. (2015). An introduction to Benford's law. Princeton University Press.
Go to original source...
- Bertin, M., & Lafouge, T. (2025) Categorization of scientometric data in a Benfordian context. Online. Quantitative Science Studies, 6, 524-545. https://doi.org/10.1162/qss_a_00361
Go to original source...
- Bornmann, L., & Daniel, H. (2008). What do citation counts measure? A review of studies on citing behavior. Journal of Documentation, 64(1), 45-80. https://doi.org/10.1108/00220410810844150
Go to original source...
- Campanario, J. M., & Coslado M.A. (2011). Benford's law and citations, articles and impact factors of scientific journals. Scientometrics, 88(2), 421-432. https://doi.org/10.1007/s11192-011-0387-9
Go to original source...
- Cerqueti, R., & Maggi, M. (2021). Data validity and statistical conformity with Benford's Law. Chaos, Solitons & Fractals. 144, 110740. https://doi.org/10.1016/j.chaos.2021.110740
Go to original source...
- Cleary, R., & Thibodeau, J. C. (2005). Applying Digital Analysis Using Benford's Law to Detect Fraud: The Dangers of Type I Errors. Auditing: A Journal of Practice & Theory, 24(1), 77-81. https://doi.org/10.2308/aud.2005.24.1.77
Go to original source...
- Crespo, J.A., Li, Y., Ruiz-Catillo, J., Bornmann, L. (2013). The Measurement of the Effect on Citation Inequality of Differences in Citation Practices across Scientific Fields. PLoS ONE, 8(3), e58727. https://doi.org/10.1371/journal.pone.0058727
Go to original source...
- Druica, E., Oancea, B., & Valsan, C. (2018). Benford's law and the limits of digit analysis. International Journal of Accounting Information Systems, 31, 75-82. https://doi.org/10.1016/j.accinf.2018.09.004
Go to original source...
- Egghe, L. (2005). Power Laws in the Information Production Process: Lotkaian Informetrics. Elsevier Academic Press.
Go to original source...
- Egghe, L. (2011). Benford's law is a simple consequence of Zipf's Law. ISSI Newsletter, 7(3), 55-56.
- Egghe, L., & Guns, R. (2012). Applications of the generalized law of Benford to informetric data. Journal of the American Society for Information Science and Technology, 63(8), 1662-1665. https://doi.org/10.1002/asi.22690
Go to original source...
- Gupta, S., Singh, V.K., & Banshal, S.K. (2024). Altmetric data quality analysis using Benford's law. Scientometrics, 129, 4597-4621. https://doi.org/10.1007/s11192-024-05061-9
Go to original source...
- Kennedy, A.P., & Yam, S.C.P. (2020). On the authenticity of COVID-19 case figures. PLoS ONE, 15(12), e0243123. https://doi.org/10.1371/journal.pone.0243123
Go to original source...
- Kossovsky, A.E. (2015). Benford's Law: theory, the general Law of relative quantities, and forensic fraud detection applications. World Scientific.
- Kossovsky, A.E. (2021). On the Mistaken Use of the Chi-Square Test in Benford's Law. Stats, 4(2), 419-453. https://doi.org/10.3390/stats4020027
Go to original source...
- Latour, B., & Woolgar, S. (1979). Laboratory Life: The Social Construction of Scientific Facts. Sage.
- Lotka, A.J. (1926). The frequency distribution of scientific productivity. Journal of the Washington Academy of Sciences, 16(12), 317-324.
- Mandelbrot, B. (1965). Information Theory and Psycholinguistics. In B.B. Wolman & E. Nagel (Eds.), Scientific psychology, (pp. 550-562). Basic Books.
- Merton, R.K. (1973). The Sociology of Science: Theoretical and Empirical Investigations. University of Chicago Press.
- Miranda, R., & Garcia-Carpintero, E. (2018). Overcitation and overrepresentation of review papers in the most cited papers. Journal of Informetrics, 12(4), 1015-1030. https://doi.org/10.1016/j.joi.2018.08.006
Go to original source...
- Newcomb, S. (1881). Note on the Frequency of Use of the Different Digits in Natural Numbers. American Journal of Mathematics, 4, 39-40.
Go to original source...
- Nigrini, M. J. (2012). Benford's Law: applications for forensic accounting, auditing, and fraud detection. Wiley.
Go to original source...
- Ochsner, M. (2021) Bibliometrics in the Humanities, Arts and Social Sciences. In Handbook Bibliometrics, (pp. 117-124). Walter de Gruyter.
Go to original source...
- Redner, S. (2005). Citation statistics from 110 years of Physical Review. Physics Today, 58, 49-54.
Go to original source...
- Ruiz-Castillo, J., & Costas, R. (2014). The skewness of scientific productivity. Journal of Informetrics, 8(4), 917-934. https://doi.org/10.1016/j.joi.2014.09.006
Go to original source...
- Solla Price, D. (1965). Networks of scientific papers. Science, 149, 510-515.
Go to original source...
- Sorour, M. A., Marey, Y. A., Halim I. T. A., & Kasem M. M. (2024) Statistical Investigation of Scientific Journals Impact Factors in Relation to Benford's Law. In 2024 6th Novel Intelligent and Leading Emerging Sciences Conference (NILES), (pp. 521-524). IEEE. https://doi.org/10.1109/NILES63360.2024.10753145
Go to original source...
- Šlosar, D. J. (2025). Dataset of first significant digits citation counts of documents from WoS Categories 2014-2018. Zenodo.org. https://doi.org/10.5281/zenodo.16935993
Go to original source...
- Tošić, A., & Vičič, J. (2021). Use of Benford's law on academic publishing networks. Journal of Informetrics, 15(3), 101163. https://doi.org/10.1016/j.joi.2021.101163
Go to original source...
- Zipf, G.K. (1949). Human Behavior and the Principle of Least Effort. Addison-Wesley.
This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, distribution, and reproduction in any medium, provided the original publication is properly cited. No use, distribution or reproduction is permitted which does not comply with these terms.