This study investigates document-type (DT) classification errors—such as misclassifications of research articles, reviews, conference proceedings, editorials, etc.—in the bibliometric databases Scopus and Web of Science, focusing on the field of engineering. Such misclassifications can adversely affect academic research and research quality assessments, with potential repercussions on researchers’ careers and the allocation of funding in academic institutions. By analysing a corpus of approximately 10,000 publications, a semiautomated approach is employed to identify misclassified documents and attribute errors to the responsible database. Additionally, the role of publishers (e.g., “Elsevier”, “Springer Nature”, “Taylor & Francis”, etc.) is investigated, based on the hypothesis that certain publishers may contribute more significantly than others to DT-classification errors, due to specific editorial practices or inconsistencies in metadata. The results reveal that classification errors are non-negligible (i.e., they occur at rates of a few percentage points) and that the extent of publishers’ contributions varies significantly in both Scopus and WoS. However, the most problematic publishers for each database appear to be uncorrelated. Integrating various statistical tests, this study provides insights that may be valuable to researchers, research evaluators, database operators, and publishers, raising awareness of the issue and offering preliminary indications for identifying possible remedies to mitigate such errors and enhance DT-classification accuracy. Future research will further investigate the reasons behind the concentration of errors among certain publishers.
Errors in document‑type classification: a focus on engineering publications and their publishers / Maisano, Domenico A.; Ferrara, Lucrezia; Franceschini, Fiorenzo. - In: SCIENTOMETRICS. - ISSN 0138-9130. - STAMPA. - 130:11(2025), pp. 6613-6641. [10.1007/s11192-025-05444-6]
Errors in document‑type classification: a focus on engineering publications and their publishers
Maisano, Domenico A.;Ferrara, Lucrezia;Franceschini, Fiorenzo
2025
Abstract
This study investigates document-type (DT) classification errors—such as misclassifications of research articles, reviews, conference proceedings, editorials, etc.—in the bibliometric databases Scopus and Web of Science, focusing on the field of engineering. Such misclassifications can adversely affect academic research and research quality assessments, with potential repercussions on researchers’ careers and the allocation of funding in academic institutions. By analysing a corpus of approximately 10,000 publications, a semiautomated approach is employed to identify misclassified documents and attribute errors to the responsible database. Additionally, the role of publishers (e.g., “Elsevier”, “Springer Nature”, “Taylor & Francis”, etc.) is investigated, based on the hypothesis that certain publishers may contribute more significantly than others to DT-classification errors, due to specific editorial practices or inconsistencies in metadata. The results reveal that classification errors are non-negligible (i.e., they occur at rates of a few percentage points) and that the extent of publishers’ contributions varies significantly in both Scopus and WoS. However, the most problematic publishers for each database appear to be uncorrelated. Integrating various statistical tests, this study provides insights that may be valuable to researchers, research evaluators, database operators, and publishers, raising awareness of the issue and offering preliminary indications for identifying possible remedies to mitigate such errors and enhance DT-classification accuracy. Future research will further investigate the reasons behind the concentration of errors among certain publishers.| File | Dimensione | Formato | |
|---|---|---|---|
|
Scientometrics v.130, n.11, 2025 pp.6613-6641 Errors in DT classification Focus on engineering publications and publishers DM LF FF.pdf
accesso aperto
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Creative commons
Dimensione
1.61 MB
Formato
Adobe PDF
|
1.61 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/3005635
