Omitted citations—i.e., missing links between a cited paper and the corresponding citing papers—are a consequence of several bibliometric-database errors. To reduce these errors, databases may undertake two actions: (1) improving the control of the (new) papers to be indexed, i.e., limiting the introduction of ‘‘new’’ dirty data, and (2) detecting and correcting errors in the papers already indexed by the database, i.e., cleaning ‘‘old’’ dirty data. The latter action is probably more complicated, as it requires the application of suitable error-detection procedures to a huge amount of data. Based on an extensive sample of scientific papers in the Engineering-Manufacturing field, this study focuses on old dirty data in the Scopus and WoS databases. To this purpose, a recent automated algorithm for estimating the omitted-citation rate of databases is applied to the same sample of papers, but in three different-time sessions. A database’s ability to clean the old dirty data is evaluated considering the variations in the omitted-citation rate from session to session. The major outcomes of this study are that: (1) both databases slowly correct old omitted citations, and (2) a small portion of initially corrected citations can surprisingly come off from databases over time.
Do Scopus and WoS correct ‘‘old’’ omitted citations? / Franceschini, Fiorenzo; Maisano, DOMENICO AUGUSTO FRANCESCO; Mastrogiacomo, Luca. - In: SCIENTOMETRICS. - ISSN 0138-9130. - STAMPA. - 107:2(2016), pp. 321-335. [10.1007/s11192-016-1867-8]
Do Scopus and WoS correct ‘‘old’’ omitted citations?
FRANCESCHINI, FIORENZO;MAISANO, DOMENICO AUGUSTO FRANCESCO;MASTROGIACOMO, LUCA
2016
Abstract
Omitted citations—i.e., missing links between a cited paper and the corresponding citing papers—are a consequence of several bibliometric-database errors. To reduce these errors, databases may undertake two actions: (1) improving the control of the (new) papers to be indexed, i.e., limiting the introduction of ‘‘new’’ dirty data, and (2) detecting and correcting errors in the papers already indexed by the database, i.e., cleaning ‘‘old’’ dirty data. The latter action is probably more complicated, as it requires the application of suitable error-detection procedures to a huge amount of data. Based on an extensive sample of scientific papers in the Engineering-Manufacturing field, this study focuses on old dirty data in the Scopus and WoS databases. To this purpose, a recent automated algorithm for estimating the omitted-citation rate of databases is applied to the same sample of papers, but in three different-time sessions. A database’s ability to clean the old dirty data is evaluated considering the variations in the omitted-citation rate from session to session. The major outcomes of this study are that: (1) both databases slowly correct old omitted citations, and (2) a small portion of initially corrected citations can surprisingly come off from databases over time.File | Dimensione | Formato | |
---|---|---|---|
SCIENTOMETRICS Revised_SCIM-D-15-00388R1 (Accepted version DM) no yellow.doc
Open Access dal 02/02/2017
Descrizione: SCIENTOMETRICS Revised_SCIM-D-15-00388R1 (Accepted version DM)
Tipologia:
2. Post-print / Author's Accepted Manuscript
Licenza:
Pubblico - Tutti i diritti riservati
Dimensione
4.56 MB
Formato
Microsoft Word
|
4.56 MB | Microsoft Word | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2640161
Attenzione
Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo