Generative AI and intellectual property: a systematic literature review of possible issues and mitigations

Arnaudo, Anna; Coppola, Riccardo; Morisio, Maurizio; Vetro, Antonio; Borghi, Maurizio; Raso, Riccardo; Khan, Bryan

doi:10.7717/peerj-cs.3928

The voracious appetite of GenAI for data has raised a number of issues regarding copyright. We conducted a Systematic Literature Review (SLR), focusing on the types of copyright-protected data involved, the main challenges for compliance, and the ongoing mitigation efforts. This study — encompassing the literature published between 2019 and the first half of 2025 — seeks to examine the entire Generative AI pipeline under the lights of the EU copyright legislation. At the input stage, recurring issues include opacity surrounding the use of training data, misalignment between content licences and their usage, and the lack of mechanisms to regulate web scrapers collecting training material. Concerns also arise from fine-tuning aimed at stylistic emulation, as well as inadvertent memorisation and verbatim reproduction of training material. Various mitigation strategies have emerged: watermarking, adversarial perturbation, training data attribution with eventual royalties distribution, synthetic datasets, and machine-readable Text and Data Mining (TDM) opt-out mechanisms. Output-related vulnerabilities are similarly relevant, as adversarial attacks may extract sensitive data or generate style-imitating content. Retrieval-Augmented Generation (RAG) offers improved transparency, but remains prone to misattribution. The review underscores ongoing research dedicated to enhancing transparency in GenAI's output, including provenance-tracking protocols, blockchain-based integrity systems, watermarking techniques, and inference-time defences. Given the heterogeneity of the field, this review prioritised breadth over depth, leaving room for future studies while serving as a foundational guide for individuals with prior exposure or a developing understanding of the field. The findings underscore the necessity for enhanced copyright safeguards — potentially structured as multi-layered protective systems — and cross-sectoral collaboration to align rapid generative AI development with robust intellectual property protection.

Generative AI and intellectual property: a systematic literature review of possible issues and mitigations / Arnaudo, A., Coppola, R., Morisio, M., Vetro, A., Borghi, M., Raso, R., Khan, B.. - In: PEERJ. COMPUTER SCIENCE. - ISSN 2376-5992. - ELETTRONICO. - 12:(2026). [10.7717/peerj-cs.3928]

Generative AI and intellectual property: a systematic literature review of possible issues and mitigations

Anna Arnaudo;Riccardo Coppola;Maurizio Morisio;Antonio Vetro;Maurizio Borghi;Riccardo Raso;Bryan Khan

2026

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2026
			
	Codice DOI
	
				https://dx.doi.org/10.7717/peerj-cs.3928
			
	Titolo della Rivista
	
				PEERJ. COMPUTER SCIENCE.
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
peerj-cs-3928.pdf accesso aperto Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Creative commons Dimensione 2.37 MB Formato Adobe PDF Visualizza/Apri	2.37 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/3010554

PORTO @ Archivio Istituzionale della Ricerca

Generative AI and intellectual property: a systematic literature review of possible issues and mitigations

Anna Arnaudo;Riccardo Coppola;Maurizio Morisio;Antonio Vetro;Maurizio Borghi;Riccardo Raso;Bryan Khan

2026

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)