Benchmarking Large Language Models in Evaluating Workforce Risk of Robotization: Insights from Agriculture

Benos, Lefteris; Marinoudi, Vasso; Busato, Patrizia; Kateris, Dimitrios; Pearson, Simon; Bochtis, Dionysis

doi:10.3390/agriengineering7040102

Understanding the impact of robotization on the workforce dynamics has become increasingly urgent. While expert assessments provide valuable insights, they are often time-consuming and resource-intensive. Large language models (LLMs) offer a scalable alternative; however, their accuracy and reliability in evaluating workforce robotization potential remain uncertain. This study systematically compares general-purpose LLM-generated assessments with expert evaluations to assess their effectiveness in the agricultural sector by considering human judgments as the ground truth. Using ChatGPT, Copilot, and Gemini, the LLMs followed a three-step evaluation process focusing on (a) task importance, (b) potential for task robotization, and (c) task attribute indexing of 15 agricultural occupations, mirroring the methodology used by human assessors. The findings indicate a significant tendency for LLMs to overestimate robotization potential, with most of the errors falling within the range of 0.229 ± 0.174. This can be attributed primarily to LLM reliance on grey literature and idealized technological scenarios, as well as their limited capacity, to account for the complexities of agricultural work. Future research should focus on integrating expert knowledge into LLM training and improving bias detection and mitigation in agricultural datasets, as well as expanding the range of LLMs studied to enhance assessment reliability.

Benchmarking Large Language Models in Evaluating Workforce Risk of Robotization: Insights from Agriculture / Benos, Lefteris; Marinoudi, Vasso; Busato, Patrizia; Kateris, Dimitrios; Pearson, Simon; Bochtis, Dionysis. - In: AGRIENGINEERING. - ISSN 2624-7402. - 7:4(2025). [10.3390/agriengineering7040102]

Benchmarking Large Language Models in Evaluating Workforce Risk of Robotization: Insights from Agriculture

Benos, Lefteris;Marinoudi, Vasso;Busato, Patrizia;Kateris, Dimitrios;Pearson, Simon;Bochtis, Dionysis

2025

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2025
			
	Codice DOI
	
				https://dx.doi.org/10.3390/agriengineering7040102
			
	Titolo della Rivista
	
				AGRIENGINEERING

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/3000489

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

PORTO @ Archivio Istituzionale della Ricerca

Benchmarking Large Language Models in Evaluating Workforce Risk of Robotization: Insights from Agriculture

Benos, Lefteris;Marinoudi, Vasso;Busato, Patrizia;Kateris, Dimitrios;Pearson, Simon;Bochtis, Dionysis

2025

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Attenzione

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)