To identify a peculiar genetic combination predisposing to differentiated thyroid carcinoma (DTC), we selected a set of single-nucleotide polymorphisms (SNPs) associated with DTC risk, considering polygenic risk score (PRS), Bayesian statistics, and a machine learning (ML) classifier to describe cases and controls in 3 different datasets. Dataset 1 (649 DTC, 431 controls) has been previously genotyped in a genome-wide association study (GWAS) on Italian DTC. Dataset 2 (234 DTC, 101 controls) and dataset 3 (404 DTC, 392 controls) were genotyped. Associations of 171 SNPs reported to predispose to DTC in candidate studies were extracted from the GWAS of dataset 1, followed by replication of SNPs associated with DTC risk (P<0.05) in dataset 2. The reliability of the identified SNPs was confirmed by PRS and Bayesian statistics after merging the three datasets. SNPs were used to describe the case/control state of individuals by ML classifier. Starting from 171 SNPs associated with DTC, 15 were positive in both the datasets 1 and 2. Using these markers, PRS revealed that individuals in the fifth quintile had a 7-fold increased risk of DTC than those in the first. Bayesian inference confirmed that the selected 15 SNPs differentiate cases from controls. Results were corroborated by ML, finding a maximum AUC of about 0.7. A restricted selection of only 15 DTC-associated SNPs is able to describe the inner genetic structure of Italian individuals and ML allows a fair prediction of case or control status based solely on the individual genetic background.

Genetic signature of differentiated thyroid carcinoma susceptibility: a machine learning approach / Brigante, Giulia; Lazzaretti, Clara; Paradiso, Elia; Nuzzo, Federico; Sitti, Martina; Tüttelmann, Frank; Moretti, Gabriele; Silvestri, Roberto; Gemignani, Federica; Försti, Asta; Hemminki, Kari; Elisei, Rossella; Romei, Cristina; Zizzi, Eric Adriano; Deriu, Marco Agostino; Simoni, Manuela; Landi, Stefano; Casarini, Livio. - In: EUROPEAN THYROID JOURNAL. - ISSN 2235-0640. - ELETTRONICO. - 11:5(2022). [10.1530/ETJ-22-0058]

Genetic signature of differentiated thyroid carcinoma susceptibility: a machine learning approach

Zizzi, Eric Adriano;Deriu, Marco Agostino;
2022

Abstract

To identify a peculiar genetic combination predisposing to differentiated thyroid carcinoma (DTC), we selected a set of single-nucleotide polymorphisms (SNPs) associated with DTC risk, considering polygenic risk score (PRS), Bayesian statistics, and a machine learning (ML) classifier to describe cases and controls in 3 different datasets. Dataset 1 (649 DTC, 431 controls) has been previously genotyped in a genome-wide association study (GWAS) on Italian DTC. Dataset 2 (234 DTC, 101 controls) and dataset 3 (404 DTC, 392 controls) were genotyped. Associations of 171 SNPs reported to predispose to DTC in candidate studies were extracted from the GWAS of dataset 1, followed by replication of SNPs associated with DTC risk (P<0.05) in dataset 2. The reliability of the identified SNPs was confirmed by PRS and Bayesian statistics after merging the three datasets. SNPs were used to describe the case/control state of individuals by ML classifier. Starting from 171 SNPs associated with DTC, 15 were positive in both the datasets 1 and 2. Using these markers, PRS revealed that individuals in the fifth quintile had a 7-fold increased risk of DTC than those in the first. Bayesian inference confirmed that the selected 15 SNPs differentiate cases from controls. Results were corroborated by ML, finding a maximum AUC of about 0.7. A restricted selection of only 15 DTC-associated SNPs is able to describe the inner genetic structure of Italian individuals and ML allows a fair prediction of case or control status based solely on the individual genetic background.
File in questo prodotto:
File Dimensione Formato  
ACCEPTED_[22350802 - European Thyroid Journal] Genetic signature of differentiated thyroid carcinoma susceptibility a machine learning approach.pdf

accesso aperto

Descrizione: Accepted manuscript
Tipologia: 2. Post-print / Author's Accepted Manuscript
Licenza: Creative commons
Dimensione 2.63 MB
Formato Adobe PDF
2.63 MB Adobe PDF Visualizza/Apri
2235-0802-ETJ-22-0058.pdf

accesso aperto

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Creative commons
Dimensione 1.65 MB
Formato Adobe PDF
1.65 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2970706