Identification and validation of potential common biomarkers for papillary thyroid carcinoma and Hashimoto’s thyroiditis … – Nature.com
Identify shared differential genes
When conducting PCA analysis on the expression matrices of GSE33570 (Fig.2a) and GSE29315 (Fig.2d), we observed a clear two-sided distribution of samples in both the disease group and the control group. In the analysis of the GSE35570 dataset, a total of 1572 distinct genes were detected as being differentially expressed. These DEGs were categorized into 824 up-regulated genes and 748 down-regulated genes (Fig.2b). Similarly, we observed 423 DEGs in the GSE29315 dataset, including 271 up-regulated DEGs and 152 down-regulated DEGs (Fig.2e). Next, the GEGs of the two datasets are displayed heatmaps for both datasets (Fig.2c,f). Furthermore, we employed a Venn diagram to identify the overlapping genes with the same directional trend, resulting in 64 genes being up-regulated (Fig.2g) and 37 genes being down-regulated (Fig.2h).
Differential expression gene analysis, function enrichment analysis and pathway enrichment analysis. (a) The PCA plot of GSE35570. (b, c) The Volcano plot and heatmap of DEGs in GSE33570. (d) The PCA plot of GSE29315. (e, f) The Volcano plot and heatmap of DEGs in GSE29315. (g) Venn plot of the up-regulated DEGs. (h) Venn plot of the down-regulated DEGs. (i) The KEGG enrichment analyses of DEGs. (j) The GO enrichment analyses of DEGs.
In order to enhance our comprehension of the fundamental biological functions linked to the 101 DEGs, an assessment of GO and KEGG enrichment was conducted using the clusterProfiler software package in R. An analysis of GO highlighted that these shared genes were mainly enriched in leukocyte mediated immunity, myeloid leukocyte activation, and antigen processing and presentation (Fig.2j). Additionally, the DEGs exhibited significant enrichment across the top five KEGG pathways, including Tuberculosis, Phagosome, Viral myocarditis, Inflammatory bowel disease, and Th1 and Th2 cell differentiation (Fig.2i). Apparently, the functions of differentially expressed genes are closely associated with the immune function of the body. The core genes primarily serve the purpose of activating immune cells.
To carry out the PPI analysis, we utilized the STRING online tool and visualized the outcomes using the Cytoscape software (Supplementary Fig. S1a). The PPI network showed 68 nodes and 498 edges. The DC value of each node was calculated, with a median value of 11. Based on this, we identified 17 hub genes of PPI network: TYROBP, ITGB2, STAT1, HLA-DRA, C1QB, MMP9, FCER1G, IL10RA, LCP2, LY86, CD53, CD14, CD163, HCK, MNDA, HLA-DPA1, and ALOX5AP. Subsequently, we employed the MCODE plug-in to identify six modules (Supplementary Fig. S1b,c), which included a total of 29 common DEGs. These DEGs were LCP2, TYROBP, CD53, LY86, ITGB2, FCER1G, MNDA, C1QB, HCK, IL10RA, HLA-DRA, ALOX5AP, MT1G, MT1F, MT1E, MT1X, ISG15, IFIT3, PSMB9, GBP2, CD14, CD163, VSIG4, CAV1, TIMP1, S100A4, SDC2, FGFR2, and STAT1. The most important module comprises 12 genes (LCP2, TYROBP, CD53, LY86, ITGB2, FCER1G, MNDA, C1QB, HCK, IL10RA, HLA-DRA, ALOX5AP), which were further analyzed using the ClueGO plug-in in Cytoscape software. The investigation revealed that these genes primarily function in activating neutrophils to participate in the immune response and activating innate immunity (Supplementary Fig. S1d).
In this study, we analyzed a total of 26 genes from six modules extracted from MCODE. To determine the importance of each gene, we employed the RF algorithm in two datasets, namely GSE35570 (Fig.3a) and GSE29315 (Fig.3b). By comparing the rankings of gene importance in both datasets, we identified the top eight genes that were consistently ranked highly. To visualize this overlap, we created a Venn diagram (Fig.3c), which revealed three genes (CD53, FCER1G and TYROBP) that were shared between the two datasets. Remarkably, these three genes overlap with the hub genes identified through the PPI analysis based on DC values, as well as the genes found in the most significant module. These three genes showed promising diagnostic potential for HT and PTC. To evaluate the diagnostic value of the common hub genes, we computed the Cutoff Value, sensitivity, specificity, AUC and 95% CI for each gene in the four datasets (Table 1). In the GSE35570 dataset (Fig.3d), the AUC values were as follows: CD53 (AUC 0.71, 95% CI 0.610.82), FCER1G (AUC 0.81, 95% CI 0.730.89), and TYROBP (AUC 0.79, 95% CI 0.710.88). In the GSE29315 dataset (Fig.3e), the AUC values were as follows: CD53 (AUC 1.00, 95% CI 1.001.00), FCER1G (AUC 1.00, 95% CI 1.001.00) and TYROBP (AUC 1.00, 95% CI 1.001.00). In the TCGA dataset (Fig.3f), we validated the diagnostic value of the common hub genes for PTC. The AUC values were as follows: CD53 (AUC 0.71 95% CI 0.610.82), FCER1G (AUC 0.74, 95% CI 0.640.89) and TYROBP (AUC 0.80, 95% CI 0.700.89). To further evaluate the diagnostic value of the common hub genes for PTC in HT, we computed the AUC and 95% CI for each gene using GSE1398198. In the GSE138198 dataset (Fig.3g), the AUC values were as follows: CD53 (AUC 0.83, 95%CI 0.571.00), FCER1G (AUC 0.92, 95% CI 0.721.00) and TYROBP (AUC 1.00, 95% CI 1.001.00). We also analyzed the difference box plots between the two groups in the four datasets (Supplementary Fig. S2). Our analysis using box plots revealed a noteworthy disparity in gene expression between the HT group and the control group in GSE29315. This disparity serves as an explanation for the AUC values of the three hub genes in GSE29315, all of which were observed to be 1.
Screening of hub genes and the diagnostic value of hub genes. (a) The rankings of gene importance in GSE35570. (b) The rankings of gene importance in GSE29315. (c) Venn plot of the top eight genes in GSE35570 and GSE29315. (d) Diagnostic value of hub genes in the GSE35570. (e) Diagnostic value of hub genes in the GSE29315, (f) Diagnostic value of hub genes in the TCGA. (g) Diagnostic value of hub genes in the GSE138198.
By using the GSE35570 dataset, we developed three diagnostic model specifically for PTC, incorporating these pivotal genes that were identified through our analysis. The ANN model (Fig.4a) had 4 hidden units, a penalty of 0.0108, and was trained for 537 epochs. The ANN model achieved an AUC of 0.94 (95% CI 0.910.98) in the training set, while in the test set, the AUC was 0.94 (95% CI 0.831.00) (Fig.4b). The XGBoost model had 8 mtry, 6 min_n, 3 max_depth, 0.001 learn_rate, and 0.07 loss_reduction and 0.97 sample_size. The XGBoost model achieved an AUC of 0.84 (95% CI 0.750.93) in the training set, while in the test set, the AUC was 0.62 (95% CI 0.420.83) (Supplementary Fig. S3a). The DT model had 0.0003 cost_complexity, 5 tree_depth and 6 min_n. The DT model achieved an AUC of 0.93 (95% CI 0.900.97) in the training set, while in the test set, the AUC was 0.83 (95% CI 0.651.00) (Supplementary Fig. S3b). Supplementary Table S1 displays the predictive performance of three machine learning models. The results indicate that the ANN model outperformed the other models, leading us to choose the ANN model for further analysis. TCGA dataset as external validation dataset was utilized to assess the diagnostic performance of the ANN model for PTC, yielding an AUC value of 0.77 (95% CI 0.660.87) (Fig.4c). The GSE138198 dataset was used to evaluate the ANN models diagnostic efficacy for PTC in HT. In the GSE138198 dataset (Fig.4d), the ANN model demonstrated a perfect AUC of 1.00 (95% CI 1.001.00). To provide clinicians with a better understanding of variable contributions, we utilized the SHAP algorithm to interpret the ANN prediction results. Figure4e, f, g illustrated how the attributed importance of features changed as their values varied. Our findings reveal that CD53 had the most significant impact on the output of the ANN model. Initially, it was positively associated with the risk of PTC and then became negatively correlated after a turning point of approximately 6. TYROBP and FCER1G showed a positive correlation with the occurrence of PTC.
ANN model construction and feature importance analysis. (a) The ANN was constructed based on the shared hub genes. (b) Diagnostic value of the ANN model in the GSE35570. (c) Diagnostic value of the ANN model in the TCGA. (d) Diagnostic value of the ANN model in the GSE138198. (e) A score calculated by SHAP was used for each input feature. (f, g) Distribution of the impact of each feature on the full model output estimated using the SHAP values.
We analyzed the protein expression of the hub genes based on the HPA database (Supplementary Fig. S4). CD53 was highly expressed in both tumor and normal tissues, while FCER1G and TYROBP showed higher expression in tumors compared to normal tissues. Furthermore, IF staining was performed to measure the expressions of CD53, FCER1G, and TYROBP in our clinical samples, including 10 HT-related PTC tissues and 6 NAT. By performing IF analysis (Fig.5), we obtained semi-quantitative results indicating significantly elevated fluorescence signal intensities for CD53, FCER1G, and TYROBP in the HT-related PTC group, as compared to the NAT group (P<0.05).
Microscopy scan of IF staining showed the distribution of CD53(green), FCER1G(green), and TYROBP(green), in HT-related PTC tissues and normal tissues adjacent to the tumour (NAT); as well as diagnostic value of CD53, FCER1G and TYROBP. MFI: Mean Fluorescence Intensity.
Considering the important roles of immune and inflammatory responses in the development of HT and PTC, we analyzed the differences in immune cell infiltration patterns between PTC, HT and normal samples using the CIBERSORT algorithm. By utilizing the GSE35570 dataset, we identified 12 immune subgroups that exhibited significant variations between PTC and normal samples (Supplementary Fig. S5a). Additionally, the analysis of the GSE29315 dataset revealed 5 immune subgroups that were significantly different between HT and normal samples (Supplementary Fig. S5b). Among these, 4 common immune subpopulations were found to be significantly higher in both PTC and HT samples compared to normal samples. These subpopulations included T cells CD8, T cells CD4 memory resting, macrophages M1 and mast cells resting. Additionally, we conducted spearman correlation analysis between hub genes and immune cells (Supplementary Fig. S5c,d). The results suggested that immune responses could potentially contribute to the involvement of hub genes in PTC and HT progression. IF staining was utilized to identify immune cell infiltration in 5 cases of PTC in HT tissues and 5 cases of NAT (Fig.6). The expression levels of CD4+T-cell marker Cd4, CD8+T-cell marker Cd8, and macrophage marker Cd86 were found to be significantly higher in the PTC in HT group compared to the NAT group. The IF staining results provided some extent of verification for the accuracy of the immune infiltration analysis results.
Microscopy scan of IF staining showed the distribution of Cd4(green), Cd8(green), and Cd86(green), in HT-related PTC tissues and normal tissues adjacent to the tumour (NAT). MFI: Mean Fluorescence Intensity.
Based on the three core genes screened in the RF algorithm, we conducted a search in the DGIdb database for relevant potential drugs. The results showed that only FCER1G had relevant drugs, while no relevant drugs were found for CD53 and TYROBP. FCER1G was predicted to have two potential drugs: benzylpenicilloyl polylysine and aspirin. Among these, benzylpenicilloyl polylysine had the highest score of 29.49, while aspirin had a score of only 1.26. We hypothesise that benzylpenicilloyl polylysine and aspirin may be effective in the treatment of HT and PTC and may prevent HT carcinogenesis.
See original here:
Identification and validation of potential common biomarkers for papillary thyroid carcinoma and Hashimoto's thyroiditis ... - Nature.com
- Machine learning at the edge in retail: constraints and gains - IoT News - March 26th, 2026 [March 26th, 2026]
- AI agents are flashy, but machine learning still pays the bills - TechRadar - March 26th, 2026 [March 26th, 2026]
- Single-cell imaging and machine learning reveal hidden coordination in algae's response to light stress - Phys.org - March 26th, 2026 [March 26th, 2026]
- Machine learning analysis of CT scans - National Institutes of Health (.gov) - March 22nd, 2026 [March 22nd, 2026]
- TransUnion Machine Learning Fraud Tools Tested Against Weak Share Price Momentum - simplywall.st - March 22nd, 2026 [March 22nd, 2026]
- Machine learning could help predict how people with depression respond to treatment - Medical Xpress - March 22nd, 2026 [March 22nd, 2026]
- KR approves machine learning-based fuel reduction methodology - Smart Maritime Network - March 22nd, 2026 [March 22nd, 2026]
- Available solar energy in Andalusia will increase through the end of the century, machine learning model finds - Tech Xplore - March 22nd, 2026 [March 22nd, 2026]
- How Machine Learning Is Reshaping Environmental Policy and Water Governance - Devdiscourse - March 22nd, 2026 [March 22nd, 2026]
- Chemistry student uses machine learning to transform gene therapy production - The University of North Carolina at Chapel Hill - March 13th, 2026 [March 13th, 2026]
- AI and Machine Learning - City of Brownsville to build smart city safety solution - Smart Cities World - March 13th, 2026 [March 13th, 2026]
- AI and Machine Learning - London borough overhauls public safety infrastructure - Smart Cities World - March 13th, 2026 [March 13th, 2026]
- Titan Technology Corp. Responds to Alberta Innovates RFP AI, Machine Learning and Automation Services - TradingView - March 13th, 2026 [March 13th, 2026]
- Vietnam FPT's AI automation solution secures new machine learning patent on overseas market - VnExpress International - March 13th, 2026 [March 13th, 2026]
- AI Healthcare Technology: The Power of Machine Learning Diagnosis in Modern Medicine - Tech Times - March 13th, 2026 [March 13th, 2026]
- Future Perspectives: Key Trends Shaping the Machine Learning Market in Financial Services Until 2030 - openPR.com - March 13th, 2026 [March 13th, 2026]
- How to Build an Autonomous Machine Learning Research Loop in Google Colab Using Andrej Karpathys AutoResearch Framework for Hyperparameter Discovery... - March 13th, 2026 [March 13th, 2026]
- The Arc in Arc Raiders have multiple "brains," and they all love pursuing you because Embark gives them "rewards" in real-time via... - March 13th, 2026 [March 13th, 2026]
- OnPoint AI to Present its Augmented Reality and Machine Learning Surgical Platform at the 2026 Canaccord Genuity Musculoskeletal Conference - Yahoo... - February 27th, 2026 [February 27th, 2026]
- TD Bank continues to develop AI, machine learning tools - Auto Finance News - February 27th, 2026 [February 27th, 2026]
- AI and Machine Learning - Tech companies team to scale private 5G and physical AI - Smart Cities World - February 27th, 2026 [February 27th, 2026]
- AI and Machine Learning in Dating Apps: Smarter Matchmaking Algorithms - Programming Insider - February 27th, 2026 [February 27th, 2026]
- Machine-Learning App Helps Anesthesiologists Navigate Critical Surgical Equipment in Real Time - Carle Illinois College of Medicine - February 24th, 2026 [February 24th, 2026]
- Fractal Launches PiEvolve, an Evolutionary Agentic Engine for Autonomous Machine Learning and Scientific Discovery - Yahoo Finance - February 24th, 2026 [February 24th, 2026]
- How Brain Data and Machine Learning Could Transform the Aging Industry - gritdaily.com - February 24th, 2026 [February 24th, 2026]
- AI and machine learning trends for Arizona leaders to watch in healthcare delivery and traveler services - AZ Big Media - February 24th, 2026 [February 24th, 2026]
- AI and machine learning are the future of Wi-Fi management: WBA report - Telecompetitor - February 22nd, 2026 [February 22nd, 2026]
- Machine learning streamlines the complexities of making better proteins - Science News - February 20th, 2026 [February 20th, 2026]
- WBA Publishes Guidance on Artificial Intelligence and Machine Learning for Intelligent Wi-Fi - ARC Advisory Group - February 20th, 2026 [February 20th, 2026]
- Machine learning-predicted insulin resistance is a risk factor for 12 types of cancer - Nature - February 20th, 2026 [February 20th, 2026]
- Exploring Machine Learning at the DOF - University of the Philippines Diliman - February 20th, 2026 [February 20th, 2026]
- AI and Machine Learning - Where US agencies are finding measurable value from AI - Smart Cities World - February 20th, 2026 [February 20th, 2026]
- Modeling visual perception of Chinese classical private gardens with image parsing and interpretable machine learning - Nature - February 16th, 2026 [February 16th, 2026]
- Analysis of Market Segments and Major Growth Areas in the Machine Learning (ML) Feature Lineage Tools Market - openPR.com - February 16th, 2026 [February 16th, 2026]
- Apple Makes One Of Its Largest Ever Acquisitions, Buys The Israeli Machine Learning Firm, Q.ai - Wccftech - February 1st, 2026 [February 1st, 2026]
- Keysights Machine Learning Toolkit to Speed Device Modeling and PDK Dev - All About Circuits - February 1st, 2026 [February 1st, 2026]
- University of Missouri Study: AI/Machine Learning Improves Cardiac Risk Prediction Accuracy - Quantum Zeitgeist - February 1st, 2026 [February 1st, 2026]
- How AI and Machine Learning Are Transforming Mobile Banking Apps - vocal.media - February 1st, 2026 [February 1st, 2026]
- Machine Learning in Production? What This Really Means - Towards Data Science - January 28th, 2026 [January 28th, 2026]
- Best Machine Learning Stocks of 2026 and How to Invest in Them - The Motley Fool - January 28th, 2026 [January 28th, 2026]
- Machine learning-based prediction of mortality risk from air pollution-induced acute coronary syndrome in the Western Pacific region - Nature - January 28th, 2026 [January 28th, 2026]
- Machine Learning Predicts the Strength of Carbonated Recycled Concrete - AZoBuild - January 28th, 2026 [January 28th, 2026]
- Vertiv Next Predict is a new AI-powered, managed service that combines field expertise and advanced machine learning algorithms to anticipate issues... - January 28th, 2026 [January 28th, 2026]
- Machine Learning in Network Security: The 2026 Firewall Shift - openPR.com - January 28th, 2026 [January 28th, 2026]
- Why IBMs New Machine-Learning Model Is a Big Deal for Next-Generation Chips - TipRanks - January 24th, 2026 [January 24th, 2026]
- A no-compromise amplifier solution: Synergy teams up with Wampler and Friedman to launch its machine-learning power amp and promises to change the... - January 24th, 2026 [January 24th, 2026]
- Our amplifier learns your cabinets impedance through controlled sweeps and continues to monitor it in real-time: Synergys Power Amp Machine-Learning... - January 24th, 2026 [January 24th, 2026]
- Machine Learning Studied to Predict Response to Advanced Overactive Bladder Therapies - Sandip Vasavada - UroToday - January 24th, 2026 [January 24th, 2026]
- Blending Education, Machine Learning to Detect IV Fluid Contaminated CBCs, With Carly Maucione, MD - HCPLive - January 24th, 2026 [January 24th, 2026]
- Why its critical to move beyond overly aggregated machine-learning metrics - MIT News - January 24th, 2026 [January 24th, 2026]
- Machine Learning Lends a Helping Hand to Prosthetics - AIP Publishing LLC - January 24th, 2026 [January 24th, 2026]
- Hassan Taher Explains the Fundamentals of Machine Learning and Its Relationship to AI - mitechnews.com - January 24th, 2026 [January 24th, 2026]
- Keysight targets faster PDK development with machine learning toolkit - eeNews Europe - January 24th, 2026 [January 24th, 2026]
- Training and external validation of machine learning supervised prognostic models of upper tract urothelial cancer (UTUC) after nephroureterectomy -... - January 24th, 2026 [January 24th, 2026]
- Age matters: a narrative review and machine learning analysis on shared and separate multidimensional risk domains for early and late onset suicidal... - January 24th, 2026 [January 24th, 2026]
- Uncovering Hidden IV Fluid Contamination Through Machine Learning, With Carly Maucione, MD - HCPLive - January 24th, 2026 [January 24th, 2026]
- Machine learning identifies factors that may determine the age of onset of Huntington's disease - Medical Xpress - January 24th, 2026 [January 24th, 2026]
- AI and Machine Learning - WEF expands Fourth Industrial Revolution Network - Smart Cities World - January 24th, 2026 [January 24th, 2026]
- Machine-learning analysis reclassifies armed conflicts into three new archetypes - The Brighter Side of News - January 24th, 2026 [January 24th, 2026]
- Machine learning and AI the future of drought monitoring in Canada - sasktoday.ca - January 24th, 2026 [January 24th, 2026]
- Machine learning revolutionises the development of nanocomposite membranes for CO capture - European Coatings - January 24th, 2026 [January 24th, 2026]
- AI and Machine Learning - Leading data infrastructure is helping power better lives in Sunderland - Smart Cities World - January 24th, 2026 [January 24th, 2026]
- How banks are responsibly embedding machine learning and GenAI into AML surveillance - Compliance Week - January 20th, 2026 [January 20th, 2026]
- Enhancing Teaching and Learning of Vocational Skills through Machine Learning and Cognitive Training (MCT) - Amrita Vishwa Vidyapeetham - January 20th, 2026 [January 20th, 2026]
- New Research in Annals of Oncology Shows Machine Learning Revelation of Global Cancer Trend Drivers - Oncodaily - January 20th, 2026 [January 20th, 2026]
- Machine learning-assisted mapping of VT ablation targets: progress and potential - Hospital Healthcare Europe - January 20th, 2026 [January 20th, 2026]
- Machine Learning Achieves Runtime Optimisation for GEMM with Dynamic Thread Selection - Quantum Zeitgeist - January 20th, 2026 [January 20th, 2026]
- Machine learning algorithm predicts Bitcoin price on January 31, 2026 - Finbold - January 20th, 2026 [January 20th, 2026]
- AI and Machine Learning Transform Baldness Detection and Management - Bioengineer.org - January 20th, 2026 [January 20th, 2026]
- A longitudinal machine-learning approach to predicting nursing home closures in the U.S. - Nature - January 11th, 2026 [January 11th, 2026]
- Occams Razor in Machine Learning. The Power of Simplicity in a Complex World - DataDrivenInvestor - January 11th, 2026 [January 11th, 2026]
- Study Explores Use of Automated Machine Learning to Compare Frailty Indices in Predicting Spinal Surgery Outcomes - geneonline.com - January 11th, 2026 [January 11th, 2026]
- Hunting for "Oddballs" With Machine Learning: Detecting Anomalous Exoplanets Using a Deep-Learned Low-Dimensional Representation of Transit... - January 9th, 2026 [January 9th, 2026]
- A Machine Learning-Driven Electrophysiological Platform for Real-Time Tumor-Neural Interaction Analysis and Modulation - Nature - January 9th, 2026 [January 9th, 2026]
- Machine learning elucidates associations between oral microbiota and the decline of sweet taste perception during aging - Nature - January 9th, 2026 [January 9th, 2026]
- Prognostic model for pancreatic cancer based on machine learning of routine slides and transcriptomic tumor analysis - Nature - January 9th, 2026 [January 9th, 2026]
- Bidgely Redefines Energy AI in 2025: From Machine Learning to Agentic AI - galvnews.com - January 9th, 2026 [January 9th, 2026]
- Machine Learning in Pharmaceutical Industry Market Size Reach USD 26.2 Billion by 2031 - openPR.com - January 9th, 2026 [January 9th, 2026]
- Noise-resistant Qubit Control With Machine Learning Delivers Over 90% Fidelity - Quantum Zeitgeist - January 9th, 2026 [January 9th, 2026]
- Machine Learning Models Forecast Parshwanath Corporation Limited Uptick - Real-Time Stock Alerts & High Return Trading Ideas -... - January 9th, 2026 [January 9th, 2026]