Machine Learning Clarifies Stress-Based Degradation of Biosimilars – The Center for Biosimilars
Machine learning shows promise as a complementary approach to chromatographic (mixture separation) techniques for assessing biosimilarity and stability, according to a recent study.
Investigators evaluated machine learning vs chromatographic analysis in the study of 3 trastuzumab biosimilars and their reference product (Herceptin) under control and stress conditions. They concluded the machine learning results correlated with the chromatographic data and revealed patterns elucidating the effects of pH and thermal stress conditions.
Trastuzumab, a monoclonal antibody to human epidermal growth factor receptor 2 (HER2), is approved as a treatment for metastatic breast cancer, early breast cancer, and metastatic gastric cancer. The investigators found that the biosimilars showed high similarity under control conditions, but differences in degradation patterns were detected underforced degradation conditions in the study.
First, physicochemical characteristics of the reference product and biosimilar trastuzumab products (approved for use in Egypt; and referred to as B1, B2, and B3 in the study) were determined by size exclusion chromatography, cation exchange chromatography, and peptide mapping. The biologics were evaluated under control conditions and under pH and thermal stress. The investigators then used unsupervised machine learning techniques to find patterns in the chromatographic data.
Chromatographic Analysis
The authors said primary structure and size and charge variants are quality attributes expected to affect the quality, safety, and efficacy of biologic drugs including trastuzumab. These attributes were similar in the biosimilars and reference product under control conditions, the authors found.
Thermal and pH stress, the authors noted, are among the most studied stress conditions in forced degradation studies due to their direct effect on the size and charge variant profiles of [monoclonal antibodies] mAbs through deamidation and oxidation. Under thermal and pH stress, the investigators did find differences in the degradation of the different products.
Size variants
Based on size exclusion chromatography, B2 and B3 showed a tendency to form high- and low-molecular weight variants under acidic and basic stress, and B2 showed 83% degradation by the 2-week time point under acidic stress. Under thermal stress, B3 showed the greatest degradation, 39% after 2 weeks.
Charge variants
Under acidic stress, the products varied from 19.9% degradation of the main variant of the reference product at 2 weeks to 93% for B2. Under basic stress, all samples showed a comparable increase in abundance of acidic variants. Under thermal stress, the charge variant distribution of B2 and B3 were similar to charge variant distribution for the reference product, while B1 showed a greater abundance of acidic variants.
Principal Component Analysis
The investigators used unsupervised machine learning techniques, which find patterns in data with no prior training or predefined subcategories. Principal component analysis (PCA) is a method for reducing complexity in high-dimensional data to a small number of components that explain the greatest percentage of the variance in the data set.
The authors plotted size exclusion chromatography and cation exchange chromatography data on 2-dimensional coordinates representing the 2 components (PC1 and PC2) that explained the most variance to identify patterns in the data. Primary component analysis of chromatographic and peptide mapping data of the control samples showed no outliers, which the authors said supports biosimilarity of the products.
The plot of control and acidic stressed samples showed that the control samples were separated along the primary component 1 (PC1) axis, while the stressed samples were distributed along the PC2 axis. Samples of the same product were clustered relevantly close to each other, the authors said, and their PCA results on control and acidic-stressed samples suggested 41% of the variance in the data was due to the applied stress, and 25% was due to inherent differences in the chromatographic profiles of the products.
Clustering Analysis
The investigators also used 2 clustering techniques, k-means and density-based spatial clustering of applications with noise (DBSCAN), on the data from the top 2 PCs from their primary component analysis. According to the authors, cluster analysis is an unsupervised exploratory technique aiming to find natural grouping in data so that items in the same cluster are more similar to each other than to those from different clusters.
Due to the inherent variability and large number of possible structural variants of monoclonal antibodies, the authors said, machine learningaided approaches have great value for assessing their critical quality attributes. They cited previous research using PCA to reveal patterns in the data on biosimilarity and stability of other biologics, recombinant human growth hormone and infliximab.
K-means clustering of the unstressed samples segregated the products into 3 clusters, with the reference product and B2 each forming their own cluster, and B1 and B3 allocated to the same cluster. DBSCAN segregated each product to its own cluster.
K-means clustering was able to separate control and pH-stressed samples into different clusters, although B2 control samples were clustered with the stressed reference product and B3 samples. Cluster analysis suggested B3 was most similar to the reference product under acidic stress, while B2 was most similar under thermal stress, and all products had a similar response to basic pH stress. The greatest variability between control samples was between the reference product and B2.
Finally, application of principal component and clustering analyses to the collective data set from all the applied chromatographic techniques supported biosimilarity of the products, the authors said. This principal component analysis identified no samples that were significantly different from the others; k-means identified 3 clusters (reference product, B1 + B3, and B2), and DBSCAN identified 4 clusters, one containing each product.
The authors concluded their results supported the biosimilarity of the products analyzed, and highlighted that regarding the charge and size profiles of the studied products, B2 showed higher variability (than B1 and B3) compared to HC under both control and stress conditions. They said that the chromatographic fingerprints and machine learning results were correlated and were able to reveal patterns related to the effect of different stress conditions on the different investigated products. They recommended future studies explore other machine learning tools to interpret physicochemical data on biologic products.
For Further Reading
The European Medicines Authority reports on a pilot experiment in tailoring development of biosimilars, or eliminating unnecessary testing, and the World Health Organization develops guidelines to support the tailoring concept.
Reference
Shatat SM, Al-Ghobashy MA, Fathalla FA, Abbas SS, Eltanany BM. Coupling of trastuzumab chromatographic profiling with machine learning tools: a complementary approach for biosimilarity and stability assessment. J Chromatogr B Analyt Technol Biomed Life Sci. 2021;1184:122976. doi:10.1016/j.jchromb.2021.122976
Read more:
Machine Learning Clarifies Stress-Based Degradation of Biosimilars - The Center for Biosimilars
- Tunable band-stop photodetection with machine learning-enabled broadband spectral adaptation - Nature - July 3rd, 2026 [July 3rd, 2026]
- Basic machine learning with lessR : Easy, simple, and free - Open Access Government - July 3rd, 2026 [July 3rd, 2026]
- QuadSci Named Machine Learning Company of the Year - MarTech Cube - July 3rd, 2026 [July 3rd, 2026]
- From Conventional to Intelligent Triage: A Systematic Review of Artificial Intelligence and Machine Learning Applications in Emergency Departments -... - July 3rd, 2026 [July 3rd, 2026]
- On Robustness and Chain-of-Thought Consistency of RL-Finetuned VLMs - Apple Machine Learning Research - July 3rd, 2026 [July 3rd, 2026]
- Improving Wildfire Prediction with Machine Learning and Firebreaks - University of Reading - July 3rd, 2026 [July 3rd, 2026]
- A 3X Leader for the Agentic Era: DataRobot Named a Leader Again in the Gartner Magic Quadrant for Data Science and Machine Learning Platforms -... - June 24th, 2026 [June 24th, 2026]
- A 3X Leader for the Agentic Era: DataRobot Named a Leader Again in the Gartner Magic Quadrant for Data Science and Machine Learning Platforms - Yahoo... - June 24th, 2026 [June 24th, 2026]
- Undergrads gain hands-on machine learning experience in summer program - The Pennsylvania State University - June 24th, 2026 [June 24th, 2026]
- Python and Machine Learning: Why the Two Skills Are Increasingly Inseparable - BNO News - June 24th, 2026 [June 24th, 2026]
- Domino Data Lab Named a Visionary for the Third Consecutive Year in the 2026 Gartner Magic Quadrant for AI Platforms for Data Science and Machine... - June 24th, 2026 [June 24th, 2026]
- Machine Learning Boosts Smart Thermochromic Window Efficiency - Bioengineer.org - June 24th, 2026 [June 24th, 2026]
- A.I. VS HUMAN ROAST BATTLE to Pit Machine Learning Against Live Rapper in SF - BroadwayWorld - June 16th, 2026 [June 16th, 2026]
- Machine learning gives the U.S. a 1% chance of winning the World Cup final in its own backyard - Fortune - June 16th, 2026 [June 16th, 2026]
- Machine Learning Reveals Genes That Help Yeasts Resist Stress - Department of Energy (.gov) - June 16th, 2026 [June 16th, 2026]
- Machine Learning Reveals AED Impact on LGG Prognosis - Bioengineer.org - June 16th, 2026 [June 16th, 2026]
- Introducing the Third Generation of Apples Foundation Models - Apple Machine Learning Research - June 12th, 2026 [June 12th, 2026]
- Machine learning model predicts T2D risk up to 10 years before onset - Managed Healthcare Executive - June 12th, 2026 [June 12th, 2026]
- GPU as a Service Market to Reach USD 14.4 Billion by 2033 at 16.0% CAGR, Fueled by Generative AI, Machine Learning, and Cloud Infrastructure Expansion... - June 12th, 2026 [June 12th, 2026]
- Machine learning-guided design of mechanoadaptive bioglues for multitissue trauma and first-aid applications - Nature - June 12th, 2026 [June 12th, 2026]
- OUCRU scientists are using machine learning to forecast the next dengue outbreak - tropicalmedicine.ox.ac.uk - June 12th, 2026 [June 12th, 2026]
- IIT Roorkee invites applications for 11th Batch of Data Science, Machine Learning & Generative AI Programme - Elets Technomedia - June 12th, 2026 [June 12th, 2026]
- RAG Is Not Machine Learning, and the ML Toolkit Solves the Wrong Problem - Towards Data Science - June 3rd, 2026 [June 3rd, 2026]
- A reality check on the AI jobs hysteria - Machine Learning Week US - June 3rd, 2026 [June 3rd, 2026]
- STMicroelectronics Releases Vibration Sensor With Integrated Machine Learning for Industrial Monitoring - geneonline.com - June 3rd, 2026 [June 3rd, 2026]
- NAVER LABS Europe is offering a 2026 Research Internship in Large Language Models, focusing on AI Alignment, Controlled Generation, and Machine... - May 29th, 2026 [May 29th, 2026]
- Q&A: A Machine-Learning-Based Tool to Enhance Clinical Care of Patients With Multiple Sclerosis - Physician's Weekly - May 29th, 2026 [May 29th, 2026]
- Evaluating the Diagnostic Performance of AI and Machine Learning in Sickle Cell Disease Detection: A Systematic Review - Cureus - May 29th, 2026 [May 29th, 2026]
- HTC-19 Update: Artificial Intelligence and Machine Learning - Chromatography Online - May 29th, 2026 [May 29th, 2026]
- Multimodal phenotypic classification of generalized anxiety and panic using structural MRI data and psychosocial factors: machine learning results... - May 29th, 2026 [May 29th, 2026]
- Machine Learning Personalizes Depression Treatment with the Help of Wearable Technology - UC San Diego Today - May 27th, 2026 [May 27th, 2026]
- How Machine Learning Makes Complex Knowledge Useable in Real-World Conditions - Supply & Demand Chain Executive - May 25th, 2026 [May 25th, 2026]
- How Airbnbs machine-learning tools aim to prevent Memorial Day weekend parties in Las Vegas - FOX5 Vegas - May 25th, 2026 [May 25th, 2026]
- Artificial Intelligence and Machine Learning in Hospital Quality Management, Patient Safety, and Accreditation Readiness: A Systematic Review and... - May 25th, 2026 [May 25th, 2026]
- Machine learning accelerates analysis of fusion materials - Technology Org - May 25th, 2026 [May 25th, 2026]
- Dr. Kaveh Heidary Presents Innovations in AI, Machine Learning and Multispectral Imaging - aamu.edu - May 25th, 2026 [May 25th, 2026]
- Comparison of Prognostic Performance Between a Machine Learning Model and Manually Measured Grey-White-Matter Ratio on Early Brain Computed Tomography... - May 25th, 2026 [May 25th, 2026]
- Machine learning proves that graphene is hydrophobic - Phys.org - May 13th, 2026 [May 13th, 2026]
- Machine learning algorithm predicts AMD stock price on May 31, 2026 - Finbold - May 13th, 2026 [May 13th, 2026]
- Genetic association and machine learning improve the prediction of type 1 diabetes risk - Nature - May 1st, 2026 [May 1st, 2026]
- What Can We Expect From Machine Learning Predictions in Daily Clinical Neurology? - Neurology Live - May 1st, 2026 [May 1st, 2026]
- How Spam Filters Paved the Way for Adversarial Machine Learning - 150sec - May 1st, 2026 [May 1st, 2026]
- Real-Time Estimation of Numerical Rating Scale (NRS) Scores Using Machine Learning-Based Facial Expression Analysis: A Proof-of-Concept Study - Cureus - May 1st, 2026 [May 1st, 2026]
- Heriot-Watt researcher warns gen AI in machine learning carries serious and underestimated risks - EdTech Innovation Hub - May 1st, 2026 [May 1st, 2026]
- HS-SPME/GCMS and Machine Learning Enable Volatile Fingerprinting and Classification of Commercial Vinegars - Chromatography Online - April 12th, 2026 [April 12th, 2026]
- Role of Artificial Intelligence and Machine Learning in Diagnosing Knee Lesions: Where Are We Now? - Cureus - April 12th, 2026 [April 12th, 2026]
- CMML2AML: machine-learning discovery of co-mutations and specific single mutations predictive of blast transformation in chronic myelomonocytic... - April 12th, 2026 [April 12th, 2026]
- Machine-learning-based reconstruction of Ming-dynasty defensive corridors in Yuxian - Nature - April 12th, 2026 [April 12th, 2026]
- Have you published a disruptive paper? New machine-learning tool helps you check - Physics World - April 12th, 2026 [April 12th, 2026]
- Microsoft is automatically updating Windows 11 24H2 to 25H2 using machine learning - TweakTown - April 5th, 2026 [April 5th, 2026]
- Inside the Magic of Machine Learning That Powers Enemy AI in Arc Raiders - 80 Level - April 3rd, 2026 [April 3rd, 2026]
- We analyzed Philly street scenes and identified signs of gentrification using machine learning trained on longtime residents observations - The... - April 3rd, 2026 [April 3rd, 2026]
- Boston University To Apply Machine Learning To Alzheimers Biomarker And Cognitive Data - Quantum Zeitgeist - April 3rd, 2026 [April 3rd, 2026]
- Sony buys machine-learning company to help "enhance gameplay visuals, improve rendering techniques, and unlock new levels of visual... - April 3rd, 2026 [April 3rd, 2026]
- The Machine Learning Stack Is Being Rebuilt From Scratch Here's What Developers Need to Know in 2026 - HackerNoon - April 3rd, 2026 [April 3rd, 2026]
- Closing the Revenue Gap: Leveraging Machine Learning to Solve the $260 Billion Denial Crisis - vocal.media - April 3rd, 2026 [April 3rd, 2026]
- Machine Learning for Pharmaceuticals Set to Witness Rapid - openPR.com - April 3rd, 2026 [April 3rd, 2026]
- You Must Address These 4 Concerns To Deploy Predictive AI - Machine Learning Week US - March 30th, 2026 [March 30th, 2026]
- Google and the rise of space-based machine learning - Latitude Media - March 30th, 2026 [March 30th, 2026]
- Researchers use machine learning and social network theory to identify formation patterns in digital forums - techxplore.com - March 30th, 2026 [March 30th, 2026]
- Mayo Clinic Study Uses Wearables and Machine Learning to Predict COPD Rehab Participation - HIT Consultant - March 30th, 2026 [March 30th, 2026]
- Machine learning at the edge in retail: constraints and gains - IoT News - March 26th, 2026 [March 26th, 2026]
- AI agents are flashy, but machine learning still pays the bills - TechRadar - March 26th, 2026 [March 26th, 2026]
- Single-cell imaging and machine learning reveal hidden coordination in algae's response to light stress - Phys.org - March 26th, 2026 [March 26th, 2026]
- Machine learning analysis of CT scans - National Institutes of Health (.gov) - March 22nd, 2026 [March 22nd, 2026]
- TransUnion Machine Learning Fraud Tools Tested Against Weak Share Price Momentum - simplywall.st - March 22nd, 2026 [March 22nd, 2026]
- Machine learning could help predict how people with depression respond to treatment - Medical Xpress - March 22nd, 2026 [March 22nd, 2026]
- KR approves machine learning-based fuel reduction methodology - Smart Maritime Network - March 22nd, 2026 [March 22nd, 2026]
- Available solar energy in Andalusia will increase through the end of the century, machine learning model finds - Tech Xplore - March 22nd, 2026 [March 22nd, 2026]
- How Machine Learning Is Reshaping Environmental Policy and Water Governance - Devdiscourse - March 22nd, 2026 [March 22nd, 2026]
- Chemistry student uses machine learning to transform gene therapy production - The University of North Carolina at Chapel Hill - March 13th, 2026 [March 13th, 2026]
- AI and Machine Learning - City of Brownsville to build smart city safety solution - Smart Cities World - March 13th, 2026 [March 13th, 2026]
- AI and Machine Learning - London borough overhauls public safety infrastructure - Smart Cities World - March 13th, 2026 [March 13th, 2026]
- Titan Technology Corp. Responds to Alberta Innovates RFP AI, Machine Learning and Automation Services - TradingView - March 13th, 2026 [March 13th, 2026]
- Vietnam FPT's AI automation solution secures new machine learning patent on overseas market - VnExpress International - March 13th, 2026 [March 13th, 2026]
- AI Healthcare Technology: The Power of Machine Learning Diagnosis in Modern Medicine - Tech Times - March 13th, 2026 [March 13th, 2026]
- Future Perspectives: Key Trends Shaping the Machine Learning Market in Financial Services Until 2030 - openPR.com - March 13th, 2026 [March 13th, 2026]
- How to Build an Autonomous Machine Learning Research Loop in Google Colab Using Andrej Karpathys AutoResearch Framework for Hyperparameter Discovery... - March 13th, 2026 [March 13th, 2026]
- The Arc in Arc Raiders have multiple "brains," and they all love pursuing you because Embark gives them "rewards" in real-time via... - March 13th, 2026 [March 13th, 2026]
- OnPoint AI to Present its Augmented Reality and Machine Learning Surgical Platform at the 2026 Canaccord Genuity Musculoskeletal Conference - Yahoo... - February 27th, 2026 [February 27th, 2026]