Machine learning results: pay attention to what you don’t see – STAT
Even as machine learning and artificial intelligence are drawing substantial attention in health care, overzealousness for these technologies has created an environment in which other critical aspects of the research are often overlooked.
Theres no question that the increasing availability of large data sources and off-the-shelf machine learning tools offer tremendous resources to researchers. Yet a lack of understanding about the limitations of both the data and the algorithms can lead to erroneous or unsupported conclusions.
Given that machine learning in the health domain can have a direct impact on peoples lives, broad claims emerging from this kind of research should not be embraced without serious vetting. Whether conducting health care research or reading about it, make sure to consider what you dont see in the data and analyses.
advertisement
One key question to ask is: Whose information is in the data and what do these data reflect?
Common forms of electronic health data, such as billing claims and clinical records, contain information only on individuals who have encounters with the health care system. But many individuals who are sick dont or cant see a doctor or other health care provider and so are invisible in these databases. This may be true for individuals with lower incomes or those who live in rural communities with rising hospital closures. As University of Toronto machine learning professor Marzyeh Ghassemi said earlier this year:
Even among patients who do visit their doctors, health conditions are not consistently recorded. Health data also reflect structural racism, which has devastating consequences.
Data from randomized trials are not immune to these issues. As a ProPublica report demonstrated, black and Native American patients are drastically underrepresented in cancer clinical trials. This is important to underscore given that randomized trials are frequently highlighted as superior in discussions about machine learning work that leverages nonrandomized electronic health data.
In interpreting results from machine learning research, its important to be aware that the patients in a study often do not depict the population we wish to make conclusions about and that the information collected is far from complete.
It has become commonplace to evaluate machine learning algorithms based on overall measures like accuracy or area under the curve. However, one evaluation metric cannot capture the complexity of performance. Be wary of research that claims to be ready for translation into clinical practice but only presents a leader board of tools that are ranked based on a single metric.
As an extreme illustration, an algorithm designed to predict a rare condition found in only 1% of the population can be extremely accurate by labeling all individuals as not having the condition. This tool is 99% accurate, but completely useless. Yet, it may outperform other algorithms if accuracy is considered in isolation.
Whats more, algorithms are frequently not evaluated based on multiple hold-out samples in cross-validation. Using only a single hold-out sample, which is done in many published papers, often leads to higher variance and misleading metric performance.
Beyond examining multiple overall metrics of performance for machine learning, we should also assess how tools perform in subgroups as a step toward avoiding bias and discrimination. For example, artificial intelligence-based facial recognition software performed poorly when analyzing darker-skinned women. Many measures of algorithmic fairness center on performance in subgroups.
Bias in algorithms has largely not been a focus in health care research. That needs to change. A new study found substantial racial bias against black patients in a commercial algorithm used by many hospitals and other health care systems. Other work developed algorithms to improve fairness for subgroups in health care spending formulas.
Subjective decision-making pervades research. Who decides what the research question will be, which methods will be applied to answering it, and how the techniques will be assessed all matter. Diverse teams are needed not just because they yield better results. As Rediet Abebe, a junior fellow of Harvards Society of Fellows, has written, In both private enterprise and the public sector, research must be reflective of the society were serving.
The influx of so-called digital data thats available through search engines and social media may be one resource for understanding the health of individuals who do not have encounters with the health care system. There have, however, been notable failures with these data. But there are also promising advances using online search queries at scale where traditional approaches like conducting surveys would be infeasible.
Increasingly granular data are now becoming available thanks to wearable technologies such as Fitbit trackers and Apple Watches. Researchers are actively developing and applying techniques to summarize the information gleaned from these devices for prevention efforts.
Much of the published clinical machine learning research, however, focuses on predicting outcomes or discovering patterns. Although machine learning for causal questions in health and biomedicine is a rapidly growing area, we dont see a lot of this work yet because it is new. Recent examples of it include the comparative effectiveness of feeding interventions in a pediatric intensive care unit and the effectiveness of different types of drug-eluting coronary artery stents.
Understanding how the data were collected and using appropriate evaluation metrics will also be crucial for studies that incorporate novel data sources and those attempting to establish causality.
In our drive to improve health with (and without) machine learning, we must not forget to look for what is missing: What information do we not have about the underlying health care system? Why might an individual or a code be unobserved? What subgroups have not been prioritized? Who is on the research team?
Giving these questions a place at the table will be the only way to see the whole picture.
Sherri Rose, Ph.D., is associate professor of health care policy at Harvard Medical School and co-author of the first book on machine learning for causal inference, Targeted Learning (Springer, 2011).
See the article here:
Machine learning results: pay attention to what you don't see - STAT
- Prefix-RFT: A Unified Machine Learning Framework to blend Supervised Fine-Tuning (SFT) and Reinforcement Fine-Tuning (RFT) - MarkTechPost - August 24th, 2025 [August 24th, 2025]
- What machine learning models say about Iterum Therapeutics plc - Weekly Risk Report & Fast Exit Strategy with Risk Control - Newser - August 24th, 2025 [August 24th, 2025]
- Can machine learning forecast Putnam Municipal Opportunities Trust recovery - Insider Selling & Weekly Return Optimization Plans - Newser - August 24th, 2025 [August 24th, 2025]
- Can machine learning forecast Viking Therapeutics Inc. recovery - Quarterly Profit Report & Fast Entry and Exit Trade Plans - Newser - August 24th, 2025 [August 24th, 2025]
- Can machine learning forecast Tectonic Financial Inc. recovery - 2025 Historical Comparison & Risk Adjusted Buy and Sell Alerts - Newser - August 24th, 2025 [August 24th, 2025]
- Combining machine learning predictions for Cowen Inc. Preferred Security - 2025 Performance Recap & Reliable Volume Spike Trade Alerts - Newser - August 24th, 2025 [August 24th, 2025]
- Can machine learning forecast Milestone Pharmaceuticals Inc. recovery - July 2025 Movers & Breakout Confirmation Trade Signals - Newser - August 24th, 2025 [August 24th, 2025]
- What machine learning models say about FIGS - Weekly Trend Recap & Expert Curated Trade Setup Alerts - Newser - August 24th, 2025 [August 24th, 2025]
- Combining machine learning predictions for Daxor Corporation - July 2025 Sentiment & Fast Exit Strategy with Risk Control - Newser - August 24th, 2025 [August 24th, 2025]
- Combining machine learning predictions for Willis Towers Watson Public Limited Company - 2025 Macro Impact & Free Safe Capital Growth Stock Tips -... - August 24th, 2025 [August 24th, 2025]
- Combining machine learning predictions for Sanmina Corporation - Trade Exit Summary & AI Based Buy and Sell Signals - Newser - August 24th, 2025 [August 24th, 2025]
- Combining machine learning predictions for Runway Growth Finance Corp. - Quarterly Market Summary & Expert Approved Momentum Ideas - Newser - August 24th, 2025 [August 24th, 2025]
- Can machine learning forecast Maywood Acquisition Corp. Debt Equity Composite Units recovery - Market Growth Summary & Weekly Breakout Watchlists... - August 24th, 2025 [August 24th, 2025]
- The Role of AI and Machine Learning in Personalizing Short Video Content - Vocal - August 22nd, 2025 [August 22nd, 2025]
- Optimization and predictive performance of fly ash-based sustainable concrete using integrated multitask deep learning framework with interpretable... - August 22nd, 2025 [August 22nd, 2025]
- Balancing ethics and statistics: machine learning facilitates highly accurate classification of mice according to their trait anxiety with reduced... - August 22nd, 2025 [August 22nd, 2025]
- Researchers use machine learning to predict dengue fever with 80% accuracy - Northeastern Global News - August 22nd, 2025 [August 22nd, 2025]
- Supervised machine learning algorithms for the classification of obesity levels using anthropometric indices derived from bioelectrical impedance... - August 22nd, 2025 [August 22nd, 2025]
- Machine learning aided optoelectric characterization modelling and prediction of the IV parameters of perovskite solar cells with > 90% accuracy -... - August 22nd, 2025 [August 22nd, 2025]
- Improvement of robot learning with combination of decision making and machine learning for water analysis - EurekAlert! - August 22nd, 2025 [August 22nd, 2025]
- Machine learning and SHAP values explain the association between social determinants of health and post-stroke depression - BMC Public Health - August 22nd, 2025 [August 22nd, 2025]
- Systematic selection of best performing mathematical models for in vitro gas production using machine learning across diverse feeds - Nature - August 22nd, 2025 [August 22nd, 2025]
- YouTubes Using Machine Learning to Improve the Look of Your Shorts Clips - Social Media Today - August 20th, 2025 [August 20th, 2025]
- Machine learning based on pangenome-wide association studies reveals the impact of host source on the zoonotic potential of closely related bacterial... - August 20th, 2025 [August 20th, 2025]
- Machine learning model for early diagnosis of breast cancer based on PiRNA expression with CA153 - Nature - August 20th, 2025 [August 20th, 2025]
- Automatic detection of cognitive events using machine learning and understanding models interpretations of human cognition - Nature - August 20th, 2025 [August 20th, 2025]
- Damon Evolves I/O Platform with Advanced Machine Learning for Adaptive Rider Performance - Motor Sports Newswire - August 20th, 2025 [August 20th, 2025]
- Predictive modeling of asthma drug properties using machine learning and topological indices in a MATLAB based QSPR study - Nature - August 20th, 2025 [August 20th, 2025]
- Saturday Citations: A new category of supernovas; neurons beat machine learning; depression and vitiligo - Phys.org - August 18th, 2025 [August 18th, 2025]
- Agentic AI Is The New Vaporware - Machine Learning Week 2025 - August 18th, 2025 [August 18th, 2025]
- ReactorNet based on machine learning framework to identify control rod position for real time monitoring in PWRs - Nature - August 18th, 2025 [August 18th, 2025]
- Low-cost fabrication and comparative evaluation of machine learning algorithms for flexible PDMS-based hexagonal patch antenna - Nature - August 18th, 2025 [August 18th, 2025]
- Digital biomarkers for interstitial glucose prediction in healthy individuals using wearables and machine learning - Nature - August 18th, 2025 [August 18th, 2025]
- Integrative machine learning models predict prostate cancer diagnosis and biochemical recurrence risk: Advancing precision oncology - Nature - August 18th, 2025 [August 18th, 2025]
- Predicting onset of myopic refractive error in children using machine learning on routine pediatric eye examinations only - Nature - August 18th, 2025 [August 18th, 2025]
- Advanced machine learning framework for thyroid cancer epidemiology in Iran through integration of environmental socioeconomic and health system... - August 18th, 2025 [August 18th, 2025]
- Year-round daily wildfire prediction and key factor analysis using machine learning: a case study of Gangwon State, South Korea - Nature - August 18th, 2025 [August 18th, 2025]
- Comparing the effect of pre-anesthesia clonidine and tranexamic acid on intraoperative bleeding volume in rhinoplasty: a machine learning approach -... - August 18th, 2025 [August 18th, 2025]
- Exploring the role of lipid metabolism related genes and immune microenvironment in periodontitis by integrating machine learning and bioinformatics... - August 18th, 2025 [August 18th, 2025]
- From Data to Delivery: Leveraging AI and Machine Learning in Network Planning - Tech Times - August 18th, 2025 [August 18th, 2025]
- Association between the nutritional inflammation index and mortality among patients with sepsis: insights from traditional methods and machine... - August 18th, 2025 [August 18th, 2025]
- C3 AI Selected for Constellation ShortList for Artificial Intelligence and Machine Learning Best-of-Breed Platforms for Q3 2025 - Yahoo Finance - August 14th, 2025 [August 14th, 2025]
- A physicist tackles machine learning black box - The University of Utah - August 14th, 2025 [August 14th, 2025]
- Morgan State University Collaborates with Amazon-Machine Learning University to Bring AI and Machine Learning Education to the Classroom - Morgan... - August 14th, 2025 [August 14th, 2025]
- BEAST-GB model combines machine learning and behavioral science to predict people's decisions - Tech Xplore - August 14th, 2025 [August 14th, 2025]
- Balancing Regulation and Risk of AI and Machine Learning Software in Medical Devices - Infection Control Today - August 14th, 2025 [August 14th, 2025]
- A deep learning model with machine vision system for recognizing type of the food during the food consumption - Nature - August 14th, 2025 [August 14th, 2025]
- Machine learning reveals the mysteries of amorphous alumina thin films at atomic scale - Phys.org - August 14th, 2025 [August 14th, 2025]
- Correction: Machine learning based prediction of cognitive metrics using major biomarkers in SuperAgers - Nature - August 14th, 2025 [August 14th, 2025]
- Transforming Cancer Biomarker Discovery with Machine Learning - the-scientist.com - August 14th, 2025 [August 14th, 2025]
- AI in Precision Agriculture Market Accelerates Adoption of Predictive Analytics and Machine Learning - openPR.com - August 14th, 2025 [August 14th, 2025]
- Improvements from incorporating machine learning algorithms into near real-time operational post-processing - Nature - August 14th, 2025 [August 14th, 2025]
- Data Quality Tools Market Expected to Surge to USD 8.0 Billion by 2033, Driven by AI and Machine Learning Adoption - Vocal - August 12th, 2025 [August 12th, 2025]
- Predicting female football outcomes by machine learning: behavioural analysis of goals as high stress events - Nature - August 12th, 2025 [August 12th, 2025]
- Harnessing Machine Learning and Weak AI to do Smart Things on the Production Floor - AdvancedManufacturing.org - August 12th, 2025 [August 12th, 2025]
- The Role of AI in Predicting Customer Churn Beyond Traditional Metrics - Machine Learning Week 2025 - August 12th, 2025 [August 12th, 2025]
- Towards better earthquake risk assessment with machine learning and geological survey data - Tech Xplore - August 12th, 2025 [August 12th, 2025]
- AI and Machine Learning - Philadelphia calls for climate resilience partners - Smart Cities World - August 12th, 2025 [August 12th, 2025]
- Exploring the Potential of Machine Learning in Optimizing Respiratory Failure Treatment - AJMC - August 9th, 2025 [August 9th, 2025]
- Decoding macrophage immune responses with gene editing and machine learning - News-Medical - August 9th, 2025 [August 9th, 2025]
- Application of causal forest double machine learning (DML) approach to assess tuberculosis preventive therapys impact on ART adherence - Nature - August 9th, 2025 [August 9th, 2025]
- Serum peptide biomarkers by MALDI-TOF MS coupled with machine learning for diagnosis and classification of hepato-pancreato-biliary cancers - Nature - August 9th, 2025 [August 9th, 2025]
- Machine learning based analysis of leucocyte cell population data by Sysmex XN series hematology analyzer for the diagnosis of bacteremia - Nature - August 9th, 2025 [August 9th, 2025]
- Predicting COVID-19 severity in pediatric patients using machine learning: a comparative analysis of algorithms and ensemble methods - Nature - August 9th, 2025 [August 9th, 2025]
- Impact of massive open online courses in higher education using machine learning and decision based fuzzy frank power aggregation operators models -... - August 9th, 2025 [August 9th, 2025]
- Machine learning improves earthquake risk assessment and foundation planning - Open Access Government - August 9th, 2025 [August 9th, 2025]
- How machine learning can tell who with schizophrenia will respond to treatment. - Psychology Today - August 7th, 2025 [August 7th, 2025]
- City Colleges of Chicago and Amazon-MLU bring enhanced Artificial Intelligence and Machine Learning to the colleges faculty - colleges.ccc.edu - August 7th, 2025 [August 7th, 2025]
- Machine learning derived development and validation of extracellular matrix related signature for predicting prognosis in adolescents and young adults... - August 7th, 2025 [August 7th, 2025]
- Alzheimers disease risk prediction using machine learning for survival analysis with a comorbidity-based approach - Nature - August 7th, 2025 [August 7th, 2025]
- Machine learning models highlight environmental and genetic factors associated with the Arabidopsis circadian clock - Nature - August 7th, 2025 [August 7th, 2025]
- AI-derived CT biomarker score for robust COVID-19 mortality prediction across multiple waves and regions using machine learning - Nature - August 7th, 2025 [August 7th, 2025]
- Alcorn State partners with AWS-Machine Learning University to integrate AI in classrooms - WJTV - August 7th, 2025 [August 7th, 2025]
- Why Machine Learning is the Next Big Thing in Diabetes Care and CGM - AZoRobotics - August 7th, 2025 [August 7th, 2025]
- D-Wave launches open-source quantum AI toolkit to accelerate machine learning innovation - Mugglehead Magazine - August 7th, 2025 [August 7th, 2025]
- Machine learning algorithms to predict the risk of admission to intensive care units in HIV-infected individuals: a single-centre study - Virology... - August 6th, 2025 [August 6th, 2025]
- Novel machine learning algorithm could boost detection of familial hypercholesterolemia - Healio - August 6th, 2025 [August 6th, 2025]
- Introducing the Signal and Image Processing and Machine Learning (SIPML) Certificate - University of Michigan - August 6th, 2025 [August 6th, 2025]
- AI to Predict Suicide: The Case for Interpretable Machine Learning - Think Global Health - August 6th, 2025 [August 6th, 2025]
- Machine learning based optimization of titanium electropolishing using artificial neural networks and Taguchi design in eco-friendly electrolytes -... - August 6th, 2025 [August 6th, 2025]