Overview of causal inference in machine learning – Ericsson
In a major operators network control center complaints are flooding in. The network is down across a large US city; calls are getting dropped and critical infrastructure is slow to respond. Pulling up the systems event history, the manager sees that new 5G towers were installed in the affected area today.
Did installing those towers cause the outage, or was it merely a coincidence? In circumstances such as these, being able to answer this question accurately is crucial for Ericsson.
Most machine learning-based data science focuses on predicting outcomes, not understanding causality. However, some of the biggest names in the field agree its important to start incorporating causality into our AI and machine learning systems.
Yoshua Bengio, one of the worlds most highly recognized AI experts, explained in a recent Wired interview: Its a big thing to integrate [causality] into AI. Current approaches to machine learning assume that the trained AI system will be applied on the same kind of data as the training data. In real life it is often not the case.
Yann LeCun, a recent Turing Award winner, shares the same view, tweeting: Lots of people in ML/DL [deep learning] know that causal inference is an important way to improve generalization.
Causal inference and machine learning can address one of the biggest problems facing machine learning today that a lot of real-world data is not generated in the same way as the data that we use to train AI models. This means that machine learning models often arent robust enough to handle changes in the input data type, and cant always generalize well. By contrast, causal inference explicitly overcomes this problem by considering what might have happened when faced with a lack of information. Ultimately, this means we can utilize causal inference to make our ML models more robust and generalizable.
When humans rationalize the world, we often think in terms of cause and effect if we understand why something happened, we can change our behavior to improve future outcomes. Causal inference is a statistical tool that enables our AI and machine learning algorithms to reason in similar ways.
Lets say were looking at data from a network of servers. Were interested in understanding how changes in our network settings affect latency, so we use causal inference to proactively choose our settings based on this knowledge.
The gold standard for inferring causal effects is randomized controlled trials (RCTs) or A/B tests. In RCTs, we can split a population of individuals into two groups: treatment and control, administering treatment to one group and nothing (or a placebo) to the other and measuring the outcome of both groups. Assuming that the treatment and control groups arent too dissimilar, we can infer whether the treatment was effective based on the difference in outcome between the two groups.
However, we can't always run such experiments. Flooding half of our servers with lots of requests might be a great way to find out how response time is affected, but if theyre mission-critical servers, we cant go around performing DDOS attacks on them. Instead, we rely on observational datastudying the differences between servers that naturally get a lot of requests and those with very few requests.
There are many ways of answering this question. One of the most popular approaches is Judea Pearl's technique for using to statistics to make causal inferences. In this approach, wed take a model or graph that includes measurable variables that can affect one another, as shown below.
To use this graph, we must assume the Causal Markov Condition. Formally, it says that subject to the set of all its direct causes, a node is independent of all the variables which are not direct causes or direct effects of that node. Simply put, it is the assumption that this graph captures all the real relationships between the variables.
Another popular method for inferring causes from observational data is Donald Rubin's potential outcomes framework. This method does not explicitly rely on a causal graph, but still assumes a lot about the data, for example, that there are no additional causes besides the ones we are considering.
For simplicity, our data contains three variables: a treatment , an outcome , and a covariate . We want to know if having a high number of server requests affects the response time of a server.
In our example, the number of server requests is determined by the memory value: a higher memory usage means the server is less likely to get fed requests. More precisely, the probability of having a high number of requests is equal to 1 minus the memory value (i.e. P(x=1)=1-z , where P(x=1) is the probability that x is equal to 1). The response time of our system is determined by the equation (or hypothetical model):
y=1x+5z+
Where is the error, that is, the deviation from the expected value of given values of and depends on other factors not included in the model. Our goal is to understand the effect of on via observations of the memory value, number of requests, and response times of a number of servers with no access to this equation.
There are two possible assignments (treatment and control) and an outcome. Given a random group of subjects and a treatment, each subject has a pair of potential outcomes: and , the outcomes Y_i (0) and Y_i (1) under control and treatment respectively. However, only one outcome is observed for each subject, the outcome under the actual treatment received: Y_i=xY_i (1)+(1-x)Y_i (0). The opposite potential outcome is unobserved for each subject and is therefore referred to as a counterfactual.
For each subject, the effect of treatment is defined to be Y_i (1)-Y_i (0) . The average treatment effect (ATE) is defined as the average difference in outcomes between the treatment and control groups:
E[Y_i (1)-Y_i (0)]
Here, denotes an expectation over values of Y_i (1)-Y_i (0)for each subject , which is the average value across all subjects. In our network example, a correct estimate of the average treatment effect would lead us to the coefficient in front of x in equation (1) .
If we try to estimate this by directly subtracting the average response time of servers with x=0 from the average response time of our hypothetical servers with x=1, we get an estimate of the ATE as 0.177 . This happens because our treatment and control groups are not inherently directly comparable. In an RTC, we know that the two groups are similar because we chose them ourselves. When we have only observational data, the other variables (such as the memory value in our case) may affect whether or not one unit is placed in the treatment or control group. We need to account for this difference in the memory value between the treatment and control groups before estimating the ATE.
One way to correct this bias is to compare individual units in the treatment and control groups with similar covariates. In other words, we want to match subjects that are equally likely to receive treatment.
The propensity score ei for subject is defined as:
e_i=P(x=1z=z_i ),z_i[0,1]
or the probability that x is equal to 1the unit receives treatmentgiven that we know its covariate is equal to the value z_i. Creating matches based on the probability that a subject will receive treatment is called propensity score matching. To find the propensity score of a subject, we need to predict how likely the subject is to receive treatment based on their covariates.
The most common way to calculate propensity scores is through logistic regression:
Now that we have calculated propensity scores for each subject, we can do basic matching on the propensity score and calculate the ATE exactly as before. Running propensity score matching on the example network data gets us an estimate of 1.008 !
We were interested in understanding the causal effect of binary treatment x variable on outcome y . If we find that the ATE is positive, this means an increase in x results in an increase in y. Similarly, a negative ATE says that an increase in x will result in a decrease in y .
This could help us understand the root cause of an issue or build more robust machine learning models. Causal inference gives us tools to understand what it means for some variables to affect others. In the future, we could use causal inference models to address a wider scope of problems both in and out of telecommunications so that our models of the world become more intelligent.
Special thanks to the other team members of GAIA working on causality analysis: Wenting Sun, Nikita Butakov, Paul Mclachlan, Fuyu Zou, Chenhua Shi, Lule Yu and Sheyda Kiani Mehr.
If youre interested in advancing this field with us, join our worldwide team of data scientists and AI specialists at GAIA.
In this Wired article, Turing Award winner Yoshua Bengio shares why deep learning must begin to understand the why before it can replicate true human intelligence.
In this technical overview of causal inference in statistics, find out whats needed to evolve AI from traditional statistical analysis to causal analysis of multivariate data.
This journal essay from 1999 offers an introduction to the Causal Markov Condition.
Go here to read the rest:
Overview of causal inference in machine learning - Ericsson
- Prefix-RFT: A Unified Machine Learning Framework to blend Supervised Fine-Tuning (SFT) and Reinforcement Fine-Tuning (RFT) - MarkTechPost - August 24th, 2025 [August 24th, 2025]
- What machine learning models say about Iterum Therapeutics plc - Weekly Risk Report & Fast Exit Strategy with Risk Control - Newser - August 24th, 2025 [August 24th, 2025]
- Can machine learning forecast Putnam Municipal Opportunities Trust recovery - Insider Selling & Weekly Return Optimization Plans - Newser - August 24th, 2025 [August 24th, 2025]
- Can machine learning forecast Viking Therapeutics Inc. recovery - Quarterly Profit Report & Fast Entry and Exit Trade Plans - Newser - August 24th, 2025 [August 24th, 2025]
- Can machine learning forecast Tectonic Financial Inc. recovery - 2025 Historical Comparison & Risk Adjusted Buy and Sell Alerts - Newser - August 24th, 2025 [August 24th, 2025]
- Combining machine learning predictions for Cowen Inc. Preferred Security - 2025 Performance Recap & Reliable Volume Spike Trade Alerts - Newser - August 24th, 2025 [August 24th, 2025]
- Can machine learning forecast Milestone Pharmaceuticals Inc. recovery - July 2025 Movers & Breakout Confirmation Trade Signals - Newser - August 24th, 2025 [August 24th, 2025]
- What machine learning models say about FIGS - Weekly Trend Recap & Expert Curated Trade Setup Alerts - Newser - August 24th, 2025 [August 24th, 2025]
- Combining machine learning predictions for Daxor Corporation - July 2025 Sentiment & Fast Exit Strategy with Risk Control - Newser - August 24th, 2025 [August 24th, 2025]
- Combining machine learning predictions for Willis Towers Watson Public Limited Company - 2025 Macro Impact & Free Safe Capital Growth Stock Tips -... - August 24th, 2025 [August 24th, 2025]
- Combining machine learning predictions for Sanmina Corporation - Trade Exit Summary & AI Based Buy and Sell Signals - Newser - August 24th, 2025 [August 24th, 2025]
- Combining machine learning predictions for Runway Growth Finance Corp. - Quarterly Market Summary & Expert Approved Momentum Ideas - Newser - August 24th, 2025 [August 24th, 2025]
- Can machine learning forecast Maywood Acquisition Corp. Debt Equity Composite Units recovery - Market Growth Summary & Weekly Breakout Watchlists... - August 24th, 2025 [August 24th, 2025]
- The Role of AI and Machine Learning in Personalizing Short Video Content - Vocal - August 22nd, 2025 [August 22nd, 2025]
- Optimization and predictive performance of fly ash-based sustainable concrete using integrated multitask deep learning framework with interpretable... - August 22nd, 2025 [August 22nd, 2025]
- Balancing ethics and statistics: machine learning facilitates highly accurate classification of mice according to their trait anxiety with reduced... - August 22nd, 2025 [August 22nd, 2025]
- Researchers use machine learning to predict dengue fever with 80% accuracy - Northeastern Global News - August 22nd, 2025 [August 22nd, 2025]
- Supervised machine learning algorithms for the classification of obesity levels using anthropometric indices derived from bioelectrical impedance... - August 22nd, 2025 [August 22nd, 2025]
- Machine learning aided optoelectric characterization modelling and prediction of the IV parameters of perovskite solar cells with > 90% accuracy -... - August 22nd, 2025 [August 22nd, 2025]
- Improvement of robot learning with combination of decision making and machine learning for water analysis - EurekAlert! - August 22nd, 2025 [August 22nd, 2025]
- Machine learning and SHAP values explain the association between social determinants of health and post-stroke depression - BMC Public Health - August 22nd, 2025 [August 22nd, 2025]
- Systematic selection of best performing mathematical models for in vitro gas production using machine learning across diverse feeds - Nature - August 22nd, 2025 [August 22nd, 2025]
- YouTubes Using Machine Learning to Improve the Look of Your Shorts Clips - Social Media Today - August 20th, 2025 [August 20th, 2025]
- Machine learning based on pangenome-wide association studies reveals the impact of host source on the zoonotic potential of closely related bacterial... - August 20th, 2025 [August 20th, 2025]
- Machine learning model for early diagnosis of breast cancer based on PiRNA expression with CA153 - Nature - August 20th, 2025 [August 20th, 2025]
- Automatic detection of cognitive events using machine learning and understanding models interpretations of human cognition - Nature - August 20th, 2025 [August 20th, 2025]
- Damon Evolves I/O Platform with Advanced Machine Learning for Adaptive Rider Performance - Motor Sports Newswire - August 20th, 2025 [August 20th, 2025]
- Predictive modeling of asthma drug properties using machine learning and topological indices in a MATLAB based QSPR study - Nature - August 20th, 2025 [August 20th, 2025]
- Saturday Citations: A new category of supernovas; neurons beat machine learning; depression and vitiligo - Phys.org - August 18th, 2025 [August 18th, 2025]
- Agentic AI Is The New Vaporware - Machine Learning Week 2025 - August 18th, 2025 [August 18th, 2025]
- ReactorNet based on machine learning framework to identify control rod position for real time monitoring in PWRs - Nature - August 18th, 2025 [August 18th, 2025]
- Low-cost fabrication and comparative evaluation of machine learning algorithms for flexible PDMS-based hexagonal patch antenna - Nature - August 18th, 2025 [August 18th, 2025]
- Digital biomarkers for interstitial glucose prediction in healthy individuals using wearables and machine learning - Nature - August 18th, 2025 [August 18th, 2025]
- Integrative machine learning models predict prostate cancer diagnosis and biochemical recurrence risk: Advancing precision oncology - Nature - August 18th, 2025 [August 18th, 2025]
- Predicting onset of myopic refractive error in children using machine learning on routine pediatric eye examinations only - Nature - August 18th, 2025 [August 18th, 2025]
- Advanced machine learning framework for thyroid cancer epidemiology in Iran through integration of environmental socioeconomic and health system... - August 18th, 2025 [August 18th, 2025]
- Year-round daily wildfire prediction and key factor analysis using machine learning: a case study of Gangwon State, South Korea - Nature - August 18th, 2025 [August 18th, 2025]
- Comparing the effect of pre-anesthesia clonidine and tranexamic acid on intraoperative bleeding volume in rhinoplasty: a machine learning approach -... - August 18th, 2025 [August 18th, 2025]
- Exploring the role of lipid metabolism related genes and immune microenvironment in periodontitis by integrating machine learning and bioinformatics... - August 18th, 2025 [August 18th, 2025]
- From Data to Delivery: Leveraging AI and Machine Learning in Network Planning - Tech Times - August 18th, 2025 [August 18th, 2025]
- Association between the nutritional inflammation index and mortality among patients with sepsis: insights from traditional methods and machine... - August 18th, 2025 [August 18th, 2025]
- C3 AI Selected for Constellation ShortList for Artificial Intelligence and Machine Learning Best-of-Breed Platforms for Q3 2025 - Yahoo Finance - August 14th, 2025 [August 14th, 2025]
- A physicist tackles machine learning black box - The University of Utah - August 14th, 2025 [August 14th, 2025]
- Morgan State University Collaborates with Amazon-Machine Learning University to Bring AI and Machine Learning Education to the Classroom - Morgan... - August 14th, 2025 [August 14th, 2025]
- BEAST-GB model combines machine learning and behavioral science to predict people's decisions - Tech Xplore - August 14th, 2025 [August 14th, 2025]
- Balancing Regulation and Risk of AI and Machine Learning Software in Medical Devices - Infection Control Today - August 14th, 2025 [August 14th, 2025]
- A deep learning model with machine vision system for recognizing type of the food during the food consumption - Nature - August 14th, 2025 [August 14th, 2025]
- Machine learning reveals the mysteries of amorphous alumina thin films at atomic scale - Phys.org - August 14th, 2025 [August 14th, 2025]
- Correction: Machine learning based prediction of cognitive metrics using major biomarkers in SuperAgers - Nature - August 14th, 2025 [August 14th, 2025]
- Transforming Cancer Biomarker Discovery with Machine Learning - the-scientist.com - August 14th, 2025 [August 14th, 2025]
- AI in Precision Agriculture Market Accelerates Adoption of Predictive Analytics and Machine Learning - openPR.com - August 14th, 2025 [August 14th, 2025]
- Improvements from incorporating machine learning algorithms into near real-time operational post-processing - Nature - August 14th, 2025 [August 14th, 2025]
- Data Quality Tools Market Expected to Surge to USD 8.0 Billion by 2033, Driven by AI and Machine Learning Adoption - Vocal - August 12th, 2025 [August 12th, 2025]
- Predicting female football outcomes by machine learning: behavioural analysis of goals as high stress events - Nature - August 12th, 2025 [August 12th, 2025]
- Harnessing Machine Learning and Weak AI to do Smart Things on the Production Floor - AdvancedManufacturing.org - August 12th, 2025 [August 12th, 2025]
- The Role of AI in Predicting Customer Churn Beyond Traditional Metrics - Machine Learning Week 2025 - August 12th, 2025 [August 12th, 2025]
- Towards better earthquake risk assessment with machine learning and geological survey data - Tech Xplore - August 12th, 2025 [August 12th, 2025]
- AI and Machine Learning - Philadelphia calls for climate resilience partners - Smart Cities World - August 12th, 2025 [August 12th, 2025]
- Exploring the Potential of Machine Learning in Optimizing Respiratory Failure Treatment - AJMC - August 9th, 2025 [August 9th, 2025]
- Decoding macrophage immune responses with gene editing and machine learning - News-Medical - August 9th, 2025 [August 9th, 2025]
- Application of causal forest double machine learning (DML) approach to assess tuberculosis preventive therapys impact on ART adherence - Nature - August 9th, 2025 [August 9th, 2025]
- Serum peptide biomarkers by MALDI-TOF MS coupled with machine learning for diagnosis and classification of hepato-pancreato-biliary cancers - Nature - August 9th, 2025 [August 9th, 2025]
- Machine learning based analysis of leucocyte cell population data by Sysmex XN series hematology analyzer for the diagnosis of bacteremia - Nature - August 9th, 2025 [August 9th, 2025]
- Predicting COVID-19 severity in pediatric patients using machine learning: a comparative analysis of algorithms and ensemble methods - Nature - August 9th, 2025 [August 9th, 2025]
- Impact of massive open online courses in higher education using machine learning and decision based fuzzy frank power aggregation operators models -... - August 9th, 2025 [August 9th, 2025]
- Machine learning improves earthquake risk assessment and foundation planning - Open Access Government - August 9th, 2025 [August 9th, 2025]
- How machine learning can tell who with schizophrenia will respond to treatment. - Psychology Today - August 7th, 2025 [August 7th, 2025]
- City Colleges of Chicago and Amazon-MLU bring enhanced Artificial Intelligence and Machine Learning to the colleges faculty - colleges.ccc.edu - August 7th, 2025 [August 7th, 2025]
- Machine learning derived development and validation of extracellular matrix related signature for predicting prognosis in adolescents and young adults... - August 7th, 2025 [August 7th, 2025]
- Alzheimers disease risk prediction using machine learning for survival analysis with a comorbidity-based approach - Nature - August 7th, 2025 [August 7th, 2025]
- Machine learning models highlight environmental and genetic factors associated with the Arabidopsis circadian clock - Nature - August 7th, 2025 [August 7th, 2025]
- AI-derived CT biomarker score for robust COVID-19 mortality prediction across multiple waves and regions using machine learning - Nature - August 7th, 2025 [August 7th, 2025]
- Alcorn State partners with AWS-Machine Learning University to integrate AI in classrooms - WJTV - August 7th, 2025 [August 7th, 2025]
- Why Machine Learning is the Next Big Thing in Diabetes Care and CGM - AZoRobotics - August 7th, 2025 [August 7th, 2025]
- D-Wave launches open-source quantum AI toolkit to accelerate machine learning innovation - Mugglehead Magazine - August 7th, 2025 [August 7th, 2025]
- Machine learning algorithms to predict the risk of admission to intensive care units in HIV-infected individuals: a single-centre study - Virology... - August 6th, 2025 [August 6th, 2025]
- Novel machine learning algorithm could boost detection of familial hypercholesterolemia - Healio - August 6th, 2025 [August 6th, 2025]
- Introducing the Signal and Image Processing and Machine Learning (SIPML) Certificate - University of Michigan - August 6th, 2025 [August 6th, 2025]
- AI to Predict Suicide: The Case for Interpretable Machine Learning - Think Global Health - August 6th, 2025 [August 6th, 2025]
- Machine learning based optimization of titanium electropolishing using artificial neural networks and Taguchi design in eco-friendly electrolytes -... - August 6th, 2025 [August 6th, 2025]