Machine learningbased observation-constrained projections reveal elevated global socioeconomic risks from wildfire – Nature.com
Applying traditional EC for global fire carbon emissions
The recently developed emergent constraint (EC) approach has demonstrated robust capability in reducing the uncertainty in characterizing or projecting Earth system variables simulated by a multimodel ensemble25,26. The basic concept of EC is that, despite the distinct model structures and parameters, there exists various across-model relationships (emergent constraints) between pairs of quantities when we analyze outputs from multiple models27. Therefore, the EC concept is especially useful to derive the relationship between a variable that is difficult or impossible to measure (e.g., future wildfires) and a second, measurable variable (e.g., historical wildfires), across multiple ESMs. We start with global total values and find significant linear relationship between historical and future global total fire carbon emission across 38 ensemble members of 13 ESMs (Supplementary Fig.2a). Because we are particularly interested in the spatial distribution of future wildfires, which are critical for quantifying future socioeconomic risks from wildfires, we further apply the EC concept to every grid cell of the globe, using either a single constraint variable (historical fire carbon emissions) or multiple constraint variables (the atmospheric and terrestrial variables in Supplementary Table2), with the latter being shown in Supplementary Fig.2b. We find insignificant linear relationships between these historical fire-relevant variables and future wildfires in the historically fire-prone regions across the analyzed 38 members of 13 ESMs. The failure of the traditional EC concept in constraining fire carbon emissions at local scales could be attributed to the highly nonlinear interactions between fire and its cross-section drivers, which is likely inadequately captured by the linear relationship under the EC assumption. Therefore, we further develop an MLT-based constraint to deal with the complex response of wildfires to environmental and socioeconomic drivers.
MLT provide powerful tools for capturing the nonlinear and interactive roles among regulators of an Earth system feature, thereby facilitating effective, multivariate constraint on wildfire activity, which represents an integrated function of climate, terrestrial ecosystem, and socioeconomic conditions. MLT have been widely applied for identifying empirical regulators32 and building prediction systems for global and regional fire activity35. To constrain the projected fire carbon emissions simulated by 13 ESMs using observational data, the current study establishes an MLT-based emergent relationship between the future fire carbon emissions and historical fire carbon emissions, climate, terrestrial ecosystem, and socioeconomic drivers.
Here, we use MLT to examine the empirical relationships between historical, observed influencing factors of wildfires and future fire carbon emissions from ESMs and then feed observational data into the trained machine learning models (Supplementary Fig.3). To train the MLT to use historical states for the prediction of future fire carbon emission, the historical and future simulations from the SSP (Shared Socioeconomic Pathway) 5-8536, a high-emission scenario, are analyzed for the currently available 13 ESMs in CMIP6 (Supplementary Table1). A subset of these ESMs (i.e., nine ESMs that provide simulation in a lower-emission scenario, SSP2-45) is also analyzed to examine the dependence of fire regimes on socioeconomic pathway. The training is conducted using the spatial sample of decadal mean predictors and target variable, both individually from each ESM and from their aggregation, with the later referred to as multimodel mean and subsequently analyzed for projecting fire carbon emission and its socioeconomic risks. Corresponding to the spatial resolution of the observational products of fire carbon emission, all model outputs are bilinearly interpolated to a 0.250.25 grid, resulting in a spatial sample of 11,325 points per model for the training. To perform the observational constraint, the historical observed predictors are then fed into the trained machine learning models. The historical predictors are listed in Supplementary Table2 with their observational data sources, temporal coverages, and spatial resolutions. For the atmospheric and terrestrial variables, the annual mean value and climatology in each of 12 calendar months are included as predictors. This training and observational constraining is performed for target decades (20112020, 20212030, 20912100), and the historical period is always 20012010. Future changes in fire carbon emission are quantified and expressed as the relative trend (% decade1) (i.e., the ratio between the absolute trend and the mean value during the 2010s), for both the default and observation-constrained ensembles.
The current spatial sample training approach establishes a history-future relationship for each pixel using the entire global sample. To minimize local prediction errors for a certain pixel, MLT search all pixels, regardless of their geographical location, to optimize the prediction model of future fires at the target pixel. In this way, a physically robust history-future relationship is established based on the global sample of locations, whereas influences of localized features, such as socioeconomic development, on wildfire trends are naturally damped in our approach (Supplementary Figs.10 and 11). The reliability of MLT is degraded when the actual observational data space is insufficiently covered by the training (historical CMIP6 simulation) data space, namely the extrapolation uncertainty. Here, we further evaluate the data space of both observation and historical simulation of the climate and fire variables (Supplementary Fig.14), and we find all these assessed variables are largely overlapped, indicating minimal extrapolation error involved in the current MLT application.
To minimize the projection uncertainty associated with the selected machine learning algorithms, this study examines three MLTrandom forest (rf), support vector machine with Radial Basis Function Kernel (svmRadialCost), and gradient boosting machine (gbm). These three algorithms differ substantially in their function. The average among these algorithms is thus believed to better capture the complex interrelation between the historical predictors and future fire carbon emissions than any single algorithm. The MLT analysis is performed using the caret, dplyr, randomForest, kernlab, and gbm packages in the R statistical software. The prediction model is fitted for each MLT using the training data set that targets each future decade, with parameters optimized for the minimum RMSE via 10-fold cross-validationin other words, using a randomly chosen nine-tenth of the entire spatial sample (n=10,193) for model fitting and the remaining one-tenth of the entire spatial sample (n=1,132) for validation, and repeating the process 10 times. For svmRadialCost, the optimal pair of cost parameter (C) and kernel parameter sigma (sigma) is searched from 30 (tuneLength=30) C candidates and their individually associated optimal sigma. For gbm, we set the complexity of trees (interaction.depth) to 3, and learning rate (shrinkage) to 0.2, and let the train function search for the optimal number of trees from 10 to 200 with an increment of 5 (10, 15, 20, , 200). For rf, the number of variables available for splitting at each tree node (mtry) is allowed to search between 5 and 50 with an increment of 1 (5, 6, 7, , 50); the number of trees is determined by the algorithm provided by randomForest package and the train function by the caret package. The cross-validation R2s exceed 0.8 (n=1,132) for all optimized MLT and all future periods. The currently examined ESMs, MLT, and hundreds of observational data set combinations constitute a multimodel, multidata set ensemble of projected fire carbon emissions for the twenty-first century. This multimodel, multidata set ensemble allows natural quantification of uncertainty in the future projection derived from observational sources and MLT, compared with a previous single-MLT, single-observation approach67.
This MLT-based observational constraining approach is validated for a historical period using the emergent relation between the fire-climate-ecosystem-socioeconomics during 19972006 and fire carbon emission during 20072016. The spatial correlation and RMSE with the observed decadal mean fire carbon emission (n=11,325) is evaluated and compared for the constrained and unconstrained ensemble, reported in the main text (Figs.1 and 2). The RMSE and R2 produced by the traditional EC approach that constrains fire carbon emissions during 20072016 with fire carbon emissions during 19972006 are reported along with the MLT-based observational constraint in Fig.1e, f. The MLT-based observational constraining approach is also applied to six ESMs that report burned area fraction, and validation is also conducted and reported in Supplementary Fig.6.
Because the MLT are trained using the global spatial sample, we expect the performance of MLT to be sensitive to the spatial resolution of the training data set. This assumption is tested by varying the interpolation grids (1, 2.5, 5, and 10 latitude by longitude) of the ESMs and fitting MLT using this specific-resolution training data for the validation period (Supplementary Fig.7). Observational data sets at 0.25 resolution are subsequently fed into the fitted MLT models, regardless of the input model data resolution. This sensitive test sheds light on the importance of spatial resolution to our observational constraining and thereby implies potential accuracy improvement of our MLT-based observation constraint with the development of higher-resolution ESMs.
Here, we define the socioeconomic exposure to wildfires as a product of decadal mean fire carbon emission and number of people, amount of GDP, and agricultural area exposed to the burning in each grid cell, following previous definition for extreme heat68. These exposure metrics measure the amount of population, GDP, and agricultural area affected by wildfires, whose severity is represented by the amount of fire carbon emission. The projected population at 1/81/8 resolution under SSP5-85 is obtained from the National Center for Atmospheric Researchs Integrated Assessment Modeling Group and the City University of New York Institute for Demographic Research69. The projected GDP at 1km resolution under SSP5 is disaggregated from national GDP projections using nighttime light and population70. The agricultural area projection at 0.050.05 resolution under SSP5-85 is obtained from the Global Change Analysis Model and a geospatial downscaling model (Demeter)71. All the projected socioeconomic variables are resampled to 0.250.25 resolution before the calculation of exposure to fire carbon emission fraction. Future changes in socioeconomic exposure to wildfires are quantified as the relative trend (% decade1) (i.e., the ratio between the absolute trend and the mean value during the 2010s) for the default and observation-constrained ensembles. These relative changes provide direct implications on what the future would be like compared with the current state, regardless of the potential biases simulated by the default ESMs.
The mechanisms underlying the projected evolution in fire carbon emissions are explored in two tasks, addressing the importance of drivers in the historical and dynamical perspectives. The first task assesses the relative contribution of each environmental and socioeconomic drivers historical distribution to the projected future wildfire distribution, for directly understanding how the current observational constraint works (Supplementary Fig.8). The second task examines the relative contribution of each drivers projected trend to the projected wildfires trends in a specific region, for disentangling the dynamical mechanisms underlying future evolution of regional wildfires (Supplementary Fig.9). These tasks benefit from the importance score as an output of MLT. Although the calculation of importance scores varies substantially by MLT, all the importance scores qualitatively reflect relative importance of each predictor when making a prediction. For each tree in both rf and gbm, the prediction accuracy on the out-of-bag portion of the data is recorded. Then, the same is done after permuting each predictor variable. For rf, the differences are averaged for each tree and normalized by the standard error. For gbm, the importance order is first calculated for each tree and then summed up over each boosting iteration. For svm, we estimate the contribution of a single variable by training the model on all variables except that specific variable. The difference in performance between that model and the one with all variables is then considered the marginal contribution of that particular variable; such marginal contribution of each variable is standardized to derive the variables relative importance. Because we apply multiple MLT in this study, the average importance scores from these MLT are reported in the corresponding figures for robustness.
In the first task, the importance of each historical driver to future global wildfire distributions is examined in three MLT models (random forest, support vector machine, and gradient boosting machine) that are trained for projecting future fire carbon emissions (Supplementary Fig.8). For the atmospheric and terrestrial variables that include annual mean and monthly climatology as predictors, to account for the overall importance of a particular variable while considering the possible information overlapping contained in each month and annual mean, the importance of each variable is represented by the highest importance score among these 13 predictors (annual mean, January, February, , December). The importance score of each historical driver reflects the relative weight of each historical, environmental driver in determining the spatial pattern of fire carbon emissions in each future decade.
In the second task, the dynamical importance of each environmental drivers future evolution is assessed for targeted tropical regions (i.e., Amazon and Congo) and major land cover types (tropical forests, other forest, shrubland, savannas, grasslands, and croplands) in both default and constrained ensembles through the importance of each drivers trend to the projected wildfire trend. For the default ensemble, the three MLT models (random forest, support vector machine, and gradient boosting machine) are used to predict the spatial distribution of simulated trends in fire carbon emission using the simulated trends in the socioeconomic, atmospheric, and terrestrial variables that are considered in our observational constraint for wildfires, for each ESM and their multimodel mean. This analysis excludes flash rate, another predictor in constraining future wildfires, because it is not dynamically simulated by most ESMs. For the observation-constrained ensemble, we first constrain the projected atmospheric and terrestrial variables in each future decade, using a similar approach as we constrain future fire carbon emissions, for each individual ESM and their multimodel aggregation. In this constraint for environmental drivers, all the variables in Supplementary Table2 are considered as predictors, thereby achieving self-consistency of the constrained future evolution of all these fire-relevant variables. Noticing that the socioeconomic trends are determined by the SSPs, future socioeconomic developments are therefore not constrained in the current approach. Then, the same three MLT models are used to predict the spatial distribution of constrained trends in fire carbon emissions using the constrained trends in those environmental and socioeconomic drivers. For computational efficiency, only the annual mean trends in the environmental drivers are constrained and analyzed in this task. The importance scores of projected trends in socioeconomic and environmental drivers reflect their dynamic role in future evolution of wildfires in the target tropical regions. Here, the Amazon and Congo regions are shown as examples of how this analysis is applied to understand regional wildfire evolutions, though the mechanism underlying the future evolution of wildfires in other regions could be similarly explored.
Read the rest here:
Machine learningbased observation-constrained projections reveal elevated global socioeconomic risks from wildfire - Nature.com
- RIT researchers use machine learning to better understand the pathways of disease - Rochester Institute of Technology - October 7th, 2025 [October 7th, 2025]
- Leveraging machine learning to predict mosquito bed net utilization among women of reproductive age in sub-Saharan Africa - Malaria Journal - October 7th, 2025 [October 7th, 2025]
- Machine learning-based radiomics using magnetic resonance images for prediction of clinical complete response to neoadjuvant chemotherapy in patients... - October 7th, 2025 [October 7th, 2025]
- Machine Learning Self Driving Cars: The Technology Driving the Future of Mobility - SpeedwayMedia.com - October 7th, 2025 [October 7th, 2025]
- Investigating the relationship between blood factors and HDL-C levels in the bloodstream using machine learning methods - Journal of Health,... - October 7th, 2025 [October 7th, 2025]
- AI in the fast lane: F1 teams Alpine, Audi use machine learning as force multiplier - The Business Times - October 7th, 2025 [October 7th, 2025]
- Future Scope of Machine Learning in Healthcare Market Set to Witness Significant Growth by 2025-2032 - openPR.com - October 7th, 2025 [October 7th, 2025]
- AI and Machine Learning - AI readiness and adoption toolkit launched - Smart Cities World - October 4th, 2025 [October 4th, 2025]
- Machine Learning Model UmamiPredict Developed to Forecast Savory Taste of Molecules and Peptides - geneonline.com - October 4th, 2025 [October 4th, 2025]
- Machine Learning Boosts Crop Yield Predictions in Senegal - Bioengineer.org - October 4th, 2025 [October 4th, 2025]
- Machine learning-driven stability analysis of eco-friendly superhydrophobic graphene-based coatings on copper substrate - Nature - October 4th, 2025 [October 4th, 2025]
- Integrated machine learning analysis of proteomic and transcriptomic data identifies healing associated targets in diabetic wound repair - Nature - October 4th, 2025 [October 4th, 2025]
- Development and evaluation of a machine learning prediction model for short-term mortality in patients with diabetes or hyperglycemia at emergency... - October 4th, 2025 [October 4th, 2025]
- Fast and robust mixed gas identification and recognition using tree-based machine learning and sensor array response - Nature - October 4th, 2025 [October 4th, 2025]
- Estimation of sexual dimorphism of adult human mandibles of South Indian origin using non-metric parameters and machine learning classification... - October 4th, 2025 [October 4th, 2025]
- Cloud-Based Machine Learning Platforms Technologies Market Growth and Future Prospects - Precedence Research - October 4th, 2025 [October 4th, 2025]
- Machine Learning Framework Developed to Optimize Phosphorus Recovery in Hydrothermal Treatment of Livestock Manure - geneonline.com - October 4th, 2025 [October 4th, 2025]
- Unifying machine learning and interpolation theory via interpolating neural networks - Nature - October 2nd, 2025 [October 2nd, 2025]
- Anna: an open-source platform for real-time integration of machine learning classifiers with veterinary electronic health records - BMC Veterinary... - October 2nd, 2025 [October 2nd, 2025]
- The Future of Liver Health: Can Human Models and Machine Learning Reduce Disease Rates? - Technology Networks - October 2nd, 2025 [October 2nd, 2025]
- Machine Learning Radiomics Predicts Pancreatic Cancer Invasion - Bioengineer.org - October 2nd, 2025 [October 2nd, 2025]
- Next-generation COVID-19 detection using a metasurface biosensor with machine learning-enhanced refractive index sensing - Nature - October 2nd, 2025 [October 2nd, 2025]
- Machine learning-based models for screening of anemia and leukemia using features of complete blood count reports - Nature - October 2nd, 2025 [October 2nd, 2025]
- Estimating the peak age of chess players through statistical and machine learning techniques - Nature - October 2nd, 2025 [October 2nd, 2025]
- Optimizing water quality index using machine learning: a six-year comparative study in riverine and reservoir systems - Nature - October 2nd, 2025 [October 2nd, 2025]
- Physics-informed machine learning-based real-time long-horizon temperature fields prediction in metallic additive manufacturing - Nature - October 2nd, 2025 [October 2nd, 2025]
- The Silicon Revolution: How AI and Machine Learning Are Forging the Future of Semiconductor Manufacturing - FinancialContent - October 2nd, 2025 [October 2nd, 2025]
- Machine learning model for differentiating Pneumocystis jirovecii pneumonia from colonization and analyzing mortality risk in non-HIV patients using... - October 2nd, 2025 [October 2nd, 2025]
- Radiomics and Machine Learning Applied to CECT Scans Show Potential in Predicting Perineural Invasion in Pancreatic Cancer - geneonline.com - October 2nd, 2025 [October 2nd, 2025]
- Machine learning and response surface optimization to enhance diesel engine performance using milk scum biodiesel with alumina nanoparticles - Nature - October 2nd, 2025 [October 2nd, 2025]
- Landmark Patent Appeal Decision Strengthens Protection for AI and Machine Learning Innovations - The National Law Review - October 2nd, 2025 [October 2nd, 2025]
- Machine learning researchers and industry leaders gathering at Santa Clara University - Stories - News & Events - Santa Clara University - September 30th, 2025 [September 30th, 2025]
- Building better batteries with amorphous materials and machine learning - Tech Xplore - September 30th, 2025 [September 30th, 2025]
- Machine Learning-Supported Fragment Hit Expansion in Absence of X-Ray Structures - Evotec - September 30th, 2025 [September 30th, 2025]
- Machine learning model predicts which radiotherapy patients are most vulnerable to adverse side effects - Health Imaging - September 30th, 2025 [September 30th, 2025]
- How AI and Machine Learning Are Revolutionizing Laser Welding - Downbeach - September 30th, 2025 [September 30th, 2025]
- What if A.I. Doesnt Get Much Better Than This? - Machine Learning Week 2025 - September 30th, 2025 [September 30th, 2025]
- Sex estimation from the sternum in Turkish population using various machine learning methods and deep neural networks - SpringerOpen - September 30th, 2025 [September 30th, 2025]
- Predictive AI Must Be Valuated But Rarely Is. Heres How To Do It - Machine Learning Week 2025 - September 30th, 2025 [September 30th, 2025]
- Interpretable machine learning incorporating major lithology for regional landslide warning in northern and eastern Guangdong - Nature - September 28th, 2025 [September 28th, 2025]
- Building Machine Learning Application with Django - KDnuggets - September 28th, 2025 [September 28th, 2025]
- Evaluating the use of body mass index change as a proxy for anorexia nervosa recovery: a machine learning perspective - Journal of Eating Disorders - September 28th, 2025 [September 28th, 2025]
- Prediction of cutting parameters and reduction of output parameters using machine learning in milling of Inconel 718 alloy - Nature - September 28th, 2025 [September 28th, 2025]
- How AI and machine learning are changing both retail and online casino experiences - Retail Technology Innovation Hub - September 28th, 2025 [September 28th, 2025]
- Machine learning and cell imaging combine to predict effectiveness of multiple sclerosis medication - Medical Xpress - September 25th, 2025 [September 25th, 2025]
- IC combines machine learning and analogue inferencing - Electronics Weekly - September 25th, 2025 [September 25th, 2025]
- ODU Awarded $2.3M NIH Grant to Improve Detection of Brain Tumor Recurrence with AI and Machine Learning - Old Dominion University - September 25th, 2025 [September 25th, 2025]
- Development of a machine learning-based depression risk identification tool for older adults with asthma - BMC Psychiatry - September 25th, 2025 [September 25th, 2025]
- AI and Machine Learning Uses in Neuroscience Drug Discovery, Upcoming Webinar Hosted by Xtalks - PR Newswire - September 25th, 2025 [September 25th, 2025]
- Error-controlled non-additive interaction discovery in machine learning models - Nature - September 23rd, 2025 [September 23rd, 2025]
- AI, Machine Learning Will Drive Market Data Consumption - Markets Media - September 23rd, 2025 [September 23rd, 2025]
- Machine Learning Model May Optimize Treatment Selection and Survival in HCC - Targeted Oncology - September 23rd, 2025 [September 23rd, 2025]
- From pixels to pumps: Machine learning and satellite imagery help map irrigation - Phys.org - September 23rd, 2025 [September 23rd, 2025]
- CMU physicist challenges what we know about particle physics with machine learning - The Tartan - September 23rd, 2025 [September 23rd, 2025]
- Hire Python Developers to Leverage the Power of Machine Learning & AI - WebWire - September 23rd, 2025 [September 23rd, 2025]
- AI-Powered Biology Careers in 2025: Opportunities with Machine Learning Skills - BioTecNika - September 23rd, 2025 [September 23rd, 2025]
- Machine learning and predictingstock price movements on NGX - Businessamlive - September 23rd, 2025 [September 23rd, 2025]
- Building a Hybrid Rule-Based and Machine Learning Framework to Detect and Defend Against Jailbreak Prompts in LLM Systems - MarkTechPost - September 21st, 2025 [September 21st, 2025]
- Development of a novel machine learning-based adaptive resampling algorithm for nuclear data processing - Nature - September 19th, 2025 [September 19th, 2025]
- Autobot platform uses machine learning to rapidly find best ways to make advanced materials - Tech Xplore - September 19th, 2025 [September 19th, 2025]
- 5 Key Takeaways | The Law of the Machine (Learning): Solving Complex AI Challenges - JD Supra - September 17th, 2025 [September 17th, 2025]
- Spectral and Machine Learning Approach Enhances Efficiency of Grape Embryo Rescue | Newswise - Newswise - September 17th, 2025 [September 17th, 2025]
- Helpful Reminders for Patent Eligibility of AI, Machine Learning, and Other Software-Related Inventions - JD Supra - September 17th, 2025 [September 17th, 2025]
- Opening the black box of machine learning-controlled plasma treatments - AIP.ORG - September 17th, 2025 [September 17th, 2025]
- Post-compilation Circuit Scaling for Quantum Machine Learning Models Reveals Resource Trends and Topology Impacts - Quantum Zeitgeist - September 17th, 2025 [September 17th, 2025]
- Machine-learning tool gives doctors a more detailed 3D picture of fetal health - Medical Xpress - September 17th, 2025 [September 17th, 2025]
- Portable Electronic Nose with Machine Learning Enhances VOC Detection in Forensic Science - Chromatography Online - September 15th, 2025 [September 15th, 2025]
- Developing a predictive model for breast cancer detection using radiomics-based mammography and machine learning - SpringerOpen - September 13th, 2025 [September 13th, 2025]
- and correlation of drug solubility via hybrid machine learning and gradient based optimization - Nature - September 11th, 2025 [September 11th, 2025]
- Rice-Houston Methodist partnership uses machine learning to reveal hidden patient groups in common heart valve disease - Rice University - September 11th, 2025 [September 11th, 2025]
- Amazon Uses Machine Learning to Tell Sellers if FBA Is a Good Fit - EcommerceBytes - September 11th, 2025 [September 11th, 2025]
- Eli Lilly Launches AI, Machine Learning Platform Called TuneLab For Biotech Companies - Stocktwits - September 11th, 2025 [September 11th, 2025]
- How AI and Machine Learning are Shaping the Future of Mobile Apps - indiatechnologynews.in - September 11th, 2025 [September 11th, 2025]
- Hybrid AI and semiconductor approaches for power quality improvement - Machine Learning Week 2025 - September 9th, 2025 [September 9th, 2025]
- The Predictive Turn | Preparing to Outthink Adversaries Through Predictive Analytics - Machine Learning Week 2025 - September 9th, 2025 [September 9th, 2025]
- NFL player props, odds and bets: Week 1, 2025 NFL picks, SportsLine Machine Learning Model AI predictions, SGP - CBS Sports - September 9th, 2025 [September 9th, 2025]
- Can machine learning forecast Lobo EV Technologies Ltd. recovery - Bear Alert & Daily Price Action Insights - Newser - September 6th, 2025 [September 6th, 2025]
- Generalised Machine Learning Models Outperform Personalised Models For Cognitive Load Classification In Real-Life Settings - Frontiers - September 6th, 2025 [September 6th, 2025]
- Machine learning for the prediction of blood transfusion risk during or after mitral valve surgery: a multicenter retrospective cohort study - Nature - September 6th, 2025 [September 6th, 2025]
- Machine Learning-Driven Exploration of Composition- and Temperature-Dependent Transport and Thermodynamic Properties in LiF-NaF-KF Molten Salts for... - September 6th, 2025 [September 6th, 2025]