New Machine Learning Model Can Accurately Predict Events Like Tornadoes and Hail Eight Days in Advance

Earth Weather Climate Change

Researchers have developed a machine learning model, the CSU-MLP, capable of accurately predicting hazardous weather events such as tornadoes and hail four to eight days in advance. The team has collaborated with the national Storm Prediction Center to test and refine the model, enhancing forecasters’ confidence in their predictions and potentially saving lives.

Precise forecasting of occurrences such as tornadoes and hail up four to eight days in advance.

As severe weather approaches with potential life-threatening hazards such as heavy rain, hail, or tornadoes, early warnings, and precise predictions are crucial. Weather researchers at Colorado State University have provided storm forecasters with a powerful new tool to enhance the reliability of their predictions, potentially saving lives in the process.

In recent years, Russ Schumacher, a professor in the Department of Atmospheric Science and Colorado State Climatologist, has spearheaded a team in creating an advanced machine learning model to enhance hazardous weather prediction across the continental United States. Initially trained on historical excessive rainfall data, the model, known as CSU-MLP (Colorado State University-Machine Learning Probabilities), has evolved to accurately predict events such as tornadoes and hail four to eight days ahead of time, a critical window for forecasters to disseminate information to the public for preparation.

Led by research scientist Aaron Hill, who has worked on refining the model for the last two-plus years, the team recently published their medium-range (four to eight days) forecasting ability in the American Meteorological Society journal Weather and Forecasting.

Aaron Hill

Research scientist Aaron Hill presents the CSU-MLP to forecasters at the Storm Prediction Center. Credit: Provided/Aaron Hill

Working with Storm Prediction Center forecasters

The researchers have now teamed with forecasters at the national Storm Prediction Center in Norman, Oklahoma, to test the model and refine it based on practical considerations from actual weather forecasters. The tool is not a stand-in for the invaluable skill of human forecasters but rather provides an agnostic, confidence-boosting measure to help forecasters decide whether to issue public warnings about potential weather.

“Our statistical models can benefit operational forecasters as a guidance product, not as a replacement,” Hill said.

Israel Jirak, M.S. ’02, Ph.D. ’05, is a science and operations officer at the Storm Prediction Center and co-author of the paper. He called the collaboration with the CSU team “a very successful research-to-operations project.”

“They have developed probabilistic machine learning-based severe weather guidance that is statistically reliable and skillful while also being practically useful for forecasters,” Jirak said. The forecasters in Oklahoma are using the CSU guidance product daily, particularly when they need to issue medium-range severe weather outlooks.

Allie Mazurek

CSU Ph.D. student Allie Mazurek discusses the CSU-MLP with forecaster Andrew Moore. Credit: Provided/Allie Mazurek

Nine years of historical weather data

The model is trained on a very large dataset containing about nine years of detailed historical weather observations over the continental U.S. These data are combined with meteorological retrospective forecasts, which are model “re-forecasts” created from outcomes of past weather events. The CSU researchers pulled the environmental factors from those model forecasts and associated them with past events of severe weather like tornadoes and hail. The result is a model that can run in real-time with current weather events and produce a probability of those types of hazards with a four- to eight-day lead time, based on current environmental factors like temperature and wind.

Ph.D. student Allie Mazurek is working on the project and is seeking to understand which atmospheric data inputs are the most important to the model’s predictive capabilities. “If we can better decompose how the model is making its predictions, we can hopefully better diagnose why the model’s predictions are good or bad during certain weather setups,” she said.

Hill and Mazurek are working to make the model not only more accurate but also more understandable and transparent for the forecasters using it.

For Hill, it’s most gratifying to know that years of work refining the machine learning tool are now making a difference in a public, operational setting.

“I love fundamental research. I love understanding new things about our atmosphere. But having a system that is providing improved warnings and improved messaging around the threat of severe weather is extremely rewarding,” Hill said.

Reference: “A New Paradigm for Medium-Range Severe Weather Forecasts: Probabilistic Random Forest–Based Predictions” by Aaron J. Hill, Russ S. Schumacher and Israel L. Jirak, 2 February 2023, Weather and Forecasting.
DOI: 10.1175/WAF-D-22-0143.1