ClimaEmpact is a framework designed to enhance the understanding of extreme weather events using large language models (LLMs). It bridges the data gap in underrepresented regions by analyzing news articles and other text sources to assess event severity, extract key themes, and evaluate public sentiment. This enables faster, more accurate decision-making when structured data is limited.
We use news articles as the primary data source. The ExtremeWeatherNews dataset comprises articles collected for 60 distinct extreme weather events, selected based on ClimaMeter. These events include heatwaves, cold spells, extreme wind, and extreme precipitation.
Our reports are generated using Domain-Aligned Small Language Models (SLMs), fine-tuned with climate-specific reasoning paths and real-world news data.