Hi energy nerds,
Catalyst has just put out a new PUDL data release with updated US energy system data through the end of 2022 from EIA, FERC, EPA, and elsewhere. It’s a stand-alone data only release with everything in SQLite or Parquet, and no need to install any specialized software to do further data processing / access analytical outputs (unlike our previous software + data releases).
It’s all CC-BY-4.0 licensed, and includes hundreds of tables with data from 1994-2023. In all it’s about 10GB when compressed. A longer release announcement is up in this GitHub Discussion.
- If you just want to go download the data, it’s on Zenodo here: PUDL Data Release v2023.12.01
- There are also more detailed release notes on ReadTheDocs as well as the PUDL data dictionary
- You don’t need to install it, but the MIT licensed software that generated this data is catalystcoop.pudl v2023.12.1 on PyPI
- If you want to play with the data without doing any setup, you can also find it on Kaggle
Let us know what you think of the data, or if you have any issues getting up and running with it! We’re trying to centralize feedback and comments in the release announcement.
Data Liberation Engineer @ Catalyst Cooperative