Publish all raw FERC SQLite DBs produced from XBRL
See original GitHub issueOur FERC XBRL Extractor works with FERC Froms 1, 2, 6, 60, and 714. Form 1 is the most well integrated with PUDL, but even then many tables have not yet been integrated into the ETL. So, the ability to publish the raw SQLite DB’s will be very useful. The XBRL data is also much better structured than the historical data, but very hard to work with in the XBRL format, so the raw SQLite versions of all of these forms could provide a lot of value.
Tasks
Enabling tasks
- Create sandbox archives of forms (2, 6, 60, 714)
- catalyst-cooperative/ferc-xbrl-extractor#18
Ingest Metadata generated by extraction tool
Enable publication
- #1861
- Integrate new sources with
datasette_metadata_to_yml
- Update datasette publication bash script
Generate documentation
Issue Analytics
- State:
- Created a year ago
- Comments:5 (5 by maintainers)
Top Results From Across the Web
PACKAGE_NAME Release Notes - FERC XBRL Extractor 0.7 ...
The FERC XBRL extractor supports extracting data from XBRL filings to produce a SQLite database. It provides a CLI that takes a link...
Read more >Catalyst Cooperative - GitHub
An Intake driver that provides access to SQLite databases identified by ... A tool for converting FERC filings published in XBRL into SQLite...
Read more >catalystcoop.ferc-xbrl-extractor - PyPI
A tool for extracting data from FERC XBRL Filings. ... and write that data to a SQLite database whose structure is generated from...
Read more >ferc714 Archives - Catalyst Cooperative
We are attempting to obtain a bulk download of all this data so we can archive it alongside our other raw data sources...
Read more >OAI-PMH Interface - Zenodo
Converted from a large collection of annual Microsoft Visual FoxPro databases into a single combined SQLite database. See the README.md file for details....
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
PUDL is now able to construct SQLite DB’s from all FERC XBRL forms, and ingest/convert the accompanying datapackage descriptors.We have not yet published these DB’s on datasette, but the infrastructure is all in place on the
xbrl_integration
branch.The only forms we’ve said we would integrate historical data for are 1, 2, and 714
The Form 2 is analogous to Form 1 but for interstate natural gas utilities, so mostly transmission pipeline companies. We’d hoped there would be more state level has utilities in there as there are for electric utilities, but it seems like that’s not the case.
The old Form 714 is partially integrated, and provides a bunch of data about balancing and planning areas, including hourly demand. The old data is bunch of CSVs dumped from DBF, all years in one partition.
So I think those are the highest priority, and the old 714 data will be easier to work with.
IIRC, Form 6 is like Forms 1 & 2, but for petroleum, and the old data is DBF. I think that would be the next priority. Form 60 seemed like a mysterious “other entities” category, and would be the lowest priority.
I imagine having the XBRL databases will make it easier to interpret the old data.