question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Publish all raw FERC SQLite DBs produced from XBRL

See original GitHub issue

Our FERC XBRL Extractor works with FERC Froms 1, 2, 6, 60, and 714. Form 1 is the most well integrated with PUDL, but even then many tables have not yet been integrated into the ETL. So, the ability to publish the raw SQLite DB’s will be very useful. The XBRL data is also much better structured than the historical data, but very hard to work with in the XBRL format, so the raw SQLite versions of all of these forms could provide a lot of value.

Tasks

Enabling tasks

  • Create sandbox archives of forms (2, 6, 60, 714)
  • catalyst-cooperative/ferc-xbrl-extractor#18

Ingest Metadata generated by extraction tool

  • catalyst-cooperative/ferc-xbrl-extractor#17
  • #1860

Enable publication

  • #1861
  • Integrate new sources with datasette_metadata_to_yml
  • Update datasette publication bash script

Generate documentation

Issue Analytics

  • State:closed
  • Created a year ago
  • Comments:5 (5 by maintainers)

github_iconTop GitHub Comments

1reaction
zschiracommented, Sep 8, 2022

PUDL is now able to construct SQLite DB’s from all FERC XBRL forms, and ingest/convert the accompanying datapackage descriptors.We have not yet published these DB’s on datasette, but the infrastructure is all in place on the xbrl_integration branch.

0reactions
zaneselvanscommented, Aug 8, 2022

The only forms we’ve said we would integrate historical data for are 1, 2, and 714

The Form 2 is analogous to Form 1 but for interstate natural gas utilities, so mostly transmission pipeline companies. We’d hoped there would be more state level has utilities in there as there are for electric utilities, but it seems like that’s not the case.

The old Form 714 is partially integrated, and provides a bunch of data about balancing and planning areas, including hourly demand. The old data is bunch of CSVs dumped from DBF, all years in one partition.

So I think those are the highest priority, and the old 714 data will be easier to work with.

IIRC, Form 6 is like Forms 1 & 2, but for petroleum, and the old data is DBF. I think that would be the next priority. Form 60 seemed like a mysterious “other entities” category, and would be the lowest priority.

I imagine having the XBRL databases will make it easier to interpret the old data.

Read more comments on GitHub >

github_iconTop Results From Across the Web

PACKAGE_NAME Release Notes - FERC XBRL Extractor 0.7 ...
The FERC XBRL extractor supports extracting data from XBRL filings to produce a SQLite database. It provides a CLI that takes a link...
Read more >
Catalyst Cooperative - GitHub
An Intake driver that provides access to SQLite databases identified by ... A tool for converting FERC filings published in XBRL into SQLite...
Read more >
catalystcoop.ferc-xbrl-extractor - PyPI
A tool for extracting data from FERC XBRL Filings. ... and write that data to a SQLite database whose structure is generated from...
Read more >
ferc714 Archives - Catalyst Cooperative
We are attempting to obtain a bulk download of all this data so we can archive it alongside our other raw data sources...
Read more >
OAI-PMH Interface - Zenodo
Converted from a large collection of annual Microsoft Visual FoxPro databases into a single combined SQLite database. See the README.md file for details....
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found