Postgres support for shredded events
See original GitHub issueSorry, couldn’t find any existing tickets regarding this (except #2475), but pretty sure they exist.
Just wanted to clarify what we need to do to make this happen, since we now considering removing Postgres support from StorageLoader/RDB Loader.
From my point of view, main missing components are:
- Spark job that takes events and JSON paths and produces TSV. Rough scalaish pseudo-code:
def convertJsonToTsv(jsonPaths: JsonPath, json: Hierarchy): TSV = ???
def produceTsv(igluResolver: Resolver, json: Hierarchy): TSV = {
val jsonPaths = igluResolver.fetchJsonPaths(json.schema)
convertJsonToTsv(jsonPaths, json)
}
- Notion of JSON Paths in Iglu.
- Actual RDB Postgres loading code.
@alexanderdean did I miss something?
Issue Analytics
- State:
- Created 6 years ago
- Comments:5 (4 by maintainers)
Top Results From Across the Web
Common: postgres support for shredded events #47 - GitHub
I am using PosgreSQL as my storage. Can I load the shredded-types data into my postgresql by following this guide ? Or currently...
Read more >Atomic.events to postgres database - Storage targets
I am trying to push atomic events to postgres database, but it remains empty. I can see all events in archive shredding bucket,...
Read more >Re: how to securely delete the storage freed when a table is ...
There are free utilities that do government leave wipes. The process would be, drop the table, shrink the old table space then (if...
Read more >Event Sourcing Is Hard (2019) - Hacker News
Event sourcing sounds so awesome in theory and it is used to great effect in many demanding applications (like Postgres!
Read more >Import Parquet to PostgreSQL: 2 Easy Ways - Learn - Hevo Data
Limited Schema Evolution is supported by Apache Parquet, which means that the schema can be changed in response to changes in the data....
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
+1 for Postgres shredding - we extensively use Postgres for all our needs and would help to not have to run another ETL to export data out of Redshift into Postgres.
Moved to https://github.com/snowplow/snowplow-rdb-loader/issues/47, closing this one.