StorageLoader: add support for Spark-based Shredder's directory structure
See original GitHub issueDue to the changes introduced by #3034, the directory structure will now look like:
shredded/
good/
run=2016-11-26-21-48-42/
atomic-events/
part-00000
part-00001
...
shredded-types/
vendor=com.acme/
name=event/
ββββformat=jsonschema/
ββββββββversion=1-0-0/
ββββββββββββpart-00001-00010
name=context/
ββββformat=jsonschema/
ββββββββversion=1-0-0/
ββββββββββββpart-00001-00010
as opposed to what was before
shredded/
ββββgood/
ββββββββrun=2016-11-26-21-48-42/
ββββββββββββatomic-events/
ββββββββββββββββpart-00000
ββββββββββββββββpart-00001
ββββββββββββββββ...
ββββββββββββcom.acme/
ββββββββββββββββevent/
ββββββββββββββββββββjsonschema/
ββββββββββββββββββββββββ1-0-0/
ββββββββββββββββββββββββββββpart-00001-00010
ββββββββββββββββcontext/
ββββββββββββββββββββjsonschema/
ββββββββββββββββββββββββ1-0-0/
ββββββββββββββββββββββββββββpart-00001-00010
There are effectively two changes:
- The field names of
SchemaKey
(vendor, name, format, version) are part of the path, what wascom.acme/event/jsonschema/1-0-0
is nowvendor=com.acme/name=event/format=jsonschema/version=1-0-0
- A
shredded-types
layer has been added
Issue Analytics
- State:
- Created 7 years ago
- Comments:6 (6 by maintainers)
Top Results From Across the Web
Snowplow 89 Plain of Jars released, porting Snowplow to Spark
Spark Enrich and RDB Shredder; Under the hood ... StorageLoader has been updated to read Spark's output directory structure (#3044)Β ...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Updated title, please update commit!
StorageLoader no space