question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

RDB Loader: Inconsistent resolution of Snowplow-hosted jsonpaths

See original GitHub issue

https://github.com/snowplow/snowplow/blob/8e5e1901af4c9073d0dceda2e2450ea6e6b486bb/4-storage/rdb-loader/src/main/scala/com/snowplowanalytics/snowplow/rdbloader/ShreddedType.scala#L158

The RDB Loader is hard-coded to use s3://snowplow-hosted-assets[-region] to retrieve jsonpaths for Snowplow-authored jsonschemas.

This seems inconsistent with other Snowplow components which use the configurable value in config.yaml (aws/s3/buckets/assets) and with the Iglu model in which each repo has its own jsonpaths store.

Is there a reason for this? If not I am happy to write a PR.

Issue Analytics

  • State:closed
  • Created 6 years ago
  • Comments:5 (5 by maintainers)

github_iconTop GitHub Comments

1reaction
chuwycommented, Aug 16, 2017

Sorry, @acgray, I’m still not following. You still can use aws.s3.buckets.jsonpath_assets setting in config.yml to point RDB Loader to your mirror. If your mirror is identical to snowplow-hosted-assets then Loader won’t ever reach our bucket, because custom assets bucket (jsonpath_assets) is always prioritized. This is how StorageLoader worked and I see it’s quite consistent with other parts of pipeline. Please, correct me if I missing something. If what I described is not possible now - then it’s quite serious issue.

You’re totally right at this point. We’re working on providing more consistent (and format-independent) approach to work with Iglu registries. But so far, there’s no clear direction for this problem. I cannot give you any clues or ETA so far to not also give false promises, but you can be sure - we’re agree and working on that.

0reactions
chuwycommented, Aug 17, 2017

Hey @acgray. I’m moving this into new repository, let’s continue discussion there: https://github.com/snowplow/snowplow-rdb-loader/issues/4

Read more comments on GitHub >

github_iconTop Results From Across the Web

RDB Loader: fix JSONPath cache resolution bug #3 - GitHub
RDB Loader : fix JSONPath cache resolution bug #3 ... Can confirm this is a bug affecting JSONPaths from Snowplow Hosted Assets.
Read more >
RDB Loader is failing to find a JSONPath file - Troubleshooting
RDB Loader is raising the following error: Data discovery error with following issues: JSONPath file [com.snowplowanalytics.snowplow/parent_event_1.json] ...
Read more >
snowplow/rdb-loader-snowflake - Docker Image
By clicking “Accept All Cookies”, you agree to the storing of cookies on your device to enhance site navigation, analyze site usage, and...
Read more >
MIT was we will home can us about if page my has no
... bill written talk federal hosting rules final adult tickets thing centre ... artists alternative processing responsibility resolution java guest seems ...
Read more >
Untitled
Gol di miccoli palermo chievo, Q pootle 5 theme, Belluno calcio a 5, ... #Root word of motivation, Will qatar host 2022 world...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found