EmrEtlRunner: add backend for a `generate` command
See original GitHub issueTwo sub-command options:
generate emr-cluster
, which produces a Dataflow Runner-compatible EMR cluster config in Avro formatgenerate playbook
, which products a Dataflow Runner-compatible playbook for running the job
Let’s take these in turn:
generate emr-cluster
This command will use the cluster specification in the config.yml
to generate a Dataflow Runner-compatible EMR cluster config.
generate playbook
This command will use the config.yml
plus the contents of the --enrichments
folder, plus any relevant command line arguments (such as --skip staging
) to generate a Dataflow Runner-compatible playbook for running the identical job via Dataflow Runner.
Issue Analytics
- State:
- Created 7 years ago
- Comments:13 (13 by maintainers)
Top Results From Across the Web
EmrEtlRunner | Snowplow Documentation
Snowplow EmrEtlRunner is a deprecated application that ran Snowplow's batch processing jobs in AWS EMR, such as the RDB shredder.
Read more >Snowplow: Full Setup With Google Analytics Tracking
Create a new cluster and database in Redshift. Add users and all the necessary tables to the database. Configure the EmrEtlRunner to ...
Read more >A system to programmatically run data pipelines | RustRepo
Unfriendly error if user attempts to add argument to command. Changing: "command": "/opt/mt-scripts/common/scripts/r77/emr-etl-runner-r77.sh",. to:
Read more >Quickstart
The Amplify Command Line Interface (CLI) is a unified toolchain to create, ... in the category's subdirectory amplify/backend/<category> , and insert its ...
Read more >snowplow/snowplow r91-stonehenge on GitHub - NewReleases.io
EmrEtlRunner robustness. Blog post. EmrEtlRunner ... Add backend for a generate command (#3105); Add --resume-from option (#3128); Remove support for ...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Okay cool - assigned you to the 0.2.0 milestone per our convo a little earlier…
Hi @BenFradet - you raise good points. I think it’s okay then to write this in Ruby, same as the rest of the codebase…