question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

EmrEtlRunner: switch AMI bootstrap scripts to HTTPS

See original GitHub issue

Hello!

Since today (15-01-2020) we are not able to download the commons-codec from Maven, which means we cannot bootstrap our EMR.

When checking the error logs, both from our application and Amazon, we saw that the bootstrap is failing on downloading that said library because it tries to connect via HTTP:

--2020-01-15 18:12:38-- http://central.maven.org/maven2/commons-codec/commons-codec/1.5/commons-codec-1.5.jar Resolving central.maven.org (central.maven.org)... 151.101.4.209 Connecting to central.maven.org (central.maven.org)|151.101.4.209|:80... connected. HTTP request sent, awaiting response... 501 HTTPS Required 2020-01-15 18:12:38 ERROR 501: HTTPS Required.

With a quick research on the web, we found out that since today (15-01-2020) Maven no longer supports plain HTTP, so we tried to connect via HTTPS. However, that failed too with the message bellow:

--2020-01-15 19:18:23-- https://central.maven.org/maven2/commons-codec/commons-codec/1.5/commons-codec-1.5.jar Resolving central.maven.org (central.maven.org)... 151.101.204.209 Connecting to central.maven.org (central.maven.org)|151.101.204.209|:443... connected. ERROR: no certificate subject alternative name matches requested host name central.maven.org. To connect to central.maven.org insecurely, use –no-check-certificate’. `

Then, we created another script to add as a custom Bootstrap. We managed to add this script, but it failed again, this time pointing out that it cannot remove commons-codec-1.4.jar.

Right now, we are trying to pass 1.5 as a parameter and see if it will work. We will try to update it here as soon as possible - no matter if it fixes or not.

Also, I’d like to point out that we are on sa-east-1.

Project:
Stream Enrich

Version:

Expected behavior:

EMR Bootstraping builds the cluster.

Actual behavior: EMR fails at Bootstrap.

Steps to reproduce:

  1. Start a new EMR using snowplow-ami4-bootstrap-0.2.0.sh as the script.

  2. Change http:// to https:// on snowplow-ami4-bootstrap-0.2.0.sh. Try to run it as you would.

  3. Change https://central to https://repo1 on snowplow-ami4-bootstrap-0.2.0.sh. Try to run it as you would normally.

  4. Create a new self hosted bootstrap script and add it to your EMR’s Bootstrap. Make sure the new script has the same changes as those above.

Cheers!

Issue Analytics

  • State:closed
  • Created 4 years ago
  • Reactions:2
  • Comments:6 (2 by maintainers)

github_iconTop GitHub Comments

1reaction
hizumisencommented, Jan 16, 2020

@chuwy thanks, our process has finished correctly.

1reaction
chuwycommented, Jan 16, 2020

Hi @rcapp. We hotfixed the AMI4 script (the AMI5 script which is used for releases since ~2018 was updated yesterday) - your pipeline should start properly now.

Read more comments on GitHub >

github_iconTop Results From Across the Web

AWS EMR Bootstrapping Incident revew - Open Source Alerts
Yesterday night some of our batch pipeline users experienced an issue during EMR cluster bootstap. The issue manifested itself as EmrEtlRunner failure with ......
Read more >
EmrEtlRunner returns 403 error. - Google Groups
You received this message because you are subscribed to the Google Groups "Snowplow" group. To unsubscribe from this group and stop receiving emails...
Read more >
snowplow/snowplow r119-tycho-magnetic-anomaly-two on GitHub
EmrEtlRunner. Switch AMI bootstrap scripts to HTTPS (#4256); Set ig.count (#4285); Catch and retry EMR connection exceptions (#4290); Bump to 0.37.0 (#4297).
Read more >
We are no longer doing umbrella Snowplow releases and are ...
The only change is that parameter clusterType got renamed documentType . ... Update EmrEtlRunner and StorageLoader scripts.
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found