question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Setup dr-elephant with EMR

See original GitHub issue

I have compiled and deployed dr-elephant in EMR. It started successfully and i could tunnel to 8080 in my local.

compiled Version : default dr.elephant configuration EMR spark Version: EMR 5.0.0, Spark- 2.0.0

But i do not find any job information in the dr-elephant. I tried to run simple wordcount program and no information in the dr-elephant.

kerberos disabled in my EMR.

Elephant.conf port=8080 #Database configuration db_url=localhost db_name=drelephant db_user=hadoop db_password=hadoop

jvm_args=“-Devolutionplugin=enabled -DapplyEvolutions.default=true -Djava.net.preferIPv4Stack=true -mem 1024 -J-Xloggc:$project_root…/logs/elephant/dr-gc.date +'%Y%m%d%H%M' -J-XX:+PrintGCDetails”

metrics=true

Dr.Elephant Dashboard shows below information

Hello there, I’ve been busy! I looked through 0 jobs today. About 0 of them could use some tuning. About 0 of them need some serious attention!

Issue Analytics

  • State:closed
  • Created 6 years ago
  • Comments:16

github_iconTop GitHub Comments

2reactions
krishnadgcommented, Aug 24, 2018

My spark fetcher looks something like this… our logs are being stored on the maternode at /var/log/… It seems that the fetcher is able to read the logs once the job has finished, but now I am having the issue that spark 2.x is not directly supported https://github.com/linkedin/dr-elephant/issues/389 <fetcher> <applicationtype>spark</applicationtype> <classname>com.linkedin.drelephant.spark.fetchers.SparkFetcher</classname> <params> <use_rest_for_eventlogs>true</use_rest_for_eventlogs> <should_process_logs_locally>false</should_process_logs_locally> </params> </fetcher> I think you can try true or false for the process locally, that didnt seem to change anything for me

1reaction
Sushant20commented, Aug 24, 2018

Hi @dmateusp , I got my dr.elephant up and running.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Tune Hadoop and Spark performance with Dr. Elephant and ...
This post demonstrates how to install Dr. Elephant and Sparklens on an Amazon EMR cluster and run workloads to demonstrate these tools' ...
Read more >
Tuning EMR Performance with Dr. Elephant and Sparklens
This post demonstrates how to install Dr. Elephant and Sparklens on an Amazon EMR cluster and run workloads to demonstrate these tools' ...
Read more >
Using Dr Elephant on Ephemeral EMR clusters - Google Groups
The only way i have been able to use it currently is to manually install dr.elephant on a EMR cluster, then run the...
Read more >
linkedin/dr-elephant - Gitter
@akshayrai : I want to install Dr. elephant EMR through bootstrap script. Now I should have Play installed before Dr. Elephant. when I...
Read more >
Spark Optimization Using Dr.Elephant - YouTube
Dr. Elephant is a performance monitoring and tuning tool for Hadoop and Spark. It automatically gathers all the metrics, runs analysis on ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found