question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Monitoring with Prometheus and Grafana

See original GitHub issue

I’m looking into adding monitoring for pangeo, and would like to add a few metrics for Dask Gateway. I’m very new to prometheus and grafana, so apologies for any mistakes. Here are some rambling thoughts as I work through things. https://github.com/TomAugspurger/pangeo-cloud-federation/blob/metrics/gke/values.yaml has some helm configuration for the pangeo deployment I’m testing on.

I notice a couple of questions immediately:

  1. Dask exposes prometheus metrics at scheduler-address/metrics, but Dask Gateway’s schedulers are ephemeral. How do we dynamically notify grafana / prometheus that it should add a datasource at an address? Should Dask-Gateway act as a broker here?
  2. In my setup I’m using JupyterHub auth. Will all prometheus requests need to go through JupyterHub to access the dashboard?

I’ll continue to post here as I work through things.

cc @droctothorpe (who mentioned prometheus in https://github.com/dask/dask-gateway/pull/258) and @consideRatio

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Comments:15 (14 by maintainers)

github_iconTop GitHub Comments

2reactions
TomAugspurgercommented, Jun 24, 2020

Hey, cool, that did it! Thanks!

Screen Shot 2020-06-24 at 10 29 35 AM

Not the most impressive chart but it’s something 😃

For reference, I added the annotation manually with

kubectl -n metrics annotate service dask-gateway-9175cdffbe5c4c48bd469214522ca714 prometheus.io/scrape='true' prometheus.io/path='/metrics'
1reaction
TomAugspurgercommented, Jul 15, 2020

For reference, this was implemented for pangeo in https://github.com/pangeo-data/pangeo-cloud-federation/pull/626/ and https://github.com/pangeo-data/pangeo-cloud-federation/pull/648. The key bit was setting the prometheus.io/scrape: 'true' annotation on all the pods.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Get started with Grafana and Prometheus
Prometheus is an open source monitoring system for which Grafana provides out-of-the-box support. This topic walks you through the steps to create a...
Read more >
Grafana - Prometheus.io
An open-source monitoring system with a dimensional data model, flexible query language, efficient time series database and modern alerting approach.
Read more >
How Grafana and Prometheus work together | MetricFire Blog
Prometheus as a Grafana datasource · Click the "cogwheel" icon in the top right corner of your Grafana dashboard to open the configuration...
Read more >
Prometheus vs Grafana: Knowing the Difference - OpsRamp
Prometheus collects rich metrics and provides a powerful querying language; Grafana transforms metrics into meaningful visualizations. Both are compatible with ...
Read more >
How to setup Monitoring using Prometheus and Grafana
All our monitoring metrics are being scrapped and stored in Prometheus. Hence, the first step is to add Prometheus as a datastore. To...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found