Tracking adoption of AzureMLClass
See original GitHub issueHello,
We’d like to add a minimal logging capability to the AzureMLCluster
. This would only report the following name-value pair: {"AzureMLCluster-Dask": "0.1"}
(or whatever the current version is) with the AzureML Service and nothing else – once the cluster is instantiated NO information of what the users do on the Dask Cluster or anything else would be logged.
The only use of this signal would be to account for the adoption of the AzureMLClass
: how many jobs were submitted to AzureML using the AzureMLCluster
, how many users used it, how many core-hours it generated. All these information would only be used in an aggregated form.
We have a solution ready to go just looking for your agreement that the above is OK.
Issue Analytics
- State:
- Created 3 years ago
- Comments:6 (6 by maintainers)
Top Results From Across the Web
Track ML experiments and models with MLflow - Microsoft Learn
Set up MLflow Tracking with Azure Machine Learning to log metrics and artifacts from ML models with MLflow.
Read more >Track, monitor, and analyze runs - Azure Machine Learning
Learn how to start, monitor, and track your machine learning experiment runs with the Azure Machine Learning Python SDK.
Read more >MLflow Tracking for models - Azure Machine Learning
Set up MLflow Tracking with Azure Machine Learning to log metrics and artifacts from ML models.
Read more >Monitor and analyze jobs in studio - Azure Machine Learning
Learn how to start, monitor, and track your machine learning experiment jobs with the Azure Machine Learning studio.
Read more >Log metrics, parameters and files with MLflow - Microsoft Learn
Azure Machine Learning supports logging and tracking experiments using MLflow Tracking. You can log models, metrics, parameters, ...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
@martindurant that is correct – this is part of the #67 PR. @TomAugspurger @jrbourbeau Totally fine to make this opt-out. Will update the documentation and provide an opt-out mechanism.
Thank you all!
Thanks for raising this issue @drabastomek. In general, logging the version of
dask-cloudprovider
used seems reasonable and would be a valuable piece of information to have.IIUC we don’t log this type of information with other cluster managers (e.g.
FargateCluster
doesn’t log this information with AWS). Would you be okay adding some documentation so users are aware of what is being logged and some way for users to opt out if they don’t want to send this information (e.g. adding alog_version
, or some other name, which can be set toFalse
)?EDIT: Ah, I missed @TomAugspurger’s comment right before mine. Looks like we’re all in favor of an opt-out option and some documentation describing what’s logged