[FEATURE] Integration with SageMaker
See original GitHub issue🚨🚨 Feature Request
- Related to an existing Issue
- A new implementation (Improvement, Extension)
Is your feature request related to a problem?
Include functionality that allows SageMaker to ingest data from Hub.
If your feature will improve HUB
SageMaker can easily fetch data from local filesystems and s3. By building an interface for SageMaker, we tie Hub to one of the most popular ML tools.
Description of the possible solution
We would have to duplicate a Hub dataset onto s3, which is then accessible to SageMaker.
For example, this is how SageMaker handles local files:
input_mode (str): The input mode that the algorithm supports
(default: 'File'). Valid modes: 'File' - Amazon SageMaker copies
the training dataset from the S3 location to a local directory.
'Pipe' - Amazon SageMaker streams data directly from S3 to the
container via a Unix-named pipe. This argument can be overriden
on a per-channel basis using
Issue Analytics
- State:
- Created 3 years ago
- Comments:11 (11 by maintainers)
Top Results From Across the Web
Create, Store, and Share Features with Amazon SageMaker ...
You can query, explore, and visualize features using Data Wrangler from Amazon SageMaker Studio. Feature Store supports combining data to produce, train, ...
Read more >Getting Started With Amazon SageMaker & ... - Tecton
Learn how to put ML features in production & obtain training data using Tecton's feature platform in an AWS SageMaker environment.
Read more >Getting started with Amazon SageMaker Feature Store
For this post, we focus on the integration of Feature Store with other Amazon SageMaker features to help you get started quickly.
Read more >AWS SageMaker Integration - Hopsworks Documentation
Connecting to the Feature Store from SageMaker requires setting up a Feature Store API key for SageMaker and installing the HSFS on SageMaker....
Read more >Amazon SageMaker integration | New Relic Documentation
Monitor your data and model in Amazon SageMaker, and send the metrics to CloudWatch · Example notebook: Monitoring bias drift and feature attribution...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Hi @mynameisvinn . I have couple of doubts regarding this feature.
Firstly, where does the scope of Hub end. Like using boto3 to move the dataset from Hub to S3 and then to SageMaker or are you thinking something else?
Secondly, this is a noob question. Are we trying to use SageMaker in training model as well? I think this doesn’t make sense as it doesnt work for all the end users right?
Lastly, since we can build data pipelines in Hub, can we try if we can integrate this with Kinesis Firehose for streaming data? I haven’t worked on Kinesis much, but I think this will be a promising feature
Sure @mynameisvinn