Add dataset loading capabilities
See original GitHub issueFrom what I gather, there’s no go-to utility to load a matrix / dataset from disk without resorting to the fromJSON
function of DenseMatrix
and SparseMatrix
. It would be great if we had a clean dataset loading interface supporting .csv
s, the MatrixMarket
format (very popular in R), and others alike.
Thanks! I would be happy to discuss this further and/or lend y’all a hand
Issue Analytics
- State:
- Created 4 years ago
- Comments:7 (3 by maintainers)
Top Results From Across the Web
Create a dataset loading script - Hugging Face
The first step is to add some information, or attributes, about your dataset in DatasetBuilder._info() . The most important attributes you should specify...
Read more >About loading data into existing feature classes and tables
You can load data into existing feature classes and tables using either the Object Loader or the Simple Data Loader. This topic compares...
Read more >17.1 Creating Applications with Data Loading Capability
Create applications with data loading capability to enable end users to dynamically import data into a table within any schema to which the...
Read more >Writing custom datasets - TensorFlow
Add an entry for your dataset into DATASET_EXTRAS in setup.py . This makes it so that users can do, for example, pip install...
Read more >7. Dataset loading utilities — scikit-learn 1.2.0 documentation
The sklearn.datasets package embeds some small toy datasets as introduced in the Getting Started section. This package also features helpers to fetch larger ......
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Thanks for your offer 👍
Before implementing anything I think it will be good to write out the various use cases we see. That way we can see what kind of API makes sense. What pops in my mind:
fromCSV
andtoCSV
functionality.fromMTX
andtoMTX
?We should be careful not to reinvent the wheel here, and embrace API’s and libraries that are already out there. The idea for a “simple”
fromCSV(src)
may turn out not to be so simple: you need to built in support for nodejs and the browser, people will need to be able to send authentication headers and CORS headers etc etc.It would be nice if we can allow people to use their rest client of choice and embrace that instead of creating a full blown rest client ourselves. So usage can look like:
@danielruss , I actually never got around it due to my year getting a little crazy. If you want to give it a stab, I’d say go ahead 😃