separator argument ImageClassificationData.from_data_frame for multi label dataset
See original GitHub issue🚀 Feature
for plant pathology kaggle dataset, after loading the csv/df with ImageClassificationData.from_data_frame I am getting dm.num_classes 11 but it should be 6.
There should be a way to pass how labels are separated in the df/csv - by space, comma or any other character.
Motivation
Pitch
Alternatives
Additional context
Issue Analytics
- State:
- Created 2 years ago
- Comments:6 (5 by maintainers)
Top Results From Across the Web
ImageClassificationData — Flash documentation
Load the ImageClassificationData from CSV files containing image file paths ... data_module_kwargs ( Any ) – Additional keyword arguments to provide to the ......
Read more >Multi-label image classification Tutorial with Keras ...
Tutorial on using Keras for Multi-label image classification using flow_from_dataframe both with and without Multi-output model. Source: MIML Dataset ...
Read more >Keras: multi-label classification with ImageDataGenerator
In this part I'll walk you through a multi-class classification problem step by step. The example will use the VOC2012 dataset which consist...
Read more >Keras Multi Label Classification Issue - Stack Overflow
I've implemented a multi-label MLP in Keras with tensorflow and I'm ... can only take single arrays as a parameter, which I have...
Read more >Multi-Label Classification | TheAILearner
To prepare the dataset, we need images and corresponding genre information. For this, we need to extract the genre information from the ...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found

Hi @aniketmaurya great suggestion! It’s worth noting that we don’t currently support this format at all. For multi-label, our CSV loading code assumes that each additional label is in a new column rather than a delimited string in a single column. Support for this, along with the ability to specify the delimiter, would be much appreciated. Is this something you would be willing to work on? The main class to be updated is here: https://github.com/PyTorchLightning/lightning-flash/blob/8f306d49b24aed09d37d083883f6c5636b26a45e/flash/core/data/data_source.py#L504
Closing this as plant pathology can now be loaded succesfully. We don’t support passing a seperator but it will be inferred and can be either a space or a comma 😃