question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Train your own data

See original GitHub issue

Hi,

I want to use ADAPET on my own data. So, I was trying to follow your recommendation with a toy dataset. I took a small subset of data/BoolQ/train.jsonl and removed the “idx” key, so it looks like my own data.

So, the file lines look like this {"question": "is ghost in the shell based on the anime", "passage": "Ghost in the Shell -- Animation studio Production I.G has produced ....", "label": false} . . . .

I used the command you provided in the README.md as follows: python cli.py --data_dir $data_dir --pattern '"[TEXT1]" has an answer in "[TEXT2]"? "[LBL]"' --dict_verbalizer '{"true": "yes", "false": "no"}'

and that command throws this error: Traceback (most recent call last): File "cli.py", line 52, in <module> train(config) File "~/ADAPET/src/train.py", line 59, in train batcher = Batcher(config, tokenizer, config.dataset) File "~/ADAPET/src/data/Batcher.py", line 21, in __init__ self.dataset_reader = DatasetReader(config, tokenizer, dataset) File "~/ADAPET/src/data/DatasetReader.py", line 44, in __init__ self.dataset_reader = GenericReader(self.config, tokenizer) File "~/ADAPET/src/data/GenericReader.py", line 24, in __init__ self.check_pattern(self.config.pattern) File "~/ADAPET/src/data/GenericReader.py", line 45, in check_pattern raise ValueError("Need at least one text ") ValueError: Need at least one text

I would highly appreciate guiding me on what I am doing wrong.

Thank you!

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:9 (5 by maintainers)

github_iconTop GitHub Comments

1reaction
rrmenon10commented, Jan 19, 2022

Whoops, my mistake. Thought it was fixed. I’ll reopen it and attend to it soon.

1reaction
rrmenon10commented, Jan 19, 2022

Hi @Afnan-Sultan, I thought I had replied to the previous issue with the same fix you mentioned 😅. Glad to know you were able to figure it out!

Read more comments on GitHub >

github_iconTop Results From Across the Web

Train Custom Data · ultralytics/yolov5 Wiki - GitHub
Creating a custom model to detect your objects is an iterative process of collecting and organizing images, labeling your objects of interest, ...
Read more >
How to Train YOLO v5 on a Custom Dataset - Paperspace Blog
This tutorial will show you how to implement and train YOLOv5 on your own custom dataset. Full Python code included.
Read more >
Custom training: walkthrough | TensorFlow Core
This tutorial shows you how to train a machine learning model with a custom training loop to categorize penguins by species. In this...
Read more >
How to Train YOLOv5 On a Custom Dataset - Roboflow Blog
In this post, we will walk through how you can train YOLOv5 to recognize your custom objects for your custom use case.
Read more >
How to train a learning model - Pluralsight
Machine learning requires us to have existing data—not the data our application will use when we run it, but data to learn from....
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found