Trouble running "Quick Start"-scripts
See original GitHub issueHi! Firstly, thanks for publishing your research and models! 😃
I have trouble evaluating the NER model with the given command CUDA_VISIBLE_DEVICES=0 python3 run_acener.py --model_type bertspanmarker ...
The output is a json-file with only one line: {"dev_best_f1": 0}
The last 3 lines of the log-output are:
02/04/2022 17:37:16 - INFO - __main__ - Training/evaluation parameters Namespace(adam_epsilon=1e-08, alpha=1, cache_dir='', config_name='', data_dir='../scierc/raw_data', dev_file='dev.json', device=device(type='cuda'), do_eval=True, do_lower_case=True, do_test=False, do_train=False, eval_all_checkpoints=True, evaluate_during_training=True, fp16=True, fp16_opt_level='O1', gradient_accumulation_steps=1, group_axis=-1, group_edge=False, group_sort=False, learning_rate=2e-05, lminit=True, local_rank=-1, logging_steps=5, max_grad_norm=1.0, max_mention_ori_length=8, max_pair_length=256, max_seq_length=512, max_steps=-1, model_name_or_path='../bert_models/scibert_scivocab_uncased', model_type='bertspanmarker', n_gpu=1, no_cuda=False, no_test=False, norm_emb=False, num_train_epochs=50.0, onedropout=True, output_dir='../sciner_models/sciner-scibert', output_results=True, overwrite_cache=False, overwrite_output_dir=True, per_gpu_eval_batch_size=16, per_gpu_train_batch_size=8, save_steps=2000, save_total_limit=1, seed=42, server_ip='', server_port='', shuffle=False, test_file='test.json', tokenizer_name='', train_file='train.json', use_full_layer=-1, warmup_steps=-1, weight_decay=0.0)
02/04/2022 17:37:16 - INFO - __main__ - Evaluate on test set
02/04/2022 17:37:16 - INFO - __main__ - Evaluate the following checkpoints: []
As you can see in the first line, I changed the original command in the following way:
--model_name_or_path ../bert_models/scibert_scivocab_uncased
I couldn’t find a folder “scibert-uncased”, so I downloaded the 4th model from huggingface as described in the “Training Script”-section (AllenAI) - is this maybe the wrong model?--data_dir ../scierc/raw_data
I downloaded the SciERC raw_data from their website to execute the evaluation on - is this the wrong dataset?
Issue Analytics
- State:
- Created 2 years ago
- Comments:8
Top Results From Across the Web
java - Gradle error with lack of mainClassName property in ...
* What went wrong: A problem was found with the configuration of task ':core:startScripts'. > No value has been specified for property 'mainClassName'....
Read more >Gradle - Multiple start script examples - Mkyong.com
gradle examples to show you how to create multiple start scripts or executable Java application. 1. Single Start Script. 1.1 In Gradle, you...
Read more >The Application Plugin - Gradle User Manual
Starts the application. startScripts — CreateStartScripts. Depends on: jar. Creates OS specific scripts to run the project as a JVM application.
Read more >Auto-start scripts on MicroPython - YouTube
Auto- start scripts on MicroPython ... Writing fast and efficient MicroPython ... PicoMite: Running BASIC on a Raspberry Pi Pico.
Read more >4 Solutions To Run Multiple Node.js or NPM Commands ...
json files with their own dependencies and individual NPM start scripts, but by simply chaining the two together with the & in between...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Thanks for the help, I got it running now 😃
For future reference (or you might add it to the repo-description): The problem was the wrong data path: you need SCIERC_processed and then refer to the json-folder as data_dir. Note that the data-directory-path needs to contain “scierc” in lowercase. Also, the model-path and the output-path have to be the same.
Same folder issues for evaluating the relation-extraction model:
cp ../sciner-scibert/ent_pred_test.json ../scierc/processed_data/json
Also, this was the setup it required:
You can try:
And then replace
downloaded_model_path
to the downloaded trained model path.