question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

processFulltextAssetDocument graphic url incorrectly including temporary (sub-)directory

See original GitHub issue

Using processFulltextAssetDocument we can download a zip with the XML and related image resources.

The url in the XML seem to include a temporary sub-directory, e.g. QwmZmbXzY9.lxml_data/image-1.png instead of image-1.png

Example document 003525v1 (bioRxiv 10k training), generates the following partial TEI XML.

<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Network-based analysis of omic data to model the processes connecting genetic variation to disease.</figDesc><graphic url="QwmZmbXzY9.lxml_data/image-1.png" coords="12,72.00,178.57,319.30,504.00" type="bitmap" /></figure>

(The zip file doesn’t contain any subdirectories)

Example command:

curl -v \
  --output "003525v1.zip" \
  --form input=@003525v1.pdf \
  https://<host>:<port>/api/processFulltextAssetDocument

(I have only tried that with the cloud.science-miner.com instance)

BTW the FAQ seem to be listing the API as deprecated. Not sure if that is still correct?

Issue Analytics

  • State:open
  • Created 2 years ago
  • Reactions:1
  • Comments:10 (4 by maintainers)

github_iconTop GitHub Comments

1reaction
suyogrichacommented, Mar 10, 2022

Patrice is it possible for you to update this on https://hub.docker.com/r/grobid/grobid/

1reaction
suyogrichacommented, Feb 17, 2022

Thanks a lot, Patrice. You are amazing. Any chances of updating this to https://hub.docker.com/r/grobid/ ?

Actually, I wish to use https://grobid.readthedocs.io/en/latest/Deep-Learning-models/ too and I am on M1 Macbook Max (Apple Silicon). It would be too complex for me to install everything.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Moving WordPress from Sub-Directory to Root Directory
Under Tools > Update URLs, type the website address containing the subdirectory in the Old URL field and the website without the subdirectory...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found