question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

mets:file URL handling: keep remote links

See original GitHub issue

Currently with workspaces we can either keep images on the remote side by using http URLs in mets:file/mets:FLocat/@xlink:href (which means they have to be downloaded again and again during processing), or get local filesystem copies with relative paths by cloning with download=True or bagging and spilling (but then the source information will be lost forever).

When processing is finished and I want to make my workspace public, I now have to upload my shiny new results in addition to the original images – which I might not even have the rights to publish myself. It would be much better, if the original remote URLs would be used again for that – even if I used local copies in between.

METS-XML allows that: A mets:FLocat has xs:@maxoccurs=unbounded within mets:file, with the following documented semantic:

The file element provides access to content files for a METS object. A file element may contain one or more FLocat elements, which provide pointers to a content file, and/or an FContent element, which wraps an encoded version of the file. Note that ALL FLocat and FContent elements underneath a single file element should identify/contain identical copies of a single file.

So why don’t we keep 2 FLocat elements in that case, one relative path for local processing and one remote URL for provenance/bookkeeping? When making results public, the local copies could be disposed of again, e.g. when bagging with --manifestation-depth=partial.

Issue Analytics

  • State:open
  • Created 4 years ago
  • Reactions:1
  • Comments:6 (3 by maintainers)

github_iconTop GitHub Comments

1reaction
kbacommented, Oct 9, 2019

We could also implement the local_filename stuff as additional FLocat as you propose and have a processor that strips the METS down to ZVDD requirements.

0reactions
bertskycommented, Oct 9, 2020

Should be revisited now that the OLA-HD client has arrived.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Using Relative Links in Metafiles - Win32 apps | Microsoft Learn
When a URL begins with a "\" character, the link is drive-relative. Windows Media Player attempts to open the file linked to on...
Read more >
METS Navigator — Documentation
Shutdown your application server, to keep it from prematurely ... in the URL to navigate images needs to be the URL to the...
Read more >
Alma-Rosetta Integration Guide - Ex Libris Knowledge Center
Setting Up Rosetta as an Alma Remote Repository · Repository Name – The repository label that appears in the ViewIt page as the...
Read more >
AIP Backup and Restore - Confluence Mobile - LYRASIS Wiki
1.2 How does this help backup your DSpace to remote storage or cloud services (like ... 2.2.3.1 Default Restore Mode; 2.2.3.2 Restore, Keep...
Read more >
Greenstone tutorial exercises (2019)
Enhanced Word document handling: Using Windows native scripting ... Hyperlinks in a Greenstone collection work like this: If the link is to a...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found