question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Cross site indexing of content

See original GitHub issue

Feature Request

In response to this discussion https://twitter.com/DockstoreOrg/status/937730314746712064

It would be great to have a community developed, cross index of Dockerstore (and other container resource) content. I can add some detail about Singularity Hub / Registry, is that helps.

  • Singularity Registry and Hub both serve image manifests to download images. Singularity Hub comes from Google Cloud Storage, and Singularity Registry comes from a served filesystem.
  • The images themselves serve a json manifest with inspect that conforms to http://jsonapi.org/format/
  • The manifests match the models in the database, not any kind of standard.

Minimally, given that a manifest is associated with some kind of container content, we would want to be able to search them. The general container naming convention that Singularity Hub uses:

<registry>/<namespace>/<container>:<digest>

Is also a nice search strategy in that it maps well to any kind of storage.

I don’t have specific suggestions or ideas, but just want to open up the conversation. I am an advocate for a strategy that is easy, maybe even fun, and can be flexible to allow for many different resources (e.g., a formal registry, or a Github repository)

┆Issue is synchronized with this Jira Story ┆fixVersions: Dockstore 2.X ┆friendlyId: DOCK-299 ┆sprint: Backlog ┆taskType: Story

Issue Analytics

  • State:closed
  • Created 6 years ago
  • Reactions:1
  • Comments:10 (4 by maintainers)

github_iconTop GitHub Comments

1reaction
denis-yuencommented, Dec 4, 2017

Ok, so if I understand correctly. Individual institutions can spin up a Singularity Registry to provide Singularity containers in addition to using those from Singularity Hub. Currently, you have a strategy of recording what sites have Singularity Manifests by listing them in GitHub and generating a page using GitHub Pages based on that information.

We have a similar idea in that we were thinking of creating a Badge system to validate and check that systems that implement the Tool Registry Schema are valid and then creating list of them.

1reaction
vsochcommented, Dec 4, 2017

Singularity Hub is a cloud service that provides building, from Github repos. Singularity Registry is optimized for a local institution to deploy, and then push images to it. Both serve the same manifests for downloading, but the registry comes from a filesystem, and Singularity Hub is Google Cloud Storage.

Additional functions to parse the manifests that the registries serve (to then make them searchable from one box) would fit the model I was describing. If we extend that model to manifests of different kinds, then I think we have a good start!

But note the strategy I came up with is as simple as I possible could do - the entire page I linked is served statically (rendered automatically) via Github pages, and other registries are added by adding a markdown file and doing a PR. Any method we do to collect manifests should be that simple - if we have a server, then it could be just an interactive web interface to do it.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Handling legitimate cross-domain content duplication
For some sites, there are legitimate reasons to duplicate content across different websites—for instance, to migrate to a new domain name using a...
Read more >
Cross Site Scripting (XSS) - OWASP Foundation
The data is included in dynamic content that is sent to a web user without being validated for malicious content. The malicious content...
Read more >
Page Indexing report - Search Console Help - Google Help
See which pages Google can find and index on your site, and learn about any indexing problems encountered. Open Page Indexing report.
Read more >
The 5 Most Common Google Indexing Issues by Website Size
Crawled, currently not indexed (quality or crawl budget issue). Duplicate content. Crawl budget issue. Soft 404. Crawl issue. Medium Websites.
Read more >
How Website Indexing Works (And How To Make It Work Better)
In particular, they've got their eyes on new URLs, sites that have undergone changes and dead links. As the web crawlers come across...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found