question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Add the tracked value which can show the dimension collapse problem

See original GitHub issue

We may follow the idea from LeCun’s paper and use the percentage of “big” singular values in SVD Technically we may put it inside the EmbeddingMetric

Another approach is to track how many principal components in PCA we need to keep 95% of accuracy. (What is the policy for the case when PCA improves accuracy?)

Issue Analytics

  • State:closed
  • Created a year ago
  • Comments:5 (5 by maintainers)

github_iconTop GitHub Comments

1reaction
dapladoccommented, Dec 6, 2022

I like the proposed design. Just want to double-check: is it right that you want to call calc_topological_metrics somewhere nearby calc_retrieval_metrics in EmbeddingsMetrics class? If so, it’s okay.

Yes, calc_topolotical_metrics will be called near the calc_retrieval_metrics in EmbeddingMetrics.

The only thing that I would change is the current confusing signature. What about this? def count_meaningful_components(embeddings, examplained_variance_to_keep: Tuple[int, ...]) -> n_components Or you can suggest other ideas.

I’m ok with this naming.

0reactions
AlekseyShcommented, Dec 6, 2022

@dapladoc I like the proposed design. Just want to double-check: is it right that you want to call calc_topological_metrics somewhere nearby calc_retrieval_metrics in EmbeddingsMetrics class? If so, it’s okay.

The only thing that I would change is the current confusing signature. What about this? def count_meaningful_components(embeddings, examplained_variance_to_keep: Tuple[int, ...]) -> n_components Or you can suggest other ideas.

As for the issue for DirectCLR, then we need to add similar issues for the hundreds of other architectures, so, I would not do it.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Understanding Dimensional Collapse in Contrastive Self ...
Here, we show that dimensional collapse also happens in contrastive learning. In this paper, we shed light on the dynamics at play in ......
Read more >
UNDERSTANDING DIMENSIONAL COLLAPSE IN CON
We empirically show that contrastive self-supervised learning suffers from dimensional collapse whereby all the embedding vectors fall into a lower-dimensional ...
Read more >
[GA4] Analytics dimensions and metrics - Google Support
This article describes the dimensions and metrics that appear in your reports and explorations and in the audience and segment builders. You can...
Read more >
Dimensions in Drawings - 2019 - SolidWorks Web Help
You can use the Autodimension tool to insert reference dimensions into drawing views as baseline, chain, and ordinate dimensions. The Autodimension tool works ......
Read more >
Exploring data in Looker | Google Cloud
Explores can be expanded or collapsed within the Explore menu to display their views. Within each Explore are views, which are groupings of...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found