Add the tracked value which can show the dimension collapse problem
See original GitHub issueWe may follow the idea from LeCun’s paper and use the percentage of “big” singular values in SVD
Technically we may put it inside the EmbeddingMetric
Another approach is to track how many principal components in PCA we need to keep 95% of accuracy. (What is the policy for the case when PCA improves accuracy?)
Issue Analytics
- State:
- Created a year ago
- Comments:5 (5 by maintainers)
Top Results From Across the Web
Understanding Dimensional Collapse in Contrastive Self ...
Here, we show that dimensional collapse also happens in contrastive learning. In this paper, we shed light on the dynamics at play in ......
Read more >UNDERSTANDING DIMENSIONAL COLLAPSE IN CON
We empirically show that contrastive self-supervised learning suffers from dimensional collapse whereby all the embedding vectors fall into a lower-dimensional ...
Read more >[GA4] Analytics dimensions and metrics - Google Support
This article describes the dimensions and metrics that appear in your reports and explorations and in the audience and segment builders. You can...
Read more >Dimensions in Drawings - 2019 - SolidWorks Web Help
You can use the Autodimension tool to insert reference dimensions into drawing views as baseline, chain, and ordinate dimensions. The Autodimension tool works ......
Read more >Exploring data in Looker | Google Cloud
Explores can be expanded or collapsed within the Explore menu to display their views. Within each Explore are views, which are groupings of...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Yes,
calc_topolotical_metrics
will be called near thecalc_retrieval_metrics
inEmbeddingMetrics
.I’m ok with this naming.
@dapladoc I like the proposed design. Just want to double-check: is it right that you want to call
calc_topological_metrics
somewhere nearbycalc_retrieval_metrics
inEmbeddingsMetrics
class? If so, it’s okay.The only thing that I would change is the current confusing signature. What about this?
def count_meaningful_components(embeddings, examplained_variance_to_keep: Tuple[int, ...]) -> n_components
Or you can suggest other ideas.As for the issue for DirectCLR, then we need to add similar issues for the hundreds of other architectures, so, I would not do it.