Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Support Clusters for Internal Structure?

See original GitHub issue

I don’t know if this is the same as #19 but I thought it might be useful to explain my use case.

Given a large codebase, some 350k LOC, (dis)organized into piles of packages, sometimes preventing a graph from being generated, since the dot file is too complex, I guess? I ended up trying to use max-bacon=1 and tried excluding some paths in the codebase, however that ends up removing a significant amount of nodes imported into the excluded files.

I saw the new cluster feature, and wondered if this might be a viable option to use clustering for internal packages as well - so that I can “collapse” significant chunks of the codebase if there are more than N nodes inside that package.

Does that make sense?

Issue Analytics

State:
Created 4 years ago
Reactions:1
Comments:8 (6 by maintainers)

Top GitHub Comments

2reactions

thebjorncommented, Nov 24, 2019

If I’m misusing the term,

No, you’re using it 100% correctly.

Pydeps started as a tool for helping us move from a repo/package containing all our code, to many smaller repos/packages (it’s an improvement, but brings its own set of challenges). I don’t think there is any easy solution to representing relationships in a 300+ Kloc code base in 2d., but manually reducing the graph can still give you useful insights…

I usually follow a workflow that looks something like this (using django-cms - ~50KLOC) as an example: pydeps cms --cluster (had to scale it down due to size):

cms1

then I look at the top and the bottom of the graph and remove nodes with high in/out degrees (since they’re significantly constraining the graph display algorithm that dot uses), e.g. a first pass gives: pydeps cms -Tpng --show-dot --cluster -x django* sekizai* south* cms.test*

cms2

…and repeating until I come to a graph that’s readable and interesting: pydeps cms --show-dot --cluster -x django* sekizai* south* cms.test* cms.exceptions* cms.constants* cms.publisher.query* cms.utils.conf* cms.utils.i18n* cms.admin.pageadmin* cms.api* -xx cms.admin

cms

We were having increasing issues with developers unintentionally introducing circular imports, so we look for arrows going the “wrong” way (i.e. up). During the extract-to-individual projects phase we’d use nodes that clustered together in the graph as an indication that they should probably move as a unit.

1reaction

NeilGirdharcommented, Jun 4, 2020

This would be really useful for me. In order to prevent circular imports, and to keep a logical program flow, it’s nice to ensure that internal packages import each other as a DAG. This is despite only requiring module imports to form a DAG. So, I’d like a way to collect modules into their respective packages and to make a graph of their dependencies.

@thebjorn Your last graph looks really nice and close to what I’d like to have, I think.

Top Results From Across the Web

What are Topic Clusters? (How They Improve SEO) - Terakeet

Content clusters provide contextual support for all pages within a group. They also lay the foundation for a strong internal linking structure.

What Are Topic Clusters and How Do They Work? - SEMrush

Topic clusters are multiple pieces of content that are grouped under one overarching topic. We'll teach you about how to cluster topics and ......

How Do Content Clusters Strengthen SEO? - SiteCare

Content clusters use topic modeling and internal linking to improve your content's user experience and boost your search performance. Here's how it's done....

What is and how to use the Topic Cluster strategy? - Hotmart

The clusters themselves may be interconnected by the internal links. Thus, you build a web of content that helps support the subject as...

Topic Clustering Guide to Level Up Your Content Strategy

Topic clustering is what helps you optimize your content strategy and improve ... Sufficient internal linking, improved website authority, ...