Better node identification
See original GitHub issueDescribe the feature
As proven by the necessity to add a hash to a test node’s unique_id
, the current model of node identification could use some tweaking.
This was discussed in previous ticket comments: https://github.com/fishtown-analytics/dbt/pull/3335#discussion_r631213752 https://github.com/fishtown-analytics/dbt/pull/3335#discussion_r629693671
Describe alternatives you’ve considered
This isn’t well fleshed out yet, but the ideas that come immediately to mind are:
- re-work the existing human readable unique_id so the naming scheme is actually unique
- replace the unique_id with a full hash and include enough tooling that we don’t have to manage hashes by hand in tests, etc.
Who will this benefit?
This will make development simpler and eliminate the need for post-parsing checks like this.
Are you interested in contributing this feature?
Issue Analytics
- State:
- Created 2 years ago
- Reactions:1
- Comments:6 (4 by maintainers)
Top Results From Across the Web
An influential node identification method considering multi ...
To improve the accuracy of identifying influential nodes, many improved centrality methods have been proposed. The idea of multi-attribute ...
Read more >Identifying Important Nodes in Complex Networks ... - NCBI
Several importance measures have been proposed from diverse perspectives to identify crucial nodes more accurately.
Read more >Identifying Important Nodes in Complex Networks ... - Hindawi
Abstract. Assessing and measuring the importance of nodes in a complex network are of great theoretical and practical significance to improve the robustness ......
Read more >Node Identification in Wireless Network Based on ...
Aiming at the problem of node identification in wireless networks, a method of node identification based on deep learning is proposed, which starts...
Read more >The risk of node re-identification in labeled social graphs
In particular, we ask: Given a graph topology, how much better does a node re-identification attack perform when the node attributes are ...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
@nathaniel-may Thank you again for this excellent comment. While thinking about this on the train, I realized that the code to fix this one is super straightforward and self-contained, so I gave it a go in https://github.com/dbt-labs/dbt-core/pull/4898
@jtcohen6 I see where you’re coming from, but it’s actually more about the
unique_id
than thename
. For that reason I’m thinking we should keep all nodes in-play here.Right now we have three+ ways to ID a node:
unique_id
unique_id
+ hash, andname
. I’d love it if we could get that down to one human readable and unique identifier, but I rather doubt it will stay human readable* as we add more to the node content.If that is the case, uniform logic to handle
unique_id
s as a hash andname
as a human readable label seems like a likely solution and would include all nodes.*unless you want a Donaudampfschifffahrtselektrizitätenhauptbetriebswerkbauunterbeamtengesellschaft