Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Better node identification

See original GitHub issue

Describe the feature

As proven by the necessity to add a hash to a test node’s unique_id, the current model of node identification could use some tweaking.

This was discussed in previous ticket comments: https://github.com/fishtown-analytics/dbt/pull/3335#discussion_r631213752 https://github.com/fishtown-analytics/dbt/pull/3335#discussion_r629693671

Describe alternatives you’ve considered

This isn’t well fleshed out yet, but the ideas that come immediately to mind are:

re-work the existing human readable unique_id so the naming scheme is actually unique
replace the unique_id with a full hash and include enough tooling that we don’t have to manage hashes by hand in tests, etc.

Who will this benefit?

This will make development simpler and eliminate the need for post-parsing checks like this.

Are you interested in contributing this feature?

Issue Analytics

State:
Created 2 years ago
Reactions:1
Comments:6 (4 by maintainers)

Top GitHub Comments

1reaction

jtcohen6commented, Mar 18, 2022

@nathaniel-may Thank you again for this excellent comment. While thinking about this on the train, I realized that the code to fix this one is super straightforward and self-contained, so I gave it a go in https://github.com/dbt-labs/dbt-core/pull/4898

1reaction

iknox-facommented, May 12, 2021

@jtcohen6 I see where you’re coming from, but it’s actually more about the unique_id than the name. For that reason I’m thinking we should keep all nodes in-play here.

Right now we have three+ ways to ID a node: unique_id unique_id + hash, and name. I’d love it if we could get that down to one human readable and unique identifier, but I rather doubt it will stay human readable* as we add more to the node content.

If that is the case, uniform logic to handle unique_ids as a hash and name as a human readable label seems like a likely solution and would include all nodes.

*unless you want a Donaudampfschifffahrtselektrizitätenhauptbetriebswerkbauunterbeamtengesellschaft

Top Results From Across the Web

An influential node identification method considering multi ...

To improve the accuracy of identifying influential nodes, many improved centrality methods have been proposed. The idea of multi-attribute ...

Identifying Important Nodes in Complex Networks ... - NCBI

Several importance measures have been proposed from diverse perspectives to identify crucial nodes more accurately.

Identifying Important Nodes in Complex Networks ... - Hindawi

Abstract. Assessing and measuring the importance of nodes in a complex network are of great theoretical and practical significance to improve the robustness ......

Node Identification in Wireless Network Based on ...

Aiming at the problem of node identification in wireless networks, a method of node identification based on deep learning is proposed, which starts...

The risk of node re-identification in labeled social graphs

In particular, we ask: Given a graph topology, how much better does a node re-identification attack perform when the node attributes are ...