Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Duplicate builds due to transitions of targets that don't depend on build settings

See original GitHub issue

Description of the problem:

Targets that don’t depend on a user defined build setting are effectively duplicated when depended upon by targets that depend on a build setting and use a configuration transition.

Bugs: what’s the simplest, easiest way to reproduce this bug? Please provide a minimal example if possible.

Check out https://github.com/aherrmann/bazel-configuration-rebuild/commit/2be2a622a57b7428fe8741505ba27a07a9c6a3a4

Follow the steps in the README: The target //:cat is independent of the user defined build flag //:flag. The target //:flag_cat on the other hand depends on the value of the flag and on //:cat as well.

The target //:cat is only built once, and cached under the same action cache key, if the configuration is set via command-line flag.

$ bazel clean --expunge; bazel build //:flag_cat --disk_cache=.cache/default --build_event_publish_all_actions --build_event_json_file=default.json

$ bazel clean --expunge; bazel build //:flag_cat --//:flag=different_flag --disk_cache=.cache/different --build_event_publish_all_actions --build_event_json_file=different.json

$ jq 'select(.id.actionCompleted.label == "//:cat")' <default.json | jq -s length
1

$ jq 'select(.id.actionCompleted.label == "//:cat")' <different.json | jq -s length
1

$ rg 'bin/cat.txt' .cache/*/ac
.cache/default/ac/b9/b966d571c54c4aa37fc5ae7629f95caaa01ad79474fdc406c9e9a79341c950f6
2:"bazel-out/k8-fastbuild/bin/cat.txtD

.cache/different/ac/b9/b966d571c54c4aa37fc5ae7629f95caaa01ad79474fdc406c9e9a79341c950f6
2:"bazel-out/k8-fastbuild/bin/cat.txtD

The targets //:transition_default and //:transition_different use a configuration transition to set the value of the build setting //:flag to "default_flag" and "different_flag" respectively.

The target //:cat is built twice and cached under different action cache keys when building these transition targets, even though //:cat does not depend on the build setting.

$ bazel clean --expunge; bazel build //:transition_default //:transition_different --disk_cache=.cache/transition --build_event_publish_all_actions --build_event_json_file=transition.json

$ jq 'select(.id.actionCompleted.label == "//:cat")' <transition.json | jq -s length
2

$ rg 'bin/cat.txt' .cache/transition/ac
.cache/transition/ac/b9/b966d571c54c4aa37fc5ae7629f95caaa01ad79474fdc406c9e9a79341c950f6
2:"bazel-out/k8-fastbuild/bin/cat.txtD

.cache/transition/ac/8a/8a979615135482bd13afbdf6004d5c87c0a5f3427bcedc299f496eeff4946e29
2:2bazel-out/k8-fastbuild-ST-1d73fa6ab729/bin/cat.txtD

The CAS key is of course the same, as the contents of the output are the same.

Preferably, //:cat would only be built once in the transition case.

What operating system are you running Bazel on?

Ubuntu 20.10

What’s the output of `bazel info release`?

release 4.0.0

Have you found anything relevant by searching the web?

This seems to be the same issue as reported on the following mailing list thread: https://groups.google.com/g/bazel-discuss/c/zVEc7gzbyu0/m/5UcZ8aXOBQAJ . This seems to fall under the following umbrella issue: https://github.com/bazelbuild/bazel/issues/6526 . I decided to create a dedicated issue nonetheless, since I didn’t see the specific issue of dependencies that don’t depend on the build setting being rebuilt and cached separately on the issue tracker. As I understand, https://github.com/bazelbuild/bazel/issues/12568 suggests to address this issue through the setting’s visibility attribute. However, that seems like a coarse approximation in general. A setting with public visibility might still only affect few targets.

Any other information, logs, or outputs that you want to share?

The use-case that prompted this issue is using user defined build settings to define feature flags. We then want to test different settings of feature flags in the same bazel test //... run, so we define transitions to generate targets that pin the feature flags. This causes unrelated dependencies to be rebuilt, e.g. core dependencies like com_google_protobuf. Ideally, only targets that directly depend on the feature flag, or transitively depend on a target that depends on the feature flag would be rebuilt.

Issue Analytics

State:
Created 2 years ago
Reactions:5
Comments:5 (5 by maintainers)

Top GitHub Comments

1reaction

gregestrencommented, Feb 2, 2022

I switched this to a feature request, although I understand the experience you’re expressing. I did that because this is a Bazel design challenge, which would take focused design improvements to resolve. It’s not a bug in the current design: the current behavior is expected for the given design.

What you’re describing falls into some form of “trimming”. If you’re specifically concerned about actions running on remote executors, the work I’m doing at https://github.com/bazelbuild/bazel/issues/6526 may address that and I’d encourage you to follow that issue.

0reactions

aherrmanncommented, Mar 2, 2022

What’s your primary concern about this behavior? I.e. build slowness? Memory usage?

Build slowness induced by unnecessary rebuilds of the same target. E.g. external dependencies such as com_google_protobuf are built multiple times even though they are unaffected by the feature flag. (See “Any other information, logs, or outputs that you want to share?” in the issue description)

Are there any alternative configurations where the part of the graph that needs the flag is its own distinct subgraph? i.e. isolate the part of the graph to propagate the flag by flag structure? I think that opens up more options.

I’m not sure I understand what this means. Do you have an example in mind of what this would look like? Looking at the minimal example, I’d say //:cat is independent of the flag and everything else forms the sub-graph that depends on the flag. Nonetheless, also //:cat is rebuilt under transitions to different flag values.

As an aside, I’m surprised to read about collision at https://github.com/aherrmann/bazel-transitions-demo#gotchas. I didn’t look super-thoroughly, but that should be hard to actually do in practice (i.e. collisions shouldn’t really happen unless someone is egregiously abusing a rule). Is that specific to pkg_tar?

The issue occurs when one combines the same target under different configurations in rules or other mechanisms that operate on paths relative to the package or $(BINDIR) rather than execRoot. I.e. Bazel might distinguish the outputs as bazel-out/k8-fastbuild/bin/app/bin and bazel-out/k8-fastbuild-ST-391c2674bb99/bin/app/bin, but pkg_tar will map them both to /bin. Similarly, paths in the runfiles tree are based on paths relative to $(BINDIR). https://github.com/aherrmann/bazel-transitions-demo/commit/0344cb1cbd42099a2fd56ca3beab1ff6c79f6ce0 illustrates this issue. Running //app:runfiles produces

$ bazel run //app:runfiles
bin-ce
Using community edition
Data file: ENTERPRISE

bin-ee
Using enterprise edition
Data file: ENTERPRISE

bin
Using enterprise edition
Data file: ENTERPRISE

(i.e. there is a collision on both bin and data.txt)

This seems to be sensitive to the order in which dependencies are declared: If I swap the deps

 sh_binary(
     name = "runfiles",
     srcs = ["runfiles.sh"],
-    data = ["bin-ce", "bin-ee"],
+    data = ["bin-ee", "bin-ce"],
     deps = ["@bazel_tools//tools/bash/runfiles"],
 )

then the output becomes

bin-ce
Using community edition
Data file: COMMUNITY

bin-ee
Using enterprise edition
Data file: COMMUNITY

bin
Using community edition
Data file: COMMUNITY

I guess whether this counts as egregious abuse depends on the use-case. We abandoned the transitions based approach because of this issue, so I can’t provide a production example. But the following made-up example doesn’t seem too absurd: Following the community vs. enterprise version example above, we may want to write an integration test that both enterprise and community clients can participate in the same network. In that case both versions would be runfile dependencies of the integration test target. At that point we need to be careful not to trigger the issue above.

Top Results From Across the Web

Avoid different transition output directories when build settings ...

To avoid duplicated actions on dependencies when using build-settings it's necessary to reset build-settings to it's default value. This is ...

Transitions and unwanted identical build of multiple targets

The main concern is the duplicated work, so if one transition is used in one test and another transition in another test no...

Xcode building for iOS Simulator, but linking in an object file ...

Steps: On top of project files, Select target > build setting > architecture > excluded architecture. now add select "any iOS simulator SDK"...

Transition | Android Developers

Constructs a Transition object with no target objects. ... the transition may need to set up initial target values and construct an appropriate...

transition-timing-function - CSS: Cascading Style Sheets | MDN

Denotes a right-continuous function, so that the last jump happens when the animation ends;. jump-none. There is no jump on either end. Instead, ......