question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Issue in TFX Transform component after upgrade it from 0.22 -> 0.24.1

See original GitHub issue

Hello folks,

Recently i’m trying to upgrade the TFX version to the latest (v0.24.1) from the old one (v0.22). I didn’t change anything in the pipeline code, just pure library upgrade. After running the pipeline, found following issue inside Transform component:

Screenshot 2020-10-16 at 2 50 44 PM

Any idea why this happened? seems Transform component want to iterate the item but not found, hence throw StopIteration. It works well with v0.22. Thanks in advance!

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Reactions:1
  • Comments:19 (6 by maintainers)

github_iconTop GitHub Comments

8reactions
zoyahavcommented, Oct 20, 2020

Thanks everyone, I’ve confirmed on the first colab in https://github.com/tensorflow/tfx/issues/2668#issuecomment-712288375 that disabling cache prevents this issue and Transform succeeds (as a temporary workaround):

transform = Transform(
    examples=example_gen.outputs['examples'],
    schema=schema_gen.outputs['schema'],
    module_file=os.path.abspath("transform.py"),
    disable_analyzer_cache=True)

I’ll look for the root cause and hopefully make a patch release with a fix.

3reactions
zoyahavcommented, Nov 3, 2020

Thank you both for verifying. I believe we can close this issue once 0.25 is released.

Read more comments on GitHub >

github_iconTop Results From Across the Web

TensorFlow Transform | TFX
TensorFlow Transform is a library for preprocessing data with TensorFlow. tf.Transform is useful for data that requires a full-pass, such as:.
Read more >
tfx - PyPI
tfx apache‑beam ml‑metadata pyarrow tensorflow tensorflow‑data‑validat... GitHub master 2.40.0 1.11.0 6.0.0 nightly (1.x/2.x) 1.11.0 1.11.0 2.40.0 1.11.0 6.0.0 1.15.5 / 2.10.0 1.11.0 1.10.0 2.40.0 1.10.0 6.0.0...
Read more >
TFX complains upstream components were not run in ...
The notebook was training fine with TFX 0.22 but getting this error in TFX ... Can you try it with TFX Version 0.24.1...
Read more >
tfx Changelog - pyup.io
Apache Airflow 2.1.0 and later. * Fixed issue where passing `analyzer_cache` to `tfx.components.Transform` before there are any Transform cache artifacts ...
Read more >
Deep Dive into ML Models in Production Using TensorFlow ...
The components available in TFX let you build efficient ML ... import sys !{sys.executable} -m pip install --user --upgrade -q tfx==0.22.0 ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found