question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Avoid creating Configuration copies in Hudi

See original GitHub issue

We observed that creating Configuration copies is consuming a lot of CPU.

We made changes in Presto to use a wrapper instead of creating configuration copies. But we were told that the presto change is breaking Hudi.

So one suggestion is to avoid creating copies in places like this: https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/common/config/SerializableConfiguration.java#L37

we can instead simply do

  public SerializableConfiguration(Configuration configuration) {
    this.configuration = configuration;
  }

The breaking Presto PR: https://github.com/prestodb/presto/pull/18115 https://github.com/prestodb/presto/issues/17736

@pratyakshsharma @7c00 do you think the code suggestion above makes sense ?

Issue Analytics

  • State:closed
  • Created a year ago
  • Comments:7 (7 by maintainers)

github_iconTop GitHub Comments

1reaction
pratyakshsharmacommented, Sep 30, 2022

@codope Let us connect sometime next week to discuss this. This has been pending for some time.

0reactions
codopecommented, Oct 3, 2022

Synced up with @pratyakshsharma regarding this issue. First of all, the issue affects hudi tables queries via presto-hive connector. We need to see if we can use the config provided by the engine itself while instantiating the meta client. Created HUDI-4974 to track that issue. For now, we have a mitigation. We will unwrap the wrapper config object and pass that.

Closing the issue as it has been triaged and we have an interim solution.

Read more comments on GitHub >

github_iconTop Results From Across the Web

All Configurations | Apache Hudi
This page covers the different ways of configuring your job to write/read Hudi tables. At a high level, you can control behaviour at...
Read more >
Employing correct configurations for Hudi's cleaner table service
In this blog, we will explain how to employ the right configurations to manage multiple file versions. Furthermore, we will discuss ...
Read more >
Configurations - Apache Hudi
This page covers the different ways of configuring your job to write/read Hudi tables. At a high level, you can control behaviour at...
Read more >
Configurations - Apache Hudi
This page covers the different ways of configuring your job to write/read Hudi tables.
Read more >
Basic Configurations - Apache Hudi
This page covers the basic configurations you may use to write/read Hudi tables. This page only features a subset of the.
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found