Avoid creating Configuration copies in Hudi
See original GitHub issueWe observed that creating Configuration copies is consuming a lot of CPU.
We made changes in Presto to use a wrapper instead of creating configuration copies. But we were told that the presto change is breaking Hudi.
So one suggestion is to avoid creating copies in places like this: https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/common/config/SerializableConfiguration.java#L37
we can instead simply do
public SerializableConfiguration(Configuration configuration) {
this.configuration = configuration;
}
The breaking Presto PR: https://github.com/prestodb/presto/pull/18115 https://github.com/prestodb/presto/issues/17736
@pratyakshsharma @7c00 do you think the code suggestion above makes sense ?
Issue Analytics
- State:
- Created a year ago
- Comments:7 (7 by maintainers)
Top Results From Across the Web
All Configurations | Apache Hudi
This page covers the different ways of configuring your job to write/read Hudi tables. At a high level, you can control behaviour at...
Read more >Employing correct configurations for Hudi's cleaner table service
In this blog, we will explain how to employ the right configurations to manage multiple file versions. Furthermore, we will discuss ...
Read more >Configurations - Apache Hudi
This page covers the different ways of configuring your job to write/read Hudi tables. At a high level, you can control behaviour at...
Read more >Configurations - Apache Hudi
This page covers the different ways of configuring your job to write/read Hudi tables.
Read more >Basic Configurations - Apache Hudi
This page covers the basic configurations you may use to write/read Hudi tables. This page only features a subset of the.
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
@codope Let us connect sometime next week to discuss this. This has been pending for some time.
Synced up with @pratyakshsharma regarding this issue. First of all, the issue affects hudi tables queries via presto-hive connector. We need to see if we can use the config provided by the engine itself while instantiating the meta client. Created HUDI-4974 to track that issue. For now, we have a mitigation. We will unwrap the wrapper config object and pass that.
Closing the issue as it has been triaged and we have an interim solution.