Workers not releasing idle S3 connections to the pool when reading Avro files
See original GitHub issuePresto 343 - 1 coordinator + 4 workers hive metastore+postgresql - avro files on s3 We’ve modified the following hive.properties
hive.s3.max-connections = 10000
hive.s3select-pushdown.max-connections = 10000
hive.s3.connect-timeout=3m
hive.s3.socket-timeout=3m
We see the connection pool progressively use all connections, never releasing them (after several days). Eventually the pool is depleted and no further work with s3 files is possible until the node service is restarted.
Issue Analytics
- State:
- Created 3 years ago
- Comments:16 (14 by maintainers)
Top Results From Across the Web
Reading in-memory Avro file from S3: 'AttributeError:'
I'm trying to read Avro files stored in S3 by a vendor and write to a DW. See code below. (Was roughly working...
Read more >Using Avro Data Files | CDP Private Cloud
If you load new data into an Avro table through Hive, either through a Hive LOAD DATA or INSERT statement, or by manually...
Read more >Kafka Broker Configurations for Confluent Platform
If not explicitly configured, the default value will be null and there will be no dedicated endpoints for controller connections.If explicitly configured ...
Read more >Configuration reference - Apache Druid
A recommended way of organizing Druid configuration files can be seen in the conf ... The timeout for idle connections in connection pool....
Read more >SageMaker — Boto3 Docs 1.26.26 documentation
SageMaker does not split the files any further for model training. ... Read input data from an S3 bucket; Write model artifacts to...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Btw I validated that cherry-picking
HIVE-22981
fixes the issue. So big thanks @rdsr for pointing towards that issue.In 341 we bumped Hive library from
3.0.6
to3.1.2
. I expect that is the reason for regression. I will see if we can backport the HIVE-22981 fix.