Spark : The iceberg-spark3-extensions module CI test failed
See original GitHub issueI found that ,the iceberg-spark3-extensions module CI test often failed.
the exception like this
org.apache.iceberg.spark.extensions.TestCopyOnWriteDelete > testDeleteNonExistingRecords[catalogName = spark_catalog, implementation = org.apache.iceberg.spark.SparkSessionCatalog, config = {type=hive, default-namespace=default, clients=1, parquet-enabled=false, cache-enabled=false}, format = avro, vectorized = false] FAILED
java.lang.RuntimeException: Failed to get table info from metastore default.table
Caused by:
org.apache.thrift.transport.TTransportException: java.net.SocketException: Broken pipe (Write failed)
Caused by:
java.net.SocketException: Broken pipe (Write failed)
Issue Analytics
- State:
- Created 3 years ago
- Comments:8 (3 by maintainers)
Top Results From Across the Web
Spark and Iceberg Quickstart
This guide will get you up and running with an Iceberg and Spark environment, including sample code to highlight some powerful features. You...
Read more >Re: test flakiness with SocketException of broken pipe in ...
It could be that there are two separate flaky test issues with not releasing connections in Flink and Spark. I don't think that...
Read more >Building a Data Lake with Spark and Iceberg at Home to over ...
csv , a test file that just contains all US postal abbreviation state codes, interactively in Jupyter . Reading and writing Data with...
Read more >Use a cluster with Iceberg installed - Amazon EMR
Alternatively, you can create an Amazon EMR cluster including the Spark application and include the file /usr/share/aws/iceberg/lib/iceberg-spark3-runtime.jar ...
Read more >Spark Shell not working after adding support for Iceberg
We are using the wrong Iceberg version, choose the spark 3.2 iceberg jar but running Spark 3.1. After using the correct dependency version ......
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
I started a thread titled as “test flakiness with SocketException of broken pipe in HiveMetaStoreClient” [1] in the dev mailing list two weeks ago. People have offered some insights there. Main suspicious is connection leak problem.
[1] https://www.mail-archive.com/dev@iceberg.apache.org/msg01789.html
Cool thank you @stevenzwu. Connection leak would potentially explain why I’m much less able to reproduce when I increase the catalog’s
clients
config value. Also, knowing that it is (was?) happening in the flink tests helps a ton with guidance for how much time to put into investigating from the POV of a potential test inheritance pattern issue vs connection leak issue.This is very helpful. I should be more active on the dev list, though I’m only able to contribute somewhat sporadically given my current work is not exactly iceberg related. I’ll follow up in that thread too with anything I find while investigating. 👍