question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Spark : The iceberg-spark3-extensions module CI test failed

See original GitHub issue

I found that ,the iceberg-spark3-extensions module CI test often failed.

the exception like this

org.apache.iceberg.spark.extensions.TestCopyOnWriteDelete > testDeleteNonExistingRecords[catalogName = spark_catalog, implementation = org.apache.iceberg.spark.SparkSessionCatalog, config = {type=hive, default-namespace=default, clients=1, parquet-enabled=false, cache-enabled=false}, format = avro, vectorized = false] FAILED
    java.lang.RuntimeException: Failed to get table info from metastore default.table

        Caused by:
        org.apache.thrift.transport.TTransportException: java.net.SocketException: Broken pipe (Write failed)

            Caused by:
            java.net.SocketException: Broken pipe (Write failed)

Issue Analytics

  • State:open
  • Created 3 years ago
  • Comments:8 (3 by maintainers)

github_iconTop GitHub Comments

1reaction
stevenzwucommented, Jan 18, 2021

I started a thread titled as “test flakiness with SocketException of broken pipe in HiveMetaStoreClient” [1] in the dev mailing list two weeks ago. People have offered some insights there. Main suspicious is connection leak problem.

[1] https://www.mail-archive.com/dev@iceberg.apache.org/msg01789.html

0reactions
kbendickcommented, Jan 19, 2021

I started a thread titled as “test flakiness with SocketException of broken pipe in HiveMetaStoreClient” [1] in the dev mailing list two weeks ago. People have offered some insights there. Main suspicious is connection leak problem.

[1] https://www.mail-archive.com/dev@iceberg.apache.org/msg01789.html

Cool thank you @stevenzwu. Connection leak would potentially explain why I’m much less able to reproduce when I increase the catalog’s clients config value. Also, knowing that it is (was?) happening in the flink tests helps a ton with guidance for how much time to put into investigating from the POV of a potential test inheritance pattern issue vs connection leak issue.

This is very helpful. I should be more active on the dev list, though I’m only able to contribute somewhat sporadically given my current work is not exactly iceberg related. I’ll follow up in that thread too with anything I find while investigating. 👍

Read more comments on GitHub >

github_iconTop Results From Across the Web

Spark and Iceberg Quickstart
This guide will get you up and running with an Iceberg and Spark environment, including sample code to highlight some powerful features. You...
Read more >
Re: test flakiness with SocketException of broken pipe in ...
It could be that there are two separate flaky test issues with not releasing connections in Flink and Spark. I don't think that...
Read more >
Building a Data Lake with Spark and Iceberg at Home to over ...
csv , a test file that just contains all US postal abbreviation state codes, interactively in Jupyter . Reading and writing Data with...
Read more >
Use a cluster with Iceberg installed - Amazon EMR
Alternatively, you can create an Amazon EMR cluster including the Spark application and include the file /usr/share/aws/iceberg/lib/iceberg-spark3-runtime.jar ...
Read more >
Spark Shell not working after adding support for Iceberg
We are using the wrong Iceberg version, choose the spark 3.2 iceberg jar but running Spark 3.1. After using the correct dependency version ......
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found