spark sql MERGE INTO There is an error Error: Error running query: java.lang.IllegalArgumentException: Can't zip RDDs with unequal numbers of partitions: List(1, 0) (state=,code=0)
See original GitHub issueCREATE TABLE IF NOT EXISTS cdp.test_merge_001(offline_channel STRING COMMENT '_pk',unique_key STRING COMMENT '_ck',open_id STRING COMMENT '',mobile string COMMENT '_ck',hobby STRING COMMENT '',activity_time STRING COMMENT '' )
USING iceberg;
CREATE TABLE IF NOT EXISTS cdp.test_merge_002(offline_channel STRING COMMENT '_pk',unique_key STRING COMMENT '_ck',open_id STRING COMMENT '',mobile string COMMENT '_ck',hobby STRING COMMENT '',activity_time STRING COMMENT '' )
USING iceberg;
this is table test_merge_002:
this is table test_merge_001:
but run this sql,appear error
MERGE INTO cdp.test_merge_002
tt1 USING
(SELECT * FROM cdp.test_merge_001) tt2 ON ( tt1.unique_key = tt2.unique_key AND tt1.mobile = tt2.mobile) WHEN MATCHED THEN UPDATE SET tt1.offline_channel = tt2.offline_channel,
tt1.unique_key = tt2.unique_key,
tt1.open_id = tt2.open_id,
tt1.mobile = tt2.mobile,
tt1.hobby = tt2.hobby,
tt1.activity_time = tt2.activity_time WHEN NOT MATCHED THEN INSERT *
Issue Analytics
- State:
- Created 2 years ago
- Comments:21 (15 by maintainers)
Top Results From Across the Web
[GitHub] [iceberg] KarlManong commented on issue #2533: spark ...
... sql MERGE INTO There is an error Error: Error running query: java.lang.IllegalArgumentException: Can't zip RDDs with unequal numbers of partitions: ...
Read more >Can't zip RDDs with unequal numbers of partitions: List(2, 1 ...
It is a bug in AQE, clearly, for the version of Spark you are running. Set AQE out. zip works with RDD partitions...
Read more >Can't zip RDDs with unequal numbers of partitions ... - Re
(See it here - http://pastebin.dqd.cz/RAhm/) After I've increased spark.sql.autoBroadcastJoinThreshold to 300000 from 100000 it went through ...
Read more >Solving 5 Mysterious Spark Errors | by yhoztak - Medium
This error usually happens when two dataframes, and you apply udf on some columns to transfer, aggregate, rejoining to add as new fields...
Read more >MNIST example cannot run because of RDD.zip() #100 - GitHub
The problem is that the zip operation assumes that the number of partitions AND the number of elements within each partition will be...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
I think this rule is fixing it for Spark
https://github.com/apache/spark/blob/8f0fef18438aa8fb07f5ed885ffad1339992f102/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/joins.scala#L128
I run the exception sql on spark-thriftserver, and it worked. May be the old application has some problem.