Error with latest jar file when trying to show dataframe
See original GitHub issue@davidrabinowitz: Any suggestions on what I may be doing wrong here? I am running
openjdk version "1.8.0_282"
OpenJDK Runtime Environment (build 1.8.0_282-b08)
OpenJDK 64-Bit Server VM (build 25.282-b08, mixed mode)
but any time I try to run a simple select query statement i am getting this error? when I call the
df.show()
or df.count()
on my dataframe in pyspark
: org.apache.spark.SparkException: Job aborted due to stage failure: Task serialization failed: java.lang.NoSuchMethodError: java.nio.ByteBuffer.flip()Ljava/nio/ByteBuffer;
java.lang.NoSuchMethodError: java.nio.ByteBuffer.flip()Ljava/nio/ByteBuffer;
at org.apache.spark.util.io.ChunkedByteBufferOutputStream.toChunkedByteBuffer(ChunkedByteBufferOutputStream.scala:115)
at org.apache.spark.broadcast.TorrentBroadcast$.blockifyObject(TorrentBroadcast.scala:323)
at org.apache.spark.broadcast.TorrentBroadcast.writeBlocks(TorrentBroadcast.scala:138)
at org.apache.spark.broadcast.TorrentBroadcast.<init>(TorrentBroadcast.scala:91)
at org.apache.spark.broadcast.TorrentBroadcastFactory.newBroadcast(TorrentBroadcastFactory.scala:35)
at org.apache.spark.broadcast.BroadcastManager.newBroadcast(BroadcastManager.scala:77)
at org.apache.spark.SparkContext.broadcast(SparkContext.scala:1509)
at org.apache.spark.scheduler.DAGScheduler.submitMissingTasks(DAGScheduler.scala:1383)
Issue Analytics
- State:
- Created 3 years ago
- Comments:14 (5 by maintainers)
Top Results From Across the Web
dataframe is not getting converted into CSV and parquet
Show activity on this post. I resolved the issue, i downloaded following files. And adding those in the jar folder of the Spark....
Read more >unable to load the dataframe from cosmos - Microsoft Q&A
I am trying to load the dataframe from cosmos and unable to do that. ... You need to build the repository into the...
Read more >Hive Tables - Spark 3.3.1 Documentation
The provided jars should be the same version as spark.sql.hive.metastore.version . These jars only need to be present on the driver, but if...
Read more >Snowpark Library for Scala and Java Release Notes
If you downloaded the .tar.gz / .zip file, the JAR files for the Jackson ... Fixed an issue in which the DataFrame.show() method...
Read more >Solved: Re: Spark/Scala Error: value toDF is not a member
I am trying to create a DataFrame of a text file which gives me error: "value ... I googled this new error but...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Hey David,
I was able to get around this issue by changing the Docker base image I was using.
I downloaded spark v3.0.1 w/ Hadoop 3.2 and later from spark.apache.org to build a docker image and that fixed my issue. Downloading directly from https://github.com/apache/spark and building an image from there seemed to cause issues.
Thanks, @davidrabinowitz, and @JordanYankovich! that fixed my issue as well
The base image was built with what @JordanYankovich recommended, we needed to customize the base image