Robin Moffatt 4/5/2023

Using Delta from pySpark - `java.lang.ClassNotFoundException: delta.DefaultSource`

Read Original

This technical blog post details a solution for a common PySpark and Delta Lake integration error. The author explains that creating both a SparkSession and a SparkContext incorrectly can cause the 'java.lang.ClassNotFoundException: Failed to find data source: delta' error. The article provides working and non-working code examples for initializing Spark with Delta Lake configurations in a Jupyter Notebook.

Using Delta from pySpark - `java.lang.ClassNotFoundException: delta.DefaultSource`

Comments

No comments yet

Be the first to share your thoughts!

Browser Extension

Get instant access to AllDevBlogs from your browser

Top of the Week

1
The Beautiful Web
Jens Oliver Meiert 2 votes
3
LLM Use in the Python Source Code
Miguel Grinberg 1 votes
4
Wagon’s algorithm in Python
John D. Cook 1 votes