About 50 results
Open links in new tab
  1. Python worker keeps on crashing in pyspark - Stack Overflow

    Oct 26, 2023 · I am using Python 3.12.0, my java version is 8, and my pyspark version is 3.5. I have set my environmental variables with JAVA_HOME, SPARK_HOME, and HADOOP_HOME and installed …

  2. apache spark - Python worker exited unexpectedly due to connection ...

    Jul 16, 2024 · Consider setting 'spark.sql.execution.pyspark.udf.faulthandler.enabled' or'spark.python.worker.faulthandler.enabled' configuration to 'true' for the better Python traceback.

  3. python - How do I add a new column to a Spark DataFrame (using …

    Spark documentation is "great" only in that it leaves great swaths of usage up to an exercise for the astute reader. Spark (and Pyspark) covers a veritable zoo of data structures, with little or no …

  4. python 3.x - How to read xlsx or xls files as spark dataframe - Stack ...

    Jun 3, 2019 · Can anyone let me know without converting xlsx or xls files how can we read them as a spark dataframe I have already tried to read with pandas and then tried to convert to spark dataframe …

  5. Apache Spark left join python databricks - Stack Overflow

    Apr 22, 2025 · I have three dataframes in databricks and try to run a join on them (apache spark functions). I mainly was used to pandas dataframe join. My current code is: joined_df = …

  6. What are compatible versions of pyspark and py4j packages in python

    Dec 25, 2023 · What are compatible versions of pyspark and py4j packages in python Asked 2 years, 1 month ago Modified 1 year, 8 months ago Viewed 5k times

  7. python - How to handle an AnalysisException on Spark SQL ... - Stack ...

    Oct 4, 2019 · python apache-spark pyspark apache-spark-sql databricks asked Oct 4, 2019 at 17:39 Kevin Gomez 343 1 4 7

  8. python - Load CSV file with PySpark - Stack Overflow

    When using spark.read.csv, I find that using the options escape='"' and multiLine=True provide the most consistent solution to the CSV standard, and in my experience works the best with CSV files …

  9. Apache Spark: How to use pyspark with Python 3 - Stack Overflow

    May 16, 2015 · I built Spark 1.4 from the GH development master, and the build went through fine. But when I do a bin/pyspark I get the Python 2.7.9 version. How can I change this?

  10. How to install Python libraries in Azure Synapse Notebook Spark …

    May 9, 2024 · Also, learn more about managing libraries for Apache Spark pools in Azure Synapse Analytics. Here is the Stack Overflow link to adding a custom Python library in Azure Synapse.