About 9,330,000 results
Open links in new tab
  1. python - Spark Equivalent of IF Then ELSE - Stack Overflow

    python apache-spark pyspark apache-spark-sql edited Dec 10, 2017 at 1:43 Community Bot 1 1

  2. python - PySpark: "Exception: Java gateway process exited before ...

    I'm trying to run PySpark on my MacBook Air. When I try starting it up, I get the error: Exception: Java gateway process exited before sending the driver its port number when sc = …

  3. pyspark - How to use AND or OR condition in when in Spark

    107 pyspark.sql.functions.when takes a Boolean Column as its condition. When using PySpark, it's often useful to think "Column Expression" when you read "Column". Logical operations on …

  4. Comparison operator in PySpark (not equal/ !=) - Stack Overflow

    Aug 24, 2016 · Comparison operator in PySpark (not equal/ !=) Asked 9 years, 2 months ago Modified 1 year, 8 months ago Viewed 164k times

  5. PySpark: multiple conditions in when clause - Stack Overflow

    Jun 8, 2016 · when in pyspark multiple conditions can be built using & (for and) and | (for or). Note:In pyspark t is important to enclose every expressions within parenthesis () that combine …

  6. Best way to get the max value in a Spark dataframe column

    1 Comment Vyom Shrivastava Over a year ago Make sure you have the correct imports, You need to import the following: from pyspark.sql.functions import max The max we use here is …

  7. Rename more than one column using withColumnRenamed

    Since pyspark 3.4.0, you can use the withColumnsRenamed() method to rename multiple columns at once. It takes as an input a map of existing column names and the corresponding …

  8. How to read xlsx or xls files as spark dataframe - Stack Overflow

    Jun 3, 2019 · Can anyone let me know without converting xlsx or xls files how can we read them as a spark dataframe I have already tried to read with pandas and then tried to convert to …

  9. Pyspark: display a spark data frame in a table format

    Pyspark: display a spark data frame in a table format Asked 9 years, 2 months ago Modified 2 years, 2 months ago Viewed 411k times

  10. python - Concatenate two PySpark dataframes - Stack Overflow

    May 20, 2016 · Utilize simple unionByName method in pyspark, which concats 2 dataframes along axis 0 as done by pandas concat method. Now suppose you have df1 with columns id, …