Pyspark SQL and DataFrames
Pyspark SQL and DataFrames
1. Creating DataFrames
2. DataFrame Operations
3. DataFrame Joins
● Union: df1.union(df2)
● Union by name: df1.unionByName(df2)
● Intersect: df1.intersect(df2)
● Except: df1.except(df2)
● Subtract: df1.subtract(df2)
5. DataFrame Sorting