Spark transformations operate on RDDs and DataFrames to create new RDDs and DataFrames. Common transformations include map, filter, groupBy, join, and distinct. Spark actions return values to the driver program like collect, count, first, take, reduce, and saveAsTextFile. Transformations are lazy while actions trigger job execution.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
22 views4 pages
Spark Transformations and Actions
Spark transformations operate on RDDs and DataFrames to create new RDDs and DataFrames. Common transformations include map, filter, groupBy, join, and distinct. Spark actions return values to the driver program like collect, count, first, take, reduce, and saveAsTextFile. Transformations are lazy while actions trigger job execution.