Correct Answer : All of the above
Explanation : To decide how RDDs are stored, PySpark has different StorageLevels, such as the following :* DISK_ONLY* DISK_ONLY_2* MEMORY_AND_DISK
Correct Answer : pyspark.sql.DataFrame
Explanation : pyspark.SQL.DataFrame represents a set of named columns and distributed data.
Correct Answer : pyspark.sql.SparkSession
Explanation : DataFrame and SQL functionality are accessed through pyspark.sql.SparkSession.
Correct Answer : Column
Explaination : A UDF extends Spark SQL's DSL vocabulary for transforming DataFrames by defining a new column-based function.