Google News
logo
PySpark - Interview Questions
What is the use of PySpark StorageLevel?
PySpark StorageLevel is used to control RDD storage. It can control how and where the RDD is stored. The PySpark StorageLevel decides whether the RDD is stored in memory, on disk, or both. It also determines if we need to replicate the RDD partitions or serialize the RDD. The code for PySpark StorageLevel looks like:

class PySpark.StorageLevel( useDisk, useMemory, useOfHeap, deserialized, replication = 1)?
Advertisement