What are the characteristics of PySpark?
There are 4 characteristics of PySpark :
* Abstracted Nodes : This means that the individual worker nodes can not be addressed.
* Spark API : PySpark provides APIs for utilizing Spark features.
* Map-Reduce Model : PySpark is based on Hadoop’s Map-Reduce model this means that the programmer provides the map and the reduce functions.
* Abstracted Network : Networks are abstracted in PySpark which means that the only possible communication is implicit communication.