PySpark : PySpark is the Python API to use Spark. Spark is an open-source, cluster computing system which is used for big data solution. It is lightning fast technology that is designed for fast computation.
PySpark provides Py4j library, with the help of this library, Python can be easily integrated with Apache Spark.