Google News
logo
PySpark - Interview Questions
What is Parquet file in PySpark?
In PySpark, the Parquet file is a column-type format supported by several data processing systems. By using the Parquet file, Spark SQL can perform both read and write operations.

The Parquet file contains a column type format storage which provides the following advantages:

* It is small and consumes less space.
* It facilitates us to fetch specific columns for access.
* It follows type-specific encoding.
* It offers better-summarized data.
* It contains very limited I/O operations.
Advertisement