'Delete rows/columns from parquet file
I have some parquet files, is it possible to delete rows or data from the parquet files without corrupting the files? For example I have data in regular parquet format which contains data for different users stored in hdfs, is it possible to loop through those parquet files and delete data for a specific user? Also, I don't want to reprocess the data and filter the specific user, I want to delete the data directly from the parquet files. Will this corrupt the parquet files?
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
| Solution | Source |
|---|
