'Error when putting partition column to variable
I'm getting an error when I'm passing string variable to partitionBy. I already did put double quote to my columns but I still get the same error. Is there any way I can put dynamic columns to partitionBy?
partition_columns='column1,column2,column3'
final_df.write.partitionBy(partition_columns).mode("append").parquet(location)
I'm using pyspark
Solution 1:[1]
The partitionBy function takes a varargs not a string.
Try using as below -
partition_columns=['column1','column2','column3']
final_df.write.partitionBy(*partition_columns).mode("append").parquet(location)
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
| Solution | Source |
|---|---|
| Solution 1 | DKNY |
