'Output Sorted KV Pcollection to a SST file from Google Cloud Dataflow
Solution 1:[1]
Beam does not have built in support for SST files.
You could use the FileIO module to write such files but you need to develop a Sink for this file format.
You could also write files from a simple ParDo transform, but you have to make sure that writing is correct and efficient (for example, make sure that you do not perform duplicate writes if a bundle fails and retried by the runner).
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
| Solution | Source |
|---|---|
| Solution 1 | chamikara |
