'Create a BigQuery Table everyday from a CSV file stored in Cloud Storage

On GCP I have a new csv added everyday at 2 am in a Cloud Storage folder (format is something like mydata_yyyymmdd.csv).

I am trying to schedule an upload of this file at 2:30 am everyday in a Big Query Table.

I succeeded in creating a dataflow that constantly screen my Cloud Storage folder and constantly update my BigQuery table if a new file is added but I don't find this optimal as :

  • my dataflow is running all day long while I only need to run it once a day (increasing costs)
  • this solution doesn't create a new Big Query table everyday, it just appends all csv to my Big Query table

Can you help me with the tools I should use in GCP to achieve this ?

Thanks a lot for your help, it is very much appreciated



Sources

This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.

Source: Stack Overflow

Solution Source