'How can I access Data in Google Storage via Presto?

I have requirement for the data lake where depending on the use case applications are using different type of storage. I have MySQL, Hive, Google Cloud Storage. I am using Presto as query engine. I want make it one stop solution where people can access data any type of the data storage? I have configured MySQL Connector & Hive Connector are working good. I tried to use TPCH connector to connect with Google Cloud Storage.

Is it right approach? Am I in correct direction?



Solution 1:[1]

I think that the answers here are out of date since this link shows a GCS connector is available now : https://docs.starburst.io/latest/connector/hive-gcs-tutorial.html

Solution 2:[2]

No, TPCH connector (https://docs.starburstdata.com/latest/connector/tpch.html) is not meant to connect to any real storage engine. Instead it generates at runtime data for tables needed to the TPC-H benchmark. Its main use is to test ANSI SQL syntax.

Solution 3:[3]

You can try to setup a Minio Server and use Hive Connector(S3)

Sources

This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.

Source: Stack Overflow

Solution Source
Solution 1 Simon Thompson
Solution 2 Kamil Bajda-Pawlikowski
Solution 3