'How can I access Data in Google Storage via Presto?
I have requirement for the data lake where depending on the use case applications are using different type of storage. I have MySQL, Hive, Google Cloud Storage. I am using Presto as query engine. I want make it one stop solution where people can access data any type of the data storage? I have configured MySQL Connector & Hive Connector are working good. I tried to use TPCH connector to connect with Google Cloud Storage.
Is it right approach? Am I in correct direction?
Solution 1:[1]
I think that the answers here are out of date since this link shows a GCS connector is available now : https://docs.starburst.io/latest/connector/hive-gcs-tutorial.html
Solution 2:[2]
No, TPCH connector (https://docs.starburstdata.com/latest/connector/tpch.html) is not meant to connect to any real storage engine. Instead it generates at runtime data for tables needed to the TPC-H benchmark. Its main use is to test ANSI SQL syntax.
Solution 3:[3]
You can try to setup a Minio Server and use Hive Connector(S3)
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
| Solution | Source |
|---|---|
| Solution 1 | Simon Thompson |
| Solution 2 | Kamil Bajda-Pawlikowski |
| Solution 3 |
