Data storage backends
Hive does not store data, only metadata. It can store metadata about data stored in various places. The Stackable Operator currently supports S3 and HFS.
S3 support
Hive supports creating tables in S3 compatible object stores.
To use this feature you need to provide connection details for the object store using the S3Connection in the top level clusterConfig
.
An example usage can look like this:
clusterConfig:
s3:
inline:
host: minio
port: 9000
accessStyle: Path
credentials:
secretClass: simple-hive-s3-secret-class
Apache HDFS support
As well as S3, Hive also supports creating tables in HDFS.
You can add the HDFS connection in the top level clusterConfig
as follows:
clusterConfig:
hdfs:
configMap: my-hdfs-cluster # Name of the HdfsCluster
Read about the Stackable Operator for Apache HDFS to learn more about setting up HDFS.