DSS and Hadoop¶
- Setting up Hadoop integration
- Connecting to secure clusters
- Hadoop filesystems connections (HDFS, S3, EMRFS, WASB, ADLS, GS)
- Hive
- Interaction with the Hive global metastore
- Synchronisation to the Hive metastore
- Importing from the Hive metastore
- Hive execution engines
- Support for Hive authentication modes
- Support for Hive authorization modes
- Supported file formats
- Internal details
- Impala
- Spark
- Hive datasets
- Hadoop user isolation
- Distribution-specific notes
- Teradata Connector For Hadoop
- Multiple Hadoop clusters
- Dynamic AWS EMR clusters
- Dynamic Google Dataproc clusters