ERR_DATASET_HIVE_INCOMPATIBLE_SCHEMA: Dataset schema not compatible with Hive¶
This error can occur when trying to synchronize an HDFS dataset to the Hive metastore. Hive does not support all schemas, and has some limitations on column names, notably:
It does not preserve case, so some columns names can conflict
It does not suport some characters, like
,
Remediation¶
Try changing the schema of the dataset in the upstream recipe, so that it is compatible
with Hive. When using Hadoop, a cautious practitve can be to only use lowercase and no
,
nor .
.