Each supported Hadoop distribution makes different choices in terms of packaging, versions of the different components of the Hadoop stack, supported ecosystems.
Each distribution bundles its own libraries and backports specific bugs that can modify the behavior of the Hadoop ecosystem components.
Therefore, there are some specificities related to the support of each Hadoop distribution
- Cloudera CDH
- Hortonworks HDP
- Amazon Elastic MapReduce
- Microsoft Azure HDInsight
- Tested versions
- Connecting Dataiku DSS to Azure HDInsight
- Using Dataiku DSS on Azure HDInsight
- Google Cloud Dataproc