Datasets (introduction)

Note

There are two main classes related to datasets handling in Dataiku’s Python APIs:

  • dataiku.Dataset in the dataiku package, which deals primarily with reading and writing data. It has the most flexibility when it comes to reading and writing

  • dataikuapi.dss.dataset.DSSDataset in the dataikuapi package which is mostly used for creating datasets, managing their settings, building flows, creating ML models, and performing a wider range of operations on datasets.

For more details on the two packages, please see Python APIs

Detailed samples about interacting with datasets can be found in:

Reference documentation for the classes supporting interaction with datasets can be found in Datasets (reference)