Datasets (introduction)¶
Note
There are two main classes related to datasets handling in Dataiku’s Python APIs:
dataiku.Dataset
in the dataiku package, which deals primarily with reading and writing data. It has the most flexibility when it comes to reading and writingdataikuapi.dss.dataset.DSSDataset
in the dataikuapi package which is mostly used for creating datasets, managing their settings, building flows, creating ML models, and performing a wider range of operations on datasets.
For more details on the two packages, please see Python APIs
Detailed samples about interacting with datasets can be found in:
Reference documentation for the classes supporting interaction with datasets can be found in Datasets (reference)