Exploring your data¶
Once you have defined a dataset, the first thing that DSS allows you to do is to view your data in the “Explore” component.
Explore shows you a sample of your dataset. It also provides:
Quick descriptive statistics, either on the sample on the entirety of the data
Meanings, i.e. rich types. See Schemas, storage types and meanings for more information
An overview of all columns
The ability to sort and filter the sample
The ability to choose which columns are displayed
All these abilities are also available while doing visual preparation of your data.
Note
For an introduction to the exploration component of Data Science Studio, we recommend that you first consult the Knowledge Base.
- Sampling
- Sampling in Explore
- Sampling methods
- First records
- Random sampling (fixed number of records)
- Random sampling (approximate ratio)
- Random sampling (approximate number of records)
- Column values subset
- Stratified (fixed number of records)
- Stratified (approximate ratio)
- Class rebalancing (approximate number of records)
- Class rebalancing (approximate ratio)
- Last records
- Sampling and partitioning
- Refreshing the sample
- Analyze