Documentation
Academy
Join the Academy
Benefit from guided learning opportunities →
Quick Starts
Learning Paths
New Features
Certifications
Academy Discussions
Community
Explore the Community
Discover, share, and contribute →
Learn About Us
Ask A Question
What's New?
Discuss Dataiku
Using Dataiku
Setup And Configuration
General Discussion
Plugins & Extending Dataiku
Product Ideas
Programs
Frontrunner Awards
Dataiku Neurons
Community Resources
Community Feedback
User Research
Documentation
Reference Documentation
Comprehensive specifications of Dataiku →
User's Guide
Specific Data Processing
Automation & Deployment
APIs
Installation & Administration
Other Topics
Knowledge
Knowledge Base
Articles and tutorials on Dataiku features →
User Guide
Admin Guide
Dataiku Solutions
Dataiku Cloud
Developer
Developer Guide
Tutorials and articles for developers and coder users →
Getting Started
Concepts and Examples
Tutorials
API Reference
User's Guide
DSS concepts
Connecting to data
Exploring your data
Schemas, storage types and meanings
Definitions
Basic usage
Schema for data preparation
Creating schemas of datasets
Handling of schemas by recipes
List of recognized meanings
User-defined meanings
Handling and display of dates
Data preparation
Charts
Interactive statistics
Machine learning
The Flow
Visual recipes
Recipes based on code
Code notebooks
MLOps
Webapps
Code Studios
Code reports
Dashboards
Workspaces
Data Catalog
Dataiku Applications
Working with partitions
DSS and SQL
DSS and Python
DSS and R
DSS and Spark
Code environments
Collaboration
Specific Data Processing
Time Series
Geographic data
Generative AI and LLM Mesh
Text & Natural Language Processing
Images
Audio
Video
Automation & Deployment
Metrics, checks and Data Quality
Automation scenarios
Production deployments and bundles
API Node & API Deployer: Real-time APIs
Governance
APIs
Python APIs
R API
Public REST API
Additional APIs
Installation & Administration
Installing and setting up
Elastic AI computation
DSS in the cloud
DSS and Hadoop
Metastore catalog
Operating DSS
Security
User Isolation
Other topics
Plugins
Streaming data
Formula language
Custom variables expansion
Sampling methods
Accessibility
Troubleshooting
Release notes
Other Documentation
Third-party acknowledgements
Dataiku DSS
You are viewing the documentation for version
13
of DSS.
»
Schemas, storage types and meanings
Open page in a new tab
Schemas, storage types and meanings
¶
Definitions
Storage types
Why use precise storage types ?
Meanings
Basic usage
Changing meaning and storage type
Changing the storage type
Changing the meaning
Editing advanced schema
Schema for data preparation
Schema in visual analysis
Schema in prepare recipe
Creating schemas of datasets
Schema of new external datasets
SQL and Cassandra datasets
Text-based files datasets
“Typed” files datasets
Schema of managed datasets
Modifying the schema
Handling of schemas by recipes
Sample, Filter, Group, Window, Join, Split, Stack
Sync
Prepare
Hive, Impala, SQL
Python, R, PySpark, SparkR
Machine Learning (scoring)
SparkSQL
Shell
List of recognized meanings
Basic meanings
Text
Decimal
Integer
Boolean
Date / Dates (needs parsing)
Object / Array
Natural language
Geospatial meanings
Latitude / Longitude
Geopoint
Geometry
Country
US State
Web-specific meanings
Other meanings
User-defined meanings
Kinds of user-defined meanings
Declarative
Values list
Values mapping
Pattern
Autodetecting user-defined meanings
Handling and display of dates
Displaying dates
Handling of dates in SQL