SequenceFile is a flat file format consisting of binary key/value pairs. It is extensively used in Hadoop MapReduce as input/output formats, since it is splittable.
Data Science Studio can read & write SequenceFiles using the Hive’s default serializer/deserializer (
Most Hive data types are supported, including complex types (object, map & array).
The following Hive types are not supported:
- The SequenceFile format can only be used on HDFS connections.