Hive RCFile

RCFiles, short of Record Columnar File, are flat files consisting of binary key/value pairs, which shares much similarity with SequenceFile. RCFile stores columns of a table in a record columnar way.

Compatibility

Data Science Studio can read & write RCFiles using either ColumnarSerDe or LazyBinaryColumnarSerDe. Most Hive data types are supported, including complex types (object, map & array).

The following Hive types are not supported:

  • DATE
  • UNION

Limitations