diff options
| author | Gordon Tetlow <gordon@FreeBSD.org> | 2017-09-17 15:57:06 +0000 |
|---|---|---|
| committer | Gordon Tetlow <gordon@FreeBSD.org> | 2017-09-17 15:57:06 +0000 |
| commit | 388b470cae72cbb0264073137949c1a7dea8f4f3 (patch) | |
| tree | 662c8ad73dc04fd4e37fcea0bea3677a10b10b00 /magic/Magdir/apache | |
| parent | 46a30e2e108c3996f8c1ef41888011c2a5cedaf7 (diff) | |
Diffstat (limited to 'magic/Magdir/apache')
| -rwxr-xr-x | magic/Magdir/apache | 28 |
1 files changed, 28 insertions, 0 deletions
diff --git a/magic/Magdir/apache b/magic/Magdir/apache new file mode 100755 index 000000000000..d896b5055127 --- /dev/null +++ b/magic/Magdir/apache @@ -0,0 +1,28 @@ + +#------------------------------------------------------------------------------ +# $File: apache,v 1.1 2017/04/11 14:52:15 christos Exp $ +# apache: file(1) magic for Apache Big Data formats + +# Avro files +0 string Obj Apache Avro +>3 byte x version %d + +# ORC files +# Important information is in file footer, which we can't index to :( +0 string ORC Apache ORC + +# Parquet files +0 string PAR1 Apache Parquet + +# Hive RC files +0 string RCF Apache Hive RC file +>3 byte x version %d + +# Sequence files (and the careless first version of RC file) + +0 string SEQ +>3 byte <6 Apache Hadoop Sequence file version %d +>3 byte >6 Apache Hadoop Sequence file version %d +>3 byte =6 +>>5 string org.apache.hadoop.hive.ql.io.RCFile$KeyBuffer Apache Hive RC file version 0 +>>3 default x Apache Hadoop Sequence file version 6 |
