Skip to content

kostya-sh/parquet-go

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

license Build Status

parquet-go

Library to work with Parquet file format in Go.

The current state

A low level API for reading parquet files provides methods to read everything that a parquet file consist of:

  • file metadata
  • page headers
  • schema
  • values along with definition and repetition levels

All encodings and all parquet types are supported.

GZIP and SNAPPY compression codecs are supported.

While I took considerable effort to test the implementation I am sure there are some bugs that are still lurking around. This library has never been used in any production system.

The library should be pretty fast but there are still quite a few known missing performance optimisations.

Writing files and assembling records are not implemented.

Usage

For examples how to use the library check out the source code of parqueteur command. csv.go is the best starting point.

Future

Currently I do not plan to work on adding new features. I will try to fix any reported bugs though.

About

Go library to work with PARQUET files

Resources

License

MIT, Apache-2.0 licenses found

Licenses found

MIT
LICENSE
Apache-2.0
LICENSE.ASF-2

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published