df_io

Python helpers for doing IO with Pandas DataFrames

Available methods

read_df

bzip2/gzip/zstandard compression
passing parameters to Pandas' readers
reading from anything, which smart_open supports (local files, AWS S3 etc)
most of the available formats, Pandas supports

write_df

This method supports:

streaming writes
chunked writes
bzip2/gzip/zstandard compression
passing parameters to Pandas' writers
writing to anything, which smart_open supports (local files, AWS S3 etc)
most of the available formats, Pandas supports

Documentation

API doc

Examples

Write a Pandas DataFrame (df) to an S3 path in CSV format (the default):

import df_io

df_io.write_df(df, 's3://bucket/dir/mydata.csv')

The same with gzip compression:

df_io.write_df(df, 's3://bucket/dir/mydata.csv.gz')

With zstandard compression using pickle:

df_io.write_df(df, 's3://bucket/dir/mydata.pickle.zstd', fmt='pickle')

Using JSON lines:

df_io.write_df(df, 's3://bucket/dir/mydata.json.gz', fmt='json')

Passing writer parameters:

df_io.write_df(df, 's3://bucket/dir/mydata.json.gz', fmt='json', writer_options={'lines': False})

Chunked write (splitting the df into equally sized parts and creating/writing outputs for them):

df_io.write_df(df, 's3://bucket/dir/mydata.json.gz', fmt='json', chunksize=10000)

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.github/workflows		.github/workflows
df_io		df_io
docs		docs
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

df_io

df_io

docs

docs

tests

tests

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

setup.py

setup.py

tox.ini

tox.ini

Repository files navigation

df_io

Available methods

read_df

write_df

Documentation

Examples

About

Releases 10

Packages

Languages

License

Mikata-Project/df_io

Folders and files

Latest commit

History

Repository files navigation

df_io

Available methods

read_df

write_df

Documentation

Examples

About

Resources

License

Stars

Watchers

Forks

Languages