Skip to content

DataBooth/duckdb-utils

 
 

Repository files navigation

duckdb-utils

PyPI

Python CLI utility and library for manipulating DuckDB databases.

Idea

In principle a "port" of the excellent sqlite-utils by Simon Willison. Not sure if the use case is there in DuckDB-land. Still thinking this through (or if there are similar offerings from others.)

Status

This is in pre-beta and not yet working or released. Please do not use.

Some potential features

Read more on my blog, in this series of posts on New features in duckdb-utils and other entries tagged duckdb-utils.

Installation

pip install duckdb-utils

Or if you use Homebrew for macOS:

brew install duckdb-utils

Using as a CLI tool

Now you can do things with the CLI utility like this:

$ duckdb-utils memory cats.csv "select * from t"
[{"id": 1, "age": 4, "name": "Emme"},
 {"id": 2, "age": 2, "name": "Pancakes"}]

$ duckdb-utils insert cats.duckdb cats cats.csv --csv
[####################################]  100%

$ duckdb-utils tables cats.duckdb --counts
[{"table": "cats", "count": 2}]

$ duckdb-utils cats.duckdb "select id, name from cats"
[{"id": 1, "name": "Emme"},
 {"id": 2, "name": "Pancakes"}]

$ duckdb-utils cats.duckdb "select * from cats" --csv
id,age,name
1,4,Emme
2,2,Pancakes

$ duckdb-utils cats.duckdb "select * from cats" --table
  id    age  name
----  -----  --------
   1      4  Emme
   2      2  Pancakes

You can import JSON data into a new database table like this:

$ curl https://api.github.com/repos/databooth/duckdb-utils/releases \
    | duckdb-utils insert releases.duckdb releases - --pk id

Or for data in a CSV file:

$ duckdb-utils insert cats.duckdb cats cats.csv --csv

duckdb-utils memory lets you import CSV or JSON data into an in-memory database and run SQL queries against it in a single command:

$ cat cats.csv | duckdb-utils memory - "select name, age from stdin"

See the full CLI documentation for comprehensive coverage of many more commands.

Using as a library

You can also import duckdb_utils and use it as a Python library like this:

import duckdb_utils
db = duckdb_utils.Database("demo_database.duckdb")
# This line creates a "cats" table if one does not already exist:
db["cats"].insert_all([
    {"id": 1, "age": 4, "name": "Emme"},
    {"id": 2, "age": 2, "name": "Pancakes"}
], pk="id")

Check out the full library documentation for everything else you can do with the Python library.

Related projects

About

Python CLI utility and library for manipulating DuckDB databases

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.6%
  • Other 0.4%