Simon Willison: TILs on duckdb

Atom feed for duckdb

duckdb Summing columns in remote Parquet files using DuckDB - 2023-11-14

vivym/midjourney-messages on Hugging Face is a large (~8GB) dataset consisting of 55,082,563 Midjourney images - each one with the prompt and a URL to the image hosted on Discord. …

duckdb Use DuckDB to convert parquet to JSON and then open it in Datasette Lite - 2023-03-21

pickapic.io is a new tool funded by stability.ai which asks people to generate and then vote on images in order to provide data to be used for fine tuning an open source image generation model. …

duckdb Using DuckDB in Python to access Parquet data - 2022-09-16

Did a quick experiment with DuckDB today, inspired by the bmschmidt/hathi-binary repo. …