The fastest way to query your Iceberg data lake

DuckDB-as-a-Service for Apache Iceberg. Connect to S3, R2, MinIO, or WarpStream Tableflow — run SQL instantly — no clusters, no ops, no setup.

Terminal
$ git clone https://github.com/gordonmurray/cloudfloe
$ cd cloudfloe && docker compose up -d
$ open http://localhost:3000
🌊 Cloudfloe running at http://localhost:3000

Data lakes are great. Iceberg is powerful.
But querying them? Not so much.

🏗️

Trino Setup Hell

Heavy, cluster-based, complex configuration. Hours of setup before your first query.

🔒

Vendor Lock-in

Athena ties you to AWS. BigQuery to Google. What about multi-cloud freedom?

💸

Hidden Costs

Surprise bills from scanning terabytes. No visibility into what you're actually paying for.

🐌

Slow Feedback

Wait for clusters to spin up. Wait for queries to run. Wait for results. Rinse, repeat.

Cloudfloe is different

DuckDB's blazing performance meets Iceberg's flexibility. Zero infrastructure, maximum speed.

Instant Queries

No cluster spin-up time. No waiting. DuckDB processes your Iceberg data at memory speed.

🌍

Vendor Neutral

Works with AWS S3, Cloudflare R2, MinIO, WarpStream Tableflow, or any S3-compatible storage. Your data stays yours.

🐳

Docker Ready

One command deployment. Runs anywhere Docker runs. No complex infrastructure required.

💰

Cost Transparent

See exactly how much data you're scanning and what it costs. No surprise bills.

🔓

Open Source

MIT licensed. Self-hosted. Modify it, extend it, contribute to it. Community driven.

🚀

Modern UI

Beautiful SQL editor with syntax highlighting. Query history. Export capabilities. Built for data professionals.

Perfect for data teams who want to move fast

👨‍💻

Data Engineers

Need to peek inside Iceberg tables fast without setting up Trino clusters or wrestling with Athena permissions.

"I just want to see my Iceberg data now"
📊

BI Developers

Building dashboards with Superset, Rill, or Evidence and need a reliable query scratchpad for prototyping.

"Let me test this query before I wire it into production"
💼

FinOps Teams

Want visibility into data scan costs and query patterns before committing to expensive cloud analytics services.

"How much will this dashboard actually cost us?"

Get started in 30 seconds

1

Clone & Start

git clone https://github.com/gordonmurray/cloudfloe
cd cloudfloe
docker compose up -d
2

Connect Your Data

Point Cloudfloe at your S3 bucket with Iceberg tables. Works with AWS, Cloudflare R2, MinIO, WarpStream Tableflow, or any S3-compatible storage.

3

Start Querying

Write SQL, hit Ctrl+Enter, see results instantly. Export to CSV, save queries, track costs.

Example queries you can run

Real IMDb movie data in Iceberg format, queried instantly with DuckDB

What you can do:

  • ✅ Docker Compose startup in seconds
  • ✅ Connect to MinIO with sample data
  • ✅ Query 37,537 IMDb movies instantly
  • ✅ Filter by decade, genre, runtime
  • ✅ Export results to CSV
  • ✅ Track query costs and performance

Try these queries:

SELECT title, year, rating
FROM movies
WHERE year >= 2020
ORDER BY rating DESC
LIMIT 10;
SELECT genre, COUNT(*) as count,
AVG(rating) as avg_rating
FROM movies
GROUP BY genre
ORDER BY count DESC;

Built on proven technology

🦆 DuckDB

Lightning-fast OLAP engine. Processes gigabytes in seconds. Perfect for analytical workloads.

🧊 Apache Iceberg

Modern table format. ACID transactions, schema evolution, partition pruning. Industry standard.

⚡ FastAPI

High-performance Python web framework. Automatic OpenAPI docs. Type-safe and fast.

🐳 Docker

Containerized deployment. Runs anywhere. No dependency hell. One command setup.

Ready to query your Iceberg lake?

Join the data teams already using Cloudfloe to explore their data lakes faster than ever.