Cloudfloe - The Fastest Way to Query Your Iceberg Data Lake

Data lakes are great. Iceberg is powerful.
But querying them? Not so much.

🏗️

Trino Setup Hell

Heavy, cluster-based, complex configuration. Hours of setup before your first query.

🔒

Vendor Lock-in

Athena ties you to AWS. BigQuery to Google. What about multi-cloud freedom?

💸

Hidden Costs

Surprise bills from scanning terabytes. No visibility into what you're actually paying for.

🐌

Slow Feedback

Wait for clusters to spin up. Wait for queries to run. Wait for results. Rinse, repeat.

Cloudfloe is different

DuckDB's blazing performance meets Iceberg's flexibility. Zero infrastructure, maximum speed.

⚡

Instant Queries

No cluster spin-up time. No waiting. DuckDB processes your Iceberg data at memory speed.

🌍

Vendor Neutral

Works with AWS S3, Cloudflare R2, MinIO, WarpStream Tableflow, or any S3-compatible storage. Your data stays yours.

🐳

Docker Ready

One command deployment. Runs anywhere Docker runs. No complex infrastructure required.

💰

Cost Transparent

See exactly how much data you're scanning and what it costs. No surprise bills.

🔓

Open Source

MIT licensed. Self-hosted. Modify it, extend it, contribute to it. Community driven.

🚀

Modern UI

Beautiful SQL editor with syntax highlighting. Query history. Export capabilities. Built for data professionals.

Perfect for data teams who want to move fast

👨‍💻

Data Engineers

Need to peek inside Iceberg tables fast without setting up Trino clusters or wrestling with Athena permissions.

"I just want to see my Iceberg data now"

📊

BI Developers

Building dashboards with Superset, Rill, or Evidence and need a reliable query scratchpad for prototyping.

"Let me test this query before I wire it into production"

💼

FinOps Teams

Want visibility into data scan costs and query patterns before committing to expensive cloud analytics services.

"How much will this dashboard actually cost us?"

Get started in 30 seconds

Clone & Start

                            git clone https://github.com/gordonmurray/cloudfloe

cd cloudfloe

docker compose up -d

Connect Your Data

Point Cloudfloe at your S3 bucket with Iceberg tables. Works with AWS, Cloudflare R2, MinIO, WarpStream Tableflow, or any S3-compatible storage.

Start Querying

Write SQL, hit Ctrl+Enter, see results instantly. Export to CSV, save queries, track costs.

Get Started on GitHub

Example queries you can run

Real IMDb movie data in Iceberg format, queried instantly with DuckDB

What you can do:

✅ Docker Compose startup in seconds
✅ Connect to MinIO with sample data
✅ Query 37,537 IMDb movies instantly
✅ Filter by decade, genre, runtime
✅ Export results to CSV
✅ Track query costs and performance

Try these queries:

SELECT title, year, rating

FROM movies

WHERE year >= 2020

ORDER BY rating DESC

LIMIT 10;

SELECT genre, COUNT(*) as count,

       AVG(rating) as avg_rating

FROM movies

GROUP BY genre

ORDER BY count DESC;

Built on proven technology

🦆 DuckDB

Lightning-fast OLAP engine. Processes gigabytes in seconds. Perfect for analytical workloads.

🧊 Apache Iceberg

Modern table format. ACID transactions, schema evolution, partition pruning. Industry standard.

⚡ FastAPI

High-performance Python web framework. Automatic OpenAPI docs. Type-safe and fast.

🐳 Docker

Containerized deployment. Runs anywhere. No dependency hell. One command setup.

The fastest way to query your Iceberg data lake

Data lakes are great. Iceberg is powerful.
But querying them? Not so much.

Trino Setup Hell

Vendor Lock-in

Hidden Costs

Slow Feedback

Cloudfloe is different

Instant Queries

Vendor Neutral

Docker Ready

Cost Transparent

Open Source

Modern UI

Perfect for data teams who want to move fast

Data Engineers

BI Developers

FinOps Teams

Get started in 30 seconds

Clone & Start

Connect Your Data

Start Querying

Example queries you can run

What you can do:

Try these queries:

Built on proven technology

🦆 DuckDB

🧊 Apache Iceberg

⚡ FastAPI

🐳 Docker

Ready to query your Iceberg lake?

Data lakes are great. Iceberg is powerful.But querying them? Not so much.

Trino Setup Hell

Vendor Lock-in

Hidden Costs

Slow Feedback

Cloudfloe is different

Instant Queries

Vendor Neutral

Docker Ready

Cost Transparent

Open Source

Modern UI

Perfect for data teams who want to move fast

Data Engineers

BI Developers

FinOps Teams

Get started in 30 seconds

Clone & Start

Connect Your Data

Start Querying

Example queries you can run

What you can do:

Try these queries:

Built on proven technology

🦆 DuckDB

🧊 Apache Iceberg

⚡ FastAPI

🐳 Docker

Ready to query your Iceberg lake?

Data lakes are great. Iceberg is powerful.
But querying them? Not so much.