Dune Analytics Bids Farewell to Its First‑Generation Infrastructure – A Technical Retrospective
By [Your Name] – March 4 2026
The team behind Dune Analytics has officially retired the architecture that powered its original platform, commonly referred to as “Dune v1.” In a recent blog entry, the engineers reflected on the design choices, operational hurdles, and lessons learned from running a high‑throughput blockchain analytics service on Amazon Web Services (AWS). The post offers a rare glimpse into the behind‑the‑scenes engineering that sustained Dune’s early growth and sets the stage for the upcoming v2 rewrite.
From Simple Databases to a Multi‑Tiered Stack
When Dune first launched, each supported blockchain was backed by its own PostgreSQL instance running on a dedicated EC2 server. The team deliberately avoided Amazon RDS because of the cost‑implications of storing massive, constantly updating datasets. Instead, they provisioned PostgreSQL manually via Ansible, giving them fine‑grained control over the underlying infrastructure.
As query volume surged—especially for Ethereum—the single‑instance model began to hit the limits of Elastic Block Store (EBS) I/O throughput. The engineers responded by introducing a primary‑replica topology: a write‑focused primary database persisted on EBS for durability, while multiple read‑only replicas were deployed on EC2 instances equipped with fast local NVMe storage (Instance Store). Data from the primary was streamed to the replicas using PostgreSQL’s physical replication and WAL‑shipping to S3 via the open‑source tool wal‑g. On the query side, pgbouncer and HAProxy were layered to distribute read traffic across the replica fleet.
The new configuration proved robust. At its peak, the Ethereum read‑replica pool sustained more than one million IOPS—a figure that dwarfs the 16 k IOPS ceiling of a standard EBS volume. “The system held up under a level of demand that would have crippled most cloud‑native databases,” the blog author noted.
Operational Friction Points
Despite its performance, the v1 stack exposed several operational weaknesses:
-
Fragile Instance Management – Each EC2 box was essentially a “snowflake,” and recovery often boiled down to re‑running Ansible and hoping the instance rebooted. While backups and EBS snapshots offered a safety net, restoring a lost replica required manual steps and could be time‑consuming.
-
Spot Instance Instability – To curb costs, the team experimented with AWS Spot instances for read replicas, attracted by a sub‑5 % reclamation risk according to the Spot Instance Advisor. In practice, spot interruptions occurred more frequently than anticipated, leading to three separate instance terminations within a span of a week. The manual two‑hour recovery process proved unsustainable, prompting the team to abandon spot instances altogether.
- Storage Limits for Binance Smart Chain – The BSC database eventually outgrew the 16 TB limit of a single EBS volume. A move to ZFS compression reduced on‑disk size but introduced unacceptable latency penalties. The solution was to migrate BSC data to Instance Store, accepting the trade‑off of higher risk (loss of data on instance failure) in exchange for storage capacity and performance.
These experiences underscored a broader theme in site‑reliability engineering (SRE): the tension between cost efficiency and operational reliability.
Transition to Dune v2
The retirement of v1 coincides with the rollout of Dune v2, a next‑generation platform built to address the shortcomings of its predecessor. While details of v2’s architecture remain under wraps, the engineering team has signaled a shift toward more automated provisioning, resilient storage abstractions, and a streamlined monitoring stack. The aim is to reduce “snowflake” instances, eliminate manual recovery steps, and provide a more predictable cost model.
Analysis & Takeaways
| Issue | How Dune v1 Handled It | Outcome | Implications for Future Deployments |
|---|---|---|---|
| High Query Latency on Ethereum | Primary‑replica design with NVMe read replicas | Sustained >1 M IOPS, no major outages | Demonstrates that hybrid storage (EBS + Instance Store) can meet intense read demands when properly orchestrated. |
| Instance Recovery | Manual Ansible re‑run; reliance on snapshots | Slow, error‑prone | Emphasizes need for automated instance replacement and immutable infrastructure. |
| Cost Management | Spot instances for read replicas | Frequent reclamations; manual fixes | Spot usage must be paired with robust auto‑recovery mechanisms to be viable at scale. |
| Data Volume Limits | ZFS compression trial (failed); migration to Instance Store | Resolved storage cap but increased risk | Future designs should incorporate scalable storage layers (e.g., distributed file systems or tiered object storage) to avoid hard caps. |
Key takeaways for the DeFi analytics community:
- Hybrid storage architectures can effectively balance cost and performance, but they require sophisticated replication and monitoring.
- Automation is essential; manual recovery processes quickly become bottlenecks as infrastructure scales.
- Spot instances are a double‑edged sword: while they reduce spend, they demand resilient orchestration to prevent service disruptions.
- Planning for data growth from the outset avoids costly retrofits—especially for blockchains that generate terabytes of historic data.
Looking Ahead
Dune v1’s story is a testament to the ingenuity required to build real‑time analytics on rapidly expanding blockchain data. The platform’s ability to sustain extreme I/O loads without major incidents speaks to the competence of its engineering team. As Dune v2 comes online, the community can expect a more resilient, automated, and cost‑effective foundation that builds on the hard‑won lessons of its predecessor.
For further details, the original farewell post is available on Dune’s official blog.
Source: https://dune.com/blog/a-farewell-to-dune-v1


















