🧠 A2A Memory System

Benchmark: Before vs After β€” Live Production Tests
2026-02-09 | Oracle VM | SQLite + FTS5 + Knowledge Graph
262x
Graph Queries
4ms vs 1001ms
2.7x
Repeated Queries
67% fewer API calls
0β†’10
Knowledge Recall
Amnesia β†’ Full memory
1.8x
Cross-Agent
Instant context sharing
8ms
Search Speed
FTS5 per operation
33%
API Saved
600 calls/month
πŸ“Š All Tests
⚑ Stress Test
πŸ’° ROI
🎯 Conclusions
πŸ”„ Test 1: Repeated Query Γ—3
2.7x faster
Customer asks "Who are our competitors?" three times β€” a typical scenario.
πŸ”΄ Without Memory
3,004ms
15 API calls (full research Γ—3)
🟒 With Memory
1,107ms
5 API calls (research once, cache twice)
Without3,004ms
With Memory1,107ms
πŸ’‘ Query 2 from graph: 3ms (vs 1,001ms). Query 3 via FTS5: 3ms. Saves 67% of API calls.
πŸ”— Test 2: Cross-Agent Context
1.8x faster
Researcher finds data, Writer needs it. Without memory β€” Writer repeats the search.
πŸ”΄ Without Memory
1,102ms
Writer wastes 500ms re-searching
🟒 With Memory
625ms
Writer reads 2 observations instantly
Without1,102ms
With Memory625ms
πŸ’‘ Agents share context through persistent knowledge graph β€” no duplicate work.
🧠 Test 3: 10 Sessions β€” Knowledge Recall
0 β†’ 10
After 10 work sessions, ask: "What technologies have we studied?"
πŸ”΄ Without Memory
0 knowledge
Complete amnesia β€” forgets everything
🟒 With Memory
10 entities
Full recall in 4ms + relations + FTS
πŸ’‘ Knowledge accumulates over time. The longer the customer uses the system, the smarter it gets.
πŸ•ΈοΈ Test 4: Graph Impact Analysis
262x faster
Question: "What will Bug #42 in the API service affect?" β€” needs transitive dependency traversal.
πŸ”΄ Without Memory
1,001ms
3 sequential agent queries, incomplete result
🟒 With Memory
4ms
BFS depth=2: 5 entities, 100% coverage
Without1,001ms
With Memory4ms
πŸ’‘ Graph traversal: Bugβ†’APIβ†’DB, APIβ†’Auth, Projectβ†’API β€” complete dependency map in a single query.
⚑ Stress Test β€” 170 Operations on Live System
15.7ms
Write (observe)
100 ops β†’ 1,566ms total
8.0ms
Search (FTS5)
50 ops β†’ 400ms total
9.3ms
Graph (BFS)
20 ops β†’ 186ms total
+16KB
DB Growth
100 observations added
55
Entities After
From 35 β†’ 55
137
Observations
From 37 β†’ 137
πŸ’‘ System stable under sustained load. SQLite + FTS5 holds sub-16ms latency. 5MB RAM footprint. Zero errors.
πŸ’° ROI β€” Monthly API Savings
Model: 20 tasks/day Γ— 30 days = 600 tasks/month. 40% repeat rate. 3 API calls per task.
1,800
API calls without memory
β†’
1,200
API calls with memory
β†’
600
calls saved (33%)
πŸ’‘ At 100 customers: 60,000 API calls/month saved. Memory pays for itself by extending free tier limits.
πŸ•ΈοΈ
262x
Graph queries are the killer feature. Impact analysis in 4ms vs manual agent polling in 1s.
πŸ”„
2.7x
Repeated queries β€” the most common case. Customers return to the same topics.
πŸ”—
1.8x
Cross-agent context. Writer produces better output with Researcher's data instantly.
🧠
0β†’10
Knowledge accumulation. The system gets smarter the longer you use it.
⚑
8-16ms
Production-ready. SQLite + FTS5 on any lightweight VM.
πŸ’°
33%
API savings. Memory extends free tier limits at scale.

Technology Stack

SQLite + FTS5 (Full-Text Search) + Knowledge Graph (entities, relations, observations)
18 REST API endpoints Β· 42/42 tests passing Β· 5MB RAM Β· Port 5950
πŸ“‚ GitHub: a2a-agent-saas/a2a-memory