Graph Embeddings + SurrealDB: AI Memory Architecture

┌─────────────────────────────────────────────────────────────────┐ │ AI AGENT (Gengar) │ │ ┌─────────────┐ ┌─────────────┐ ┌─────────────────────────┐ │ │ │ Working │ │ Graph │ │ Vector Encoder │ │ │ │ Memory │ │ Traversal │ │ (Embeddings) │ │ │ │ (Context) │ │ Engine │ │ │ │ │ └──────┬──────┘ └──────┬──────┘ └───────────┬─────────────┘ │ │ │ │ │ │ │ └────────────────┼──────────────────────┘ │ │ ▼ │ │ ┌─────────────────────┐ │ │ │ Memory Controller │ │ │ │ (Orchestration) │ │ │ └──────────┬──────────┘ │ └─────────────────────────┼───────────────────────────────────────┘ │ ▼ ┌─────────────────────────────────────────────────────────────────┐ │ SURREALDB LAYER │ │ ┌─────────────────┐ ┌─────────────────┐ ┌────────────────┐ │ │ │ memory_node │ │ memory_edge │ │ vector_index │ │ │ │ (embeddings) │ │ (relations) │ │ (similarity) │ │ │ │ │ │ │ │ │ │ │ │ • content │ │ • in/out refs │ │ • cosine │ │ │ │ • vector[] │ │ • relation_type │ │ • euclidean │ │ │ │ • importance │ │ • strength │ │ • manhattan │ │ │ │ • timestamp │ │ • timestamp │ │ │ │ │ │ • metadata │ │ • metadata │ │ │ │ │ └─────────────────┘ └─────────────────┘ └────────────────┘ │ │ │ │ Query Patterns: │ │ • Vector: SELECT * FROM memory_node WHERE embedding <|5|> │ │ • Graph: SELECT * FROM memory_node->relates_to->* │ │ • Hybrid: (vector result) + (traverse graph from result) │ └─────────────────────────────────────────────────────────────────┘

Capability	Traditional Approach	SurrealDB Approach
Documents	MongoDB / PostgreSQL JSONB	Native, schemaless records
Graph Relations	Neo4j (+ separate vector DB)	Built-in RELATE statements
Vector Search	Pinecone / Weaviate (+ graph DB)	Native vector indexes
Full-Text Search	Elasticsearch (+ sync layer)	Integrated FTS indexes
Real-Time Sync	WebSocket + pub/sub	Live queries (built-in)

Aspect	Neo4j + GraphRAG Python	SurrealDB Native
Vector Storage	Separate (Neo4j + Pinecone/Weaviate)	Built-in
Query Language	Cypher + Python SDK	SurrealQL (SQL-like)
Deployment	Complex (multi-service)	Single binary / container
Real-Time	Polling / custom	Live queries (WebSocket)
Maturity	Enterprise-proven	Rapidly evolving (v2.x)
Ecosystem	Rich (LangChain, etc.)	Growing
Self-Hosting	Requires Aura or self-managed	Single binary, edge-ready

Risk	Mitigation
SurrealDB v2.x breaking changes	Pin version, test upgrades in staging
Vector dimension limits	Use 1536 (OpenAI) or test 768 (local models)
Query performance at scale	Index optimization, query result caching
Embedding generation cost	Batch processing, local model fallback

The Bottom Line

SurrealDB's multi-model approach eliminates the need for separate vector and graph databases. For an AI agent requiring both semantic search and relational reasoning, this reduces operational complexity while enabling sophisticated memory patterns that mirror human cognition.

The trade-off is maturity—Neo4j has years of production use, while SurrealDB is newer but rapidly improving. For a lean core that prioritizes architectural elegance over enterprise legacy, SurrealDB is the sharper tool.

Graph Embeddings + SurrealDB

Contents

1. The Memory Problem

2. Why SurrealDB

3. Proposed Architecture

Memory Lifecycle

4. Data Model & Schema

Memory Nodes (Documents with Vectors)

Memory Edges (Graph Relations)

Full-Text Search Index

5. Query Patterns

Pattern 1: Pure Vector Similarity

Pattern 2: Graph Traversal

Pattern 3: Hybrid (Vector + Graph)

Pattern 4: Importance-Weighted Recall

Pattern 5: Temporal Context Window

6. Integration Strategy

Option A: Direct SurrealDB Integration (Recommended)

Option B: Hybrid (SurrealDB + File Cache)

Python SDK Example

7. Comparison with Neo4j GraphRAG

8. Recommendations

Immediate Actions

Migration Path

Risk Assessment

The Bottom Line