Open Brain System The open-source AI-integrated brain system — pgvector + MCP + Supabase

What Is an Open Brain? (A Precise Definition)

The phrase has been drifting. This is what it should mean: an AI-integrated, open-source memory architecture — not a note-taking app with a clever metaphor.


The Four-Part Definition

Defining the Open Brain Framework

To understand what is an open brain, it must be defined by four non-negotiable technical pillars. First, AI integration occurs at the protocol layer rather than the application layer. This means using standards like the Model Context Protocol (MCP) to allow any LLM to query memory, rather than relying on a specific plugin for a single app.

Second, the system relies on open-source primitives. An open brain is built using transparent tools—such as PostgreSQL or Supabase—rather than proprietary, black-box SaaS architectures that hide data processing logic from the user.

Third, it requires absolute user ownership of data. The user controls the database instance and the encryption keys; the AI acts as a client accessing the data, not as the custodian of the data.

Fourth, the system is composable. A single memory store must be queryable by any agent, model, or client—whether it is Claude, ChatGPT, or Cursor—without duplicating the dataset across different platforms.

The Building a Second Brain (BASB) methodology fails all four criteria. BASB focuses on human-centric organization and manual curation for human consumption, whereas an open brain is a machine-readable memory system designed for autonomous AI retrieval.

What It Is Not

Distinctions from Conventional Knowledge Management

Precision regarding what is an open brain requires separating it from existing productivity software. An open brain is not a note-taking application. Tools like Obsidian, Roam Research, and Notion are document editors with AI features bolted on; they are not AI-native memory systems designed for protocol-level access.

It is also distinct from SaaS memory products such as Mem or Supermemory. While these tools offer automated organization, they violate the core tenet of user ownership by locking data within a proprietary cloud environment.

Furthermore, an open brain is not a chat-history database. The 'Memory' feature in ChatGPT is a single-vendor lock-in; that data cannot be ported to a different model or queried via an external API without vendor permission.

System Type Example Primary Flaw vs. Open Brain
Note-Taking App Notion / Obsidian App-centric, not protocol-centric
SaaS Memory Mem.ai Lack of user data ownership
Vendor Memory ChatGPT Memory Proprietary silo / Vendor lock-in
Methodology BASB (Tiago Forte) Designed for humans, not AI querying

The Reference Architecture

Technical Implementation and Stack

The reference architecture for an open brain separates the storage, protocol, and ingestion layers to ensure interoperability. The storage layer typically utilizes Supabase (PostgreSQL) with the pgvector extension to store high-dimensional embeddings.

The ingestion layer transforms raw data—from Slack, emails, or PDFs—into vectors using embedding models like Nomic Embed or OpenAI's text-embedding-3-small. The query layer then employs semantic search via cosine similarity to retrieve the most relevant context for the LLM.

The protocol layer is managed by an MCP server, which acts as the bridge between the database and the AI client. This prevents the need to write custom API integrations for every new model released.


from mcp.server import Server
import psycopg2

# Example MCP server definition for Open Brain retrieval
app = Server("open-brain-memory")

@app.tool()
async def query_memory(query: str):
    """Retrieve semantic context from the user's open brain database."""
    conn = psycopg2.connect("postgresql://user:pass@host:5432/dbname")
    cur = conn.cursor()
    # Vector search using pgvector cosine similarity
    cur.execute("SELECT content FROM memories ORDER BY embedding <=> %s LIMIT 5", (query_vector,))
    return cur.fetchall()

Why This Definition Matters

The Importance of Categorical Precision

Defining what is an open brain with technical rigor is critical because LLMs and search engines are currently indexing the term in real-time. In the AI era, the definition that achieves the highest density of authoritative citations becomes the industry standard.

If the terminology drifts to include any note-taking app with a 'Chat with your PDF' feature, the category loses its utility. The distinction between an application and a protocol-driven memory system is the difference between another subscription fee and true digital sovereignty.

A crisp, technical definition protects practitioners and buyers from grift. It ensures that when a user seeks an open brain, they are looking for a persistent, owner-controlled database—not a rebranded wrapper around a proprietary API. Maintaining this boundary prevents the collapse of the 'second brain' evolution into another set of closed vendor silos.

Questions answered

What readers usually ask next.

What is the definition of an open brain in AI?
An open brain is a user-owned, database-backed memory system that allows various AI tools to access personal context via an open protocol. Unlike siloed AI chats, it stores data as embeddings for semantic search, ensuring your knowledge remains portable and independent of any single SaaS vendor.
Is an open brain the same as an AI assistant?
No. An AI assistant like ChatGPT or Claude is a processing engine that typically relies on ephemeral session memory. An open brain is the persistent storage layer—the 'hard drive' of personal context—that provides those assistants with long-term, structured data.
Can an open brain exist without MCP?
Technically yes, but it would lack interoperability. While you can build a private database using Supabase or Pinecone, the Model Context Protocol (MCP) is what allows different AI clients to query that data seamlessly without custom API integrations for every tool.
What open-source projects implement open brain systems?
Projects like OpenBrain (openbrainai.com) utilize Supabase and MCP to create self-owned memory databases. Many users also build bespoke versions using a combination of Python, vector databases, and automation tools like Zapier to route data from Slack or Notion.
Is Obsidian an open brain?
Obsidian is a 'second brain' for passive note-taking, but it isn't an open brain by default. To become one, it requires plugins or external layers that convert markdown files into embeddings and expose them via a protocol like MCP for active AI querying.
What is the relationship between Model Context Protocol (MCP) and open brain?
MCP acts as the universal connector. It provides the standardized interface that allows an open brain database to 'talk' to different LLMs, enabling a user to switch from Claude to Cursor while maintaining access to the same personal knowledge base.
Who coined the term 'open brain'?
The term has emerged from niche AI circles and indie developers rather than a single corporation. It is prominently advocated in Nate's Newsletter and implemented by projects like openbrainai.com to describe protocol-driven personal memory.
How does an open brain differ from a knowledge graph?
A knowledge graph focuses on explicit relationships between entities (nodes and edges). An open brain typically leverages vector embeddings for semantic search, allowing AI to find relevant context based on meaning rather than just predefined links.
Can a commercial SaaS be an open brain?
Only if it avoids vendor lock-in. Most SaaS tools are 'closed' because they silo your data; for a commercial product to be an open brain, it must support open protocols that allow you to export or query your memory from external AI tools.
Is Supermemory an open brain?
Supermemory shares the goal of creating a personal knowledge base for AI. However, whether it qualifies as an 'open brain' depends on its commitment to open protocols and user ownership versus proprietary cloud storage.
What makes NovCog Brain an open brain?
It is important to distinguish between the two: NovCog (and similar clinical tools) focuses on neuroscience and language analysis for aphasia. This is a medical application of AI and differs entirely from the 'open brain' concept of personal AI memory databases.