Logo
Back to podcasts

The Graph Layer Behind NASA’s Breakthroughs

with Michael Hunger

Transcript

Chapters

Trailer
[00:00:00]
Introduction & Neo4j Origins
[00:01:03]
Persisting Relationships for High-Performance Queries
[00:03:02]
Modeling Business Intent & Key Use Cases
[00:04:00]
Fraud Detection at Scale with Graph Algorithms
[00:05:00]
Graph-Enhanced RAG vs. Vector-Only Retrieval
[00:06:11]
Explainability & Drill-Down Evaluation in RAG
[00:09:02]
Fusing Structured & Unstructured Data for Context
[00:13:05]
MCP for Developer Productivity: Schema-to-Code & API Wrapping
[00:15:00]
Security & Sandboxing Best Practices for MCP
[00:21:16]
MCP Server Recommendations & Outro
[00:29:08]

In this episode

In this episode, Simon sits down with Michael Hunger, VP of Product Innovation at Neo4j, to explore how graph databases enhance RAG systems by providing rich, explainable context. They discuss how Neo4j powers more intelligent retrieval, the pitfalls of traditional vector search, and why graphs are ideal for structured + unstructured data fusion. Michael also dives into the power of MCP servers for developer productivity, security risks in LLM-agent pipelines, and how devs can start building secure, customizable MCP integrations today.

Michael Hunger explains the advantages of graph databases—particularly Neo4j—for use cases where relationships between data points matter. Unlike traditional relational databases or vector search systems, graph databases persist relationships at the time of data insertion. This structure enables fast traversal, stronger semantic connections, and more intuitive modeling of real-world domains. The result is a more expressive and performant system for querying interconnected data.

Enhancing Retrieval-Augmented Generation (RAG) with Graphs

The conversation turns to the limitations of conventional RAG pipelines, which often rely solely on dense vector search. Michael outlines key shortcomings of this approach: it lacks transparency, fails to consider context beyond immediate semantic similarity, and often underperforms when dealing with structured or semi-structured data. By integrating a graph database into the RAG system, developers can retrieve not only relevant documents but also the surrounding context—including entities, relationships, timeframes, and business logic—leading to more accurate and explainable responses.

Developer Productivity through MCP Integration

Michael discusses how he began experimenting with the Model Context Protocol (MCP) by wrapping Neo4j’s infrastructure APIs. This allowed developers to spin up test databases, scale environments, and automate integration testing directly from within their development environment or chat interface. MCP servers serve as a bridge between code and infrastructure, enabling higher developer productivity through natural language interfaces and tool orchestration. Michael notes that this pattern is increasingly popular, as more developers recognize the value of integrating dev tools into AI workflows.

Security Risks in AI Automation

A significant portion of the discussion focuses on the security implications of AI agents with access to sensitive systems. Michael highlights a recent vulnerability in the GitHub MCP server, where a prompt injection allowed unintended access to private repositories. He emphasizes the need for strong sandboxing, strict authorization scopes, and architectural separation between instructions and data. Organizations are advised to treat MCP integrations like any other software supply chain component, applying due diligence, validation, and access controls.

Building and Selecting MCP Servers

The episode closes with practical advice on selecting and building MCP servers. Michael recommends developers start by creating their own, as it fosters a better understanding of how LLM-integrated tools operate. He also points to curated MCP registries and infrastructure-focused servers (e.g., Cloudflare, GitHub) as useful starting points. Ultimately, the key to leveraging MCP effectively lies in combining trusted tools, secure practices, and context-aware integrations within the development workflow.

Chapters

Trailer
[00:00:00]
Introduction & Neo4j Origins
[00:01:03]
Persisting Relationships for High-Performance Queries
[00:03:02]
Modeling Business Intent & Key Use Cases
[00:04:00]
Fraud Detection at Scale with Graph Algorithms
[00:05:00]
Graph-Enhanced RAG vs. Vector-Only Retrieval
[00:06:11]
Explainability & Drill-Down Evaluation in RAG
[00:09:02]
Fusing Structured & Unstructured Data for Context
[00:13:05]
MCP for Developer Productivity: Schema-to-Code & API Wrapping
[00:15:00]
Security & Sandboxing Best Practices for MCP
[00:21:16]
MCP Server Recommendations & Outro
[00:29:08]