Building Semantic Memory for AI With Cognee

AI Engineering Podcast

2024/11/25

Overview Shownote Highlights Transcript Chapters Pins

In this episode of the AI Engineering Podcast, host Tobias Macey speaks with Vasilije Markovic about the evolving challenge of memory integration in large language models (LLMs). As LLMs become more central to complex applications, maintaining context and long-term knowledge remains a persistent hurdle. Markovic shares insights from his experience building Cognee, an open-source semantic memory engine designed to enhance how AI systems retain and retrieve information over time.

Vasilije Markovic discusses how current LLM architectures struggle with 'catastrophic forgetting' due to limited context windows, especially in multi-turn interactions. He introduces the concept of hierarchical memory—balancing short-term retrieval with long-term storage—as a solution to improve continuity and reasoning in AI applications. The conversation explores tools like Graph RAG and semantic memory inspired by cognitive science, which structure knowledge more effectively than traditional vector databases. Markovic details the development of Cognee, its architecture built on adaptable data pipelines, and how it integrates with existing systems using AWS services and Python-based frameworks. He also compares Cognee’s approach to personalization and memory management with other tools like Mem0, while highlighting use cases across industries such as agriculture and logistics. Looking ahead, Markovic outlines plans for deeper integrations with emerging AI stacks and predicts future advancements in agent communication protocols.