Changelog
This changelog highlights user-facing changes only. Internal maintenance, CI/CD, and infrastructure updates are omitted.
For full release details, see GitHub Releases.
0.3.0
Features
- Add memory tags so you can label and filter memories during recall/reflect. (
20c8f8b) - Allow choosing different AI providers/models per operation. (
e6709d5) - Add Cohere support for embeddings and reranking. (
4de0730) - Add configurable embedding dimensions and OpenAI embeddings support. (
70de23e) - Support custom base URLs for OpenAI-style embeddings and Cohere endpoints. (
fa53917) - Add LiteLLM gateway support for routing LLM/embedding requests. (
d47c8a2) - Add multilingual content support to improve handling and retrieval across languages. (
c65c6a9) - Add delete memory bank capability. (
4b82d2d) - Add backup/restore tooling for memory banks. (
67b273d)
Improvements
- Add retention modes to control how memories are extracted and stored. (
fb31a35) - Add offline (optional) database migrations to support restricted/air-gapped deployments. (
233bd2e) - Add database connection configuration options for more flexible deployments. (
33fac2c) - Load .env automatically on startup to simplify configuration. (
c06d9b4) - Expose an operation ID from retain requests so async/background processing can be tracked. (
1dacd0e) - Add per-request LLM token usage metrics for monitoring and cost tracking. (
29a542d) - Add LLM call latency metrics for performance monitoring. (
5e1f13e) - Include tenant in metrics labels for better multi-tenant observability. (
1ffc2a4) - Add async processing option to MCP retain tool for background retention workflows. (
37fc7fb)
Bug Fixes
- Fix extension loading in multi-worker deployments so all workers load extensions correctly. (
f5f3fca) - Improve recall performance by batching recall queries. (
5991308) - Improve retrieval quality and stability for large memory banks (graph/MPFP retrieval fixes). (
6232e69) - Fix entities list being limited to 100 entities. (
26bf571) - Fix UI only showing the first 1000 memories. (
67c1a42) - Fix duplicated causal relationships and improve token usage during processing. (
49e233c) - Improve causal link detection accuracy. (
2a00df0) - Make retain max completion tokens configurable to prevent truncation issues. (
7715a51) - Fix Python SDK not sending the Authorization header, preventing authenticated requests. (
39e3f7c) - Fix stats endpoint missing tenant authentication in multi-tenant setups. (
d6ff191) - Fix embedding dimension handling for tenant schemas in multi-tenant databases. (
6fe9314) - Fix Groq free-tier compatibility so requests work correctly. (
d899d18) - Fix security vulnerability (qs / CVE-2025-15284). (
b3becb6) - Restore MCP tools for listing and creating memory banks. (
9fd5679)
0.2.0
Features
- Add additional model provider support, including Anthropic Claude and LM Studio. (
787ed60) - Add multi-bank access and new MCP tools for interacting with multiple memory banks via MCP. (
6b5f593) - Allow supplying custom entities when retaining memories via the retain endpoint. (
dd59bc8) - Enhance the /reflect endpoint with max_tokens control and optional structured output responses. (
d49e820)
Improvements
- Improve local LLM support for reasoning-capable models and streamline Docker startup for local deployments. (
eea0f27) - Support operation validator extensions and return proper HTTP errors when validation fails. (
ce45d30) - Add configurable observation thresholds to control when observations are created/updated. (
54e2df0) - Improve graph visualization to the control plane for exploring memory relationships. (
1a62069)
Bug Fixes
- Fix MCP server lifecycle handling so MCP lifespan is correctly tied to the FastAPI app lifespan. (
6b78f7d)
0.1.15
Features
- Add the ability to delete documents from the web UI. (
f7ff32d)
Improvements
- Improve the API health check endpoint and update the generated client APIs/types accordingly. (
e06a612)
0.1.14
Bug Fixes
- Fixes the embedded “get-skill” installer so installing skills works correctly. (
0b352d1)
0.1.13
Improvements
- Improve reliability by surfacing task handler failures so retries can occur when processing fails. (
904ea4d) - Revamp the hindsight-embed component architecture, including a new daemon/client model and CLI updates for embedding workflows. (
e6511e7)
Bug Fixes
- Fix memory retention so timestamps are correctly taken into account. (
234d426)
0.1.12
Features
- Added an extensions system for plugging in new operations/skills (including built-in tenant support). (
2a0c490) - Introduced the hindsight-embed tool and a native agentic skill for embedding/agent workflows. (
da44a5e)
Improvements
- Improved reliability when parsing LLM JSON by retrying on parse errors and adding clearer diagnostics. (
a831a7b)
Bug Fixes
- Fixed structured-output support for Ollama-based LLM providers. (
32bca12) - Adjusted LLM validation to cap max completion tokens at 100 to prevent validation failures. (
b94b5cf)
0.1.11
Bug Fixes
- Fixed the standalone Docker image and control plane standalone build process so standalone deployments build correctly. (
2948cb6)
0.1.10
This release contains internal maintenance and infrastructure changes only.
0.1.9
Features
- Simplified local MCP installation and added a standalone UI option for easier setup. (
1c6acc3)
Bug Fixes
- Fixed the standalone Docker image so it builds and starts reliably. (
b52eb90) - Improved Docker runtime reliability by adding required system utilities (procps). (
ae80876)
0.1.8
Bug Fixes
- Fix bank list responses when a bank has no name. (
04f01ab) - Fix failures when retaining memories asynchronously. (
63f5138) - Fix a race condition in the bank selector when switching banks. (
e468a4e)
0.1.7
This release contains internal maintenance and infrastructure changes only.
0.1.6
Features
- Added support for the Gemini 3 Pro and GPT-5.2 models. (
bb1f9cb) - Added a local MCP server option for running/connecting to Hindsight via MCP without a separate remote service. (
7dd6853)
Improvements
- Updated the Postgres/pg0 dependency to a newer 0.11.x series for improved compatibility and stability. (
47be07f)
0.1.5
Features
- Added LiteLLM integration so Hindsight can capture and manage memories from LiteLLM-based LLM calls. (
dfccbf2) - Added an optional graph-based retriever (MPFP) to improve recall by leveraging relationships between memories. (
7445cef)
Improvements
- Switched the embedded Postgres layer to pg0-embedded for a smoother local/standalone experience. (
94c2b85)
Bug Fixes
- Fixed repeated retries on 400 errors from the LLM, preventing unnecessary request loops and failures. (
70983f5) - Fixed recall trace visualization in the control plane so search/recall debugging displays correctly. (
922164e) - Fixed the CLI installer to make installation more reliable. (
158a6aa) - Updated Next.js to patch security vulnerabilities (CVE-2025-55184, CVE-2025-55183). (
f018cc5)
0.1.3
Improvements
- Improved CLI and UI branding/polish, including new banner/logo assets and updated interface styling. (
fa554b8)
0.1.2
Bug Fixes
- Fixed the standalone Docker image so it builds/runs correctly. (
1056a20)