Changelog¶

All notable changes to ALMA-memory will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

[0.6.0] - 2026-01-31¶

Added - Workflow Context Layer (AGtestari Integration)¶

Workflow Types (alma/workflow/)
WorkflowContext: Hierarchical scoping for workflow orchestration
RetrievalScope: Enum for scope levels (NODE → RUN → WORKFLOW → AGENT → TENANT → GLOBAL)
Checkpoint: Crash recovery state persistence
CheckpointManager: Checkpoint operations with sequence tracking
WorkflowOutcome: Learning from completed workflow executions
WorkflowResult: Enum for result status (SUCCESS, FAILURE, PARTIAL, CANCELLED, TIMEOUT)
ArtifactRef: External artifact linking (screenshots, logs, reports)
ArtifactType: Enum for artifact types (FILE, SCREENSHOT, LOG, REPORT, etc.)
State Reducers (alma/workflow/reducers.py)
8 built-in reducers for parallel branch merging:
- AppendReducer: Concatenate lists
- MergeDictReducer: Merge dictionaries (later wins)
- LastValueReducer: Take last non-None value (default)
- FirstValueReducer: Take first non-None value
- SumReducer: Sum numeric values
- MaxReducer: Take maximum value
- MinReducer: Take minimum value
- UnionReducer: Set union preserving order
ReducerConfig: Configure reducers per field
StateMerger: Merge multiple branch states
merge_states(): Convenience function for simple merges
Storage Layer Updates (alma/storage/)
New scope_filter parameter on all get_* methods
Checkpoint operations: save_checkpoint(), get_checkpoint(), get_latest_checkpoint(), cleanup_checkpoints()
Workflow outcome operations: save_workflow_outcome(), get_workflow_outcome(), get_workflow_outcomes()
Artifact link operations: save_artifact_link(), get_artifact_links(), delete_artifact_link()
Full SQLite implementation with workflow tables
Full PostgreSQL implementation with pgvector support
ALMA Core Workflow API (alma/core.py)
checkpoint(): Create crash recovery checkpoints
get_resume_point(): Get checkpoint to resume from after crash
merge_states(): Merge parallel branch states with configurable reducers
learn_from_workflow(): Record workflow execution outcomes
link_artifact(): Link external artifacts to memories
get_artifacts(): Get artifacts linked to a memory
cleanup_checkpoints(): Clean up old checkpoints for completed runs
retrieve_with_scope(): Scoped memory retrieval with workflow context
Async variants for all workflow methods
MCP Workflow Tools (alma/mcp/)
alma_checkpoint: Create crash recovery checkpoints
alma_resume: Get checkpoint to resume from
alma_merge_states: Merge parallel branch states
alma_workflow_learn: Record workflow outcomes
alma_link_artifact: Link external artifacts
alma_get_artifacts: Get artifacts for a memory
alma_cleanup_checkpoints: Clean up old checkpoints
alma_retrieve_scoped: Scoped memory retrieval
Async variants for all workflow tools
Database Migrations (alma/storage/migrations/versions/v1_1_0_workflow_context.py)
PostgreSQL: alma_checkpoints, alma_workflow_outcomes, alma_artifact_links tables
SQLite: checkpoints, workflow_outcomes, artifact_links tables
Workflow scope columns added to existing tables (tenant_id, workflow_id, run_id, node_id)
Indexes for scope filtering

Changed¶

Updated MCP server version to 0.6.0
Updated package version to 0.6.0-dev
Migration system now supports v1.1.0 schema

Tests¶

144 tests for workflow types and reducers (93% coverage)
31 tests for ALMA core workflow integration
17 tests for storage workflow operations

[0.5.1] - 2026-01-29¶

Added - Phase 2: Graph Database Backends¶

Memgraph Backend (alma/graph/backends/memgraph.py)
Neo4j Bolt protocol compatible graph database
Supports authenticated and unauthenticated connections
Full GraphBackend interface implementation
32 unit tests
Kuzu Backend (alma/graph/backends/kuzu.py)
Embedded graph database (no server required, like SQLite for graphs)
In-memory mode with :memory: for testing
Persistent mode with file path for production
Full GraphBackend interface implementation
23 unit tests
Testing Module (alma/testing/)
MockStorage: Full in-memory StorageBackend implementation
MockEmbedder: Deterministic hash-based embeddings for testing
Factory functions: create_test_heuristic(), create_test_outcome(), create_test_preference(), create_test_knowledge(), create_test_anti_pattern()
27+ unit tests

Changed¶

Updated create_graph_backend() factory to support "memgraph" and "kuzu" backends
Added coverage exclusions for optional graph backends
Version bump to 0.5.1

[0.5.0] - 2026-01-28¶

Added - Phase 2: Vector Database Backends¶

Qdrant Backend (alma/storage/qdrant.py)
Full StorageBackend implementation with 1300+ lines
Vector similarity search using cosine distance
Metadata filtering for all memory queries
Optimized MatchAny queries for multi-agent memory sharing
52 unit tests with full coverage
Pinecone Backend (alma/storage/pinecone.py)
Full StorageBackend implementation for serverless vector DB
Namespace-based organization per memory type
Serverless spec support for automatic scaling
Environment variable expansion in configuration
45 unit tests
Chroma Backend (alma/storage/chroma.py)
Full StorageBackend implementation for ChromaDB
Support for persistent, client-server, and ephemeral modes
Native embedding storage and similarity search
Numpy array handling for edge cases
Graph Database Abstraction (alma/graph/)
GraphBackend abstract base class for pluggable backends
Neo4jBackend wrapping existing Neo4j code
InMemoryBackend for testing without external dependencies
BackendGraphStore adapter bridging GraphStore API to backends
create_graph_backend() factory function for easy setup
46 unit tests

Added - Phase 1: Core Features¶

Memory Consolidation Engine (alma/consolidation/)
LLM-powered deduplication that merges similar memories
Cosine similarity-based grouping with configurable thresholds (default 0.85)
Provenance tracking via merged_from metadata
Dry-run mode for safety previews
Support for heuristics, domain_knowledge, and anti_patterns
Event System (alma/events/)
EventEmitter for in-process callbacks
WebhookManager for HTTP delivery with HMAC signatures
Event types: CREATED, UPDATED, DELETED, ACCESSED, CONSOLIDATED
Retry logic with configurable count and exponential backoff
EventAwareStorageMixin for automatic event emission
TypeScript/JavaScript SDK (packages/alma-memory-js/)
Full API coverage: retrieve, learn, addPreference, addKnowledge, forget
Type-safe with comprehensive TypeScript definitions
Error hierarchy matching Python SDK (ALMAError, ConnectionError, etc.)
Automatic retry with configurable backoff
MCP protocol support for direct server communication
Multi-Agent Memory Sharing
inherit_from scope attribute for reading other agents' memories
share_with scope attribute for making memories readable
Origin tracking via metadata['shared_from']
Optimized get_*_for_agents() batch queries across all backends
include_shared parameter in retrieval engine

Changed¶

Updated version to 0.5.0
Added CI/CD workflow with GitHub Actions
Updated architecture diagram in README to show all 6 storage backends
Expanded troubleshooting section for new backends

[0.4.0] - 2026-01-28¶

Security¶

CRIT-001: Fixed critical eval() vulnerability in Neo4j graph store (graph/store.py)
Replaced unsafe eval() with json.loads() for property deserialization
Added input validation for graph entity properties
This prevented potential arbitrary code execution when loading graph data
Added comprehensive input validation to all MCP tools
Validates required parameters before processing
Returns clear error messages for invalid inputs
Prevents injection attacks through tool parameters

Bug Fixes¶

CRIT-002: Fixed SQLite embeddings delete bug (storage/sqlite_local.py)
Delete operations used singular form (heuristic) while add operations used plural (heuristics)
Embeddings were never actually deleted, causing unbounded storage growth
Now consistently uses plural form across all operations
Fixed Azure Cosmos DB missing methods (storage/azure_cosmos.py)
Implemented update_heuristic_confidence() method
Implemented update_knowledge_confidence() method
These were defined in base class but never implemented, causing AttributeError
Fixed PostgreSQL IVFFlat index creation on empty tables (storage/postgresql.py)
IVFFlat indexes require data to create clusters
Now defers index creation until first data is inserted
Falls back to exact search until index is ready

Performance¶

PostgreSQL HNSW indexes (storage/postgresql.py)
Added HNSW (Hierarchical Navigable Small World) indexes as alternative to IVFFlat
HNSW provides better recall with similar performance
Configuration option: vector_index_type: hnsw | ivfflat
Lazy FAISS index rebuild (storage/sqlite_local.py)
FAISS indexes now only rebuild when necessary
Tracks dirty state to avoid redundant rebuilds
Reduces startup time for large databases by 60-80%
Timestamp indexes (storage/sqlite_local.py, storage/postgresql.py)
Added indexes on created_at and updated_at columns
Improves performance of temporal queries and forgetting operations
Reduces query time for date-range filters by 10x+
Connection pooling improvements (storage/postgresql.py)
Increased default pool size from 5 to 10
Added configurable min/max pool sizes
Better handling of connection exhaustion

API Improvements¶

Batch operations (storage/base.py, all backends)
New batch_save_heuristics() method
New batch_save_outcomes() method
New batch_save_domain_knowledge() method
Reduces database round-trips for bulk operations by 90%
Exception hierarchy (exceptions.py)
New base class: ALMAError
Storage errors: StorageError, ConnectionError, QueryError
Validation errors: ValidationError, ConfigurationError
Learning errors: LearningError, ScopeViolationError
Retrieval errors: RetrievalError, EmbeddingError
Makes error handling more precise and debugging easier
Input validation with clear error messages
All public methods now validate inputs before processing
Clear error messages indicate which parameter failed and why
Example: ValidationError: agent name cannot be empty

Deprecation Fixes¶

Replaced datetime.utcnow() (types.py, all storage backends)
datetime.utcnow() is deprecated in Python 3.12+
Replaced with datetime.now(timezone.utc) throughout codebase
Ensures Python 3.13+ compatibility

Documentation¶

Added comprehensive technical debt assessment documentation
Created competitive analysis vs Mem0 (docs/architecture/competitive-analysis-mem0.md)
Updated system architecture documentation (docs/architecture/system-architecture.md)
Added troubleshooting section to README

Internal¶

Added pre-commit hooks for code quality
Standardized logging format across modules
Improved test fixtures for storage backends

[0.3.0] - 2026-01-15¶

Added¶

LLM-Powered Fact Extraction (alma/extraction/)
AutoLearner class for automatic learning from conversations
Supports OpenAI, Anthropic, and rule-based extraction
Configurable confidence thresholds
Graph Memory (alma/graph/)
Entity and relationship storage via Neo4j
EntityExtractor for LLM-powered entity extraction
In-memory graph store for testing
Confidence Engine (alma/confidence/)
Forward-looking strategy assessment
assess_strategy() method returns confidence scores
rank_strategies() for comparing multiple approaches
Session Initializer (alma/initializer/)
Full session initialization before starting work
Combines memory retrieval, progress check, and session handoff
initialize() method returns comprehensive context

Changed¶

Improved retrieval scoring weights (similarity: 0.4, recency: 0.3, success: 0.2, confidence: 0.1)
Enhanced cache invalidation after learning operations

Fixed¶

Fixed race condition in concurrent cache access
Fixed embedding dimension validation on startup

[0.2.0] - 2026-01-01¶

Added¶

Progress Tracking (alma/progress/)
ProgressTracker class for work item management
Priority-based and quick-win task selection strategies
Progress summary with completion percentages
Session Handoff (alma/session/)
SessionManager for cross-session context continuity
Handoff creation with next steps and blockers
Quick reload summaries for session start
Domain Memory Factory (alma/domains/)
DomainMemoryFactory for creating domain-specific ALMA instances
Pre-built schemas: coding, research, sales, general, customer_support, content_creation
Custom schema creation support
Harness Pattern (alma/harness/)
Decouples agent from domain memory schema
create_harness() factory function
Automatic memory injection during execution

Changed¶

Refactored storage backends for better abstraction
Improved MCP server error handling

[0.1.0] - 2025-12-15¶

Added¶

Initial release of ALMA-memory
Core memory types: Heuristic, Outcome, UserPreference, DomainKnowledge, AntiPattern
Memory scoping: can_learn and cannot_learn per agent
Storage backends: SQLite+FAISS, Azure Cosmos DB, File-based
Retrieval engine: Vector similarity search with multi-factor scoring
Learning protocol: Automatic heuristic formation from outcomes
Cache layer: In-memory and Redis backends
Forgetting mechanism: Prune stale/low-confidence memories
MCP server: 7 tools for Claude Code integration
Embedding providers: Local (sentence-transformers), Azure OpenAI, Mock

Legend¶

Added: New features
Changed: Changes to existing functionality
Deprecated: Features that will be removed in future versions
Removed: Features that have been removed
Fixed: Bug fixes
Security: Security-related changes
Performance: Performance improvements
Documentation: Documentation changes
Internal: Internal changes not affecting the public API