Research Blog

LLM Cognition & Prompt Theory

A practical comparison of l0l1 with generic Text2SQL tools — when the privacy guarantee matters, when the learning from query patterns matters, and when a vanilla LLM SQL bridge is enough.

SQLText2SQLcopilot

LLM Cognition 23 June 2026

mullama vs Ollama vs vLLM: Choosing a Local LLM Server in 2026

A practical comparison of mullama, Ollama, vLLM, LocalAI, and LM Studio for local LLM serving — when to use which, and why you might pick the research-instrumented option.

OllamavLLMLocalAI

LLM Cognition 23 June 2026

perishable vs Cloudflare API Gateway vs HashiCorp Vault: Short-Lived Credentials for LLMs

A practical comparison of perishable, Cloudflare API Gateway, and HashiCorp Vault for ephemeral LLM API credentials — when zero-trust LLM access needs its own purpose-built tooling, and when a general-purpose solution is enough.

zero-trustephemeral-credentialsCloudflare

LLM Cognition 23 June 2026

route-switch vs LiteLLM vs Portkey vs OpenRouter: LLM Routing in 2026

A practical comparison of route-switch, LiteLLM, Portkey, and OpenRouter for LLM routing — when you need a gateway, when you need a learned router, and when you need both.

LLM routingLiteLLMPortkey

LLM Cognition 22 June 2026

memorg vs Mem0: Structured Memory vs Vector Memory for LLM Agents

A practical comparison of memorg and Mem0 for agent memory — when structured schemas beat vector recall, when they don't, and how to combine them.

memoryagentsMem0

LLM Cognition 22 June 2026

mpl vs MCP: A Compliance and Audit Layer on Top of the Agent Protocol

MCP defines how agents talk. mpl defines what correct looks like. Why production agentic systems need both — and how to compose them.

MCPA2Aagent-protocol

LLM Cognition 22 June 2026

promptel vs DSPy: Declarative Specification vs Programmatic Compilation

A practical comparison of promptel and DSPy for prompt engineering — when to use a declarative specification language, when to use a programmatic compiler, and how they compose.

promptsDSPydeclarative

LLM Cognition 1 April 2026

Persistent Memory for Long-Running Agents

What happens when LLM agents need to remember across sessions — structured memory schemas, retrieval strategies, and the memory-context distinction.

memoryagentsretrieval

LLM Cognition 22 March 2026

Prompt Lifecycle Management: From Extraction to Deployment

A practical framework for managing prompts as versioned dependencies — tackling drift, regression, and reproducibility.

promptsversioningdevops

LLM Cognition 15 March 2026

Formalising Prompts as First-Class Research Objects

Why treating prompts as typed, portable artefacts changes how we reason about LLM behaviour — and how promptel implements this idea.

promptsformal-methodsspecification

LLM Cognition 8 March 2026

Intelligent LLM Routing: Spending Compute Where It Matters

How route-switch uses MIPROv2 to automatically select the right model for each query — balancing cost, quality, and latency.

routingMIPROv2cost-optimisation

LLM Cognition 28 February 2026

Building mullama: What We Learned Replacing Ollama from Scratch

A post-mortem on building a local LLM serving layer — llama.cpp integration, model management, and where existing tools constrain research.

llama.cppinferencelocal-llm

LLM Cognition & Prompt Theory

l0l1 vs Text2SQL Tools: A Privacy-Preserving SQL Co-Pilot for Your Own Database

mullama vs Ollama vs vLLM: Choosing a Local LLM Server in 2026

perishable vs Cloudflare API Gateway vs HashiCorp Vault: Short-Lived Credentials for LLMs

route-switch vs LiteLLM vs Portkey vs OpenRouter: LLM Routing in 2026

memorg vs Mem0: Structured Memory vs Vector Memory for LLM Agents

mpl vs MCP: A Compliance and Audit Layer on Top of the Agent Protocol

promptel vs DSPy: Declarative Specification vs Programmatic Compilation

Persistent Memory for Long-Running Agents

Prompt Lifecycle Management: From Extraction to Deployment

Formalising Prompts as First-Class Research Objects

Intelligent LLM Routing: Spending Compute Where It Matters

Building mullama: What We Learned Replacing Ollama from Scratch