mBedLM Docs Back to site
Docs/Concepts/System Architecture

System Architecture

mBedLM-core is a contract-driven platform built around layered runtime boundaries: model serving, memory and policy, orchestration, and domain adapters.

Last updated: 2026-06-23Keywords: architecture layers runtime governance orchestration

Architecture Overview

The platform separates concerns so each layer can evolve independently while preserving deterministic request behavior. Core requests flow through validated ingress, intent/routing, policy-governed inference, normalized response contracts, and observability hooks.

Runtime Layers

High-Level Topology

Client/Web App
  -> API ingress and auth
  -> Intent and route policy
  -> Tool/skill mediation (optional)
  -> Memory + inference policy
  -> Model serving backends
  -> Response normalization (content_json.v1)
  -> Telemetry and persistence

Request Lifecycle (Simplified)

  1. Validate request envelope, tenant/session context, and safety constraints.
  2. Classify intent and select route strategy (direct, tool-assisted, or orchestrated).
  3. Hydrate contextual memory and optional retrieval/tool outputs.
  4. Execute inference with runtime policy controls (timeouts, fallbacks, gating).
  5. Normalize to response contract and emit observability metadata.

Reference Startup Sequence

Validation-first startup reduces drift between environments and prevents partial-route instability.

1. Validate config and secrets
2. Confirm model artifacts and endpoint reachability
3. Start serving backends (general and specialist)
4. Start memory and tool substrate
5. Start orchestration services
6. Start web/API product surfaces
7. Enable domain modules and optional enhancers

Governance and Safety Boundaries

Extension Points

Curated Source Synthesis

This page is built from repository architecture documents and condensed into operator-safe guidance. Instead of raw document links, the key architecture outcomes are synthesized here.

What Is Intentionally Not Included