Skip to main content
Private AI operations

FlexInferPrivate AI, operated like real infrastructure

FlexInfer is the runtime anchor for private inference, while Loom, MentatLab, and fi-fhir define the context, workflow, and healthcare proof boundaries around it.

OpenAI-compatible ingressMCP context boundaryHealthcare proof path
Live route preview
Private runtime boundary
Ready
Boundary
Private
Ops
GitOps
Routes
4 lanes
Runtime
GPU-aware
Gateway
OpenAI-compatible
Context
Bounded MCP
Workflow
DAG-visible
flexinfer.route.yaml
model: llama-3.1-private
route: /v1/chat/completions
placement: gpu-pool/ready
policy: tools.allowed[3]
Platform posture

Private AI, operated like real infrastructure

The stack is intentionally layered: run and customize models inside your boundary, govern how agents reach tools and context, orchestrate repeatable DAG workflows over direct API models, and use fi-fhir when the workload is sensitive healthcare ETL instead of generic demo data.

Integration points

How platform surfaces connect in production

Use these contracts to map deployment and integration boundaries before implementation.

Core Stack

Loom suite, FlexInfer, fi-fhir, and MentatLab

Product pages cover lane fit. Docs and playground cover execution. FlexInfer anchors runtime operations, MentatLab adds the orchestration UI surface for DAG run control, and Loom Core continues to govern context and policy boundaries.

Operational surface

MentatLab mission control

Loom Core governs context routing and policy boundaries. MentatLab provides the DAG design and run-visibility layer over direct API model calls, internal services, and private FlexInfer endpoints.

Mobile

Loom Companion

SwiftUI app for fleet monitoring, session management, real-time alerts, and lightweight operator control from iPhone and iPad.

Enterprise capabilities

Operator controls already shipping across the platform stack

FlexInfer already ships runtime control and observability surfaces, while gateway, RBAC, sandbox execution, and HUD visibility work continue around the adjacent Loom Core layer.

In progress

HUD Cost Dashboard
In progress

Cost monitoring integration: loom/cost-stats RPC, CostMonitor polling, SSE events, and OverviewPanel KPI tile.

HUD RBAC + Audit Visibility
In progress

RBAC config RPC, denied-calls ring buffer, ServersPanel RBAC sub-tab, and OverviewPanel badge.

Available now

MCP Gateway
Available

Centralized MCP routing via loom proxy with streamable HTTP transport, bearer/OIDC/mTLS auth, and hub failover.

Single context ingress with controlled routing, auditing, and automatic local fallback.

Role-based Access Control (RBAC)
Available

Role-aware permissions for MCP tool access with audit trail, cost tracking, and OAuth 2.1.

Enforces least-privilege context access with auditable decision logs across teams and environments.

Sandbox Executor (Docker + K8s)
Available

mcp-devbox sandbox runtime with Docker and Kubernetes backends for isolated agent execution.

Runs builds, tests, and automation in controlled containers with consistent isolation and audit trails.

Operational Foundations
Available

OTel tracing across all 59 MCP servers, JSON log correlation, observability stack, and deployment controls.

Full production observability with distributed tracing, structured logging, and repeatable deployment workflows.

Start Here

Choose the product surface, then validate the same contracts

Product pages explain lane fit. Docs and playground pages stay aligned on config shape, schema validation, and the operational entrypoints behind each surface.

Consulting

Bring it into production

If you want this stack inside your environment, the fastest path is a scoped audit or build engagement.