Skip to main content
Private AI Platform
FlexInfer

FlexInfer

Private AI Runtime, Context, and Integration Surfaces

FlexInfer runs model lifecycle, OpenAI-compatible proxying, serverless activation, and GPU-aware scheduling inside your own cluster. Loom Core and fi-fhir extend context control and healthcare integration around that runtime boundary.

Start with the product surface when you need lane fit. Use docs and playground when you need implementation detail.

Model LifecycleOpenAI ProxyGPU SchedulingQuantizationGitOpsObservability
Platform posture

Private AI Platform for Sensitive Workloads

FlexInfer anchors the runtime layer with model lifecycle, serverless activation, routing, and GPU-aware operations, while Loom Core and fi-fhir extend context and integration boundaries around it.

Integration points

How platform surfaces connect in production

Use these contracts to map deployment and integration boundaries before implementation.

Core Stack

Loom suite, FlexInfer, fi-fhir, and MentatLab

Product pages cover lane fit. Docs and playground cover execution. FlexInfer anchors runtime operations, MentatLab adds the orchestration UI surface for DAG run control, and Loom Core continues to govern context and policy boundaries.

Operational surface

MentatLab mission control

Loom Core governs context routing and policy boundaries; MentatLab provides the operator UX for DAG design and run visibility.

Mobile

Loom Companion

SwiftUI app for fleet monitoring, session management, real-time alerts, and lightweight operator control from iPhone and iPad.

Enterprise capabilities

Operator controls already shipping across the platform stack

FlexInfer already ships runtime control and observability surfaces, while gateway, RBAC, sandbox execution, and HUD visibility work continue around the adjacent Loom Core layer.

In progress

HUD Cost Dashboard
In progress

Cost monitoring integration: loom/cost-stats RPC, CostMonitor polling, SSE events, and OverviewPanel KPI tile.

HUD RBAC + Audit Visibility
In progress

RBAC config RPC, denied-calls ring buffer, ServersPanel RBAC sub-tab, and OverviewPanel badge.

Available now

MCP Gateway
Available

Centralized MCP routing via loom proxy with streamable HTTP transport, bearer/OIDC/mTLS auth, and hub failover.

Single context ingress with controlled routing, auditing, and automatic local fallback.

Role-based Access Control (RBAC)
Available

Role-aware permissions for MCP tool access with audit trail, cost tracking, and OAuth 2.1.

Enforces least-privilege context access with auditable decision logs across teams and environments.

Sandbox Executor (Docker + K8s)
Available

mcp-devbox sandbox runtime with Docker and Kubernetes backends for isolated agent execution.

Runs builds, tests, and automation in controlled containers with consistent isolation and audit trails.

Operational Foundations
Available

OTel tracing across all 59 MCP servers, JSON log correlation, observability stack, and deployment controls.

Full production observability with distributed tracing, structured logging, and repeatable deployment workflows.

Start Here

Choose the product surface, then validate the same contracts

Product pages explain lane fit. Docs and playground pages stay aligned on config shape, schema validation, and the operational entrypoints behind each surface.

Consulting

Bring it into production

If you want this stack inside your environment, the fastest path is a scoped audit or build engagement.