Integrations
One SDK. Every provider. No prompt logging.
Framewren intercepts at the transport layer — token counts, latencies, model identifiers. Your prompt text and completion content are never captured. Add a provider in under 2 minutes; no infrastructure changes, no proxy layer, no ops overhead.
LLM providers
Every major model provider, one dashboard
Add a provider in under 2 minutes. Framewren normalizes their different pricing models into a unified cost view.
OpenAI
GPT-4o, GPT-4 Turbo, GPT-3.5 Turbo, o1, o3 — all models tracked. Cost per-request and aggregate monthly. Token-level breakdown.
● ConnectedAnthropic
Claude 3.5 Sonnet, Claude 3 Opus, Haiku — input/output token costs tracked separately across all context-length tiers.
● ConnectedGemini
Gemini 1.5 Pro, Flash, Ultra — character-based pricing normalized to per-token equivalent for apples-to-apples comparison.
● ConnectedMistral
Mistral Large, Mistral Small, Mixtral 8x7B — latency and cost tracking with region-specific pricing awareness.
● ConnectedCohere
Command R, Command R+, Embed — full cost attribution and latency percentile tracking via the Framewren Python and Node SDKs.
○ BetaTogether AI
Open-source model hosting — Llama 3, Qwen, DeepSeek. Cost per-inference tracked across all model variants and hosting tiers.
○ BetaReplicate
On-demand model hosting. Track run-level costs by model and version, including cold-start latency overhead separate from inference latency.
○ Coming soonSelf-hosted
vLLM, Ollama, or any OpenAI-compatible API. Framewren tracks latency and infrastructure cost attribution for locally-hosted models.
● ConnectedOrchestration frameworks
Works with the LLM frameworks you already use
Drop the Framewren SDK into your LangChain chain, LlamaIndex query engine, or custom HTTP client with zero prompt-layer changes.
LangChain
Callback handler integration
LlamaIndex
Event handler support
Haystack
Pipeline step wrapper
Vercel AI SDK
Edge-compatible middleware
Custom HTTP
Manual SDK init
Setup
Three steps. Under 10 minutes.
No infrastructure changes. No proxy layer. No prompt routing. Just a lightweight SDK that wraps your existing LLM client calls.
Step 1 — Install
$ pip install framewren-sdk
# or for Node.js:
$ npm install @framewren/sdk
Step 2 — Initialize
import framewren
from openai import OpenAI
framewren.init(api_key="frw_live_your_key_here")
client = framewren.wrap(OpenAI())
Step 3 — You're live
✓ Framewren connected
✓ Monitoring 1 active project: production
Provider: OpenAI · First metric in 12s
Start monitoring your LLM APIs today
Free tier includes 2 provider connections and 50K API calls per month. No credit card required.