What You Get
Transparent Request Interception
Seamlessly intercept outgoing API calls to vendors like OpenAI, Anthropic, and others. No changes to your application code—just point your base URL to Waterfall.
Automatic Usage Tracking
Every request and response is logged and attributed to the right team, project, or user. Get granular visibility into who is consuming what, and how much it costs.
Vendor-Agnostic Rerouting
Route requests through Waterfall to any upstream provider. Switch vendors, load-balance across providers, or fail over automatically—all without touching your code.
Real-Time Cost Attribution
Map every API call to a cost center instantly. Know your per-team, per-project, and per-feature spend as it happens—not at the end of the month.
Token & Request Metering
Track input tokens, output tokens, compute time, and request counts across all providers. Normalize metrics across different vendor billing models.
Spend Alerts & Budgets
Set budget thresholds per team or project. Get notified before you overspend and automatically enforce limits to prevent runaway costs.
Zero-Latency Overhead
Our proxy is optimized for minimal latency impact. Requests are forwarded in real-time with sub-millisecond overhead so your applications stay fast.
Drop-In Deployment
Deploy as a sidecar, a local daemon, or use our hosted proxy. Works with any language or framework—just change your API base URL and you're live.
