Velaxe
AI Hub — Unified LLM Gateway, Chat, Embeddings & Jobs | Velaxe

AI Hub

Enforce monthly token quotas via RateLimiter

Set ai_quota_tokens in InstalledApps.options; AI Hub enforces hard limits.

8 min Intermediate Admin, Finance, Ops Updated Sep 19, 2025

Overview

Set ai_quota_tokens in InstalledApps.options; AI Hub enforces hard limits.

Prerequisites

  • InstalledApps.options JSON accessible for the workspace

Permissions required

ai.configure

Downloads & Templates

Steps (2)

Estimated: 8 min
  1. 1

    Set quota

    Admin 4 min Back to top

    In InstalledApps.options, set {"ai_quota_tokens": 250000}.

    Tips

    Validation

    Success criteria

  2. 2

    Verify enforcement

    Ops 4 min Back to top

    When UsageDaily tokens exceed the limit in the current month, calls throw "AI monthly quota exceeded".

    Tips

    Validation

    Success criteria

    • Over-quota requests return 429/500 with clear error; spend plateaus.

About this guide

AI Hub centralizes generative AI for your workspace with a single, policy-aware gateway to multiple providers. Teams get a streamlined chat experience with searchable history and feedback, a minimal Query API for quick prompts, and embeddings for retrieval workflows. Operators gain visibility with usage & cost tracking, quotas, and exportable audit logs.

Choose the best model for each task, fail over between providers, and moderate inputs/outputs with block/warn/allow policies. Keys are encrypted at rest and scoped per workspace. Long-running tasks run on a background worker and broadcast events so other apps can react in real time.

Designed for safety and speed: opinionated defaults, least-privilege access, and drop-in APIs that make it easy to bring AI to every surface of Velaxe.