discourse-ai/lib/completions/endpoints
Sam d07cf51653
FEATURE: llm quotas (#1047)
Adds a comprehensive quota management system for LLM models that allows:

- Setting per-group (applied per user in the group) token and usage limits with configurable durations
- Tracking and enforcing token/usage limits across user groups
- Quota reset periods (hourly, daily, weekly, or custom)
-  Admin UI for managing quotas with real-time updates

This system provides granular control over LLM API usage by allowing admins
to define limits on both total tokens and number of requests per group.
Supports multiple concurrent quotas per model and automatically handles
quota resets.


Co-authored-by: Keegan George <kgeorge13@gmail.com>
2025-01-14 15:54:09 +11:00
..
anthropic.rb FEATURE: AI artifacts (#898) 2024-11-19 09:22:39 +11:00
aws_bedrock.rb FEATURE: Amazon Nova support via bedrock (#997) 2024-12-06 07:45:58 +11:00
base.rb FEATURE: llm quotas (#1047) 2025-01-14 15:54:09 +11:00
canned_response.rb FEATURE: Select stop sequences from triage script (#1010) 2024-12-06 11:13:47 -03:00
cohere.rb FEATURE: improve tool support (#904) 2024-11-12 08:14:30 +11:00
fake.rb FEATURE: partial tool call support for OpenAI and Anthropic (#908) 2024-11-14 06:58:24 +11:00
gemini.rb FEATURE: AI artifacts (#898) 2024-11-19 09:22:39 +11:00
hugging_face.rb FEATURE: improve tool support (#904) 2024-11-12 08:14:30 +11:00
mistral.rb FEATURE: Add support for Mistral models (#919) 2024-11-19 17:28:09 +11:00
ollama.rb FEATURE: improve tool support (#904) 2024-11-12 08:14:30 +11:00
open_ai.rb FEATURE: add the ability to disable streaming on an Open AI LLM 2025-01-13 17:01:01 +11:00
open_router.rb FEATURE: first class support for OpenRouter (#1011) 2024-12-10 05:59:19 +11:00
samba_nova.rb FEATURE: improve tool support (#904) 2024-11-12 08:14:30 +11:00
vllm.rb FEATURE: improve tool support (#904) 2024-11-12 08:14:30 +11:00