discourse-ai/spec
Sam d07cf51653
FEATURE: llm quotas (#1047)
Adds a comprehensive quota management system for LLM models that allows:

- Setting per-group (applied per user in the group) token and usage limits with configurable durations
- Tracking and enforcing token/usage limits across user groups
- Quota reset periods (hourly, daily, weekly, or custom)
-  Admin UI for managing quotas with real-time updates

This system provides granular control over LLM API usage by allowing admins
to define limits on both total tokens and number of requests per group.
Supports multiple concurrent quotas per model and automatically handles
quota resets.


Co-authored-by: Keegan George <kgeorge13@gmail.com>
2025-01-14 15:54:09 +11:00
..
configuration FIX: Prevent LLM enumerator from erroring when spam enabled (#1045) 2024-12-27 09:12:29 +11:00
db/migrate Migrate sentiment to a TEI backend (#886) 2024-11-04 09:14:34 -03:00
fabricators FEATURE: llm quotas (#1047) 2025-01-14 15:54:09 +11:00
fixtures FEATURE: AI Usage page (#964) 2024-11-29 06:26:48 +11:00
jobs FIX: Make sure gists are atleast five minutes old before updating them (#1029) 2024-12-13 19:36:34 -03:00
lib FEATURE: add the ability to disable streaming on an Open AI LLM 2025-01-13 17:01:01 +11:00
models FEATURE: llm quotas (#1047) 2025-01-14 15:54:09 +11:00
requests FEATURE: llm quotas (#1047) 2025-01-14 15:54:09 +11:00
serializers DEV: Fix new Rubocop offenses 2024-03-06 15:23:29 +01:00
services FEATURE: Add periodic problem checks for each LLM in use (#1020) 2024-12-16 15:00:05 -08:00
shared DEV: Cleanup deprecations (#952) 2024-12-02 14:18:03 -03:00
support FIX: Gemini inference client was missing #instance (#1019) 2024-12-10 15:42:31 -03:00
system UX: disabled preseeded edit button, add description (#1038) 2024-12-20 19:33:45 -05:00
tasks Migrate sentiment to a TEI backend (#886) 2024-11-04 09:14:34 -03:00
plugin_helper.rb DEV: Fix mismatched column types (#868) 2024-10-28 15:36:42 +02:00
plugin_spec.rb FEATURE: smart date support for AI helper (#1044) 2024-12-31 08:04:25 +11:00