discourse-ai/lib
Sam d07cf51653
FEATURE: llm quotas (#1047)
Adds a comprehensive quota management system for LLM models that allows:

- Setting per-group (applied per user in the group) token and usage limits with configurable durations
- Tracking and enforcing token/usage limits across user groups
- Quota reset periods (hourly, daily, weekly, or custom)
-  Admin UI for managing quotas with real-time updates

This system provides granular control over LLM API usage by allowing admins
to define limits on both total tokens and number of requests per group.
Supports multiple concurrent quotas per model and automatically handles
quota resets.


Co-authored-by: Keegan George <kgeorge13@gmail.com>
2025-01-14 15:54:09 +11:00
..
ai_bot FIX: properly spin down unused streamer threads (#1035) 2024-12-20 12:09:42 +11:00
ai_helper FEATURE: smart date support for AI helper (#1044) 2024-12-31 08:04:25 +11:00
ai_moderation DEV: Add structure for errors in spam (#1054) 2025-01-09 09:17:06 -08:00
automation FIX: Triage rule should append selected tags instead of replacing them (#1022) 2024-12-11 11:19:44 -03:00
completions FEATURE: llm quotas (#1047) 2025-01-14 15:54:09 +11:00
configuration FIX: Prevent LLM enumerator from erroring when spam enabled (#1045) 2024-12-27 09:12:29 +11:00
database DEV: port directory structure to Zeitwerk (#319) 2023-11-29 15:17:46 +11:00
discord/bot FEATURE: Discord Bot integration (#831) 2024-10-16 12:41:18 -03:00
embeddings FEATURE: Unavailable state for semantic search when sort is not Relevant (#1030) 2024-12-16 14:30:11 -06:00
inference FIX: Cloudflare Workers AI embeddings (#1037) 2024-12-20 17:45:27 -03:00
sentiment UX: Make sentiment trends more readable (#1018) 2024-12-11 09:13:18 -08:00
summarization FEATURE: Show gists everywhere except suggested/related (#995) 2024-12-12 12:29:35 -03:00
tasks/modules REFACTOR: A Simpler way of interacting with embeddings tables. (#1023) 2024-12-13 10:15:21 -03:00
tokenizer FIX/REFACTOR: FoldContent revamp (#866) 2024-10-25 11:51:17 -03:00
utils FEATURE: allow artifacts to be updated (#980) 2024-12-03 07:23:31 +11:00
automation.rb FIX: AI Automation scripts were broken when using seeded models (#991) 2024-12-02 19:07:05 -03:00
engine.rb DEV: port directory structure to Zeitwerk (#319) 2023-11-29 15:17:46 +11:00
guardian_extensions.rb FEATURE: Calculate gists from non hot topics too (#958) 2024-11-26 13:44:12 -03:00
multisite_hash.rb FIX: properly cache user locale (#593) 2024-04-26 09:28:35 -03:00
post_extensions.rb FEATURE: Backfill posts sentiment. (#982) 2024-12-03 10:27:03 -03:00
summarization.rb FEATURE: Generate topic gists for the hot topics list. (#837) 2024-10-18 18:01:39 -03:00
topic_extensions.rb PERF: Preload only gists when including summaries in topic list (#948) 2024-11-25 12:24:02 -03:00