discourse-ai

Commit Graph

Author	SHA1	Message	Date
Roman Rizzi	a53719ab8e	FIX: Open AI embeddings config migration & Seeded indexes cleanup & (#1092 ) This change fixes two different problems. First, we add a data migration to migrate the configuration of sites using Open AI's embedding model. There was a window between the embedding config changes and #1087, where sites could end up in a broken state due to an unconfigured selected model setting, as reported on https://meta.discourse.org/t/-/348964 The second fix drops pre-seeded search indexes of the models we didn't migrate and corrects the ones where the dimensions don't match. Since the index uses the model ID, new embedding configs could use one of these ones even when the dimensions no longer match.	2025-01-27 15:24:43 -03:00
Kris	99e73f09ff	UX: improve embeddings config styles (#1085 ) * WIP: improve embeddings config styles * switch to textarea, fix back button * remove log, update button, fix tests * stree * fix spec * spec fix * remove comment	2025-01-24 16:24:59 +11:00
Rafael dos Santos Silva	67a1257b89	FEATURE: Gemini Tokenizer (#1088 )	2025-01-23 18:20:35 -03:00
Sam	8bf350206e	FEATURE: track duration of AI calls (#1082 ) * FEATURE: track duration of AI calls * annotate	2025-01-23 11:32:12 +11:00
Roman Rizzi	e2e753d73c	FEATURE: Formalize support for matryoshka dimensions. (#1083 ) We have a flag to signal we are shortening the embeddings of a model. Only used in Open AI's text-embedding-3-*, but we plan to use it for other services.	2025-01-22 11:26:46 -03:00
Roman Rizzi	a5e5ae72a8	FIX: Open AI embedding shortening is only available for some models (#1080 )	2025-01-21 17:50:40 -03:00
Roman Rizzi	3b66fb3e87	FIX: Restore the accidentally deleted query prefix. (#1079 ) Additionally, we add a prefix for embedding generation. Both are stored in the definitions table.	2025-01-21 14:10:31 -03:00
Roman Rizzi	f5cf1019fb	FEATURE: configurable embeddings (#1049 ) * Use AR model for embeddings features * endpoints * Embeddings CRUD UI * Add presets. Hide a couple more settings * system specs * Seed embedding definition from old settings * Generate search bit index on the fly. cleanup orphaned data * support for seeded models * Fix run test for new embedding * fix selected model not set correctly	2025-01-21 12:23:19 -03:00
Sam	2c609e165b	FEATURE: Add user location info to spam scanner context (#1076 ) This adds registration and last known IP information and email to scanning context. This provides another hint for spam scanner about possible malicious users. For example registered in India, replying from Australia or email is clearly a throwaway email address.	2025-01-21 17:51:21 +11:00
Roman Rizzi	46fcdb6ba5	FIX: Make summaries backfill job more resilient. (#1071 ) To quickly select backfill candidates without comparing SHAs, we compare the last summarized post to the topic's highest_post_number. However, hiding or deleting a post and adding a small action will update this column, causing the job to stall and re-generate the same summary repeatedly until someone posts a regular reply. On top of this, this is not always true for topics with `best_replies`, as this last reply isn't necessarily included. Since this is not evident at first glance and each summarization strategy picks its targets differently, I'm opting to simplify the backfill logic and how we track potential candidates. The first step is dropping `content_range`, which serves no purpose and it's there because summary caching was supposed to work differently at the beginning. So instead, I'm replacing it with a column called `highest_target_number`, which tracks `highest_post_number` for topics and could track other things like channel's `message_count` in the future. Now that we have this column when selecting every potential backfill candidate, we'll check if the summary is truly outdated by comparing the SHAs, and if it's not, we just update the column and move on	2025-01-16 09:42:53 -03:00
Sam	81b952d56e	FIX: only hide posts detected explicitly as spam (#1070 ) When enabling spam scanner it there may be old unscanned posts this can create a risky situation where spam scanner operates on legit posts during false positives To keep this a lot safer we no longer try to hide old stuff by the spammers.	2025-01-15 16:50:41 +11:00
Natalie Tay	c881f8b361	DEV: Add rake task to send topics or posts to spam scanner (#1059 )	2025-01-15 11:48:57 +08:00
Rafael dos Santos Silva	92f122c54d	SECURITY: Fix XSS on Shared AI Conversations local Onebox (#1069 )	2025-01-14 18:05:37 -03:00
Sam	d07cf51653	FEATURE: llm quotas (#1047 ) Adds a comprehensive quota management system for LLM models that allows: - Setting per-group (applied per user in the group) token and usage limits with configurable durations - Tracking and enforcing token/usage limits across user groups - Quota reset periods (hourly, daily, weekly, or custom) - Admin UI for managing quotas with real-time updates This system provides granular control over LLM API usage by allowing admins to define limits on both total tokens and number of requests per group. Supports multiple concurrent quotas per model and automatically handles quota resets. Co-authored-by: Keegan George <kgeorge13@gmail.com>	2025-01-14 15:54:09 +11:00
Sam	20612fde52	FEATURE: add the ability to disable streaming on an Open AI LLM Disabling streaming is required for models such o1 that do not have streaming enabled yet It is good to carry this feature around in case various apis decide not to support streaming endpoints and Discourse AI can continue to work just as it did before. Also: fixes issue where sharing artifacts would miss viewport leading to tiny artifacts on mobile	2025-01-13 17:01:01 +11:00
Keegan George	b24669c810	DEV: Add structure for errors in spam (#1054 ) This update adds some structure for handling errors in the spam config while also handling a specific error related to the spam scanning user not being an admin account.	2025-01-09 09:17:06 -08:00
Keegan George	24b69bf840	FIX: Update spam controller action should consider seeded LLM properly (#1053 ) The seeded LLM setting: `SiteSetting.ai_spam_detection_model_allowed_seeded_models` returns a _string_ with IDs separated by pipes. running `_map` on it will return an array with strings. We were previously checking for the id with custom prefix identifier, but instead we should be checking the stringified ID.	2025-01-08 13:41:25 -08:00
Sam	11d0f60f1e	FEATURE: smart date support for AI helper (#1044 ) * FEATURE: smart date support for AI helper This feature allows conversion of human typed in dates and times to smart "Discourse" timezone friendly dates. * fix specs and lint * lint * address feedback * add specs	2024-12-31 08:04:25 +11:00
Sam	f9f89adac5	FIX: keep track of silence reason when spam detection flags user (#1046 ) Previously reason was blank for silencing user	2024-12-27 17:47:16 +11:00
Keegan George	b480f13a0f	FIX: Prevent LLM enumerator from erroring when spam enabled (#1045 ) This PR fixes an issue where LLM enumerator would error out when `SiteSetting.ai_spam_detection = true` but there was no `AiModerationSetting.spam` present. Typically, we add an `LlmDependencyValidator` for the setting itself, however, since Spam is unique in that it has it's model set in `AiModerationSetting` instead of a `SiteSetting`, we'll add a simple check here to prevent erroring out.	2024-12-27 09:12:29 +11:00
Sam	47ecf86aa1	FIX: embedding validation (#1043 )	2024-12-24 09:37:23 +11:00
Roman Rizzi	ceac6e5efb	FIX: Embeddings validator test needs to use the new Vector class. (#1041 )	2024-12-23 14:19:22 -03:00
Keegan George	bdb8f1d5e0	FIX: Custom prefix causing allowed seeded LLMs not to be shown (#1039 ) * FIX: Custom prefix causing allowed seeded LLMs not to be shown * DEV: update spec * not `_map` so should be string not array	2024-12-23 16:42:26 +11:00
Kris	d15876025f	UX: disabled preseeded edit button, add description (#1038 )	2024-12-20 19:33:45 -05:00
Rafael dos Santos Silva	7607477ff9	FIX: Cloudflare Workers AI embeddings (#1037 ) Regressed on `534b0df`	2024-12-20 17:45:27 -03:00
Martin Brennan	f35db8068b	DEV: Change to use DPageSubheader (#1033 ) Previously was AdminPageSubheader until https://github.com/discourse/discourse/pull/30146	2024-12-18 17:39:31 +10:00
Sam	fae2d5ff2c	FEATURE: link correctly to filters to assist in debugging spam (#1031 ) - Add spam_score_type to AiSpamSerializer for better integration with reviewables. - Introduce a custom filter for detecting AI spam false negatives in moderation workflows. - Refactor spam report generation to improve identification of false negatives. - Add tests to verify the custom filter and its behavior. - Introduce links for all spam counts in report	2024-12-17 11:02:18 +11:00
Keegan George	90ce942108	FEATURE: Add periodic problem checks for each LLM in use (#1020 ) This feature adds a periodic problem check which periodically checks for issues with LLMs that are in use. Periodically, we will run a test to see if the in use LLMs are still operational. If it is not, the LLM with the problem is surfaced to the admin so they can easily go and update the configuration.	2024-12-16 15:00:05 -08:00
Roman Rizzi	534b0df391	REFACTOR: Separation of concerns for embedding generation. (#1027 ) In a previous refactor, we moved the responsibility of querying and storing embeddings into the `Schema` class. Now, it's time for embedding generation. The motivation behind these changes is to isolate vector characteristics in simple objects to later replace them with a DB-backed version, similar to what we did with LLM configs.	2024-12-16 09:55:39 -03:00
Roman Rizzi	94b85ece80	FIX: Make sure gists are atleast five minutes old before updating them (#1029 ) * FIX: Make sure gists are atleast five minutes old before updating them * Update app/jobs/regular/fast_track_topic_gist.rb Co-authored-by: Keegan George <kgeorge13@gmail.com> --------- Co-authored-by: Keegan George <kgeorge13@gmail.com>	2024-12-13 19:36:34 -03:00
Roman Rizzi	1c40a698ca	FIX: get strategy version through vector_rep (#1028 )	2024-12-13 18:49:18 -03:00
Roman Rizzi	eae527f99d	REFACTOR: A Simpler way of interacting with embeddings tables. (#1023 ) * REFACTOR: A Simpler way of interacting with embeddings' tables. This change adds a new abstraction called `Schema`, which acts as a repository that supports the same DB features `VectorRepresentation::Base` has, with the exception that removes the need to have duplicated methods per embeddings table. It is also a bit more flexible when performing a similarity search because you can pass it a block that gives you access to the builder, allowing you to add multiple joins/where conditions.	2024-12-13 10:15:21 -03:00
Krzysztof Kotlarek	04c4ff8cf0	UX: No admin header for edit personas tools or llms (#1021 ) In this PR, we added functionality to hide the admin header for edit/new actions - https://github.com/discourse/discourse/pull/30175 To make it work properly, we have to rename `show` to `edit` which is also a more accurate name.	2024-12-12 10:48:58 +11:00
Sam	47c1ea337e	FIX: allow scanning of trashed posts and deleted users for test (#1024 ) When a post is trashed we should still be allowed to scan it post.topic will be nil for a trashed topic even if post is trashed	2024-12-12 10:26:05 +11:00
Sam	47f5da7e42	FEATURE: Add AI-powered spam detection for new user posts (#1004 ) This introduces a comprehensive spam detection system that uses LLM models to automatically identify and flag potential spam posts. The system is designed to be both powerful and configurable while preventing false positives. Key Features: * Automatically scans first 3 posts from new users (TL0/TL1) * Creates dedicated AI flagging user to distinguish from system flags * Tracks false positives/negatives for quality monitoring * Supports custom instructions to fine-tune detection * Includes test interface for trying detection on any post Technical Implementation: * New database tables: - ai_spam_logs: Stores scan history and results - ai_moderation_settings: Stores LLM config and custom instructions * Rate limiting and safeguards: - Minimum 10-minute delay between rescans - Only scans significant edits (>10 char difference) - Maximum 3 scans per post - 24-hour maximum age for scannable posts * Admin UI features: - Real-time testing capabilities - 7-day statistics dashboard - Configurable LLM model selection - Custom instruction support Security and Performance: * Respects trust levels - only scans TL0/TL1 users * Skips private messages entirely * Stops scanning users after 3 successful public posts * Includes comprehensive test coverage * Maintains audit log of all scan attempts --------- Co-authored-by: Keegan George <kgeorge13@gmail.com> Co-authored-by: Martin Brennan <martin@discourse.org>	2024-12-12 09:17:25 +11:00
Martin Brennan	ae80494448	UX: Improve rough edges of AI usage page (#1014 ) * UX: Improve rough edges of AI usage page * Ensure all text uses I18n * Change from <button> usage to <DButton> * Use <AdminConfigAreaCard> in place of custom card styles * Format numbers nicely using our number format helper, show full values on hover using title attr * Ensure 0 is always shown for counters, instead of being blank * FEATURE: Load usage data after page load Use ConditionalLoadingSpinner to hide load of usage data, this prevents us hanging on page load with a white screen. * UX: Split users table, and add empty placeholders and page subheader * DEV: Test fix	2024-12-12 08:55:24 +11:00
Keegan George	a4440c507b	UX: Make sentiment trends more readable (#1018 ) Instead of a stacked chart showing a separate series for positive and negative, this PR introduces a simplification to the overall sentiment dashboard. It comprises the sentiment into a single series of the difference between `positive - negative` instead. This should allow for the data to be more easy to scan and look for trends	2024-12-11 09:13:18 -08:00
Roman Rizzi	5fc7a730ef	FIX: Triage rule should append selected tags instead of replacing them (#1022 )	2024-12-11 11:19:44 -03:00
Roman Rizzi	6da35d8e66	FIX: Gemini inference client was missing #instance (#1019 )	2024-12-10 15:42:31 -03:00
Keegan George	700e9de073	Revert "UX: Make sentiment trends more readable in time series data (#1013 )" (#1016 ) This reverts commit `375dd702b2`.	2024-12-10 08:15:27 -08:00
Keegan George	375dd702b2	UX: Make sentiment trends more readable in time series data (#1013 ) Instead of a stacked chart showing a separate series for positive and negative, this PR introduces a simplification to the overall sentiment dashboard. It comprises the sentiment into a single series of the difference between `positive - negative` instead. This should allow for the data to be more easy to scan and look for trends.	2024-12-10 07:22:41 -08:00
Sam	7ca21cc329	FEATURE: first class support for OpenRouter (#1011 ) * FEATURE: first class support for OpenRouter This new implementation supports picking quantization and provider pref Also: - Improve logging for summary generation - Improve error message when contacting LLMs fails * Better support for full screen artifacts on iPad Support back button to close full screen	2024-12-10 05:59:19 +11:00
Roman Rizzi	085dde7042	FEATURE: Select stop sequences from triage script (#1010 )	2024-12-06 11:13:47 -03:00
Roman Rizzi	7ebbcd2de3	FIX: Make sure prompt uploads get included in the prompt when triaging (#1008 )	2024-12-05 21:04:35 -03:00
Sam	a55216773a	FEATURE: Amazon Nova support via bedrock (#997 ) Refactor dialect selection and add Nova API support Change dialect selection to use llm_model object instead of just provider name Add support for Amazon Bedrock's Nova API with native tools Implement Nova-specific message processing and formatting Update specs for Nova and AWS Bedrock endpoints Enhance AWS Bedrock support to handle Nova models Fix Gemini beta API detection logic	2024-12-06 07:45:58 +11:00
Roman Rizzi	b32b1cf241	FIX: Add a digest check to avoid repeteadly generating embeddings (bulk) (#1001 )	2024-12-04 17:47:28 -03:00
Keegan George	d6beac48f8	DEV: Improve explain suggestion footnote replacement (#999 ) Previously, when clicking add footnote on an explain suggestion it would replace the selected word by finding the first occurrence of the word. This results in issues when there are more than one occurrences of a word in a post. This is not trivial to solve, so this PR instead prevents incorrect text replacements by only allowing the replacement if it's unique. We use the same logic here that we use to determine if something can be fast edited. In this PR we also update tests for post helper explain suggestions. For a while, we haven't had tests here due to streaming/timing issues, we've been skipping our system specs. In this PR, we add acceptance tests to handle this which gives us improved ability to publish message bus updates in the testing environment so that it can be better tested without issues.	2024-12-04 11:41:34 -08:00
Kris	8203bdfbc9	UX: move topic summary from DMenu to DModal (#992 ) Co-authored-by: Keegan George <kgeorge13@gmail.com>	2024-12-03 13:30:15 -05:00
Roman Rizzi	ce6a2eca21	FEATURE: Backfill posts sentiment. (#982 ) * FEATURE: Backfill posts sentiment. It adds a scheduled job to backfill posts' sentiment, similar to our existing rake task, but with two settings to control the batch size and posts' max-age. * Make sure model_name order is consistent.	2024-12-03 10:27:03 -03:00
Sam	7c65dd171f	FIX: regression, no longer sending examples to AI helper (#993 ) For a while now we have not been sending the examples to AI helper, which can lead to inconsistent results. Note: this also means that in non English we did not send English results, so this may end up reducing performance That said first thing we need to do is fix the regression.	2024-12-03 16:03:46 +11:00

1 2 3 4 5 ...

525 Commits