discourse-ai

Commit Graph

Author	SHA1	Message	Date
Keegan George	9ee82fd8be	DEV: Temporarily suppress diff animation as we fix issues (#1341 ) The diff animation introduced in https://github.com/discourse/discourse-ai/pull/1332 and with attempts to improve it in https://github.com/discourse/discourse-ai/pull/1338 still has various issues. As we work on a fix, we want to revert the animation to simply stream the diff without animation so users are not left with a janky unusable experience.	2025-05-15 14:55:30 -07:00
Keegan George	dfea784fc4	DEV: Improve diff streaming accuracy with safety checker (#1338 ) This update adds a safety checker which scans the streamed updates. It ensures that incomplete segments of text are not sent yet over message bus as this will cause breakage with the diff streamer. It also updates the diff streamer to handle a thinking state for when we are waiting for message bus updates.	2025-05-15 11:38:46 -07:00
Roman Rizzi	ff2e18f9ca	FIX: Structured output discrepancies. (#1340 ) This change fixes two bugs and adds a safeguard. The first issue is that the schema Gemini expected differed from the one sent, resulting in 400 errors when performing completions. The second issue was that creating a new persona won't define a method for `response_format`. This has to be explicitly defined when we wrap it inside the Persona class. Also, There was a mismatch between the default value and what we stored in the DB. Some parts of the code expected symbols as keys and others as strings. Finally, we add a safeguard when, even if asked to, the model refuses to reply with a valid JSON. In this case, we are making a best-effort to recover and stream the raw response.	2025-05-15 11:32:10 -03:00
Sam	1b3fdad5c7	FEATURE: allow researcher to also research specific topics (#1339 ) * FEATURE: allow researcher to also research specific topics Also improve UI around research with more accurate info * this ensures that under no conditions PMs will be included	2025-05-15 17:48:21 +10:00
Sam	2c6459429f	DEV: use a proper object for tool definition (#1337 ) * DEV: use a proper object for tool definition This moves away from using a loose hash to define tools, which is error prone. Instead given a proper object we will also be able to coerce the return values to match tool definition correctly * fix xml tools * fix anthropic tools * fix specs... a few more to go * specs are passing * FIX: coerce values for XML tool calls * Update spec/lib/completions/tool_definition_spec.rb Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-05-15 17:32:39 +10:00
Sam	c34fcc8a95	FEATURE: forum researcher persona for deep research (#1313 ) This commit introduces a new Forum Researcher persona specialized in deep forum content analysis along with comprehensive improvements to our AI infrastructure. Key additions: New Forum Researcher persona with advanced filtering and analysis capabilities Robust filtering system supporting tags, categories, dates, users, and keywords LLM formatter to efficiently process and chunk research results Infrastructure improvements: Implemented CancelManager class to centrally manage AI completion cancellations Replaced callback-based cancellation with a more robust pattern Added systematic cancellation monitoring with callbacks Other improvements: Added configurable default_enabled flag to control which personas are enabled by default Updated translation strings for the new researcher functionality Added comprehensive specs for the new components Renames Researcher -> Web Researcher This change makes our AI platform more stable while adding powerful research capabilities that can analyze forum trends and surface relevant content.	2025-05-14 12:36:16 +10:00
Roman Rizzi	aef84bc5bb	FEATURE: Examples support for personas. (#1334 ) Examples simulate previous interactions with an LLM and come right after the system prompt. This helps grounding the model and producing better responses.	2025-05-13 10:06:16 -03:00
Rafael dos Santos Silva	67587367ab	FEATURE: New setting to control model for translations (#1333 )	2025-05-12 12:12:30 -03:00
Sam	cf45e6884c	FIX: persona triage should be logged to automation (#1326 ) We were logging persona triage as "bot" in logs, causing some confusions around real world usage This amends it so we log usage to "automation - AUTOMATION NAME"	2025-05-08 12:51:36 +10:00
Sam	2a62658248	FEATURE: support configurable thinking tokens for Gemini (#1322 )	2025-05-08 07:39:50 +10:00
Rafael dos Santos Silva	1b71330cea	FIX: Correct prompt format for img2text used in our AI Bot PDF Rag pipeline (#1323 )	2025-05-07 16:48:56 -03:00
Roman Rizzi	5bc9fdc06b	FIX: Return structured output on non-streaming mode (#1318 )	2025-05-06 15:34:30 -03:00
Keegan George	adc2716cec	FIX: Invalid access error in logs (#1317 )	2025-05-06 10:13:03 -07:00
Roman Rizzi	c0a2d4c935	DEV: Use structured responses for summaries (#1252 ) * DEV: Use structured responses for summaries * Fix system specs * Make response_format a first class citizen and update endpoints to support it * Response format can be specified in the persona * lint * switch to jsonb and make column nullable * Reify structured output chunks. Move JSON parsing to the depths of Completion * Switch to JsonStreamingTracker for partial JSON parsing	2025-05-06 10:09:39 -03:00
Sam	c6a307b473	FIX: handle unexpected errors when browsing web (#1314 )	2025-05-06 18:12:26 +10:00
Sam	e9fed4f5fa	FEATURE: ensure researcher and github helper know the date (#1312 ) The date / time is important cause it creates a bias towards action. Without it, model may think it is getting future annoucements vs up to date info.	2025-05-06 14:39:30 +10:00
Rafael dos Santos Silva	4eac377987	DEV: Zero delays on fake endpoint used in tests (#1311 )	2025-05-05 17:47:32 -03:00
Roman Rizzi	48305dc7d3	FIX: resource_url replacemente in Persona's system prompt (#1310 )	2025-05-05 11:41:04 -03:00
Sam	a6fa619c31	FEATURE: enforce jpg/png for all images (#1309 ) This ensures webp / gif are converted to png prior to sending to LLMs given webp and gif are not evenly supported. png/jpg is universally supported and are the only supported format. longer term we need to add support for audio/video/pdf which is supported by some models. * more specs	2025-05-05 17:46:37 +10:00
Sam	9196546f6f	FIX: better LLM feedback for image generation failures (#1306 ) * FIX: handle error conditions when generating images gracefully * FIX: also handle error for edit_image * Update lib/inference/open_ai_image_generator.rb Co-authored-by: Krzysztof Kotlarek <kotlarek.krzysztof@gmail.com> * lint --------- Co-authored-by: Krzysztof Kotlarek <kotlarek.krzysztof@gmail.com>	2025-05-01 19:25:38 +10:00
Sam	491dac298f	FIX: system persona state leaking between sites (#1304 ) System personas leaned on reused classes, this was a problem in a multisite environement cause state, such as "enabled" ended up being reused between sites. New implementation ensures state is pristine between sites in a multisite * more handling for new superclass story * small oversight, display name should be used for display	2025-05-01 13:24:53 +10:00
Sam	7dc3c30fa4	FEATURE: correctly decorate AI bots (#1300 ) AI bots come in 2 flavors 1. An LLM and LLM user, in this case we should decorate posts with persona name 2. A Persona user, in this case, in PMs we decorate with LLM name (2) is a significant improvement, cause previously when creating a conversation you could not tell which LLM you were talking to by simply looking at the post, you would have to scroll to the top of the page. * lint * translation missing	2025-04-30 16:36:38 +10:00
Sam	09a6841480	FIX: s3 was missing a const (#1298 )	2025-04-30 07:58:24 +10:00
Sam	17f04c76d8	FEATURE: add OpenAI image generation and editing capabilities (#1293 ) This commit enhances the AI image generation functionality by adding support for: 1. OpenAI's GPT-based image generation model (gpt-image-1) 2. Image editing capabilities through the OpenAI API 3. A new "Designer" persona specialized in image generation and editing 4. Two new AI tools: CreateImage and EditImage Technical changes include: - Renaming `ai_openai_dall_e_3_url` to `ai_openai_image_generation_url` with a migration - Adding `ai_openai_image_edit_url` setting for the image edit API endpoint - Refactoring image generation code to handle both DALL-E and the newer GPT models - Supporting multipart/form-data for image editing requests * wild guess but maybe quantization is breaking the test sometimes this increases distance * Update lib/personas/designer.rb Co-authored-by: Alan Guo Xiang Tan <gxtan1990@gmail.com> * simplify and de-flake code * fix, in chat we need enough context so we know exactly what uploads a user uploaded. * Update lib/personas/tools/edit_image.rb Co-authored-by: Alan Guo Xiang Tan <gxtan1990@gmail.com> * cleanup downloaded files right away * fix implementation --------- Co-authored-by: Alan Guo Xiang Tan <gxtan1990@gmail.com>	2025-04-29 17:38:54 +10:00
Mark VanLandingham	b7b9179bc8	FEATURE: Allow for persona & llm selection in bot conversations page (#1276 )	2025-04-24 11:17:24 -05:00
Sam	65718f6dbe	FIX: eat all leading spaces llms provide when they stream them (#1280 ) * FIX: eat all leading spaces llms provide when they stream them * improve so we don't stop replying...	2025-04-24 22:07:26 +10:00
Sam	2060426709	FIX: guard against situations where there is no reply, pass thread id (#1279 )	2025-04-24 20:31:14 +10:00
Sam	2a5c60db10	FEATURE: display more places where AI is used / Chat streamer (#1278 ) * FEATURE: display more places where AI is used - Usage was not showing automation or image caption in llm list. - Also: FIX - reasoning models would time out incorrectly after 60 seconds (raised to 10 minutes) * correct enum not to enumerate non configured models * FEATURE: implement chat streamer This implements a basic chat streamer, it provides 2 things: 1. Gives feedback to the user when LLM is generating 2. Streams stuff much more efficiently to client (given it may take 100ms or so per call to update chat)	2025-04-24 16:22:19 +10:00
Rafael dos Santos Silva	4470e8af9b	FIX: Tables should group only per their key on usage page (#1277 )	2025-04-23 15:47:34 -03:00
Keegan George	d26c7ac48d	FEATURE: Add spending metrics to AI usage (#1268 ) This update adds metrics for estimated spending in AI usage. To make use of it, admins must add cost details to the LLM config page (input, output, and cached input costs per 1M tokens). After doing so, the metrics will appear in the AI usage dashboard as the AI plugin is used.	2025-04-17 15:09:48 -07:00
Sam	0f34ce999f	FIX: omit thinking tokens from chat (#1264 ) * FIX: omit thinking tokens from chat Thinking tokens cause a lot of confusion in chat, get rid of them * also catch partial tool just in case	2025-04-15 17:13:20 +10:00
Sam	274a54a324	FEATURE: Update model names and specs (#1262 ) * FEATURE: Update model names and specs - not a bug, but made it explicit that tools and thinking are not a chat thing - updated all models to latest in presets (Gemini and OpenAI) * allow larger context windows	2025-04-15 16:33:44 +10:00
Keegan George	1300cc8a36	FEATURE: Add streaming to composer helper (#1256 ) This update adding streaming to the AI helper inside the composer.	2025-04-14 08:18:50 -07:00
Sam	38b492529f	FEATURE: improve context management (#1260 ) 1. Add age of post to topic context (1 month ago, 1 year ago, etc) 2. Refactor code for simplicity 3. Fix handling of post context in DMs which was not using new handling of uploads	2025-04-14 21:45:48 +10:00
Sam	67e3a610cb	FIX: invalid context construction for responders (#1257 ) Previous to this fix we assumed the name field contained usernames when in fact it was stored in the id field. This fixes the context contruction and also adds some basic user information to the context to assist responders in understanding the cast of chars	2025-04-12 08:15:31 +10:00
Keegan George	4de39a07e5	FEATURE: Configure persona backed features in admin panel (#1245 ) In this feature update, we add the UI for the ability to easily configure persona backed AI-features. The feature will still be hidden until structured responses are complete.	2025-04-10 08:16:31 -07:00
Sam	e15984029d	FEATURE: allow tools to amend personas (#1250 ) Add API methods to AI tools for reading and updating personas, enabling more flexible AI workflows. This allows custom tools to: - Fetch persona information through discourse.getPersona() - Update personas with modified settings via discourse.updatePersona() - Also update using persona.update() These APIs enable new use cases like "trainable" moderation bots, where users with appropriate permissions can set and refine moderation rules through direct chat interactions, without needing admin panel access. Also adds a special API scope which allows people to lean on API for similar actions Additionally adds a rather powerful hidden feature can allow custom tools to inject content into the context unconditionally it can be used for memory and similar features	2025-04-09 15:48:25 +10:00
Roman Rizzi	f9d641dd3a	FIX: Restore gists previous group access behavior. (#1247 ) Previously, allowing "everyone" to access gists meant anons would see them too. With the move to Personas, we used "[]" to reflect that. With discourse/discourse#32199 adding the "everyone" option to the personas-allowed groups, we are switching back to the original behavior. Leaving allowed groups empty should always mean nobody can use the feature.	2025-04-07 12:04:30 -03:00
Sam	ed907dd004	FEATURE: allow to send LLM reports to groups (#1246 ) * FEATURE: allow to send LLM reports to groups * spec regression	2025-04-07 15:31:30 +10:00
Roman Rizzi	0d60aca6ef	FEATURE: Personas powered summaries. (#1232 ) * REFACTOR: Move personas into it's own module. * WIP: Use personas for summarization * Prioritize persona default LLM or fallback to newest one * Simplify summarization strategy * Keep ai_sumarization_model as a fallback	2025-04-02 12:54:47 -03:00
Sam	8b573fe743	FIX: maintain newest uploads correctly when constructing context (#1242 )	2025-04-02 10:09:38 -03:00
Roman Rizzi	30242a27e6	REFACTOR: Move personas into its own module. (#1233 ) This change moves all the personas code into its own module. We want to treat them as a building block features can built on top of, same as `Completions::Llm`. The code to title a message was moved from `Bot` to `Playground`.	2025-03-31 14:42:33 -03:00
Sam	5b6d39a206	FEATURE: flexible image handling within messages (#1214 ) * DEV: refactor bot internals This introduces a proper object for bot context, this makes it simpler to improve context management as we go cause we have a nice object to work with Starts refactoring allowing for a single message to have multiple uploads throughout * transplant method to message builder * chipping away at inline uploads * image support is improved but not fully fixed yet partially working in anthropic, still got quite a few dialects to go * open ai and claude are now working * Gemini is now working as well * fix nova * more dialects... * fix ollama * fix specs * update artifact fixed * more tests * spam scanner * pass more specs * bunch of specs improved * more bug fixes. * all the rest of the tests are working * improve tests coverage and ensure custom tools are aware of new context object * tests are working, but we need more tests * resolve merge conflict * new preamble and expanded specs on ai tool * remove concept of "standalone tools" This is no longer needed, we can set custom raw, tool details are injected into tool calls	2025-03-31 12:39:07 -03:00
Keegan George	bab6f0be43	FIX: Ensure category badging present in sentiment reports (#1222 ) This PR ensures that the category badges are present in the sentiment analysis report. Since the core change in https://github.com/discourse/discourse/pull/31795, there was a regression in the post list drill-down where category badges were not being shown. This PR fixes that and also ensures icons/emojis are shown when categories make use of them. This PR also adds the category badge in the table list.	2025-03-26 12:37:41 -07:00
Natalie Tay	f7536181af	DEV: Update solved report due to changes in solved plugin (#1212 )	2025-03-25 15:15:30 +08:00
Sam	1dde82eb58	FEATURE: allow specifying tool use none in completion prompt This PR adds support for disabling further tool calls by setting tool_choice to :none across all supported LLM providers: - OpenAI: Uses "none" tool_choice parameter - Anthropic: Uses {type: "none"} and adds a prefill message to prevent confusion - Gemini: Sets function_calling_config mode to "NONE" - AWS Bedrock: Doesn't natively support tool disabling, so adds a prefill message We previously used to disable tool calls by simply removing tool definitions, but this would cause errors with some providers. This implementation uses the supported method appropriate for each provider while providing a fallback for Bedrock. Co-authored-by: Natalie Tay <natalie.tay@gmail.com> * remove stray puts * cleaner chain breaker for last tool call (works in thinking) remove unused code * improve test --------- Co-authored-by: Natalie Tay <natalie.tay@gmail.com>	2025-03-25 08:06:43 +11:00
Régis Hanol	2dd7068b46	DEV: add `guardian` argument to `TopicsFilter` callback (#1206 ) To match the updates in discourse/discourse#31908	2025-03-24 10:33:18 +01:00
Rafael dos Santos Silva	0e2dd7378f	DEV: Support for extra model params in LLM completions (#1208 ) This will be useful to start experiment with non-standard features like structured outputs.	2025-03-21 13:49:23 -03:00
Alan Guo Xiang Tan	01eced74a3	FIX: Ensure that we shutdown thread pool (#1207 )	2025-03-21 11:08:36 +08:00
Sam	269f169abf	DEV: add API to get topic info from a custom tool (#1197 ) Previously we could only get post info	2025-03-17 18:47:23 +11:00

1 2 3 4 5 ...

654 Commits