discourse-ai

Commit Graph

Author	SHA1	Message	Date
Keegan George	9ee82fd8be	DEV: Temporarily suppress diff animation as we fix issues (#1341 ) The diff animation introduced in https://github.com/discourse/discourse-ai/pull/1332 and with attempts to improve it in https://github.com/discourse/discourse-ai/pull/1338 still has various issues. As we work on a fix, we want to revert the animation to simply stream the diff without animation so users are not left with a janky unusable experience.	2025-05-15 14:55:30 -07:00
Keegan George	dfea784fc4	DEV: Improve diff streaming accuracy with safety checker (#1338 ) This update adds a safety checker which scans the streamed updates. It ensures that incomplete segments of text are not sent yet over message bus as this will cause breakage with the diff streamer. It also updates the diff streamer to handle a thinking state for when we are waiting for message bus updates.	2025-05-15 11:38:46 -07:00
Keegan George	1300cc8a36	FEATURE: Add streaming to composer helper (#1256 ) This update adding streaming to the AI helper inside the composer.	2025-04-14 08:18:50 -07:00
Sam	5b6d39a206	FEATURE: flexible image handling within messages (#1214 ) * DEV: refactor bot internals This introduces a proper object for bot context, this makes it simpler to improve context management as we go cause we have a nice object to work with Starts refactoring allowing for a single message to have multiple uploads throughout * transplant method to message builder * chipping away at inline uploads * image support is improved but not fully fixed yet partially working in anthropic, still got quite a few dialects to go * open ai and claude are now working * Gemini is now working as well * fix nova * more dialects... * fix ollama * fix specs * update artifact fixed * more tests * spam scanner * pass more specs * bunch of specs improved * more bug fixes. * all the rest of the tests are working * improve tests coverage and ensure custom tools are aware of new context object * tests are working, but we need more tests * resolve merge conflict * new preamble and expanded specs on ai tool * remove concept of "standalone tools" This is no longer needed, we can set custom raw, tool details are injected into tool calls	2025-03-31 12:39:07 -03:00
Sam	5e80f93e4c	FEATURE: PDF support for rag pipeline (#1118 ) This PR introduces several enhancements and refactorings to the AI Persona and RAG (Retrieval-Augmented Generation) functionalities within the discourse-ai plugin. Here's a breakdown of the changes: 1. LLM Model Association for RAG and Personas: - New Database Columns: Adds `rag_llm_model_id` to both `ai_personas` and `ai_tools` tables. This allows specifying a dedicated LLM for RAG indexing, separate from the persona's primary LLM. Adds `default_llm_id` and `question_consolidator_llm_id` to `ai_personas`. - Migration: Includes a migration (`20250210032345_migrate_persona_to_llm_model_id.rb`) to populate the new `default_llm_id` and `question_consolidator_llm_id` columns in `ai_personas` based on the existing `default_llm` and `question_consolidator_llm` string columns, and a post migration to remove the latter. - Model Changes: The `AiPersona` and `AiTool` models now `belong_to` an `LlmModel` via `rag_llm_model_id`. The `LlmModel.proxy` method now accepts an `LlmModel` instance instead of just an identifier. `AiPersona` now has `default_llm_id` and `question_consolidator_llm_id` attributes. - UI Updates: The AI Persona and AI Tool editors in the admin panel now allow selecting an LLM for RAG indexing (if PDF/image support is enabled). The RAG options component displays an LLM selector. - Serialization: The serializers (`AiCustomToolSerializer`, `AiCustomToolListSerializer`, `LocalizedAiPersonaSerializer`) have been updated to include the new `rag_llm_model_id`, `default_llm_id` and `question_consolidator_llm_id` attributes. 2. PDF and Image Support for RAG: - Site Setting: Introduces a new hidden site setting, `ai_rag_pdf_images_enabled`, to control whether PDF and image files can be indexed for RAG. This defaults to `false`. - File Upload Validation: The `RagDocumentFragmentsController` now checks the `ai_rag_pdf_images_enabled` setting and allows PDF, PNG, JPG, and JPEG files if enabled. Error handling is included for cases where PDF/image indexing is attempted with the setting disabled. - PDF Processing: Adds a new utility class, `DiscourseAi::Utils::PdfToImages`, which uses ImageMagick (`magick`) to convert PDF pages into individual PNG images. A maximum PDF size and conversion timeout are enforced. - Image Processing: A new utility class, `DiscourseAi::Utils::ImageToText`, is included to handle OCR for the images and PDFs. - RAG Digestion Job: The `DigestRagUpload` job now handles PDF and image uploads. It uses `PdfToImages` and `ImageToText` to extract text and create document fragments. - UI Updates: The RAG uploader component now accepts PDF and image file types if `ai_rag_pdf_images_enabled` is true. The UI text is adjusted to indicate supported file types. 3. Refactoring and Improvements: - LLM Enumeration: The `DiscourseAi::Configuration::LlmEnumerator` now provides a `values_for_serialization` method, which returns a simplified array of LLM data (id, name, vision_enabled) suitable for use in serializers. This avoids exposing unnecessary details to the frontend. - AI Helper: The `AiHelper::Assistant` now takes optional `helper_llm` and `image_caption_llm` parameters in its constructor, allowing for greater flexibility. - Bot and Persona Updates: Several updates were made across the codebase, changing the string based association to a LLM to the new model based. - Audit Logs: The `DiscourseAi::Completions::Endpoints::Base` now formats raw request payloads as pretty JSON for easier auditing. - Eval Script: An evaluation script is included. 4. Testing: - The PR introduces a new eval system for LLMs, this allows us to test how functionality works across various LLM providers. This lives in `/evals`	2025-02-14 12:15:07 +11:00
Sam	11d0f60f1e	FEATURE: smart date support for AI helper (#1044 ) * FEATURE: smart date support for AI helper This feature allows conversion of human typed in dates and times to smart "Discourse" timezone friendly dates. * fix specs and lint * lint * address feedback * add specs	2024-12-31 08:04:25 +11:00
Keegan George	f1c7ee8624	DEV: Better control what prompts can appear in post/composer (#969 ) This PR updates the logic for the location map so it permits only the desired prompts through to the composer/post menu. Anything else won't be shown by default. This PR also adds relevant tests to prevent regression.	2024-11-27 16:14:21 -08:00
Keegan George	dabef02919	DEV: Prevent `detect_text_locale` from appearing in menus (#967 ) ### 🔍 Overview With the recent changes to allow DiscourseAi in the translator plugin, `detect_text_locale` was needed as a CompletionPrompt. However, it is leaking into composer/post helper menus. This PR ensures we don't not show it in those menus.	2024-11-28 09:27:08 +11:00
Rafael dos Santos Silva	aef9a03d4c	FEATURE: Truncate AI Captions to a reasonable max size (#907 )	2024-11-12 15:52:46 -03:00
Rafael dos Santos Silva	96f5f8cbd0	FIX: Basic cleanup of AI Caption to remove line breaks and pipes (#857 )	2024-10-23 18:38:29 -03:00
Roman Rizzi	c6aeabbfc0	FIX: Malformed message in systemless + inline img scenario (#771 )	2024-08-23 16:41:57 -03:00
Roman Rizzi	5c196bca89	FEATURE: Track if a model can do vision in the llm_models table (#725 ) * FEATURE: Track if a model can do vision in the llm_models table * Data migration	2024-07-24 16:29:47 -03:00
PangBo	4ebbdc043e	fix: locale handling in assistant.rb (#705 )	2024-07-05 11:16:09 +02:00
Keegan George	eab2f74b58	DEV: Use site locale for composer helper translations (#698 )	2024-07-04 08:23:37 -07:00
Rafael dos Santos Silva	a708d4dfa2	FIX: Use base64 encoded images in AI Image Caption via LLaVa (#693 ) * FIX: Use base64 encoded images in AI Image Caption via LLaVa This fixed a regression introduced in #646 where we started sending schemaless URLs for our LLaVa service, which doesn't handle it well. Moving to base64 encoded images solves: - The service needing to download images Now the service running LLaVa doesn't need internet access - Secure uploads compat Every image is treated the same, less branching for secure uploads - Image Size problems Discourse is now responsible for ensure a max size for images - Troublesome dev env Previously to this commit you would need a dev env that was internet acessible to use llava image captions	2024-06-27 16:24:44 -03:00
Sam	b487de933d	FEATURE: add support for all vision models (#646 ) Previoulsy on GPT-4-vision was supported, change introduces support for Google/Anthropic and new OpenAI models Additionally this makes vision work properly in dev environments cause we sent the encoded payload via prompt vs sending urls	2024-05-28 10:31:15 -03:00
Sam	8eee6893d6	FEATURE: GPT4o support and better auditing (#618 ) - Introduce new support for GPT4o (automation / bot / summary / helper) - Properly account for token counts on OpenAI models - Track feature that was used when generating AI completions - Remove custom llm support for summarization as we need better interfaces to control registration and de-registration	2024-05-14 13:28:46 +10:00
Sam	85734fef52	FIX: properly cache user locale (#593 ) This blob is localized according to user locale, so we can end up bleeding incorrect data in the cache	2024-04-26 09:28:35 -03:00
Rafael dos Santos Silva	595cde0fd6	FIX: Users with empty locales would error out during prompt localization (#584 )	2024-04-22 13:55:10 -03:00
Sam	484fd1435b	DEV: improve internal design of ai persona and bug fix (#495 ) * DEV: improve internal design of ai persona and bug fix - Fixes bug where OpenAI could not describe images - Fixes bug where mentionable personas could not be mentioned unless overarching bot was enabled - Improves internal design of playground and bot to allow better for non "bot" users - Allow PMs directly to persona users (previously bot user would also have to be in PM) - Simplify internal code Co-authored-by: Martin Brennan <martin@discourse.org>	2024-02-28 16:46:32 +11:00
Sam	d036f3fb8e	FEATURE: AI helper support in non English languages (#489 ) * FEATURE: AI helper support in non English languages This attempts some prompt engineering to coerce AI helper to answer in the appropriate language. Note mileage will vary, in testing GPT-4 produces the best results GPT-3.5 can return OKish results. * Extend non english support for GPT-4V image caption * Update db/fixtures/ai_helper/603_completion_prompts.rb --------- Co-authored-by: Rafael Silva <xfalcox@gmail.com>	2024-02-27 16:31:51 -03:00
Keegan George	a9b2d6a30a	FEATURE: AI image caption (#470 ) This PR adds a new feature where you can generate captions for images in the composer using AI. --------- Co-authored-by: Rafael Silva <xfalcox@gmail.com>	2024-02-19 14:56:28 -03:00
Sam	1f74a77e17	DEV: correct flaky spec (#475 ) We were not properly expiring prompt cache	2024-02-19 15:21:55 +11:00
Keegan George	d66915ecc1	DEV: Make prompts available on `CurrentUserSerializer` (#472 )	2024-02-16 10:57:14 -08:00
Roman Rizzi	04eae76f68	REFACTOR: Represent generic prompts with an Object. (#416 ) * REFACTOR: Represent generic prompts with an Object. * Adds a bit more validation for clarity * Rewrite bot title prompt and fix quirk handling --------- Co-authored-by: Sam Saffron <sam.saffron@gmail.com>	2024-01-12 14:36:44 -03:00
Keegan George	7201d482d5	FEATURE: Add DallE support to AI helper's illustrate post (#404 )	2024-01-05 09:03:23 -08:00
Sam	03fc94684b	FIX: AI helper not working correctly with mixtral (#399 ) * FIX: AI helper not working correctly with mixtral This PR introduces a new function on the generic llm called #generate This will replace the implementation of completion! #generate introduces a new way to pass temperature, max_tokens and stop_sequences Then LLM implementers need to implement #normalize_model_params to ensure the generic names match the LLM specific endpoint This also adds temperature and stop_sequences to completion_prompts this allows for much more robust completion prompts * port everything over to #generate * Fix translation - On anthropic this no longer throws random "This is your translation:" - On mixtral this actually works * fix markdown table generation as well	2024-01-04 09:53:47 -03:00
Keegan George	1a5985134a	FIX: Show illustrate post only if stability API key present (#395 )	2024-01-02 11:24:16 -08:00
Keegan George	5a84969c96	FIX: Illustrate post icon and translation not appearing correctly (#371 )	2023-12-19 12:55:43 -08:00
Keegan George	7b4710d5c9	FEATURE: Generate post illustrations (#367 )	2023-12-19 11:17:34 -08:00
Keegan George	408d9f68eb	FEATURE: Proofread with post AI helper (#359 )	2023-12-14 19:30:52 -08:00
Keegan George	74a7ac4a3d	FEATURE: Add custom prompts to post helper options (#355 ) * FEATURE: Add custom prompts to post helper options * 💄Make pretty * 💄Make pretty!	2023-12-14 13:47:20 -03:00
Keegan George	6aaf1f002e	FEATURE: Add streaming to post AI helper's explain option (#344 ) Co-authored-by: Rafael dos Santos Silva <xfalcox@gmail.com> Co-authored-by: Roman Rizzi <roman@discourse.org>	2023-12-12 09:28:39 -08:00
Sam	6ddc17fd61	DEV: port directory structure to Zeitwerk (#319 ) Previous to this change we relied on explicit loading for a files in Discourse AI. This had a few downsides: - Busywork whenever you add a file (an extra require relative) - We were not keeping to conventions internally ... some places were OpenAI others are OpenAi - Autoloader did not work which lead to lots of full application broken reloads when developing. This moves all of DiscourseAI into a Zeitwerk compatible structure. It also leaves some minimal amount of manual loading (automation - which is loading into an existing namespace that may or may not be there) To avoid needing /lib/discourse_ai/... we mount a namespace thus we are able to keep /lib pointed at ::DiscourseAi Various files were renamed to get around zeitwerk rules and minimize usage of custom inflections Though we can get custom inflections to work it is not worth it, will require a Discourse core patch which means we create a hard dependency.	2023-11-29 15:17:46 +11:00

34 Commits