discourse-ai/spec/lib/modules
Roman Rizzi 9a79afcdbf
DEV: Better strategies for summarization (#88)
* DEV: Better strategies for summarization

The strategy responsibility needs to be "Given a collection of texts, I know how to summarize them most efficiently, using the minimum amount of requests and maximizing token usage".

There are different token limits for each model, so it all boils down to two different strategies:

Fold all these texts into a single one, doing the summarization in chunks, and then build a summary from those.
Build it by combining texts in a single prompt, and truncate it according to your token limits.

While the latter is less than ideal, we need it for "bart-large-cnn-samsum" and "flan-t5-base-samsum", both with low limits. The rest will rely on folding.

* Expose summarized chunks to users
2023-06-27 12:26:33 -03:00
..
ai_bot FIX: adjust token limits to account for functions (#96) 2023-06-23 10:02:04 +10:00
ai_helper FEATURE: Anthropic Claude for AIHelper and Summarization modules (#39) 2023-04-10 11:04:42 -03:00
embeddings FIX: ensure we only attempt embedding once every 15 minutes (#76) 2023-05-23 10:43:24 +10:00
nsfw FEATURE: Handle invalid media in NSFW module (#57) 2023-05-11 15:35:39 -03:00
sentiment DEV: DiscourseAI -> DiscourseAi rename to have consistent folders and files (#9) 2023-03-14 16:03:50 -03:00
summarization DEV: Better strategies for summarization (#88) 2023-06-27 12:26:33 -03:00
toxicity DEV: Fix toxicity test (#78) 2023-05-23 11:02:11 -03:00