discourse-ai/spec/lib/utils
Sam 0c9466059c
DEV: improve artifact editing and eval system (#1130)
- Add non-contiguous search/replace support using ... syntax
- Add judge support for evaluating LLM outputs with ratings
- Improve error handling and reporting in eval runner
- Add full section replacement support without search blocks
- Add fabricators and specs for artifact diffing
- Track failed searches to improve debugging
- Add JS syntax validation for artifact versions in eval system
- Update prompt documentation with clear guidelines

* improve eval output

* move error handling

* llm as a judge

* fix spec

* small note on evals
2025-02-19 15:44:33 +11:00
..
diff_utils DEV: improve artifact editing and eval system (#1130) 2025-02-19 15:44:33 +11:00
dns_srv_spec.rb FEATURE: Add basic connection check to DNS SRV resources (#563) 2024-04-12 10:39:19 -03:00
pdf_to_text_spec.rb DEV: Skip PDF tests (#1129) 2025-02-18 10:17:11 +10:00