- Add non-contiguous search/replace support using ... syntax - Add judge support for evaluating LLM outputs with ratings - Improve error handling and reporting in eval runner - Add full section replacement support without search blocks - Add fabricators and specs for artifact diffing - Track failed searches to improve debugging - Add JS syntax validation for artifact versions in eval system - Update prompt documentation with clear guidelines * improve eval output * move error handling * llm as a judge * fix spec * small note on evals |
||
|---|---|---|
| .. | ||
| artifact_update_strategies | ||
| jobs/regular | ||
| personas | ||
| tools | ||
| bot_spec.rb | ||
| entry_point_spec.rb | ||
| playground_spec.rb | ||
| question_consolidator_spec.rb | ||
| site_setting_extension_spec.rb | ||