- Add non-contiguous search/replace support using ... syntax - Add judge support for evaluating LLM outputs with ratings - Improve error handling and reporting in eval runner - Add full section replacement support without search blocks - Add fabricators and specs for artifact diffing - Track failed searches to improve debugging - Add JS syntax validation for artifact versions in eval system - Update prompt documentation with clear guidelines * improve eval output * move error handling * llm as a judge * fix spec * small note on evals |
||
---|---|---|
.. | ||
artifact_update_strategies | ||
personas | ||
tools | ||
bot.rb | ||
entry_point.rb | ||
playground.rb | ||
post_streamer.rb | ||
question_consolidator.rb | ||
response_http_streamer.rb | ||
site_settings_extension.rb | ||
tool_runner.rb |