- Add non-contiguous search/replace support using ... syntax - Add judge support for evaluating LLM outputs with ratings - Improve error handling and reporting in eval runner - Add full section replacement support without search blocks - Add fabricators and specs for artifact diffing - Track failed searches to improve debugging - Add JS syntax validation for artifact versions in eval system - Update prompt documentation with clear guidelines * improve eval output * move error handling * llm as a judge * fix spec * small note on evals |
||
---|---|---|
.. | ||
ai_artifact_fabricator.rb | ||
ai_persona_fabricator.rb | ||
ai_summary_fabricator.rb | ||
ai_tool_fabricator.rb | ||
classification_result_fabricator.rb | ||
embedding_definition_fabricator.rb | ||
llm_model_fabricator.rb | ||
llm_quota_fabricator.rb | ||
llm_quota_usage_fabricator.rb | ||
rag_document_fragment_fabricator.rb |