- Add non-contiguous search/replace support using ... syntax - Add judge support for evaluating LLM outputs with ratings - Improve error handling and reporting in eval runner - Add full section replacement support without search blocks - Add fabricators and specs for artifact diffing - Track failed searches to improve debugging - Add JS syntax validation for artifact versions in eval system - Update prompt documentation with clear guidelines * improve eval output * move error handling * llm as a judge * fix spec * small note on evals |
||
|---|---|---|
| .. | ||
| ai_artifact_fabricator.rb | ||
| ai_persona_fabricator.rb | ||
| ai_summary_fabricator.rb | ||
| ai_tool_fabricator.rb | ||
| classification_result_fabricator.rb | ||
| embedding_definition_fabricator.rb | ||
| llm_model_fabricator.rb | ||
| llm_quota_fabricator.rb | ||
| llm_quota_usage_fabricator.rb | ||
| rag_document_fragment_fabricator.rb | ||