Class: Raif::Evals::LlmJudges::Binary

Inherits:

Raif::Evals::LlmJudge

Object
ApplicationRecord
Task
Raif::Evals::LlmJudge
Raif::Evals::LlmJudges::Binary

show all

Defined in:: lib/raif/evals/llm_judges/binary.rb

Constant Summary

Constants included from Concerns::LlmResponseParsing

Concerns::LlmResponseParsing::ASCII_CONTROL_CHARS

Instance Attribute Summary

Attributes inherited from Task

#files, #images

Instance Method Summary collapse

#build_prompt ⇒ Object
#build_system_prompt ⇒ Object
#passes? ⇒ Boolean

Judgment accessor methods.

Instance Method Details

#build_prompt ⇒ `Object`

# File 'lib/raif/evals/llm_judges/binary.rb', line 33

def build_prompt
  prompt = "    Evaluation criteria: \#{criteria}\n\n    \#{strict_mode ? \"Apply the criteria strictly without any leniency.\" : \"Apply reasonable judgment while adhering to the criteria.\"}\n  PROMPT\n\n  if examples.present?\n    prompt += \"\\nHere are examples of how to evaluate:\"\n    examples.each do |example|\n      prompt += format_example(example)\n    end\n  end\n\n  prompt += additional_context_prompt if additional_context.present?\n\n  prompt += <<~PROMPT.rstrip\n\n    Now evaluate this content:\n    \#{content_to_judge}\n\n    Does this content meet the evaluation criteria?\n  PROMPT\n\n  prompt\nend\n"

#build_system_prompt ⇒ `Object`

# File 'lib/raif/evals/llm_judges/binary.rb', line 17

def build_system_prompt
  "    You are an expert evaluator assessing whether content meets specific criteria.\n    Your task is to make binary pass/fail judgments with clear reasoning.\n\n    First, provide detailed reasoning/explanation of your evaluation. Then, provide a precise pass/fail judgment.\n\n    Respond with JSON matching this schema:\n    {\n      \"passes\": boolean,\n      \"reasoning\": \"detailed explanation\",\n      \"confidence\": 0.0-1.0\n    }\n  PROMPT\nend\n".strip

#passes? ⇒ `Boolean`

Judgment accessor methods



61
62
63

# File 'lib/raif/evals/llm_judges/binary.rb', line 61

def passes?
  parsed_response["passes"] if completed?
end

Class: Raif::Evals::LlmJudges::Binary

Constant Summary

Constants included from Concerns::LlmResponseParsing

Instance Attribute Summary

Attributes inherited from Task

Instance Method Summary collapse

Methods inherited from Raif::Evals::LlmJudge

Methods inherited from Task

Methods included from Concerns::LlmResponseParsing

Methods included from Concerns::HasAvailableModelTools

Methods included from Concerns::HasRequestedLanguage

Methods included from Concerns::HasLlm

Methods inherited from ApplicationRecord

Instance Method Details

#build_prompt ⇒ Object

#build_system_prompt ⇒ Object

#passes? ⇒ Boolean

#build_prompt ⇒ `Object`

#build_system_prompt ⇒ `Object`

#passes? ⇒ `Boolean`