def test_task_done(self, judge_model): """Book a table — agent calls the tool and gives confirmation.""" test_case = LLMTestCase( input="Book a table for 2 at Pasta Place for Friday at 7pm.", ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results