Research-use only. Internal Safety Collapse (ISC) is released exclusively for accelerating red-teaming process, evaluation, and mitigation work. We do not condone or permit any use of these materials ...
DeepEval is a simple-to-use, open-source LLM evaluation framework, for evaluating large-language model systems. It is similar to Pytest but specialized for unit testing LLM apps. DeepEval incorporates ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results