The Beyond the Imitation Game Benchmark (BIG-bench) is a collaborative benchmark intended to probe large language models and extrapolate their future capabilities. The more than 200 tasks included in ...
With the rapid development of large language models (LLMs) in legal applications, systematically evaluating their reasoning ability in judgment prediction has become increasingly urgent. Currently, ...