Compared to other AI chatbots, GPT-4 performs best on a test of legal reasoning – but it still falls short of the knowledge required for human lawyers. Early attempts to use AI chatbots in courtrooms have sometimes proven disastrous, and this finding adds to evidence that AI isn’t ready to handle the complexities of real-world legal arguments.
Artificial intelligence researchers and lawyers worked together to design LegalBench, which evaluates how well AI chatbots …