CyberJurors: A Multi-Agent Simulation Task for E-Commerce Disputes Verdict
Abstract
The intelligent verdict is essential for handling voluminous demands of E-commerce dispute. Unlike the legal dispute, it necessitates identifying pivotal clues from redundant multimodal evidence chains, relying on informal transaction rules for dispute verdicts. The complex ``clues-dispute" causal logic and flexible verdict rules render existing methods inadequate. Motivated by this, we propose a pioneering task, E-commerce Dispute Verdict (EDV), and introduce VerdictBench, the first Multimodal Disputes Verdicts Benchmark for E-commerce, to facilitate the intelligent verdicts. Building upon this, we propose CyberJurors, a framework that integrates an Individual Verdict Chain-of-Thought (IV-CoT) and Jury Consensus Verdict (JCV) to clarify the dispute logic and regulate the fair verdict process. For the individual juror, IV-CoT decomposes the EDV task into a structured reasoning, enabling fine-grained clues perception and explicit causal logic between clues and dispute. For the collective jury, JCV simulates multi-round discussion and voting among jurors guided by Verdict Precedents, effectively mitigating individual biases. Extensive experiments on VerdictBench demonstrate that CyberJurors significantly improves verdict accuracy, fairness, and interpretability, outperforming existing MLLMs by up to 9.48\% in accuracy.