Meituan LongCat Launches Open-Source VitaBench 2.0 for Real-Life Agent Evaluation

2026-06-25 20:11

Woofun AI reports that Meituan's LongCat team has released VitaBench 2.0 under an open-source license, succeeding the version launched in October last year. The benchmark is designed to evaluate agents in real-life scenarios by modeling long-term, dynamic user behavior. It systematically assesses large language models' capabilities for personalization and proactivity during sustained, real-world interactions.

Disclaimer: Views are the author's own and do not represent the platform. Do not reproduce without permission. Content is for reference only, not investment advice. Trade at your own risk.