Paper page - MemLens: Benchmarking Multimodal Long-Term Memory in Large Vision-Language Models
…Xiyu Ren , , Yiming Du , , , , , , , , , , , Abstract A new benchmark evaluates memory capabilities in vision-language models through multi-session conversations, revealing limitations of both long-context and memory-augmented approaches. AI-generated summary…