Full Results

Complete leaderboard results with pagination. Showing all ranked methods across datasets and metrics.

Dataset:
Metric:
Showing 6 of 6 results
MethodBackbone
A-Mem

Text-based

GPT-4.1-Nano57.57
MuRAG

Multimodal

GPT-4.1-Nano55.27
NGM

Multimodal

GPT-4.1-Nano50.49
4
NaiveRAG

Text-based

GPT-4.1-Nano45.69
5
Full (MM)

Multimodal

GPT-4.1-Nano39.88
6
Full (Text)

Text-based

GPT-4.1-Nano34.64