Full Results
Complete leaderboard results with pagination. Showing all ranked methods across datasets and metrics.
Dataset:
Metric:
Showing 6 of 6 results
| Method | Backbone | ||
|---|---|---|---|
A-Mem Text-based | GPT-4.1-Nano | 57.57 | |
MuRAG Multimodal | GPT-4.1-Nano | 55.27 | |
NGM Multimodal | GPT-4.1-Nano | 50.49 | |
4 | NaiveRAG Text-based | GPT-4.1-Nano | 45.69 |
5 | Full (MM) Multimodal | GPT-4.1-Nano | 39.88 |
6 | Full (Text) Text-based | GPT-4.1-Nano | 34.64 |