Vai al contenuto

Toggle the table of contents

BABILong

Da Wiki AI.

Versione del 20 mar 2024 alle 11:39 di Michela (discussione | contributi)

(diff) ← Versione meno recente | Versione attuale (diff) | Versione più recente → (diff)

BABILong è un benchmark progettato per valutare le capacità del modello nell'estrazione ed elaborazione di fatti distribuiti all'interno di testi estesi.

Link

Paper

In Search of Needles in a 11M Haystack:Recurrent Memory Finds What LLMs Miss: paper originale

Github

BABILong: a long-context needle-in-a-haystack benchmark for LLMs

Estratto da "https://wiki.mindmaker.it/index.php?title=BABILong&oldid=1452"

Benchmark