In Search of Needles in a 11M Haystack:Recurrent Memory Finds What LLMs Miss: paper originale
BABILong: a long-context needle-in-a-haystack benchmark for LLMs