LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models

Da Wiki AI.

LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models