LLMLingua: Efficient Token Removal for Large Language Models
LLMLingua uses a compact language model to remove unnecessary tokens in prompts, leading to efficient inference with large language models and up to 20x compression without significant performance loss.
Read More