EvalPlus LLM Coding is a rigorous evaluation framework for Language Model Learning (LLM) code. It offers enhanced testing with HumanEval+ and MBPP+, providing 80x and 35x more tests than the original versions respectively. The software also includes packages, images, and tools that can easily and safely evaluate LLMs on these benchmarks.