NVIDIA Boosts LLM Inference Performance With New TensorRT-LLM Software Library Posted on septiembre 9, 2023 by admin TensorRT-LLM provides 8x higher performance for AI inferencing on NVIDIA hardware.