Neowin News Feed for: Tensorrt

NVIDIA announces TensorRT-LLM for Windows that boosts LLMs by up to 4 times with RTX GPUs

John Callaham — Tue, 17 Oct 2023 20:50:01 +0000

NVIDIA has announced TensorRT-LLM for Windows. This open-source library will allow PC developers with NVIDIA GeForce RTX graphics cards to boost the performance of LLMs by up to four times. Read more...

Nvidia announces TensorRT 8, slashes BERT inference times down to a millisecond

Ather Fawaz — Tue, 20 Jul 2021 13:00:01 +0000

Providing over twice the precision and inference speed compared to the last generation, Nvidia's new TensorRT 8 deep learning SDK clocked in a time of 1.2 ms in BERT-Large's inference. Read more...