Skip to content

ONNX Runtime v1.18.1

Latest
Compare
Choose a tag to compare
@sophies927 sophies927 released this 28 Jun 00:29
3871274

What's new?

Announcements:

  • ONNX Runtime Python packages now have numpy dependency >=1.21.6, <2.0. Support for numpy 2.0 will be added in a future release.
  • CUDA 12.x ONNX Runtime GPU packages are now built against cuDNN 9.x (1.18.0 packages previously depended on cuDNN 8.x). CUDA 11.x ONNX Runtime GPU packages continue to depend on CuDNN 8.x.
  • Windows packages require installation of Microsoft Visual C++ Redistributable Runtime 14.38 or newer.

TensorRT EP:

  • TensorRT Weightless API integration.
  • Support for TensorRT hardware compatible engines.
  • Support for INT64 types in TensorRT constant layer calibration.
  • Now using latest commit of onnx-tensorrt parser, which includes several issue fixes.
  • Additional TensorRT support and performance improvements.

Packages:

  • Publish CUDA 12 Java packages to Azure DevOps feed.
  • Various packaging pipeline fixes.

This patch release also features various other bug fixes, including a CUDA 12.5 build error fix.

Big thank you to @yf711 for driving this release as the release manager and to all our contributors!

@yf711 @jchen351 @mszhanyi @snnn @wangyems @jywu-msft @skottmckay @chilo-ms @moraxu @kevinch-nv @pengwa @wejoncy @pranavsharma @Craigacp @jslhcl @adrianlizarraga @inisis @jeffbloo @mo-ja @kunal-vaishnavi @sumitsays @neNasko1 @yufenglee @dhruvbird @wangshuai09 @xiaoyu-work @axinging @yuslepukhin @YUNQIUGUO @shubhambhokare1 @fs-eire @afantino951 @tboby @HectorSVC @baijumeswani