Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[tensorrt] Updates tensorrt to 9.2.0 #3091

Open
wants to merge 167 commits into
base: master
Choose a base branch
from

Conversation

frankfliu
Copy link
Contributor

Description

Brief description of what this PR is about

  • If this change is a backward incompatible change, why must this change be made?
  • Interesting edge cases to note here

SidneyLann and others added 30 commits September 19, 2023 17:36
---------

Co-authored-by: Administrator <Administrator@tech8>
Co-authored-by: KexinFeng <fenkexin@amazon.com>
* Implement PtNDArraryEx.multiboxDetection

* MultiboxDetection - code cleanup

* MultiboxDetection - code cleanup

* MultiboxDetection - code cleanup

* MultiboxDetection - code cleanup

* format code

* Fix, add tests, and pass CI

---------

Co-authored-by: Zach Kimberg <kimbergz@amazon.com>
This fixes the markdown headers to be h1 so they render correctly in docs.
…valibrary#2806)

* [api] Added Early stopping configuration (deepjavalibrary#38)

* [api] Added Builder for Early stopping configuration (deepjavalibrary#38)

* Explicitly set NDManager for dataset in EarlyStoppingListenerTest to make the test run on JDK11 in gradle.
This creates an abstraction for combining devices into a single device. The main
use case for now is in DJL Serving TP_parallel. It will allow us to create a
WorkerGroup and a PyPredictor for a set of devices and then track the usage of
devices properly. It could also be used later for multi-gpu training or other
multi-device cases.
* Updates doc versions to 0.24.0

Also moves android gradle.properties to the new 0.25.0.

* Remove android change
* Updates XGBoost to 2.0.1

* Use devtools 8

* Updates based on new Xgboost JNI API.

---------

Co-authored-by: Frank Liu <frankfliu2000@gmail.com>
* Added element-wise gauss error function (ERF)

* Added element-wise arctan2

* Format java

* Fixed docs

* added * to other_ptr in Atan2
* Added 2D FFT

* Format java

* Add default fft2

* Convert array to vectors

* Add inverse fft2

* Add better assersion in ifft2 test

* Add really better assersion in ifft2 test

* Move cast bellow ifft2 for unsupported exception

* Format java

* changed dims to axes

* changed dims to axes
* only build triton binaries

* install requests library

* remove script
frankfliu and others added 25 commits March 26, 2024 10:08
* Increase DJL version to 0.27.0

* Update README
Avoid crash when token exceed model max length.
…avalibrary#3088)

* [fix] Fix HfModelZoo and LlamaModelZoo concurrent init problem

---------

Co-authored-by: Frank Liu <frankfliu2000@gmail.com>
@frankfliu frankfliu requested review from zachgk and a team as code owners April 15, 2024 22:05
@@ -6,16 +6,11 @@ on:
jobs:
build-tensorrt-jni-linux:
runs-on: ubuntu-latest
container: deepjavalibrary/ubuntu18.04:tensorrt-cuda116
container: deepjavalibrary/ubuntu20.04:tensorrt-cuda122
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

how about using 22.04?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet