djl | Tensorflow Ecosystem Directory

Bot releases are hidden (Show)

djl - DJL v0.30.0 Release Latest Release

Published by xyang16 about 1 month ago

Key Changes

Engine Updates:
- OnnxRuntime 1.19.0 https://github.com/deepjavalibrary/djl/pull/3446
- Huggingface Tokenizers 0.20.0 https://github.com/deepjavalibrary/djl/pull/3452
Adds mask generation task for SAM2 model https://github.com/deepjavalibrary/djl/pull/3450
Text Embedding Inference:
- Add Mistral, Qwen2, GTE, Camembert embedding model support
- Add reranker model support

Enhancement

[api] Avoid non-ascii characters by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3395
[djl-converter] Exit with error if convert model failed by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3399
[api] Support TEI input format to reranking model by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3400
[rust] Adds sigmoid and softmax operator for Rust engine by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3407
[test] Detect GPUs with specified engine by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3409
[api] Adds Criteria.isDownload() api by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3403
[rust] Build .so file for each cuda arch by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3410
[rust] Add mistral embedding model by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3412
[tokenizers] Add supported arch in djl-convert by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3416
[tokenizers] Replace pt file names to safetensors by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3417
[rust] Load model on given device by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3419
[rust] Add qwen2 model by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3420
[rust] Support pre-downloaded rust shared library by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3421
[pytorch] Adds pad operator by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3423
[rust] Provides better error message for unsupported ops by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3424
[api] Adds center fit image operation for Yolo by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3425
[rust] Add GTE and Gemma2 model by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3422
[djl-convert] Sets default max model size limit for importing by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3428
[djl-import] Includes requires version when importing model by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3431
[android] Upgrade DJL version to 0.30.0 by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3432
[rust] Make cublaslt wrapper non static by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3434
[djl-convert] Exclude models in includeTokenTypes by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3435
[rust] Make tensor contiguous in rotary embedding by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3436
[rust] Allows -1 dim for normalize() by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3442
Refactored Identifiers by @congyuluo in https://github.com/deepjavalibrary/djl/pull/3381
[rust] Adds text classification models to Rust model zoo by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3444
[examples] Adds segment anything 2 example by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3449
[api] Refactor ImageFeatureExtractor by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3455
[api] Adds base64 image support for ImageTranslator by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3456
[djl-import] Improve model import speed by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3457
[api] Updates dependencies version to latest by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3454
[api] Optimized text embedding post processing performance by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3459
add drawMarks to android BitMapImageFactory by @sindhuvahinis in https://github.com/deepjavalibrary/djl/pull/3460
[ci] moving to temporary iam credentials for publishing steps by @siddvenk in https://github.com/deepjavalibrary/djl/pull/3462
[OnnxRuntime] Update debug log message by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3463
Increase DJL version to 0.30.0 by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3465
[examples] Adds gradle tasks for each example by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3466
Upgrade dependency versions by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3467
[tokenizers] Converting encoding to int32 NDList by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3468

Bug Fixes

[api] Fixes logging calling convention by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3394
[djl-converter] Fixes import text embedding model from local folder by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3388
[djl-converter] Fixes djl-convert command line return code by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3406
[rust] Fix camembert and distilbert model loading by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3415
[rust] Fix camembert model loading by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3418
[rust] Fixes memory leak by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3433
[djl-convert] Fixes huggingface model converter by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3440
[rust] Fix bert model classifier loading by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3441
[xgb] Fixes alternative NDArray conversion issue by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3453
[djl-import] Fixes missing arguments for onnx import by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3458
[ci][fix] use v2 for aws credentials due to glib issues with node 20 by @siddvenk in https://github.com/deepjavalibrary/djl/pull/3464

Documentation

[examples] Moves nlp examples into nlp folder by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3393
[docs] Build versions.json before mike deploy by @Varun-Dutta in https://github.com/deepjavalibrary/djl/pull/3392
[example] Enable PyTorch for some training example by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3398
[docs] Updates docs website url by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3404
[docs] Fixes broken links in markdown files. by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3408
[djl-import] Fixes missing trust-remote-code arg for import model zoo by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3427
[docs] Updates trace whisper model document by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3426
[tensorflow] Updates tensorflow document by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3430
[docs] Adds segment anything document by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3451

CI/CD

[ci] Fixes serving publish for awscurl release version by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3411
[ci] Remove no_response workflow by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3429

Full Changelog: https://github.com/deepjavalibrary/djl/compare/v0.29.0...v0.30.0

djl - v0.29.0

Published by ydm-amazon 3 months ago

Key Changes

Upgrades for engines
- Upgrades PyTorch engine to 2.3.1
- Upgrades TensorFlow engine to 2.16.1
- Introduces Rust engine CUDA support
- Upgrades OnnxRuntime version to 1.18.0 and added CUDA 12.4 support
- Upgrades javacpp version to 1.5.10
- Upgrades HuggingFace tokenizer to 0.19.1
- Fixes several issues for LightGBM engine
- Deprecated llamacpp engine
Enhancements for engines and API
- Adds Yolov8 segmentation and pose detection support
- Adds metric type to Metic class
- Improves drawJoints and drawMask behavior for CV model
- Improves HuggingFace model importing and conversion tool
- Improves HuggingFace NLP model batch inference performance
- Adds built-in ONNX extension support
- Adds several NDArray operators in PyTorch engine
- Adds fp16 and bf16 support for OnnxRuntime engine
- Adds CrossEncoder support for NLP models

Enhancements

Adds metric type to Metic class by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3244
Improves drawJoints behavior by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3305
[api] Allows to control json pretty print with env var by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3288
[api] Avoid null dimensions for Metric by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3246
[api] Improve NDArray.toDebugString() output by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3290
[api] Loads native engine in deterministic order by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3300
[api] Refactor drawMask() for instance segmentation by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3304
[api] Refactor nms for yolo translator by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3297
add close method to all nd manager by @lanking520 in https://github.com/deepjavalibrary/djl/pull/3225
ported tools/stats.gradle by @elect86 in https://github.com/deepjavalibrary/djl/pull/3219
use standard GSON output by @lanking520 in https://github.com/deepjavalibrary/djl/pull/3284
[enhancement] Optimize memory copy overhead to enhance performance. by @ewan0x79 in https://github.com/deepjavalibrary/djl/pull/3289
Gradle Kotlin script plus other stuff by @elect86 in https://github.com/deepjavalibrary/djl/pull/3167
Improved incremental build by @benjie332 in https://github.com/deepjavalibrary/djl/pull/3231
Refactored Identifiers by @congyuluo in https://github.com/deepjavalibrary/djl/pull/3276
Refactored Identifiers by @congyuluo in https://github.com/deepjavalibrary/djl/pull/3282
[gradle] Remove unused gradle files by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3280
[jacoco] exclude spark extension since it doesnot contain test by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3230
[Lgbm] support multi classification by @ewan0x79 in https://github.com/deepjavalibrary/djl/pull/3234
[Lgbm] support multi type prediction by @ewan0x79 in https://github.com/deepjavalibrary/djl/pull/3237
[llamacpp] Removing llamacpp support in DJL by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3312
[mxnet-model-zoo] Adds missing translatorFactory in metadata by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3279
[onnx] Adds fp16 and bfp16 support for OnnxRuntime by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3281
[onnxruntime] Add debug message for OnnxRuntime by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3217
[onnxruntime] Adds yolov8n pose model for OnnxRuntime by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3309
[onnxruntime] Adds yolov8n-seg model to onnxruntime model zoo by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3310
[onnxruntime] Load onnx extenstion if available by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3333
[pytorch] Adds Yolov8n-seg model to model zoo by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3308
[pytorch] Adds back PyTorch 2.1.2 support by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3285
[pytorch] Adds yolov8n pose estimation model by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3298
[pytorch] Implements gammaln operator for PyTorch by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3262
[pytorch] Split maven publish into two parts by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3273
[rust] Add tokenizer cuda build workflow by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3322
[rust] Allows -2 as dims for sum() by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3221
[rust] Change loging level to debug by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3336
[rust] Download cu124 jni library for cuda by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3327
[rust] Remove 0-dimension tensor compare in NDArrayTests by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3320
[rust] Update gpu build pipeline to cu122 by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3334
[rust] Upgrade candle version by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3248
[rust] Use fused layer by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3260
[spark] Do not support model_url by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3224
[spark] Update dependency versions by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3241
[spark] Updates spark version to 3.5.1 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3240
[spark] Use batch predict API by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3242
[text-embedding] Remove CrossEncoderTranslatorFactory in favor of TextEmbeddingTranslatorFactory by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3239
[tokenizer] Adds maxos-13 support back by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3328
[tokenizer] Ensure GPU is used in TextEmbeddingTranslator by @david-sitsky in https://github.com/deepjavalibrary/djl/pull/3212
[tokenizer] Process text embedding input and output in stacked NDArray by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3213
[tokenizer] Recover accidentally deleted file by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3311
[tokenizer] Supports cross encoder for text classification model by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3338
[tokenizers] Download jni lib files for cuda by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3326

Bug Fixes

[api] Fix unitest in GPU docker running on CPU case by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3228
[api] Fixes IdEmbedding memory leak by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3257
[api] Fixes nightly tests on GPU machine by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3302
[api] Fixes unitest by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3210
[fix] fix lgbm bytebuffer native order by @ewan0x79 in https://github.com/deepjavalibrary/djl/pull/3258
Fix Application.of missing some applications by @tadayosi in https://github.com/deepjavalibrary/djl/pull/3277
[mxnet] Fixes GloveWordEmbeddingTranslator bug by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3287
[pytorch-model-zoo]: fix PtSsdTranslator.Builder.self() by @eversnarf in https://github.com/deepjavalibrary/djl/pull/3204
[pytorch] Fixes PyTorch 2.3.1 windows dependencies by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3269
[pytorch] Fixes PyTorch 2.3.1 windows dependencies by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3270
[pytorch] Fixes uploadS3 gradle task by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3263
[rust] Fix NDArrayTests failure on cuda by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3319
[rust] Fix deleteModel error by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3229
[rust] Fix output tensor dtype by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3249
[rust] Fix tokenizer cuda pipeline name by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3325
[rust] Fixes test failure on GPU by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3301
[timeseries] Fixes contentLength issue for inference by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3306
[timeseries] Fixes duration format issue by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3307
[tensorrt] Fixes gradle biuld script by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3253
[tokenizer] Fixes detect include token type logic by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3318
[tokenizer] Fixes tokenizer build workflow by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3323
[tokenizers] Fixes huggingface build for Windows by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3330
[tokenizers] Fixes memory leak when there is overflowing tokens by @baldersheim in https://github.com/deepjavalibrary/djl/pull/3317
[xgb] Fixes gradle build script by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3254

Documentation

[doc] add output formatter schema to LMI docs.djl.ai by @sindhuvahinis in https://github.com/deepjavalibrary/djl/pull/3268
[doc] add release notes to docs.djl.ai by @sindhuvahinis in https://github.com/deepjavalibrary/djl/pull/3266
[docs] Bump up DJL version to 0.28.0 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3247
[docs] Update example reference by @emmanuel-ferdman in https://github.com/deepjavalibrary/djl/pull/3275
[docs] add dark theme and fixed broken link by @Varun-Dutta in https://github.com/deepjavalibrary/djl/pull/3295
[example] Adds PyTorch action recognition model to model zoo by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3292
[examples] Enabled training unit tests on macOS M1 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3256
[examples] Fixes ObjectDetection example for macOS m1 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3206
[examples] Fixes nightly build failure on Windows by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3267
[examples] Remove symbolic training for MXNet by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3299
Update README.md by @bradh in https://github.com/deepjavalibrary/djl/pull/3200

CI/CD

[android] Updates android with PyTorch 2.2.2 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3236
[api] Updates slf4j version to 2.0.13 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3329
[bom] Uses release version for tensorflow by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3313
[ci] Disable github actions runner for non-djl repo by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3220
[ci] Fixes nightly publish for nodejs20 issue by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3314
[ci] Fixes publish maven native package by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3264
[ci] Fixes pytorch JNI build by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3339
[ci] Fixes rust jni build for nodejs20 issue by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3315
[ci] Fixes serving nightly publish by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3245
[ci] Fixes windows pytoch jni build by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3209
[ci] Minor github action workflow changes by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3331
[ci] Remove fastertransformer build workflow by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3291
[ci] Update to amazon-ecr-login@v2 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3250
[ci] Updates OnnxRuntime to 1.18.0 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3235
[ci] Updates dependency versions to latest by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3332
[ci] Updates spotbugs to 6.0.15 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3233
[djl] update djl version and readmes by @tosterberg in https://github.com/deepjavalibrary/djl/pull/3202
[MCM 0.29.0] Remove -SNAPSHOT for release v0.29.0 by @ydm-amazon in https://github.com/deepjavalibrary/djl/pull/3345
Add more test logging by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3321
[pytorch] Updates PyTorch to 2.3.1 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3265
[release] Bump up versions to 0.29.0 in documents to point to new url by @ydm-amazon in https://github.com/deepjavalibrary/djl/pull/3344
[tensorflow] Updates tensorflow to 2.16.1 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3283

New Contributors

@bradh made their first contribution in https://github.com/deepjavalibrary/djl/pull/3200
@eversnarf made their first contribution in https://github.com/deepjavalibrary/djl/pull/3204
@benjie332 made their first contribution in https://github.com/deepjavalibrary/djl/pull/3231
@emmanuel-ferdman made their first contribution in https://github.com/deepjavalibrary/djl/pull/3275
@tadayosi made their first contribution in https://github.com/deepjavalibrary/djl/pull/3277
@congyuluo made their first contribution in https://github.com/deepjavalibrary/djl/pull/3276
@Varun-Dutta made their first contribution in https://github.com/deepjavalibrary/djl/pull/3295
@baldersheim made their first contribution in https://github.com/deepjavalibrary/djl/pull/3317

Full Changelog: https://github.com/deepjavalibrary/djl/compare/v0.28.0...v0.29.0

djl - DJL v0.28.0 Release

Published by tosterberg 5 months ago

Key Changes

Upgrades for engines
- PyTorch 2.2.2 https://github.com/deepjavalibrary/djl/pull/3155
- Sentencepiece 0.2.0 https://github.com/deepjavalibrary/djl/pull/3163
Enhancements for engines and API
- Adds experimental Rust engine https://github.com/deepjavalibrary/djl/pull/3078

Enhancement

[api] Automatically detect translatorFactory based on task by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3136
[api] Adds OnesBlockFactory to make it easy for testing by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3140
Ensure the alternative ND manager can use GPUs by @david-sitsky in https://github.com/deepjavalibrary/djl/pull/3138
[api] Tries to use the same device for alternative NDManager by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3146
[api] Supports serialize NaN in json by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3156
[rust] Add rust engine implemenation by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3078
[rust] Adds Rust model zoo by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3132
[rust] Support load DJL model for RsModel by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3147
[rust] RsModel delete model in close by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3170
[tokenizers] Updates tokenizer to 0.19.1 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3143
[tokenizer] Allows use HF_TOKEN to access gated model by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3150
[tokenizers] Create djl_converter package by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3172
[tokenizer] Refactor djl_convert python code by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3179
Updates on djl_converter by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3187
[pytorch] Updates PyTorch to 2.2.2 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3155
[pytorch] Update PyTorch engine README for version 2.2.2 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3165
[pytorch] optimize memory copy cost for pytorch NDArray by @ewan0x79 in https://github.com/deepjavalibrary/djl/pull/3137
[pytorch] Updates PyTorch to 2.3.0 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3192
[sentencepiece] Updates sentencepiece to 0.2.0 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3163
[huggingface] Adds more option to convert onnx model by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3180

Bug Fixes

[gitignore] Avoid checking binary files. by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3134
[api] Closes file stream by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3130
[api] Fixes logging invoke convention by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3148
[api] Fixes Criteria.toString() bug by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3151
[api] Fixes tarslip issue by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3075
[examples] Fixes TextGeneration EOS bug by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3177
[tokenizer] Fixes model zoo import script by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3126
[Lgbm] fix LgbmNDArray replaced.close() release data problem by @ewan0x79 in https://github.com/deepjavalibrary/djl/pull/3174
[rust] Fixes compile warnings by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3189
[ci] Fixes pytorch jni build for 1.13.1 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3184
[ci] Fixes awscurl publish location by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3182
[ci] Fixes build on macOS aarch64 machine by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3191
[ci] Fixes nightly pytorch jni build by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3196

Documentation

[examples] Re-organize CV examaples by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3135
[examples] Prepare for MXNet deprecation by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3157
[doc] Removes mention of future lab by @zachgk in https://github.com/deepjavalibrary/djl/pull/3154
[docs] Updates docs for setup java on mac by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3188
[website] Remove live demo from djl.ai web page by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3171
Fixed Typo in Docs by @fensch in https://github.com/deepjavalibrary/djl/pull/3193
Update README.md by @elect86 in https://github.com/deepjavalibrary/djl/pull/3195

CI/CD

[ci] Update github action runner to macOS x86_64 instance by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3144
[ci] Updates google code formatter to 1.22.0 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3149
[ci] Upgrades gradle to 8.5 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3153
[ci] Updates dependencies version by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3164
[ci] Adds cuda version as github actions parameter for Pytorch JNI build by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3185

New Contributors

@david-sitsky made their first contribution in https://github.com/deepjavalibrary/djl/pull/3138
@elect86 made their first contribution in https://github.com/deepjavalibrary/djl/pull/3195

Full Changelog: https://github.com/deepjavalibrary/djl/compare/v0.27.0...v0.28.0

djl - v0.27.0

Published by xyang16 7 months ago

Key Changes

Upgrades for engines
- OnnxRuntime 1.17.1 https://github.com/deepjavalibrary/djl/pull/3019
Enhancements for engines and API
- Supports PyTorch stream imperative model load https://github.com/deepjavalibrary/djl/pull/2981
- Support encode/decode String tensor https://github.com/deepjavalibrary/djl/pull/3034

Enhancement

Suppress serial warning for JDK21 by @zachgk in https://github.com/deepjavalibrary/djl/pull/2935
[api] Moves commons-compress dependency to standalone class. by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2951
[api] Allows to load .pt or .onnx file from jar url by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2955
[tokenizer] Return if exceed max token length by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2957
[tokenizer] Adds getters for HuggingfaceTokenizer by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2958
[pytorch] Upgrade android build to 0.26.0 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2975
[pytorch] Avoid loading .lib file from PYTORCH_LIBRARY_PATH by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2987
[api] Adds utility method to Model for accessing properties by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3007
[api] Adds suffix to percentile metric name by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3011
[api] Adds dimension for prediction metric by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3013
Thread-safe FaceDetectionTranslator by @StefanOltmann in https://github.com/deepjavalibrary/djl/pull/3016
[api] Upgrades commons compress to 1.26.0 for CVE by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3018
Avoid duplicated loading native library by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3020
[api] Allows to use relative jar uri for cache folder name by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3026
support includeTokenTypes in TextEmbeddingBatchTranslator by @morokosi in https://github.com/deepjavalibrary/djl/pull/3032
[tokenizer] Adds includeTokenTypes for all translators by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3035
Updates dependencies version to latest by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3040
[pytorch] Allows to exclude certain DLL from pytorch directory by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3043
Update checkstyle tool version to 10.14.2 by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3047
Upgrade dependency version by @xyang16 in https://github.com/deepjavalibrary/djl/pull/3049

Bug Fixes

[fix][ci] fix typo in publish metric workflow by @siddvenk in https://github.com/deepjavalibrary/djl/pull/2976
[fix][ci] avoid early exit of script for failure case by @siddvenk in https://github.com/deepjavalibrary/djl/pull/2979
[ci][fix] update path to android sdk manager cli by @siddvenk in https://github.com/deepjavalibrary/djl/pull/2980
[dataset] Fixes broken link for mnist dataset by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2984
[database] Fixes mnist URL for local unitest by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2988
fix #2968 by @SidneyLann in https://github.com/deepjavalibrary/djl/pull/2986
[dataset] Fixes wikitext-2 by @zachgk in https://github.com/deepjavalibrary/djl/pull/2996
[spark] Fixes python tarslip security concern by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2995
Fixes failing CI by @ydm-amazon in https://github.com/deepjavalibrary/djl/pull/3001
Fixes cases where the getEngine method in the EngineProvider class returns null when called concurrently. by @onaple in https://github.com/deepjavalibrary/djl/pull/3005
[api] Fixes typo in CudaUtils by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3008
[model-zoo] Fixes typo in README by @fensch in https://github.com/deepjavalibrary/djl/pull/3009
[ci] Fixes nightly build for onnx 1.17.1 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3021
[pytorch] Fixes detecting wrong flavor on macOS issue by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3027
[bom] Fixes djl-serving packages in BOM by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3039

Documentation

Bump DJL version to 0.27.0 by @siddvenk in https://github.com/deepjavalibrary/djl/pull/2933
[doc] include trtllm convert manual by @sindhuvahinis in https://github.com/deepjavalibrary/djl/pull/2941
[docs] Updates README by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2954
[doc] Make LMI a separate tab and include I/O schema by @sindhuvahinis in https://github.com/deepjavalibrary/djl/pull/2960
[docs] Fixes cuda version for pytorch native library by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2963
docs: add AWS Graviton3 PyTorch inference tuning details by @snadampal in https://github.com/deepjavalibrary/djl/pull/2982
[docs] Update Huggingface tokenizer cache directory document by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2994
[docs] Disable progress bar for jupyter notebook convertion by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3017
[example] Adds document about how to trace gpt2 model by @frankfliu in https://github.com/deepjavalibrary/djl/pull/3028
[docs] update mkdocs structure for new lmi documentation by @siddvenk in https://github.com/deepjavalibrary/djl/pull/3029

CI/CD

removing pytorch 2.0.1 from 0.27.0 by @siddvenk in https://github.com/deepjavalibrary/djl/pull/2940
Moves to Actions hosted M1 runner by @zachgk in https://github.com/deepjavalibrary/djl/pull/2948
[ci] Disable run scheduled github actions in fork by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2943
[ci] add cloudwatch metrics for scheduled workflow failures by @siddvenk in https://github.com/deepjavalibrary/djl/pull/2966
[ci] Upgrade github actions nodejs 16 to nodejs 2 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2967
[ci] Upgrade codeql-actions to v3 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2973
[ci] Upgrade aws-actions/configure-aws-credentials to v4 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2972
[ci] refactor cloudwatch metric publishing to avoid needing changes i… by @siddvenk in https://github.com/deepjavalibrary/djl/pull/2974
[ci] Downgrade github actions version for centos7 and amazonlinux by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2977
[ci] move cw publish step to github hosted runner by @siddvenk in https://github.com/deepjavalibrary/djl/pull/2978
[CI] downgrade the version to V3 by @lanking520 in https://github.com/deepjavalibrary/djl/pull/2990
[CI] change to cache v3 for the versions by @lanking520 in https://github.com/deepjavalibrary/djl/pull/2991
Uses gradle dependency submission by @zachgk in https://github.com/deepjavalibrary/djl/pull/2983
Excludes test dependencies from dependency submission by @zachgk in https://github.com/deepjavalibrary/djl/pull/2999
Update continuous OSX to 13 by @zachgk in https://github.com/deepjavalibrary/djl/pull/3004
Removes dependency submission by @zachgk in https://github.com/deepjavalibrary/djl/pull/3006

New Contributors

@snadampal made their first contribution in https://github.com/deepjavalibrary/djl/pull/2982
@ydm-amazon made their first contribution in https://github.com/deepjavalibrary/djl/pull/3001
@onaple made their first contribution in https://github.com/deepjavalibrary/djl/pull/3005
@fensch made their first contribution in https://github.com/deepjavalibrary/djl/pull/3009
@StefanOltmann made their first contribution in https://github.com/deepjavalibrary/djl/pull/3016
@morokosi made their first contribution in https://github.com/deepjavalibrary/djl/pull/3032

Full Changelog: https://github.com/deepjavalibrary/djl/compare/v0.26.0...v0.27.0

djl - DJL v0.26.0 Release

Published by siddvenk 9 months ago

Key Changes

LlamaCPP Support. You can use DJL to run supported LLMs using the LlamaCPP engine. See the Chatbot example here to learn more.
Manual Engine Initialization. You can configure DJL to not load any engines at startup, and query/register engines programmatically at runtime
Engine Updates:
- PyTorch 2.1.1
- Huggingface Tokenizers 0.15.0
- OnnxRuntime 1.16.3
- XGBoost 2.0.3

Enhancement

Add erf and atan2 by @TalGrbr in https://github.com/deepjavalibrary/djl/pull/2842
Add FFT2 and FFT2 inverse by @TalGrbr in https://github.com/deepjavalibrary/djl/pull/2845
[tokenizer] Update import script for huggingface_hub api change by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2850
[tokenizer] Not returns overflow tokens by default by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2857
[pytorch] Updates PyTorch engine to 2.1.1 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2864
Adds Device.getDevices() for all Device by @zachgk in https://github.com/deepjavalibrary/djl/pull/2820
Creates DJL manual engine initialization by @zachgk in https://github.com/deepjavalibrary/djl/pull/2885
[pytorch] Allows to load libstdc++.so.6 form different location by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2929
Add Evaluator support to update multiple accumulators by @petebankhead in https://github.com/deepjavalibrary/djl/pull/2894
Adds llama.cpp engine by @bryanktliu in https://github.com/deepjavalibrary/djl/pull/2904
Yelov8 Translator optimization by @gevant in https://github.com/deepjavalibrary/djl/pull/2908
[pytorch] Adds Yolov8n model to pytorch model zoo. by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2910
[onnx] Adds yolov8n to model zoo by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2909
[llama.cpp] Adds unit-test and standardize input parameters by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2905
[llama.cpp] Adds llama.cpp huggingface model zoo by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2911
[XGBoost] Updates XGBoost to 2.0.3 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2915
[pytorch] Upgrade pytorch andorid to 2.1.1 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2914
add awscurl release by @lanking520 in https://github.com/deepjavalibrary/djl/pull/2917
[awscurl] change build to jar by @lanking520 in https://github.com/deepjavalibrary/djl/pull/2918
[bom] Adds llama engine to BOM by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2916
[api] Adds ModelZooResolver interface by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2922
[api] Use folk java process to avoid jvm consume GPU memory by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2882
[onnxruntime] Updates OnnxRuntime to 1.16.3 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2888
Tokenizers: Updated huggingface_models.py to support Safetensors models as well as pytorch by @dameikle in https://github.com/deepjavalibrary/djl/pull/2880
[tokenizer] Uses fp32 for TextembeddingTranslator clip() by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2881
[tokenizer] Updates huggingface tokenizer to 0.15.0 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2867

Bug Fixes

[tokenizer] Fixes tokenizer bug by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2843
Fixes archiveBaseName in native builds by @zachgk in https://github.com/deepjavalibrary/djl/pull/2859
[pytorch] Ensure shared library loading order for aarch64 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2892
[api] Handles both JNA conflict and missing case by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2896
Minor fixes to improve Apple Silicon MPS support by @petebankhead in https://github.com/deepjavalibrary/djl/pull/2873
[tokenizer] Handles import huggingface model zoo exception case by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2872
[api] Update offline property name to avoid conflict with other app. by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2877
[tensorflow] Revert InstanceHolder for TensorFlow engine by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2884
[pytorch] Revert InstanceHolder for PyTorch engine by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2876
[pytorch] Fixes windows load nvfuser_codegen bug by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2868

Documentation

[docs] Update serving configuration nav by @zachgk in https://github.com/deepjavalibrary/djl/pull/2853
Updates DJL version to 0.25.0 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2860
Bump up DJL version to 0.26.0 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2861
[docs] Move jupyter notebooks to DJL Demo by @zachgk in https://github.com/deepjavalibrary/djl/pull/2854
[docs] Include LMI documents by @sindhuvahinis in https://github.com/deepjavalibrary/djl/pull/2870
[docs] Updates documents to use JDK 17 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2898
Updates DJL version to 0.26.0 by @siddvenk in https://github.com/deepjavalibrary/djl/pull/2930
update master branch on the website to have large model inference guide by @lanking520 in https://github.com/deepjavalibrary/djl/pull/2865

CI/CD

[ci] Allows build project with JDK 21 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2903
[ci] Fixes pytorch android build by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2921
[ci] Fix build failure for build-pytorch-jni-linux by @maaquib in https://github.com/deepjavalibrary/djl/pull/2920
[ci] Fixes native ci build failure by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2924
[CI] Fixes flaky early stopping test by @zachgk in https://github.com/deepjavalibrary/djl/pull/2866
[ci] Fixes flaky early stopping training test by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2879
[ci] Use JDK 17 for github actions workflow by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2897
[ci] Fixes github action for centos and amazonlinux by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2913
[ci] Use macos-13 to avoid flaky test by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2927
[test] Fixes EarlyStopping flaky test by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2926
[api] Updates dependencies to latest version by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2928
[api] Updates common-compress version to address CVE issues by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2871
only build triton binaries by @lanking520 in https://github.com/deepjavalibrary/djl/pull/2847

New Contributors

@TalGrbr made their first contribution in https://github.com/deepjavalibrary/djl/pull/2842
@petebankhead made their first contribution in https://github.com/deepjavalibrary/djl/pull/2873
@dameikle made their first contribution in https://github.com/deepjavalibrary/djl/pull/2880
@gevant made their first contribution in https://github.com/deepjavalibrary/djl/pull/2908
@maaquib made their first contribution in https://github.com/deepjavalibrary/djl/pull/2920

Full Changelog: https://github.com/deepjavalibrary/djl/compare/v0.25.0...v0.26.0

djl - DJL v0.25.0 Release

Published by siddvenk 10 months ago

Key Changes

Engine Upgrades
- [XGB] support for .xgb file extension https://github.com/deepjavalibrary/djl/pull/2810
- [Tokenizers] Upgrade tokenizers to 1.14.1 https://github.com/deepjavalibrary/djl/pull/2818
- [XGB] Updates XGBoost to 2.0.1 https://github.com/deepjavalibrary/djl/pull/2833
Early Stopping support for Training by @jagodevreede https://github.com/deepjavalibrary/djl/pull/2806

Enhancement

[tokenizer] Allows import non-english model by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2797
[api] Allows cancel Input by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2805
[huggingface] Adds CrossEncoderTranslator by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2817
Creates MultiDevice by @zachgk in https://github.com/deepjavalibrary/djl/pull/2819
[api] Refactor PublisherBytesSupplier.java by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2831
[api] Replace double-check singlton with lazy initialization by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2826

Bug fixes

[api] Fixed NDList decode numpy file bug by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2804

Documentation and Examples

Updates doc versions to 0.24.0 by @zachgk in https://github.com/deepjavalibrary/djl/pull/2829
[docs] Fixes markdown headers by @zachgk in https://github.com/deepjavalibrary/djl/pull/2812
Bump up DJL version to 0.25.0 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2809
Update README with release update by @zachgk in https://github.com/deepjavalibrary/djl/pull/2823

CI

[FT Deps] allow to just build for 1 flow by @lanking520 in https://github.com/deepjavalibrary/djl/pull/2798
[ci] Fixes out of diskspace issue by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2808
Add Triton gpu flag build on by @lanking520 in https://github.com/deepjavalibrary/djl/pull/2815

New Contributors

@jagodevreede made their first contribution in https://github.com/deepjavalibrary/djl/pull/2806

Full Changelog: https://github.com/deepjavalibrary/djl/compare/v0.24.0...v0.25.0

djl - DJL v0.24.0 Release

Published by zachgk about 1 year ago

Key Features

Engine Upgrades
- Makes PyTorch 2.0.1 the default version https://github.com/deepjavalibrary/djl/pull/2710
- OnnxRuntime to 1.16.0 https://github.com/deepjavalibrary/djl/pull/2784
SafeTensors support https://github.com/deepjavalibrary/djl/pull/2763
YoloV8 Support https://github.com/deepjavalibrary/djl/pull/2776

Enhancement

[spark] Update djl version in dockerfile by @sindhuvahinis in https://github.com/deepjavalibrary/djl/pull/2712
[pytorch] Makes PyTorch 2.0.1 default version for DJL 0.24.0 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2710
pytorch support inference on separate cuda stream by @jiyuanq in https://github.com/deepjavalibrary/djl/pull/2706
[spark] Update javacv version to 1.5.9 for spark docker by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2713
[pytorch] Upgrade pytorch andorid to 2.0.1 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2717
[api] Makes getNeuronDevices() public by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2721
[api] Log warning message if failed to load specified class by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2724
[api] Workaround detect neuron issue on SageMaker by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2729
Setup custom ft build for Llama support by @rohithkrn in https://github.com/deepjavalibrary/djl/pull/2732
[api] Fixes NeuronUtils issue when running as non-root user by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2735
Adds Utils.getEnvOrSystemProperty with default by @zachgk in https://github.com/deepjavalibrary/djl/pull/2742
Issue #2693 Implement PtNDArrayEx.multiBoxPrior with validation by @juliangamble in https://github.com/deepjavalibrary/djl/pull/2715
[api] Implements NDArray.toType() for NDArrayAdapter by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2746
[onnxruntime] Upgrades onnxruntime version to 1.15.1 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2743
[api] Output endPosition induced by reaching EOS token by @KexinFeng in https://github.com/deepjavalibrary/djl/pull/2730
[api] Adds Safetensors support by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2763
[SentencePiece] Make SpProcessor public by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2765
[tokenizer] Print out warning in model_zoo_importer by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2759
To support Yolov8 by @SidneyLann in https://github.com/deepjavalibrary/djl/pull/2776
[onnxruntime] Upgrades OnnxRuntime to 1.16.0 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2784
Build FT for sm90 by @rohithkrn in https://github.com/deepjavalibrary/djl/pull/2785
PtndArrayEx.multiboxDetection() implementation by @juliangamble in https://github.com/deepjavalibrary/djl/pull/2769

Bug fixes

[api] Fixes ChunkedBytesSupplier read timeout bug by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2716
[fix] Set past_kv name for corner case. by @KexinFeng in https://github.com/deepjavalibrary/djl/pull/2722
Fix AmazonReviews by @zachgk in https://github.com/deepjavalibrary/djl/pull/2725
Fix issue with setPadding and setTruncation overriding configurations… by @siddvenk in https://github.com/deepjavalibrary/djl/pull/2741
Fixes #2744, support onnx model for TextEmbeddingTranslator by @bryanktliu in https://github.com/deepjavalibrary/djl/pull/2749
[api] Fixes NDArray.toDevice() missing name issue by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2751
[pytorch] Avoid toByteBuffer() crash for large tensor by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2780

Documentation and Examples

Update DJL version to 0.23.0 in documents by @sindhuvahinis in https://github.com/deepjavalibrary/djl/pull/2694
[docs] Updates README for pytorch 2.0.1 by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2705
Update docs and Bump up version to 0.24.0 by @sindhuvahinis in https://github.com/deepjavalibrary/djl/pull/2708
[docs] Updates troubleshooting README to remove outdated content by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2734
[docs] Update IntelliJ debug view image by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2747
[examples] Fixes whipser model on GPU machine by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2752

CI

[api] Restore Lm search unittest to recover coverage rate by @KexinFeng in https://github.com/deepjavalibrary/djl/pull/2723
[ci] Fixes PMD warnings by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2764
[ci] Fixes gradle deprecation warnings by @frankfliu in https://github.com/deepjavalibrary/djl/pull/2774

New Contributors

@jiyuanq made their first contribution in https://github.com/deepjavalibrary/djl/pull/2706
@rohithkrn made their first contribution in https://github.com/deepjavalibrary/djl/pull/2732
@SidneyLann made their first contribution in https://github.com/deepjavalibrary/djl/pull/2776

Full Changelog: https://github.com/deepjavalibrary/djl/compare/v0.23.0...v0.24.0

djl - DJL v0.23.0 release

Published by sindhuvahinis over 1 year ago

Key Features

Upgrades for engines
- Upgrades PyTorch engine to 2.0.1
- Upgrades javacpp version to 1.5.9 (#2636)
- Upgrades HuggingFace tokenizer to 0.13.3 (#2697)
- Upgrades OnnxRuntime version to 1.15.0 and other dependencies version (#2658)
Enhancements for engines and API
- Adds XGBoost aarch64 support (#2659)
- Adds fastText macOS M1 supports (#2639)
- Creates asynchronous predictStreaming (#2615)
Introduces text-generation search algorithm
- Implements text-generation search algorithm (#2637)
- Enhancement features for LMSearch (#2642)

Enhancement

DJL API improvements:
- Adds uint16, uint32, uint64, int16, bf16 data type (#2570)
- Adds NDArray topK operator for PyTorch (#2634)
- Adds support for unsigned datatype (#2574)
- Allows subclass access member variable of Predictor (#2582)
- Makes PredictorContext constructor public (#2586)
- Refactor ChunkedBytesSupplier to avoid unnecessary conversion (#2587)
- Move compileJava() into ClassLoaderUtils (#2600)
- Enable boolean input on ort (#2644)
- Adds more logs for platform detection (#2646)
- Improves DJL URL error message (#2678)
- Avoid exception if SecurityManager is applied (#2665)
- Masks sensitive env vars in debug print out (#2657)
- open isImage() method to package children for reuse-enabling custom datasets (#2662)
- Migrate google analytics (#2654)
PyTorch engine improvements
- Load dependencies with specific order (#2599)
- Improves IValue tuple of tuple support (#2651)
- Add basic median support (#2701)
Spark extension enhancements
- Support requirements.txt in model tar file (#2528)
- Upgrade dependency version in Dockerfile (#2569)
- Use batch predict in spark (#2545)
- Change implicit conversions to explicit (#2595)
Huggingface tokenizer enhancements
- Allow creating BPE huggingface tokenizers (2550)
Tensorflow engine enhancements
- Reload javacpp properties (2668)

Breaking change

Bug fixes

Avoids exception for cuda version lower than 10.x (#2583)
Reverts "[bom] Simplify BOM build script (#2438)" 2598
CI fails looking for v3 reverting to v2 (#2604)
Fixes the dependencies issue (#2609)
Fixes the usage of the repeat function for embedding (#2590)
Adds missing djl-zero to bom (#2625)
Fixes tabnet predictor (#2643)
Fixes error message in X.dot(w) (#2688)
Fixes liquid parsing issues in pytorch ndarray cheatsheet (#2690)
Fixes getIOU bug (#2674)
Fixes setup formatting (#2653)
Fixes broken link (#2622)
Fixes LocalRepository detection (#2593)
Fix jupyter notebook links (#2704)

Documentation and Examples

Adds docs on JNI compilation (#2510)
Updates import tensorflow model README (#2614)
Updates pytorch native JNI development document (#2613)
Setup - Running on M1 Macs (#2652)
Adds pytorch vs djl ndarray cheatsheet (#2661)
Updates timeseries README (#2667)
Adds PT NDArray cheat sheet to docs (#2670)
Cleans stable_diffusion and add missing .md language blocks (#2635)
Updates README (#2596)
Fixes typos (#2603)
Fixes markdown format (#2608)

CI improvements

Upgrades github action gradle plugin to v3 (#2576)
Avoids upload djl-serving.tar to S3 if already exist (#2578)
Upgrades spotbugs to 5.0.14 (#2594)
Upgrades gradle to 8.1.1 (#2611)
Publishes PyTorch 2.0.1 jni package (#2699)
Removes cu102 test (#2689)
Fixes nightly publish ci bug (#2691)
Fixes djl-serving release publish workflow script (#2568)
Minor fix to the instance spinning (#2606)
Adds Triton and FasterTransformers source build instruction (#2605)
Publishes triton executable (#2617)
Adds http endpoint in the build (#2618)

Contributors

Thank you to the following community members for contributing to this release:
@frankfliu
@KexinFeng
@lanking520
@xyang16
@zachgk
@takanori-ugai
@tosterberg
@siddvenk

New Contributors

@larochef made their first contribution in https://github.com/deepjavalibrary/djl/pull/2550
@Crusader99 made their first contribution in https://github.com/deepjavalibrary/djl/pull/2603
@juliangamble made their first contribution in https://github.com/deepjavalibrary/djl/pull/2652
@i10416 made their first contribution in https://github.com/deepjavalibrary/djl/pull/2661

Full Changelog: https://github.com/deepjavalibrary/djl/compare/v0.22.1...v0.23.0

djl - DJL v0.22.1 release

Published by frankfliu over 1 year ago

Key Features

Upgrades and enhancements for Engines
- Upgrades PyTorch to 1.13.1 (#2245)
- Upgrades TensorFlow engine to 2.10.1 (#2440)
- Upgrades XGBoost to 1.7.5 (#2522)
- DJLServing release 0.22.1

Enhancement

Introduces several enhancement for HuggingFace tokenizer:
- Allows tokenizer native library load from different classloader (#2465)
- Makes Huggingface model zoo lazy load (#2469)
- Make Huggingface tokenizers translator factory serializable (#2442)
Introduces several enhancement for Spark extension:
- Adds audio predictors (#2466)
- Adds more image predictors and change some APIs (#2456)
- Adds more text predictors (#2443)
- Adds np_util (#2419)
- Adds pyspark TextEmbedder and update ImageClassifier (#2414)
- Adds text generation in pyspark (#2477)
- Adds text2text generation (#2506)
- Adds whisper python code (#2513)
- Upgrades spark version to 3.3.2 (#2523)
DJL API improvements:
- Adds support for unique, bmm, xlogy (#2415)
- Fixes NDArray.toByteArray() bug (#2436)
- Adds NDArray.copyTo() support for NDArrayAdapter (#2437)
- Improves Classifications.toString() print out (#2439)
- Makes Batchifier serializable (#2441)
- Loads inputShapes in the loadMetadata method of Linear block (#2448)
- Adds chunked output support (#2453)
- Makes audio and cv translator factory serializable (#2455)
- Adds NamedEntity.toString() function (#2468)
- Streaming Predict and streamable BytesSupplier (#2470)
- Mitigates ZipInputStream CVE. (#2473)
- Adds getProperties() to Model interface (#2476)
- Adds non-blocking poll() for BytesSupplier (#2478)
- Makes PassthroughNDManager aware of engine and device (#2484)
- Fixes telemetry opt out (#2490)
- Uses sha-256 to avoid security warning (#2495)
- Moves NeuronUtils to api package (#2496)
- Adds encode and decode to Input and Output (#2502)
- Fails model loading if specified translator not found (#2515)
- Adds a way to check if streaming is supported (#2518)
- Fixed detect platform for different CUDA version (#2527)
- Fixes neuron core detection in docker container (#2536)
PyTorch engine improvements:
- Upgrades PyTorch engine to 2.0.0 (#2525)
- Implements unique operator for PyTorch engine (#2417)
- Adds yolov5s to pytorch model zoo (#2433)
- Respect PYTORCH_FLAVOR override to download libtorch (#2486)
- Print log if graph optimizer is enabled (#2501)
OnnxRuntime engine improvements:
- Adds support for OnnxRuntime Profiler (#2472)
MXNet engine improvements:
- Enables boolean index on mxnet (#2427)

Breaking change

Bug fixes

Fixes pytorch-native-cu118 package in BOM (#2535)
Fixes spark package name in BOM (#2534)
Fixes OnnxRuntime version (#2524)
Fixes memory leak in get with and to long, double, float, ... (#2428)

Documentation and Examples

Adds timeseries examples document (#2411)
Fixes link to doc Mask detection with YOLOv5 (#2529)
Adds DeferredTranslatorFactory to tokenizers example (#2511)
Updates depednency manage for spark extension (#2531)
Updates FAQ and troubleshooting documents (#2454)
Cleans inference performance optimization doc (#2519)
Adds Yolov5 on Face Mask Detection (#2452)

CI improvements:

Simplifies BOM build script (#2438)
Avoids re-publish serving tarball (#2479)
Fixes gradle 8.0 native publish issue (#2457)
Fixes gradle 8.0 publish to release issue (#2460)
Upgrades gradle to 8.0.2 (#2449)
Uses recommended way to create task in build.gradle (#2451)

Contributors

@frankfliu
@KexinFeng
@lanking520
@nezda
@tipame
@xyang16
@zachgk

New Contributors

@tipame made their first contribution in https://github.com/deepjavalibrary/djl/pull/2415
@nezda made their first contribution in https://github.com/deepjavalibrary/djl/pull/2511

Full Changelog: https://github.com/deepjavalibrary/djl/compare/v0.21.0...v0.22.1

djl - DJL v0.21.0 release

Published by frankfliu over 1 year ago

Key Features

Upgrades and enhancements for Engines
- Upgrades PyTorch to 1.13.1 (#2245)
- Upgrades ONNXRuntime to 1.14.0 (#2393)
- Upgrades HuggingFace tokenizer version to 0.13.2 (#2369)
- Upgrades XGBoost to 1.7.3 (#2371)
- Removes Neo-DLR engine from DJL #2373
Introduces several improvements for extensions:
- Adds batch support huggingface tokenizer
- Adds API improvement for Spark extensions
- Add a few image processing methods in OpenCV extension (#2320)
- Adds stft and fft forier transform for audio extension (#2259)
Implements NDScope to automatically close NDArray in the scope (#2321)
Allows MXNet runs on Ampere GPU (#2313)
DJLServing release
- Adds faster transformer support (#424)
- Adds Deepspeed ahead of time partition script in DLC (#466)
- Adds SageMaker MME support (#479)
- Adds support for stable-diffusion-2-1-base model (#484)
- Adds support for stable diffusion depth model (#488)
- Adds out of memory protection for modle loading (#496)
- Makes load_on_devices per model setting (#493)
- Adds several per model settings
- Improves management console model loading and inference UI (#431, #432)
- Updates deepspeed to 0.8.0 (#465)

Enhancement

Introduces several enhancements for timeseries extension:
- Adds probability distribution support for timeseries (#2025)
- Add time series dataset support for timeseries package (#2026)
- Add some basic block and deepAR model (#2027)
- Enable pytorch deepar model inference in time series package (#2149)
Introduces several enhancement for HuggingFace tokenizer:
- Adds batch encoding support(#2342, #2343, #2337, #2338)
- Adds batchEncode for text pair (#2339)
- Adds mean_sqrt_len and weightedmean pooling for TextEmbedding (#2272)
- Adds more pooling mode form TextEmbedding (#2261)
- Allows Huggingface model zoo list models in offline mode (#2322)
- Update TextEmbedding pooling model name (#2314)
Introduces a few new examples:
- Adds clip model to examples (#2239)
- Adds openai whisper model to examples (#2293)
- Adds stable diffusion examples (#2246)
Introduces several enhancement for Spark extension:
- Add pyspark support (#2301)
- Adds spark extension docker image (#2243)
- Adds Numpy binary translator (#2399)
- Adds huggingface tokenizer support for Spark (#2311)
- Refactor Spark extension API (#2370)
DJL API improvements:
- Adds limit and callback for Metrics API (#2362)
- Adds newBaseManager(String engineName) api (#2275)
- Falls back to PassthroughNDManager if there is no engine (#2354)
- Improves Criteria.build() error message (#2397)
- Improves hybrid engine operators (#2279)
- Improve NDArray encode/decode performance (#2361)
- Refactors Engine class (#2303)
- Implements gatherNd, partial flatten and enable BertOnCode training (#2216)
- Makes TestDataset constructor protected (#2271)
- Creates SimplePaddingStackBatchifier (#2384)
- Creates standard for PreTrained behavior (#2360)
- Creates the TabularTranslator (#2344)
- Enables tuning distill_bert embedding layer (#2203)
- Handle RuntimeException on ImageFactory::newInstance (#2241)
- Implement AdamW on Pytorch and MXNet (#2206)
- Improves TranslatorExpansions with pre-processing and post-processing (#2213)
- Opens LayerNorm.Builder for inheritance (#2309)
- Adds feature to identify NDArray if double closed (#2352)
- Adds back support for getting managed Arrays (#2386)
- Opens Conv2d block constructor for inheritance (#2231)
PyTorch engine improvements:
- Adds scatter function to PtNDArray (#2332)
- Search for model.pt or model.onnx when loading the model (#2364)
- Better handle String tensor operations (#2380)
- Fixes typo in error message (#2355)
- Makes JNI comptible with PyTorch 1.11.0 (#2263)
- No longer search java.library.path (#2235)
- Uses runMethod to replace forward function (#2234)
- Workaround hann_window issue for PyTorch 1.12.1 (#2262)
- Adds support for torch::cuda::empty_cache(). (#2305)
- Fixes memory leak in PyTorch indexing fuction (#2300)
LightGBM engine improvements:
- Fixes atomic move issue (#2258)
- Fixes byte order for fp32 and fp64 array creation (#2278)
Updates library dependencies:
- Reduces aws s3 extension dependencies (#2378)
- Reduces hadoop extension dependencies (#2377)
- Removes unnecessary dependency (#2329)
- Upgrade dependencies versions (#2371)
Adds aarch64 and macOS M1 support for SentencePiece (#2325, #2324)
Adds centos 7 support for SentencePiece (#2402)
Adds aarch64 support for audio extension (#2250)

Breaking change

Removes Neo-DLR engine from DJL #2373
Moves RawDataset from basicdata to api module (#2375)
Changes Spark extension API (#2388)

Bug fixes

Fixes nested model directory issue (#2214)
Fixes NPE if block is cleard before training (#2365)
Fixes toDebugString() IllegalStateException if NDArray is closed (#2347)
Fixes typo in javadoc (#2286)
Fixes unittest failure on GPU (#2255)
Fixes inconsistencies using RANK in engine providers (#2244)
Fixes the content format (#2229)
Fixes the name passing bug (#2222)
Fixes TrainAmazonReviewRanking example for PyTorch (#2173)
Fixes performance issues from freezing MXNet (#2394)

Documentation and Examples

Fixes ONNXRuntime Android doc (#2194)
Updates readme of tranferFreshFruilt (#2236)
Updates cache menagement adds ONNX and Huggingface cache directory (#2334)
Updates FAQ (#2328)
Fix typo in javadoc of Block and AbstractBlock (#2288)
Adds extensions to docs web page (#2284)
Adds missing DJLServing docs menu items (#2363)
Improves documents (#2396)
Updates PyTorch graph exector optimization document (#2374)
Updates README for engines (#2282)
Fixes whisper model translator (#2341)
Documents workaround AudioGrabber bug for whisper model (#2330)
Adds tensorflow dependency to pom.xml (#2268)
Fixes PyTorch nightly test (#2264)
Fixes Onnx version in README (#2253)
Update onnx version in README file (#2223)
Fix djl-zero doc (#2201)

CI improvements:

Adds lost package for Codespaces Dockerfile (#2260)
Uses ubuntu-latest as runner (#2199)
Updates configure-aws-credentials to v1-node16 (#2196)
Adds formatCpp exclusion to gradle plugin (#2215)
Allows publish djl-serving from a branch (#2225)
Build cu102 JNI for PyTorch 1.11.0 and 1.12.1 (#2254)
Fixes jdk18 compile error (#2306)
Replace jacoc root report with aggregation report plugin (#2280)
Updates github action setup-python to v4 (#2202)
Upgrade gradle to 7.6 (#2302)
Upgrade testng to 7.7.0 (#2230)
Fixes PMD 6.21.0 reported issues (#2296)
Removes DLR github actions (#2398)
Sets seed in TransferFreshFruitTest for consistent results (#2238)
fastText should not depends on any engine #2353
Avoid unittest polluting cache directory. (#2228)
Minor refactor encoding class (#2336)
Split TranslatorTest into smaller test classes. (#2335)

Contributors

@dayo05
@demq
@enpasos
@frankfliu
@KexinFeng
@lanking520
@Noricks
@siddvenk
@SuperMaskv
@xyang16
@zachgk

New Contributors

@dayo05 made their first contribution in https://github.com/deepjavalibrary/djl/pull/2241
@Noricks made their first contribution in https://github.com/deepjavalibrary/djl/pull/2260
@SuperMaskv made their first contribution in https://github.com/deepjavalibrary/djl/pull/2288

Full Changelog: https://github.com/deepjavalibrary/djl/compare/v0.20.0...v0.21.0

djl - DJL v0.20.0 release

Published by xyang16 almost 2 years ago

Key Features

Upgrades and enhancements for Engines
- Upgrades PyTorch to 1.13.0 (#2157)
- Add support for Apple's Metal Performance Shaders (MPS) in PyTorch (#2037)
- Add system property to config GraphExecutorOptimize (#2156)
- Upgrades ONNXRuntime to 1.13.1 (#2115)
- Upgrades Paddle to 2.3.2 (#2116)
- Upgrades TensorFlow to 2.7.4 (#2121)
- Upgrades HuggingFace tokenizer version to 0.13.1 (#2127)
- Upgrades XGBoost to 1.7.1 (#2143)
DJLServing
- Adds large model inference support with MPI mode (#291)
- Adds built-in DeepSpeed handler (#292)
- Publishes PaddlePaddle docker image (#342)
Adds TabNet Training (#2057)
Publishes DJL Zero (#2091)
Adds Spark extension (#2162)
Introduces several improvements for timeseries extension
Adds ImageFeatureExtractor example and resnet base model to model zoo

Enhancement

Introduces several enhancements for timeseries extension:
- Adds probability distribution support for timeseries (#2025)
- Add time series dataset support for timeseries package (#2026)
- Update M5Forecast dataset and its unittest (#2105)
- Add some basic block and deepAR model (#2027)
- Enable pytorch deepar model inference in time series package (#2149)
Introduces several enhancement for HuggingFace tokenizer:
- Enhance huggingface text embedding translator to support max length padding (#2049)
- Add cli options to only validate jit model on CPU (#2052)
- Add batch decoding methods for tokenizers (#2154)
Adds new models to DJL model zoo:
- Adds TabNet model for tabular dataset in modelzoo (#2036)
- Adds yolo5s to OnnxRuntime model zoo (#2046)
- Object Detection (#1930)
- Adds image classification resnet18 base model to model zoo (#2079)
DJL API improvements:
- Adds Sparsemax block (#2028)
- Updates the SemanticSegmentationTranslator (#2032)
- Creates Ensembleable (#2043)
- Handle error when forget to initialize a child block (#2045)
- Adds draw mask for BitMapWrapper (#2071)
- Allows show NDArray content in debugger (#2078)
- Rename transparency to opacity in CategoryMask (#2081)
- Allows to show NDArray content in Debugger 2 (#2080)
- Transfer learning with pytorch engine on fresh fruit dataset (#2070)
- Ensure GradientCollector can clear gradients (#2101)
- Handles conflict JNA package issue (#2118)
- Adds Multiplication block (#2110)
- Allows non-ServingTranslatorFactory for DJLServing (#2148)
- Adds cumprod operator (#2152)
- Adds Randperm on PyTorch and MxNet (#2084)
- Creates translator options (#2145)
CI improvements:
- Add Mac M1 build (#2039)
- Publishes serving tar and zip (#2014)
- Uploads djl-bench release artifacts to S3 (#2020)
- Upgrade deprecated github actions (#2119)
- Upgrade github actions to latest version (#2122)
- Compile JNI only when file changes (#2161)
- Speed up continuous build by not uploading jacoco report (#2166)
- Respect -SNAPSHOT version in jar manifest (#2177)
- Allows JNI to be compiled on headless jdk (#2098)
- Move model zoo download test to canary (#2169)
- Upgrades PyTorch for Android to 1.13.0 (#2171)
- Add some unit tests (#2063)
- Test accumulating gradient collector (#2111)
- Refactor unit test TestRequirements, add missing TestRequirements (#2120)
Upgrade protobuf version to 3.20.2 (#2035)
Update deeplabv3 model zoo metadata (#2051)
Remove String tensor limitation for model output (#2056)
Disables mapLocation when using MPS device (#2061)
Adds disablePerSessionThreads option to model loading for ONNXRuntime (#2104)
LightGBM inference result matches input type (#2129)
Apply no_optimizer_guard only for Android (#2153)
Update dependency version (#2176)
Reduce nested exception level (#2181)

Documentation and Examples

Updates Semantic Segmentation app (#263)
Adds the object detection app demo, use onnxruntime engine (#266)
Adds stable diffusion demo (#269)
Adds Spark extension example (#272)
Adds DJLServing Postman examples (#276)
Adds DJLServing Java client demo (#277)
Adds DJLServing Python client demo (#278)
Update README.md (#2010)
Update dependency docs for timeseries package (#2004)
Update javadoc links (#2017)
Use latest javadoc links (#2021)
Info added (#2022)
Add Mac M1 info in docs (#2040)
Some doc fixes (#2042)
Improve memory management and batchify docs (#2076)
Move serving docs to top level and reorganize (#2100)
Update docs top level memu (#2102)
Change timeseries dataset source example and add test (#2109)
Fix a document issue (#2114)
Update dependency document (#2134)
Upgrade pytorch 1.13.0 documents (#2158)
Add readme for TransferFreshFruit (#2160)
Made the sites copyright year dyanamic (#2188)

Breaking change

NDArray.toDebugString() signature has been changed (#2078)

Bug Fixes

Fixes flooded warning message in PyTorch (#2136)
Fixes youtube link in quick start (#2012)
Fixes dlr-native build script (#2015)
Adds missing condition for benchmark release (#2023)
Fixes folder not exist bug when use external pytorch native library (#2033)
Fixes bug in OD TF saved model example (#2050)
Fixes MXNet engine cu112 document (#2066)
Fixes CategoryMask (#2073)
Fixes ames and various tabular improvements (#2054)
Fixes randomColor (#2075)
Fixes remove warning spam during training (#2097)
Adds missing test requirements to opencv test (#2108)
Fixes android crash issue (#2113)
Avoid static setup/tearDown in testng (#2126)
Add missing dependency to timeseries (#2128)
Fixes mac os build failure (#2131)
Fixes char offset (#2137)
Adds repo_url in order to fix broken edit links (#2140) (#2141)
Fixes format bug and change the name of retrain (#2147)
Fixes ndarry operator warnings show up in Jupyter notebook (#2150)
Fixes HfModelZoo NPE bug with shadowjar (#2163)
Fixes Temporary File Information Disclosure Vulnerability (#2164)
Fixes movielens download issue (#2167)
Fixes Yolov3 javadoc warning (#2168)
CVE-2007-4559 Patch (#2189)

Contributors

@asbachb
@Carkham
@demq
@dependabot
@frankfliu
@JLLeitschuh
@KexinFeng
@lanking520
@patins1
@siddvenk
@tosterberg
@warthecatalyst
@wxm2018
@xyang16
@ylwu-amzn
@zachgk

New Contributors

@dependabot made their first contribution in (#2035)
@ylwu-amzn made their first contribution in (#2049)
@tosterberg made their first contribution in (#2097)
@asbachb made their first contribution in (#2128)
@JLLeitschuh made their first contribution in (#2164)

Full Changelog: https://github.com/deepjavalibrary/djl/compare/v0.19.0...v0.20.0

djl - DJL v0.19.0 release

Published by xyang16 about 2 years ago

Key Features

Creates new LightGBM engine (#1895)
Upgrades and enhancements for Engines
- Upgrades PyTorch to 1.12.1 (#1894)
- Upgrades ONNXRuntime to 1.12.1 (#1879)
- Upgrades Apache MXNet to 1.9.1 (#1898)
- Publishes new xgboost-gpu package to maven (#1918)
- Adds ARM support for ONNXRuntime (#1856)
- Disable autograd by default when PyTorch engine start (#1872)
Introduces several enhancement for HuggingFace tokenizer
- Introduces HuggingFace model zoo (#1984)
- Adds a few built-in Translators for HuggingFace NLP models
- Adds macOS M1 support for HuggingFace tokenizer
- Adds ARM support for HuggingFace tokenizer
- Adds centos 7 support for HuggingFace tokenizer (#1874)
- Adds decode API for HuggingFace tokenizer (#1843)
- Adds padding and truncation support for HuggingFace tokenizer (#1870)
- Support stride in tokenizers (#2006)
Introduces time series extension (#1903)
Adds new Audio API and improves audio extension (#1974)
Adds Android support for ONNXRuntime (#1844)
JDK18 support (#1892)
Adds python script to import HuggingFace model into DJL model zoo (#1835)
DJLServing
- Adds management console plugin, which allows user manage models with web UI (#205)
- Adds KServe plugin (#177)
- Publishes DeepSpeed docker image to dockerhub (#223)

Enhancement

Adds a few more built-in Translators:
- Adds HuggingFace QuestionAnsweringTranslator (#1828)
- Adds HuggingFace FillMaskTranslator (#1876)
- Adds HuggingFace TokenClassificationTranslator (#1906)
- Adds HuggingFace TextClassificationTranslator (#1983)
- Adds HuggingFace TextEmbeddingTranslator (#1953)
- Adds speech recognition translator (#1899)
Adds new models to DJL model zoo:
- Adds PyTorch deeplabvs model into DJL model zoo (#1818)
- Adds MobileNetV1 into model zoo (#1817)
Image handling enhancement:
- Improves ImageFactory to allow convert float32 NDArray to Image. (#1814)
- Handle both HWC and CHW image (#1833)
DJL API improvements:
- Adds NDArray normalize() operator (#1924)
- Adds DeferredTranslatorFactory to let serving.properties take effect (#1868)
- Makes PtBertQATranslator compatible with huggingface model (#1827)
- Improves debug log for model loading options. (#1825)
- Allows to load block only model for PyTorch (#1831)
- Adds IdentityBlockFactory for demo/test purpose (#1854)
- Support queryString for JarRepository (#1842)
- Set arguments in serving.properties as model properties (#1853)
- Allow overriding special token flags in encode and decode methods (#1855)
- Adds support for intermediate sequential block results (#1943)
- Adds load SentencePiece model from InputStream (#1949)
- Allows use cached PyTorch native libraries in offline mode by caching "files.txt". (#1982)
- Makes Encoding class constructor protected. (#1945)
- Adds string tensor support for PyTorch (#1968)
- Adds Loss function: Coverage.java (#1653)
- Adds Loss function: QuantileLoss.java (#1652)
- Validate data type for NDArray.set(Buffer) API (#1975)
- Adds offline mode to to ensure not download engine files from network (#1987)
- Adds encodeDual support for HuggingFace tokenizer (#1826)
- Bulk batch creation and array indexing on mxnet engine (#1869)
- Adds NDArray gammaln and sample distribution function support. (#1990)
- Padding when the size of input is 2 in LSTM (#2000)
- Creates a SystemNDManager interface (#1888)
Adds python script to import huggingface model into DJL model zoo
- Added fill-mask support for converting huggingface model to model zoo (#1849)
- Adds support for converting huggingface token-classification models (#1902)
- Adds support for converting huggingface sentence-similarity models (#1913)
- Adds support for converting huggingface text-classification models (#1972)

Documentation and Examples

Adds Neural machine translation example (#1851)
Adds New Bert example using Goemotions (#1682)
Adds Semantic Segmentation example (#1808)
Adds tokenizer readme for usage (#1981)
Updates troubleshooting.md to remove -native-auto package (#1793)
Document PYTORCH_PRECXX11 usage in README (#1807)
Immutable array output from InferenceMode PyTorch (#1822)
Fixes NDIndex javadoc issue (#1841)
Updates pose estimation example to detect joints for all people (#2002)
Adds Semantic segmentation and Speech recognition to README (#2003)
Updates links in README (#2005)
Adds an example of time series model inference (#1971)

Breaking change

NDManager.vaildateBufferSize() has been renamed to NDManager.validateBuffer()
Remove unnecessary DeviceType interface (#1978)

Bug Fixes

Adds missing text_embedding application in Application.of() (#1917)
Fixes capped manager bug for TensorFlow (#1952)
Fixes NDArray.set() bug (#1789)
Fixes breaking behavior for NDIndex in 0.18.0 (#1801)
Backward compatible with Apache MXNet indexing. (#1802)
Fixes OrtNDArray double close issue (#1809)
Fixes ImageFactory.fromNDArray() bug (#1824)
Fixes NDArrayAdapter toDevice() and toType() behavior (#1839)
Fixes the parsing issue (#1857)
Fixes OrtNDArray double free issue (#1861)
Fixes memory leak when using NDManager.newBaseManager() (#1887)
Fixes PyTorch download library fallback to CPU bug (#1951)
Fixes bug in Criteria (#1964)
Fixes issue in TrainMnistWithLSTM (#1965)
Fixes closing error in Apache MXNet when indexing results in an empty array (#1966)
Fixes path parsing bug on Windows (#1985)
Fixes memory leak in Apache MXNet layerNorm() (#1993)
Fix DynamicBuffer position error when expand for the first time (#2007)
Fix some bugs in pytorch based examples (#2009)

Contributors

@925781609
@bryanktliu
@Carkham
@demq
@frankfliu
@gforman44
@JohnDoll2023
@KexinFeng
@Konata-CG
@lanking520
@oyy2000
@patins1
@pdradx
@siddvenk
@takanori-ugai
@warthecatalyst
@wxm2018
@xyang16
@zachgk

New Contributors

@bryanktliu made their first contribution in (#1828)
@xyang16 made their first contribution in (#1852)
@wxm2018 made their first contribution in (#1844)
@925781609 made their first contribution in (#1887)
@demq made their first contribution in (#1945)
@Carkham made their first contribution in (#1903)
@gforman44 made their first contribution in (#1653)
@takanori-ugai made their first contribution in (#2000)

Full Changelog: https://github.com/deepjavalibrary/djl/compare/v0.18.0...v0.19.0

djl - DJL v0.18.0 release

Published by siddvenk over 2 years ago

Key Features

Adds macOS M1 chip support for PyTorch https://github.com/deepjavalibrary/djl/pull/1656, https://github.com/deepjavalibrary/djl/pull/1696
JDK 17 support https://github.com/deepjavalibrary/djl/pull/1672
Full support of PyTorch Get Indexing for NDArrays https://github.com/deepjavalibrary/djl/pull/1719
Full support of PyTorch Set Indexing for NDArrays https://github.com/deepjavalibrary/djl/pull/1755
Updates Dataset documentation https://github.com/deepjavalibrary/djl/pull/1686
Moves djl-bench to DJL Serving https://github.com/deepjavalibrary/djl/pull/1743
Engines and Extensions
- TensorFlow 2.7.0 https://github.com/deepjavalibrary/djl/pull/1674
- New djl-audio extension https://github.com/deepjavalibrary/djl/pull/1681
- Adds GPU support for XGBoost https://github.com/deepjavalibrary/djl/pull/1680
- tokenizers 0.12.0 https://github.com/deepjavalibrary/djl/pull/1739
- sentencepiece 0.1.96 https://github.com/deepjavalibrary/djl/pull/1745
- TensorRT 8.4.1 https://github.com/deepjavalibrary/djl/pull/1758
Newly Added Datasets
- Goemotions dataset https://github.com/deepjavalibrary/djl/pull/1598
- Daily Delhi Climate Dataset https://github.com/deepjavalibrary/djl/pull/1667
- Tablesaw Dataset https://github.com/deepjavalibrary/djl/pull/1679
- Universal Dependencies Corpus for English https://github.com/deepjavalibrary/djl/pull/1595
- Movielens 100k dataset https://github.com/deepjavalibrary/djl/pull/1718

Enhancement

Increases build version to 0.18.0 https://github.com/deepjavalibrary/djl/pull/1645
Support of take from pytorch https://github.com/deepjavalibrary/djl/pull/1627
Upgrades JNA to 5.11.0 https://github.com/deepjavalibrary/djl/pull/1655
Improves ServingTranslator output handling https://github.com/deepjavalibrary/djl/pull/1654
Adds width/height conversion to ObjectDetection https://github.com/deepjavalibrary/djl/pull/1651
Add openCV find rectangle method to improve PaddleORC performance https://github.com/deepjavalibrary/djl/pull/1662
Removes unnecessary logics in Paddle https://github.com/deepjavalibrary/djl/pull/1676
Adds Cyclical Tracker https://github.com/deepjavalibrary/djl/pull/1671
Adds support of take on MXNet engine https://github.com/deepjavalibrary/djl/pull/1649
Implements GhostBatchNorm https://github.com/deepjavalibrary/djl/pull/1666
Allows indexer to attach specific manager https://github.com/deepjavalibrary/djl/pull/1688
Upgrades android module to use DJL 0.18.0 https://github.com/deepjavalibrary/djl/pull/1693
Uses pytorch to test API and aws-ai module https://github.com/deepjavalibrary/djl/pull/1695
Avoid download cudf dependency for XGBoost at build time https://github.com/deepjavalibrary/djl/pull/1694
Bumps up versions https://github.com/deepjavalibrary/djl/pull/1691
Refactors ServingTranslatorFactory https://github.com/deepjavalibrary/djl/pull/1702
Adds "capped" state to NDManager https://github.com/deepjavalibrary/djl/pull/1683
Upgrades NDK version to 21.1.6352462 https://github.com/deepjavalibrary/djl/pull/1707
Adds LinearCollection block https://github.com/deepjavalibrary/djl/pull/1658
Adds android test code https://github.com/deepjavalibrary/djl/pull/1714
Changes DJL repo names from aws-samples https://github.com/deepjavalibrary/djl/pull/1716
Adds serving deb file publish for CI https://github.com/deepjavalibrary/djl/pull/1721
Upgrades codeql github action to v2 https://github.com/deepjavalibrary/djl/pull/1730
Fixes publish serving deb https://github.com/deepjavalibrary/djl/pull/1725
Adds ai.djl.audio and ai.djl.tablesaw to BOM https://github.com/deepjavalibrary/djl/pull/1728
Upgrades java formatter to 1.15.0 https://github.com/deepjavalibrary/djl/pull/1727
Adds name to LambdaBlock https://github.com/deepjavalibrary/djl/pull/1726
Adds disable static option in MXNet to allow some model running https://github.com/deepjavalibrary/djl/pull/1735
Improves Criteria.toBuilder() api https://github.com/deepjavalibrary/djl/pull/1741
Fixes serving publish github actions https://github.com/deepjavalibrary/djl/pull/1742
Enables better textual description of neural net https://github.com/deepjavalibrary/djl/pull/1720
Ignores hidden files for nested model directory https://github.com/deepjavalibrary/djl/pull/1754
Creates action to auto-close issues without response https://github.com/deepjavalibrary/djl/pull/1751
Builds jni for aarch64 https://github.com/deepjavalibrary/djl/pull/1756
Removes unnecessary packages from tensorrt dockerfile https://github.com/deepjavalibrary/djl/pull/1760
Adds log for custom Translator loading https://github.com/deepjavalibrary/djl/pull/1761
Stores indices with batch https://github.com/deepjavalibrary/djl/pull/1750
Adds put feature with linear indexing on PyTorch engine https://github.com/deepjavalibrary/djl/pull/1749
Adds NDList to IValue unit test https://github.com/deepjavalibrary/djl/pull/1762
Makes tensorflow NDArray always dense https://github.com/deepjavalibrary/djl/pull/1763
JDK version updated https://github.com/deepjavalibrary/djl/pull/1767
Adds IValue Dict(str, IValue) support https://github.com/deepjavalibrary/djl/pull/1765
Creates tabular dataset https://github.com/deepjavalibrary/djl/pull/1699
Creates PreparedFeaturizer https://github.com/deepjavalibrary/djl/pull/1700
Normalizes Numeric Featurizer https://github.com/deepjavalibrary/djl/pull/1701
Adds support for registerCustomOpLibrary for ONNXRuntime. https://github.com/deepjavalibrary/djl/pull/1771
Implements inverse operation https://github.com/deepjavalibrary/djl/pull/1768
Supports Image output for ImageServingTranslator https://github.com/deepjavalibrary/djl/pull/1772
Allows user specify model name in serving.properties file https://github.com/deepjavalibrary/djl/pull/1780
Adds model zoo implementation https://github.com/deepjavalibrary/djl/pull/1781
Change the sagemaker model to s3 https://github.com/deepjavalibrary/djl/pull/1769
Improvements to image coloring https://github.com/deepjavalibrary/djl/pull/1784
Updates bert classification notebook to reflect changes in CSVDataset https://github.com/deepjavalibrary/djl/pull/1786
Paddle model zoo should not have compile time dependency on opencv https://github.com/deepjavalibrary/djl/pull/1785

Documentation and Examples

Updates README for 0.17.0 Release https://github.com/deepjavalibrary/djl/pull/1646
Increases DJL Version for main branchhttps://github.com/deepjavalibrary/djl/pull/1644
Fixes broken and redirected links https://github.com/deepjavalibrary/djl/pull/1647
Clarifies typo in example documentation https://github.com/deepjavalibrary/djl/pull/1685
Fixes javadoc error in JDK 1.8 https://github.com/deepjavalibrary/djl/pull/1698
Update description for latest javadoc location https://github.com/deepjavalibrary/djl/pull/1708
Creates README for DJL Android PyTorch 1.11 builds https://github.com/deepjavalibrary/djl/pull/1704
Adds serving to docs site https://github.com/deepjavalibrary/djl/pull/1715
Fixes broken javadoc links in jupyter notebooks https://github.com/deepjavalibrary/djl/pull/1722
Readme updates for PyTorch 1.11 https://github.com/deepjavalibrary/djl/pull/1709
Updates CVSDataset example README file https://github.com/deepjavalibrary/djl/pull/1729
Updates document to use MXNet 1.9.0 https://github.com/deepjavalibrary/djl/pull/1737
Adds documentation on loading TF extension libraries for running certa… https://github.com/deepjavalibrary/djl/pull/1776
Adds semantic segmentation example https://github.com/deepjavalibrary/djl/pull/1764

Breaking Changes

The following changes to api.djl.basicdataset.tabular may cause backwards incompatibility:

Features and Featurizers have been refactored out of the CSVDataset class. The are now present in ai.djl.basicdataset.tabular.utils
CSVDataset now extends a new abstract class, TabularDataset
api.djl.basicdataset.utils.DynamicBuffer implementation has moved to api.djl.basicdataset.tabular.utils.DynamicBuffer

Bug Fixes

[TensorFlow] fix GPU memory leak https://github.com/deepjavalibrary/djl/pull/1648
[tensorrt] Fixes native library path https://github.com/deepjavalibrary/djl/pull/1650
Fixes bug in NDArray.oneHot() API https://github.com/deepjavalibrary/djl/pull/1661
Fix errors in "getIoU" function https://github.com/deepjavalibrary/djl/pull/1687
Follow symlinks when loading models. https://github.com/deepjavalibrary/djl/pull/1692
[pytorch] Fixes model loading bug for 1.11.0 https://github.com/deepjavalibrary/djl/pull/1705
Ensure PreparedOneHotStringFeaturizer encodes categorical mappings co… https://github.com/deepjavalibrary/djl/pull/1723
[tensorflow] Avoid NPE in TfEngine https://github.com/deepjavalibrary/djl/pull/1734
[m1] Fix test failure on macOS M1 machine by @frankfliu in https://github.com/deepjavalibrary/djl/pull/1777

Contributors

@dandansamax
@DiaaAj
@frankfliu
@WHALEEYE
@patins1
@JohnDoll2023
@KexinFeng
@Konata-CG
@pdradx
@lanking520
@siddvenk
@LanAtGitHub
@warthecatalyst
@zachgk
@freemanliu
@liumingxiy

New Contributors

@Konata-CG made their first contribution in https://github.com/deepjavalibrary/djl/pull/1598
@pdradx made their first contribution in https://github.com/deepjavalibrary/djl/pull/1661
@JohnDoll2023 made their first contribution in https://github.com/deepjavalibrary/djl/pull/1685
@liumingxiy made their first contribution in https://github.com/deepjavalibrary/djl/pull/1687
@LanAtGitHub made their first contribution in https://github.com/deepjavalibrary/djl/pull/1679
@freemanliu made their first contribution in https://github.com/deepjavalibrary/djl/pull/1692
@DiaaAj made their first contribution in https://github.com/deepjavalibrary/djl/pull/1666
@warthecatalyst made their first contribution in https://github.com/deepjavalibrary/djl/pull/1767
@siddvenk made their first contribution in https://github.com/deepjavalibrary/djl/pull/1718

Full Changelog: https://github.com/deepjavalibrary/djl/compare/v0.17.0...v0.18.0

djl - DJL v0.17.0 release

Published by zachgk over 2 years ago

Key Features

Adds linux AArch64 support for PyTorch
Upgrades Engine Releases
- XGBoost version 1.6.0 (#1624)
- PaddlePaddle version 2.2.2 (#1601)
- ONNXRuntime version 1.11.0 (#1602)
- PyTorch version 1.11.0 (#1583)
- Apache MXNet version 1.9.0 (#1429)
Newly Added Datasets
- PennTreebank dataset (#1580)
- WikiText-2 dataset (#1545)
Support freezing parameters for transfer learning (#1544)

Enhancement

Upgrade djl-bench to 0.15.0 (#1476)
Add SessionOptions support for OnnxRuntime (#1479)
Device Name parsing (#1490)
arena allocator setter (#1510)
Upgrade native build mxnet version (#1517)
Refactor ml extension support (#1521)
Parse default device with Device.fromName (#1529)
Upgrade dependency versions (#1533)
release Update document version to 0.16.0 (#1536)
Bump up version to 0.17.0 (#1537)
Building DJL For aarch64 (#1526)
api Make resource loading compatible with java9 module (#1541)
pytorch Allows multiple native jars to package into fat jar (#1543)
Add document with installation instructions (#1548)
Separate AbstractSymbolBlock from AbstractBlock (#1555)
Add better error message for libstdc++ tf errors (#1570)
basicdataset Add Stanford Question Answering Dataset (#1554)
xgb Set default missing value to NaN (#1571)
allow empty batch data (#1569)
pytorch Allows load libtroch from pip installation package (#1577)
HF Tokenizer: get charspans (#1584)
benchmark Allows benchmark run on aarch64 for PyTorch (#1591)
add pytorch cuDNN acceleration (#1592)
add troubleshoot issue (#1600)
add paddle CU110 (#1604)
Update badge (#1610)
ONNXRuntime add tensorRT option (#1611)
api Refactor metrics API (#1613)
api Fixes metric dimension (#1614)
bom Update bom dependency (#1615)
pytorch Use precxx11 build for aarch64 native library (#1619)
pytorch use precxx11 for aarch64 (#1620)
Update inference_performance_optimization.md (#1621)
Fix testFreezeParameters for multi-gpu (#1623)
Support gather of pytorch (#1622)

Documentation and Examples

Update README for 0.15.0 release (#1477)
Update README to use 0.16.0-SNAPSHOT version (#1486)
typo fix (#1519)
docs Fixes dataset document. (#1523)
jupyter remove unecessary maven import (#1540)
examples Update maven pom file dependency version (#1546)
Add release note to README (#1565)
update pytorch build instruction on android (#1630)
bump up versioning (#1633)
update android package versions (#1635)
docs Update pytorch and paddle version (#1634)

Breaking Changes

Custom symbol blocks should extend AbstractSymbolBlock instead of AbstractBlock

Bug Fixes

Fix flaky test for tensorflow 2.7.0 on GPU (#1475)
Fixes topK items for DetectedObjects and make it configurable to Classifications (#1478)
Fix load native library failure on Android (#1485)
Adding huggingface tokenizer extension to BOM (#1487)
Fixes #1149, fix NPE bug (#1492)
Fix djl site vue version (#1495)
Fixes MLP example code in README (#1497)
Fixes jni dependency in README document (#1513)
Fixes memory leak in hybrid engine (#1518)
Fix the version issue reported from get-pip.py (#1530)
Fix FastText JNI build (#1531)
api Fixes loading BlockFactory bug (#1547)
tensorflow Fixes tensorflow session always on gpu(0) bug (#1558)
xgb Fixes missing anonymous classes (#1572)
examples Fix ImageClassification invalid probability (#1575)
fix naming (#1581)
ONNXRuntime fix naming (#1608)
basicdataset Fixed out of bound limit (#1599)
api Avoid NPE in Metric.toString() (#1626)
integration Enable gather unit test for windows (#1638)
Add rpath fix to Native Publish PyTorch (#1639)
api Fixes JDK 18 compiler warnings (#1640)

Contributors

@AKAGIwyf
@andreabrduque
@dandansamax
@frankfliu
@hd1080p
@KexinFeng
@lanking520
@patins1
@sindhuvahinis
@WHALEEYE
@zachgk

New Contributors

@sindhuvahinis made their first contribution in https://github.com/deepjavalibrary/djl/pull/1519
@KexinFeng made their first contribution in https://github.com/deepjavalibrary/djl/pull/1530
@hd1080p made their first contribution in https://github.com/deepjavalibrary/djl/pull/1526
@dandansamax made their first contribution in https://github.com/deepjavalibrary/djl/pull/1545
@WHALEEYE made their first contribution in https://github.com/deepjavalibrary/djl/pull/1554
@patins1 made their first contribution in https://github.com/deepjavalibrary/djl/pull/1569
@AKAGIwyf made their first contribution in https://github.com/deepjavalibrary/djl/pull/1580

Full Changelog: https://github.com/deepjavalibrary/djl/compare/750c153a7...v0.17.0

djl - DJL v0.16.0 release

Published by frankfliu over 2 years ago

Key Features

Upgrades Apache MXNet engine to 1.9.0 with CUDA 11.2 support
Improves ONNXRuntime engine memory configurations
Improves fastText engine’s API
Fixes several critical bugs

Enhancement

Upgrades Apache MXNet engine to 1.9.0 with CUDA 11.2 support (#1517)
Improves ONNXRuntime engine:
- Adds arena allocator support to SessionOptions (#1510)
- Adds SessionOptions support for OnnxRuntime (#1479)
Introduces several API improvements:
- Parse default device with Device.fromName (#1529)
- Refactor ml extension support to be consistent with DJL api (#1521)
- Device Name parsing (#1490)
- Make DetectedObjects topK configurable (#1478)
Uses JDK 11 for github actions in CI build (#1489)
Adds huggingface tokenizer extension to BOM (#1487)
Removes unnecessary github actions workflow (#1484)
Upgrades DJL android to 0.15.0 (#1483)

Documentation and examples

Fixes outdated dataset document (#1523)
Fixes repository README typo (#1519)
Fixes PyTorch JNI dependency in README document (#1513)
Fixes MLP example code in README (#1497)
Fixes djl.ai website vue version (#1495)
Updates inferentia demo to DJL 0.15.0 (#210)
Updates android README to use 0.15.0 (#1486)
Publishes D2L Chinese book with latest chapters https://d2l-zh.djl.ai

Breaking change

fastText specific inference and training APIs are removed, use standard DJL API instead

Bug Fixes

Fixes memory leak in hybrid engine (#1518)
Fixes fastText JNI build (#1531)
Fixes the python version bug in the benchmark workflow files (#1530)
Fixes sentencepiece NPE bug (#1492)
Fixes PyTorch load native library failure on Android (#1485)
Fixes topK items for DetectedObjects (#1478)

Contributors

This release is thanks to the following contributors:

Andréa Duque (@andreabrduque)
Dennis Kieselhorst (@deki)
Frank Liu (@frankfliu)
Jake Lee (@stu1130)
Kexin Feng (@KexinFeng)
Qing Lan (@lanking520)
Sindhu Somasundaram (@sindhuvahinis)
Zach Kimberg (@zachgk)

New Contributors

Andréa Duque made their first contribution in https://github.com/deepjavalibrary/djl/pull/1510

Full Changelog: https://github.com/deepjavalibrary/djl/compare/v0.15.0..v0.16.0

djl - DJL v0.15.0 release

Published by frankfliu over 2 years ago

DJL v0.15.0 updates the engines PyTorch to 1.10.0, ONNXRuntime to 1.10.0, TensorFlow to 2.7.0, TensorFlowLite to 2.6.2 and introduces several new features:

Key Features

Introduces Huggingface tokenizers extension which allows user to leverage high performance fast tokenizer in Java
Upgrades PyTorch engine to 1.10.0 with CUDA 11.3 support
Upgrades TensorFlow to 2.7.0 with CUDA 11.3 support
Upgrades ONNXRuntime engine to 1.10.0
Upgrades TensorFlowLite to 2.6.2
Provides better PyTorch engine backward compatibility support
Adds load model from InputStream support
Adds Windows support for SentencePiece
Removes -auto packages to simplify DJL dependencies
Fixes log4j CVEs

Enhancement

Improves PyTorch engine:
- Adds support to load custom build PyTorch native library from specified location
- Adds support to use precxx11 version of PyTorch on GPU
- Provides offline native package for older version of PyTorch to run with latest DJL (#1385)
- Report correct engine version for custom build PyTorch native library
- Adds Tuple support to IValue (#1436)
Adds support to load model from InputStream for some of the engines (#1400):
- Adds load from InputStream for PyTorch model
- Adds load from InputStream for TensorFlowLite model (#1402)
- Adds load from InputStream for ONNXRuntime model (#1402)
- Adds load from InputStream for SentencePiece (#1139)
Introduces several new features in djl-serving:
- Automatically detect model’s engine
- Reduces netty dependencies to minimize package size
- Installs engine dependency on-demand for non-commonly used engines
- Adds support for nested folder in model archive file
- Released djl-serving to homebrew
Introduces server new features in djl-bench
- Release djl-bench to homebrew and snapcraft
Improve opencv extension to expose API from opencv to end users
Introduces several API improvements:
- Improves TrainingResult with toString() print out (#1369)
- Allows to register engine at runtime (#1386)
- Allows to dynamic add model zoo (#1397)
- Creates IndexEvaluator and IndexLoss (#1414)
- Introduces Huggingface Tokenizer (#1406)
Publishes several new docker images to dockerhub (https://hub.docker.com/u/deepjavalibrary)
- djl-serving docker image
- djl-serving for inferentia docker image
- DJL windows docker image with jdk11

Documentation and examples

Updates inferentia demo to support neuron runtime 2.x
Updates jupyter notebooks with NoBatchifyTranslator to simplify example code (#1370)
Updates benchmark README (#1410)
Fixes rank classification jupyter notebook (#1368)
Updates packaging model document (#1471)
Fixes d2l book (https://d2l.djl.ai/) image display issue (#183)
Translates d2l Chinese book (https://d2l-zh.djl.ai/) chapter 9 to chapter 14 to Chinese

Breaking change

Bug Fixes

Fixes several ci issues on Github Actions (#1358, #1362, #1363, #1364)
Fixes crash when IValue list input is empty (#1440)
Fixes NullPointException bug in getContextClassLoader() (#1445)
Fixes model name detection issue in loading model from jar (#1446)
Fixes unset numEmbeddings in TrainableWordEmbedding.java (#1450)
Fixes protobuf-java CVEs

Contributors

This release is thanks to the following contributors:

Andrey Zakharov (@zakhio)
daofaziran1 (@daofaziran1)
enpasos (@enpasos)
Frank Liu (@frankfliu)
Hampus Londögård (@Lundez)
Jake Lee (@stu1130)
James Zow(@jzow)
Qing Lan (@lanking520)
Zach Kimberg (@zachgk)
Viet Yen Nguyen (@nguyenvietyen)

New Contributors

@Lundez made their first contribution in https://github.com/deepjavalibrary/djl/pull/1412
@zakhio made their first contribution in https://github.com/deepjavalibrary/djl/pull/1436
@daofaziran1 made their first contribution in https://github.com/deepjavalibrary/djl/pull/1450

Full Changelog

https://github.com/deepjavalibrary/djl/compare/0855108d0...v0.15.0

djl - DJL v0.14.0 release

Published by frankfliu almost 3 years ago

DJL v0.14.0 updates the engines PyTorch to 1.9.1 and introduces several new features:

Key Features

Upgrades PyTorch engine to 1.9.1
Adds support for Neuron SDK 1.16.1
Adds autoscale in djl-serving for AWS Inferentia model
Introduces OpenCV extension to provide high performance image processing
Adds support for older version of PyTorch engine, user now can use PyTorch 1.8.1 with latest DJL
Adds support for precxx11 PyTorch native library in auto detection mode
Adds AWS Inferentia support in djl-bench
Adds support for TorchServe .mar format, user can deploy TorchServe model archive in djl-serving

Enhancement

Introduces several new features in djl-serving:
- Adds autoscale feature for AWS Inferentia (#31)
- Creates SageMaker hosting compatible docker image for AWS Inferentia (#36)
- Adds auto detect number of neuron cores feature for AWS Inferentia (#34)
- Adds autoscale support for SageMaker style .tar.gz model (#35)
- Adds support to load torchserve model (#32)
- Adds support to pip installed dependency per model (#37)
- Adds custom environment variable support for python engine (#29)
- Adds nested folder support in model archive file (#38)
- Improves model status with model version support (#25)
- Adds model warn up feature for python engine. (#23)
- Adds WorkLoadManager.unregisterModel (#33)
- Adds testing tool to test python model locally (#22)
- Adds set python executable path for python engine (#21)
- Creates Workflow for ModelServing (#26)
Adds OpenCV extension (#1331)
Introduces several new features in djl-bench:
- Adds support for AWS Inferentia (#1329)
Introduces several new features in Apache MXNet engine:
- Implements LayerNorm for Apache MXNet (#1342)
Introduces several new features in PyTorch engine:
- Upgrades PyTorch to 1.9.1 (#1297)
- Implements padding to bert tokenizer (#1328)
- Makes pytorch-native-auto package optional (#1326)
- Adds support to use different version of PyTorch native library (#1323)
- Adds map_location support for load model from InputStream (#1314)
- Makes map_location optional (#1312)
Introduces several new features in TensorFlow Lite engine:
- Makes tensor-native-auto package optional (#1301)
Introduces several API improvements:
- Adds support for nested folder in model archive (#1349)
- Improves translator output error message (#1348)
- Improves Predictor API to support predict with device (#1346)
- Improves BufferedImageFactory.fromNDArray performance (#1339)
- Adds support for downloading .mar file (#1338)
- Adds debugging toString to Input and Output (#1327)
- Refactors BERT Translator and Tokenizer (#1318)
- Makes question answering model serving ready (#1311)
- Refactors minMaxWorkers from ModelInfo to WorkerPool (#30)

Documentation and examples

Adds huggingface Inferentia serving example (#184)
Adds AWS SageMaker hosting document
Adds python hybrid engine demo (#182)
Improves DJL examples project gradle build script (#1344)

Breaking change

PyTorch 1.9.1 no longer supports Amazon Linux 2, AL2 user has to use pytorch-native-cpu-precxx11
Image.Type is removed and Image.duplicate() function no longer take Image.Type as input
Image.getSubimage() is renamed to Image.getSubImage()
PaddlePaddle model loading may break due to prefix changes.

Bug Fixes

Fixes 2nd inference throw exception bug (#1351)
Fixes calculation for SigmoidBinaryCrossEntropyLoss from sigmoid (#1345)
Fixes jar model url download bug (#1336)
Fixes memory in Trainer.checkGradients (#1319)
Fixes NDManager is closed bug (#1308)
Fixes PyTorch GPU model loading issue (#1302)
Fixes MXNet EngineException message (#1300)
Fixes python resnet18 demo model GPU bug (#24)
Fixes python engine get_as_bytes() bug (#20)

Contributors

This release is thanks to the following contributors:

Frank Liu (@frankfliu)
Jake Lee (@stu1130)
enpasos (@enpasos)
Qing Lan (@lanking520)
Zach Kimberg (@zachgk)

djl - DJL v0.13.0 release

Published by frankfliu about 3 years ago

DJL v0.13.0 brings the new TensorRT and Python engines, and updates the engines PyTorch to 1.9.0, ONNXRuntime to 1.9.0, PaddlePaddle to 2.0.2, and introduces several new features:

Key Features

Introduces TensorRT engine
Introduces Python engine which allows you to run Python scripts with DJL
Upgrades PyTorch engine to 1.9.0 with CUDA 11.1 support
Upgrades ONNXRuntime engine to 1.9.0 with UINT8 support
Upgrades PaddlePaddle engine to 2.0.2
Introduces the djl-bench snap package: sudo snap install djlbench --classic
Introduces dynamic batch feature for djl-serving (#1154)
DJL serving becomes a standalone repository: https://github.com/deepjavalibrary/djl-serving (#1170)
Allows load ModelZoo model using url (#1120)
Support .npy and .npz file format (#1131)
djl-serving is available in dockerhub
Publishes d2l book Chinese translation preview (chapter 1-5) : https://d2l-zh.djl.ai

Enhancement

Introduces several new features in djl-serving:
- Improves djl-serving API to make it easy to get HTTP headers (#1134)
- Loads models on all GPUs at startup for djl-serving (#1132)
- Enables asynchronous logging for djl-serving
- Makes djl-serving access log in separate log file (#1150)
- Adds configuration to support number worker threads for GPU inference (#1153)
- Improves auto-scale algorithm for djl-serving (#1149)
Introduces several new features in djl-bench:
- Adds a command line option to djl-bench to generate NDList file (#1155)
- Adds warmup to benchmark (#1152)
- Improves djll-bench to support djl:// urls (#1146)
- Adds support to benchmark on multiple GPUs (#1144)
- Adds support to benchmark onnx on GPU machines (#1148)
- Adds support to benchmark TensorRT models (#1257)
- Adds support to benchmark Python models (#1267)
Introduces several new features in PyTorch engine:
- Supports PyTorch custom input data type with IValue (#1208)
Introduces several new features in OnnxRuntime:
- Adds UINT8 support for OnnxRuntime (#1271)
Introduces several new features in PaddlePaddle:
- Adds more model loading options for PaddlePaddle (#1173)
- Adds load functionalities to PaddlePaddle (#1140)
- Adds remove pass option to PaddlePaddle (#1141)
Introduces several API improvements:
- Adds missing NDList.get(String) API (#1194)
- Adds support to directly load models from a TFHub url (#1231)
- Improves repository API to support passing argument in the URL query string (#1139)
- Avoids loading the default engine if it is not being used (#1136)
- Improves IO by adding a buffer to read/write (#1135)
- Improves NDArray.toString() debug mode performance (#1142)
- Makes GPU device detection engine specific to avoid confusion when using multiple engines (#1138)

Documentation and examples

Adds Style Transfer example with CycleGAN (#1180)

Breaking change

Removes support for Apache MXNet 1.6.0
Deprecates Device.getDevices() API - Use Engine.getDevices() instead
Renames SimpleVocabulary to DefaultVocabulary

Bug Fixes

Fixes broken link in documents
Fixes TensorFlow NDArray was created on CPU instead of GPU bug (#1279)
Fixes default image processing pipeline (#1268)
Fixed XGBoost NDArray multiple read bug (#1239)
Fixes platform matching bug (#1167)
Fixes NullPointerException in NDArray.toString() (#1157)
Fixes PaddlePaddle crash due to GC (#1162)
Fixes NDArrayAdapter.getSparseFormat() unsupported bug (#1151)
Fixes mixed device issue in multiple engine use case (#1123)
Fixes handle duplicate plugin issue for djl-serving (#1108)
Fixes XGBoost NDArray creation bug (#1109)
Fixes runtime exception running benchmark arm machine(#1107)
Fixes unregister model regression (#1101)

Contributors

This release is thanks to the following contributors:

Akshay Rajvanshi (@aksrajvanshi)
Aziz Zayed (@AzizZayed)
Elchanan Haas (@ElchananHaas)
Erik Bamberg (@ebamberg)
Frank Liu (@frankfliu)
Jake Lee (@stu1130)
Kimi MA (@kimim)
Paul Greyson
Qing Lan (@lanking520)
Raymond Liu (@raymondkhliu)
Sindhu Somasundaram (@sindhuvahinis)
Zach Kimberg (@zachgk)

djl - DJL v0.12.0 release

Published by frankfliu over 3 years ago

DJL v0.12.0 added GPU support to PaddlePaddle and ONNXRuntime, and introduces several new features:

Key Features

Updates PaddlePaddle engine with GPU support.
Updates ONNXRuntime engine with GPU support.
Upgrades ONNXRuntime engine to 1.8.0.
Upgrades XGBoost engine to 1.4.1.
Introduces AWS Inferentia support, see our example for detail.
Adds FLOAT16 datatype support in NDArray.
Support UTF16 surrogate characters in NLP tokenization.
Makes benchmark as a standalone tool.
Releases djl-serving docker image to docker hub.

Enhancement

DJL Benchmark now can benchmark any datatype as input.
Makes Grayscale image processing match openCV’s behavior (#965)
Improves PyTorch engine to load extra shared library for custom operators (#983)
Improves djl-serving REST API to support load model on specified engine (#977)
Improves djl-serving to support load multiple version of a model on the same endpoint (#1052)
Improves djl-serving to support auto-scale workers based on traffic (#986)
Implements several operators:
- Adds the truncated normal operator (#1005)
- Adds the one hot operator for PyTorch (#1014)
- Adds the LayerNorm operator in PyTorch (#1069)
Introduces several API improvements
- Improves Criteria.loadModel() API (#1018)
- Refactors ModleLoader and TranslatorFactory (#712)
- Improves BlockFactory API (#1045)
- Makes SpProcessor public API (#1060)

Documentation and examples

Adds Low cost inference with AWS Inferentia demo.
Adds BigGAN demo in examples (#1038)
Adds Super-resolution demo in examples (#1049)

Breaking change

Direct access ModelZoo ModelLoader is no longer supported, use Criteria API instead.
Deprecates ModelZoo.loadModel() API in favor of using Criteria.loadModel().

Bug Fixes

Fixes missing softmax in action_recognition model zoo model (#969)
Fixes saveModel NPE bug (#989)
Fixes NPE bug in block.toString() function (#1076)
Adds back String tensor support to TensorFlow engine (lost in 0.11.0 during refactor) (#1040)
Sets ai.djl.pytorch.num_interop_threads default value for djl-serving (#1059)

Known issues

The TensorFlow engine has a known memory leak issue due to the JavaCPP dependency. The memory leak issue has been fixed in javacpp 1.5.6-SNAPSHOT. You have to manually include javacpp 1.5.6-SNAPSHOT to avoid the memory leak. See: https://github.com/deepjavalibrary/djl/tree/master/tensorflow/tensorflow-engine#installation for more details.

Contributors

This release is thanks to the following contributors:

Akshay Rajvanshi(@aksrajvanshi (https://github.com/ghost))
Aziz Zayed(@AzizZayed (https://github.com/AzizZayed))
Erik Bamberg(@ebamberg (https://github.com/ebamberg))
Frank Liu(@frankfliu (https://github.com/frankfliu))
Hodovo(@Hodovo (https://github.com/Hodovo))
Jake Lee(@stu1130 (https://github.com/stu1130))
Qing Lan(@lanking520 (https://github.com/lanking520))
Tibor Mezei (@zemei (https://github.com/zemei))
Zach Kimberg(@zachgk (https://github.com/zachgk))

djl - DJL v0.11.0 release note

Published by frankfliu over 3 years ago

DJL v0.11.0 brings the new engines XGBoost 1.3.1, updates PyTorch to 1.8.1, TensorFlow to 2.4.1, Apache MXNet 1.8.0, PaddlePaddle to 2.0.2 and introduces several new features:

Key Features

Supports XGBoost 1.3.1 engine inference: now you can run prediction using models trained in XGBoost.
Upgrades PyTorch to 1.8.1 with CUDA 11.1 support.
Upgrades TensorFlow to 2.4.1 with CUDA 11.0 support.
Upgrades Apache MXNet to 1.8.0 with CUDA 11.0 support.
Upgrades PaddlePaddle to 2.0.2.
Upgrades SentencePiece to 0.1.95.
Introduces the djl-serving brew package: now you can install djl-serving with brew install djl-serving.
Introduces the djl-serving plugins.
Introduces Amazon Elastic Inference support.

Enhancement

Improves TensorFlow performance by reducing GC and fixed memory leaking issue (#892)
djl-serving now can run all the engines out-of-box (#886)
Improves DJL training by using multi-threading on each GPU (#743)
Implements several operators:
- Adds boolean set method to NDArray (#784)
- Adds batch dot product operator (#849)
- Adds norm operator to PyTorch (#692)
- Adds one hot operator (#684)
- Adds weight decay to Loss (#788)
Adds setGraphExecutorOptimize option for PyTorch engine. (#904)
Introduces String tensor support for ONNXRuntime (#724)
Introduces several API improvements
- Creates ObjectDetectionDataset (#683)
- Improves Block usability (#712)
- Adds BlockFactory feature in model loading (#805)
- Allows PyTorch stream model loading (#729)
- Adds NDList decode from InputStream (#734)
- Adds SymbolBlock Serialization (#687)
Introduces model searching feature in djl central (#799)

Documentation and examples

Introduces DJL tutorials - How to load model on DJL Youtube Channel
Adds the PaddlePaddle load model documentation (#811)
Adds the documentations for profiler (#722)
Adds face detection and face recognition examples (#814)
Adds model training visualization demo using Vue

Breaking change

Renames CheckpointsTrainingListener to SaveModelTrainingListener (#686)
Removes erroneous random forest application (#726)
Deletes DataManager class (#691)
Classes under ai.djl.basicdataset packages has been moved into each sub-packages.

Bug Fixes

Fixes BufferOverflowException when handling handling subimage (#866)
Fixes ONNXRuntime 2nd engine dependency from IrisTranslator (#853)
Fixes sequenceMask error when n dimension is 2 (#828)
Fixes TCP port range buf in djl-serving (#773)
Fixes one array case for concat operator (#739)
Fixes non-zero operator for PyTorch (#704)

Known issues

TensorFlow engine has known memory leak issue due to JavaCPP dependency. The memory leak issue has been fixed in javacpp 1.5.6-SNAPSHOT. User has to manually include javacpp 1.5.6-SNAPSHOT to avoid memory leak. See: https://github.com/deepjavalibrary/djl/tree/master/tensorflow/tensorflow-engine#installation for more detail.