serving

A flexible, high-performance serving system for machine learning models

APACHE-2.0 License

Stars
6.2K
Committers
228

Bot releases are hidden (Show)

serving - 2.5.1

Published by netfs over 3 years ago

(Please note: Due to build issues, we had to skip 2.5.0 version number for this release)

TensorFlow Serving using TensorFlow 2.5.0

Major Features and Improvements

  • Upgrade to CUDA 11.2. (commit: 1975e3e5701d1cb266d356834ec50a326016983f)
  • Update TF Text to v2.4.3. (commit: ccfb606b2c86f4ffb57ecd8454c8809ab8c13090)
  • Experimental support for XLA/CPU models. (commit: 3c1b2b34ec516afbb73543cc62d3979f013a33f1)
  • Add metrics to the Prometheus API (commit: dfb41f191ad9b61e9a8ca2cda4fa381733b9179d)
  • Support URL reserved characters for REST API (#1726) (commit: dd9c467e15b996be40ceb356f69d3d85c503e72d)

Breaking Changes

  • No Breaking changes

Bug Fixes and Other Changes

  • Fix typo in REQUIRED_PACKAGES for grpcio (commit: b9ed0f80852d7a33b0745833ff9c0243178b0c5d)
  • update resnet_k8s.yaml file (commit: e7b7b337eb82bec1556b6ab47353138c5e1a7f8b)
  • Fix a compile warning thrown by gcc-9 (commit: 38a017db5c35f60a9641eef0d70f4e8eaecd5608)
  • Update json_tensor.cc (commit: a0a9d14322bf30299d0056182952c00d1961bf35)
  • Add TfLiteInterpreterPool to make concurrent use of TfliteSession better (commit: d9efa43a77e609260614d9633a3ee7c4cc3e2843)
  • Enable download of TF Serving sources at arbitrary commit for CPU docker image. (commit: de1ab9e0b5abf8ee7540cc711fda2be9fcbd1b8c)
  • Updated tests to newer API (commit: 30dd2fe67ad00b38324d01351ca52c571701f97f)
  • When GRPC messages come to TF Serving GRPC server, server will create a new threads to handle each message. (commit: ac0eb73e71e0658196f02aa6518e119351e14a95)
  • Add dedicated aliases field to ModelServerConfig. (commit: 358f7d1fe94dc2c904b1c4de68c7bbfe42088b2d)
  • Update docker command line to work with GPUs (Fixes #1768). (commit: b41a28bb2c815bd76010d3a39a3ea52b5da85f35)
  • option to disable grpc over http (commit: f08729066771c55a44941a7661ac4d0bc48b85dd)
  • use RecordRuntimeLatency (commit: d353931269d1a9b10381102a9f8fd80b27bcf28e)
  • Improve error message for file not exists. (commit: 78d47f7031d5a60b8cfff4fd0f0a2e1ec0586c01)
  • Transition TensorFlow Serving to TensorFlow's new WORKSPACE protocol. (commit: 50a7ef370139073640e62178ab95b5894f587667)
  • fix http_rest_api_handler_test (commit: 53da4f80a6d6a15cebfb9add0ef5e0e59abe3eb2)
  • add status label in request_latency (commit: 0483edb643fdfcda34ca5a2cd7923c783f0e48bb)
  • fix timing of request_latency (commit: e20c91efcc9221109aa19cff6cca4f2495e367f3)
  • fix http_rest_api_handler_test (commit: 6254411fa3183ef51b46216a54f71675afe5b735)
  • Clarifying object values in REST requests to include B64 encoding and similar key:value pair objects. (commit: 0536678a05073feaa3f2815cb9012abd7a86907a)
  • Register custom TfLite ParseExample and add benchmark (commit: 20fe3ca4f292b1ae575217ecb69981d59addc720)
  • Update docker.md (commit: 7ebcd15c3d432959cf44ba784c869433730f73f4)
  • Pre-allocate memory for certain vectors where the size is known. (commit: e208b6ed8a621e91684dfa233b0923ea367f6a43)
  • Updating serving_basic for adjusting serving_basic.md file and making it up2date with TF2.x - including: (commit: cea306a2f3857e5715233b9f45b49097e4d5b4c8)
  • Support URL reserved characters for REST API (#1726) (commit: dd9c467e15b996be40ceb356f69d3d85c503e72d)
  • Fixing MKL builds due to missing 'build_with_openmp' option (commit: 0ed23dfe3d02e8bfcf19907da4b3d764146eefc0)
  • Implement batch parallelism for tflite sessions (commit: fec1d5db3cec6c81635e6f77923f3112f09a0289)
  • Use LOG_EVERY_N_SEC insted of LOG with local static time variable. (commit: c6b3936accc6aa75cf174b1ce3dffc4ca588fbdc)
  • Fix TensorFlow Serving build with MKL+OpenMP (commit: ddad074e9f2347e8898811c97db1e454f00c9062)
  • add cors headers (commit: d65914b5212213ea26e61956ade0668978001fba)
  • Remove extra build options for TF Serving (commit: 63acc954d7082f515a0c599991bab2ad4e87d449)
  • Remove hashtable custom op dependencies (commit: bb51722b1dea8d139c46e2fe6fb52fa379da164c)
  • allow http OPTIONS request and add default OPTIONS request handling (commit: 6287cb42734438ae2430ee480bfa57b3d734efbc)
  • Enable aspired version which failed to load to attempt reload. (commit: 2530a336ccff3fa202fd24967562d1bbe0055a6c)
  • Fixed a compilation error in aspired_versions_manager.cc (commit: 4ca9a4b41402597fea4933893cac2639d23d7ded)
  • handle options requests and check if Origin header exists (commit: c51214134671d565f541fa5ef2a1a63226fb53e3)
  • Add "_r" root event annotation to ProcessBatch events. (commit: e5c3aec49d28b06f98967a609129862d81b61019)
  • Bump minimum bazel version 3.7.2. (commit: 5edcd13d323bb133a6505fd558422d008307ce72)
  • Update TensorFlow serving documentation with instructions to bind a host volume. The profiler in the docker will write to this volume, and on the host side TensorBoard will load the profile from the location. (commit: 7c21b223617d8293b45064b8d40c5c1297117677)
  • Fix package build due to config move in: (commit: 18dd7666bc7ed64c1254fb21895d1602f160b2f7)
  • Fix memory leak from allocating input tensors (commit: 2f9b6a0ea802ee726d1aba277650a8ddf3a411f9)
  • Allowing lossy floating point conversions for JSON inputs since JSON has only a single numeric type. This is inline with textproto conversion (and even C++ implicit double -> float conversion). (commit: 57dac6cdca7ffaed0e186ff370cf346db57d1cbc)
  • Adding enable_profiler command line flag. (commit: 7e8720dacc1e644d2858d4856fc56d6c01a81441)
  • Add logging in ServerCore. (commit: 623da67418df5be71494a4d82065351d6e4425c3)
  • Fix zlib. (commit: bc24390a45f5f9dbbf2fa1a05e39c9eb128187ee)
  • Fix broken GPU build by add TF cuda options: (commit: 05377a99a61217976cb4ddd2ea2c7f1ce71ae0eb)

Thanks to our Contributors

This release contains contributions from many people at Google, as well as:

Abhinav Pundir, Abolfazl Shahbazi, Aurélien Geron, Bairen Yi, gbaned, handong, Hao Ziyu, Junqin Zhang, kiddos, Oliver Sampson, OniB, Runzhen Wang, skawasak, zou000

serving - 2.5.0-rc3

Published by netfs over 3 years ago

TensorFlow Serving using TensorFlow 2.5.0-rc3

Major Features and Improvements

  • Upgrade to CUDA 11.2. (commit: 1975e3e5701d1cb266d356834ec50a326016983f)
  • Update TF Text to v2.4.3. (commit: ccfb606b2c86f4ffb57ecd8454c8809ab8c13090)
  • Experimental support for XLA/CPU models. (commit: 3c1b2b34ec516afbb73543cc62d3979f013a33f1)
  • Add metrics to the Prometheus API (commit: dfb41f191ad9b61e9a8ca2cda4fa381733b9179d)
  • Support URL reserved characters for REST API (#1726) (commit: dd9c467e15b996be40ceb356f69d3d85c503e72d)

Breaking Changes

  • No Breaking changes

Bug Fixes and Other Changes

  • Fix typo in REQUIRED_PACKAGES for grpcio (commit: b9ed0f80852d7a33b0745833ff9c0243178b0c5d)
  • update resnet_k8s.yaml file (commit: e7b7b337eb82bec1556b6ab47353138c5e1a7f8b)
  • Fix a compile warning thrown by gcc-9 (commit: 38a017db5c35f60a9641eef0d70f4e8eaecd5608)
  • Update json_tensor.cc (commit: a0a9d14322bf30299d0056182952c00d1961bf35)
  • Add TfLiteInterpreterPool to make concurrent use of TfliteSession better (commit: d9efa43a77e609260614d9633a3ee7c4cc3e2843)
  • Enable download of TF Serving sources at arbitrary commit for CPU docker image. (commit: de1ab9e0b5abf8ee7540cc711fda2be9fcbd1b8c)
  • Updated tests to newer API (commit: 30dd2fe67ad00b38324d01351ca52c571701f97f)
  • When GRPC messages come to TF Serving GRPC server, server will create a new threads to handle each message. (commit: ac0eb73e71e0658196f02aa6518e119351e14a95)
  • Add dedicated aliases field to ModelServerConfig. (commit: 358f7d1fe94dc2c904b1c4de68c7bbfe42088b2d)
  • Update docker command line to work with GPUs (Fixes #1768). (commit: b41a28bb2c815bd76010d3a39a3ea52b5da85f35)
  • option to disable grpc over http (commit: f08729066771c55a44941a7661ac4d0bc48b85dd)
  • use RecordRuntimeLatency (commit: d353931269d1a9b10381102a9f8fd80b27bcf28e)
  • Improve error message for file not exists. (commit: 78d47f7031d5a60b8cfff4fd0f0a2e1ec0586c01)
  • Transition TensorFlow Serving to TensorFlow's new WORKSPACE protocol. (commit: 50a7ef370139073640e62178ab95b5894f587667)
  • fix http_rest_api_handler_test (commit: 53da4f80a6d6a15cebfb9add0ef5e0e59abe3eb2)
  • add status label in request_latency (commit: 0483edb643fdfcda34ca5a2cd7923c783f0e48bb)
  • fix timing of request_latency (commit: e20c91efcc9221109aa19cff6cca4f2495e367f3)
  • fix http_rest_api_handler_test (commit: 6254411fa3183ef51b46216a54f71675afe5b735)
  • Clarifying object values in REST requests to include B64 encoding and similar key:value pair objects. (commit: 0536678a05073feaa3f2815cb9012abd7a86907a)
  • Register custom TfLite ParseExample and add benchmark (commit: 20fe3ca4f292b1ae575217ecb69981d59addc720)
  • Update docker.md (commit: 7ebcd15c3d432959cf44ba784c869433730f73f4)
  • Pre-allocate memory for certain vectors where the size is known. (commit: e208b6ed8a621e91684dfa233b0923ea367f6a43)
  • Updating serving_basic for adjusting serving_basic.md file and making it up2date with TF2.x - including: (commit: cea306a2f3857e5715233b9f45b49097e4d5b4c8)
  • Support URL reserved characters for REST API (#1726) (commit: dd9c467e15b996be40ceb356f69d3d85c503e72d)
  • Fixing MKL builds due to missing 'build_with_openmp' option (commit: 0ed23dfe3d02e8bfcf19907da4b3d764146eefc0)
  • Implement batch parallelism for tflite sessions (commit: fec1d5db3cec6c81635e6f77923f3112f09a0289)
  • Use LOG_EVERY_N_SEC insted of LOG with local static time variable. (commit: c6b3936accc6aa75cf174b1ce3dffc4ca588fbdc)
  • Fix TensorFlow Serving build with MKL+OpenMP (commit: ddad074e9f2347e8898811c97db1e454f00c9062)
  • add cors headers (commit: d65914b5212213ea26e61956ade0668978001fba)
  • Remove extra build options for TF Serving (commit: 63acc954d7082f515a0c599991bab2ad4e87d449)
  • Remove hashtable custom op dependencies (commit: bb51722b1dea8d139c46e2fe6fb52fa379da164c)
  • allow http OPTIONS request and add default OPTIONS request handling (commit: 6287cb42734438ae2430ee480bfa57b3d734efbc)
  • Enable aspired version which failed to load to attempt reload. (commit: 2530a336ccff3fa202fd24967562d1bbe0055a6c)
  • Fixed a compilation error in aspired_versions_manager.cc (commit: 4ca9a4b41402597fea4933893cac2639d23d7ded)
  • handle options requests and check if Origin header exists (commit: c51214134671d565f541fa5ef2a1a63226fb53e3)
  • Add "_r" root event annotation to ProcessBatch events. (commit: e5c3aec49d28b06f98967a609129862d81b61019)
  • Bump minimum bazel version 3.7.2. (commit: 5edcd13d323bb133a6505fd558422d008307ce72)
  • Update TensorFlow serving documentation with instructions to bind a host volume. The profiler in the docker will write to this volume, and on the host side TensorBoard will load the profile from the location. (commit: 7c21b223617d8293b45064b8d40c5c1297117677)
  • Fix package build due to config move in: (commit: 18dd7666bc7ed64c1254fb21895d1602f160b2f7)
  • Fix memory leak from allocating input tensors (commit: 2f9b6a0ea802ee726d1aba277650a8ddf3a411f9)
  • Allowing lossy floating point conversions for JSON inputs since JSON has only a single numeric type. This is inline with textproto conversion (and even C++ implicit double -> float conversion). (commit: 57dac6cdca7ffaed0e186ff370cf346db57d1cbc)
  • Adding enable_profiler command line flag. (commit: 7e8720dacc1e644d2858d4856fc56d6c01a81441)
  • Add logging in ServerCore. (commit: 623da67418df5be71494a4d82065351d6e4425c3)
  • Fix zlib. (commit: bc24390a45f5f9dbbf2fa1a05e39c9eb128187ee)
  • Fix broken GPU build by add TF cuda options: (commit: 05377a99a61217976cb4ddd2ea2c7f1ce71ae0eb)

Thanks to our Contributors

This release contains contributions from many people at Google, as well as:

Abhinav Pundir, Abolfazl Shahbazi, Aurélien Geron, Bairen Yi, gbaned, handong, Hao Ziyu, Junqin Zhang, kiddos, Oliver Sampson, OniB, Runzhen Wang, skawasak, zou000

serving - 2.5.0-rc2

Published by shadowdragon89 over 3 years ago

TensorFlow Serving using TensorFlow 2.5.0-rc2

Major Features and Improvements

  • Upgrade to CUDA 11.2. (commit: 1975e3e5701d1cb266d356834ec50a326016983f)
  • Update TF Text to v2.4.3. (commit: ccfb606b2c86f4ffb57ecd8454c8809ab8c13090)
  • Experimental support for XLA/CPU models. (commit: 3c1b2b34ec516afbb73543cc62d3979f013a33f1)
  • Add metrics to the Prometheus API (commit: dfb41f191ad9b61e9a8ca2cda4fa381733b9179d)
  • Support URL reserved characters for REST API (#1726) (commit: dd9c467e15b996be40ceb356f69d3d85c503e72d)

Breaking Changes

  • No Breaking changes

Bug Fixes and Other Changes

  • Fix typo in REQUIRED_PACKAGES for grpcio (commit: b9ed0f80852d7a33b0745833ff9c0243178b0c5d)
  • update resnet_k8s.yaml file (commit: e7b7b337eb82bec1556b6ab47353138c5e1a7f8b)
  • Fix a compile warning thrown by gcc-9 (commit: 38a017db5c35f60a9641eef0d70f4e8eaecd5608)
  • Update json_tensor.cc (commit: a0a9d14322bf30299d0056182952c00d1961bf35)
  • Add TfLiteInterpreterPool to make concurrent use of TfliteSession better (commit: d9efa43a77e609260614d9633a3ee7c4cc3e2843)
  • Enable download of TF Serving sources at arbitrary commit for CPU docker image. (commit: de1ab9e0b5abf8ee7540cc711fda2be9fcbd1b8c)
  • Updated tests to newer API (commit: 30dd2fe67ad00b38324d01351ca52c571701f97f)
  • When GRPC messages come to TF Serving GRPC server, server will create a new threads to handle each message. (commit: ac0eb73e71e0658196f02aa6518e119351e14a95)
  • Add dedicated aliases field to ModelServerConfig. (commit: 358f7d1fe94dc2c904b1c4de68c7bbfe42088b2d)
  • Update docker command line to work with GPUs (Fixes #1768). (commit: b41a28bb2c815bd76010d3a39a3ea52b5da85f35)
  • option to disable grpc over http (commit: f08729066771c55a44941a7661ac4d0bc48b85dd)
  • use RecordRuntimeLatency (commit: d353931269d1a9b10381102a9f8fd80b27bcf28e)
  • Improve error message for file not exists. (commit: 78d47f7031d5a60b8cfff4fd0f0a2e1ec0586c01)
  • Transition TensorFlow Serving to TensorFlow's new WORKSPACE protocol. (commit: 50a7ef370139073640e62178ab95b5894f587667)
  • fix http_rest_api_handler_test (commit: 53da4f80a6d6a15cebfb9add0ef5e0e59abe3eb2)
  • add status label in request_latency (commit: 0483edb643fdfcda34ca5a2cd7923c783f0e48bb)
  • fix timing of request_latency (commit: e20c91efcc9221109aa19cff6cca4f2495e367f3)
  • fix http_rest_api_handler_test (commit: 6254411fa3183ef51b46216a54f71675afe5b735)
  • Clarifying object values in REST requests to include B64 encoding and similar key:value pair objects. (commit: 0536678a05073feaa3f2815cb9012abd7a86907a)
  • Register custom TfLite ParseExample and add benchmark (commit: 20fe3ca4f292b1ae575217ecb69981d59addc720)
  • Update docker.md (commit: 7ebcd15c3d432959cf44ba784c869433730f73f4)
  • Pre-allocate memory for certain vectors where the size is known. (commit: e208b6ed8a621e91684dfa233b0923ea367f6a43)
  • Updating serving_basic for adjusting serving_basic.md file and making it up2date with TF2.x - including: (commit: cea306a2f3857e5715233b9f45b49097e4d5b4c8)
  • Support URL reserved characters for REST API (#1726) (commit: dd9c467e15b996be40ceb356f69d3d85c503e72d)
  • Fixing MKL builds due to missing 'build_with_openmp' option (commit: 0ed23dfe3d02e8bfcf19907da4b3d764146eefc0)
  • Implement batch parallelism for tflite sessions (commit: fec1d5db3cec6c81635e6f77923f3112f09a0289)
  • Use LOG_EVERY_N_SEC insted of LOG with local static time variable. (commit: c6b3936accc6aa75cf174b1ce3dffc4ca588fbdc)
  • Fix TensorFlow Serving build with MKL+OpenMP (commit: ddad074e9f2347e8898811c97db1e454f00c9062)
  • add cors headers (commit: d65914b5212213ea26e61956ade0668978001fba)
  • Remove extra build options for TF Serving (commit: 63acc954d7082f515a0c599991bab2ad4e87d449)
  • Remove hashtable custom op dependencies (commit: bb51722b1dea8d139c46e2fe6fb52fa379da164c)
  • allow http OPTIONS request and add default OPTIONS request handling (commit: 6287cb42734438ae2430ee480bfa57b3d734efbc)
  • Enable aspired version which failed to load to attempt reload. (commit: 2530a336ccff3fa202fd24967562d1bbe0055a6c)
  • Fixed a compilation error in aspired_versions_manager.cc (commit: 4ca9a4b41402597fea4933893cac2639d23d7ded)
  • handle options requests and check if Origin header exists (commit: c51214134671d565f541fa5ef2a1a63226fb53e3)
  • Add "_r" root event annotation to ProcessBatch events. (commit: e5c3aec49d28b06f98967a609129862d81b61019)
  • Bump minimum bazel version 3.7.2. (commit: 5edcd13d323bb133a6505fd558422d008307ce72)
  • Update TensorFlow serving documentation with instructions to bind a host volume. The profiler in the docker will write to this volume, and on the host side TensorBoard will load the profile from the location. (commit: 7c21b223617d8293b45064b8d40c5c1297117677)
  • Fix package build due to config move in: (commit: 18dd7666bc7ed64c1254fb21895d1602f160b2f7)
  • Fix memory leak from allocating input tensors (commit: 2f9b6a0ea802ee726d1aba277650a8ddf3a411f9)
  • Allowing lossy floating point conversions for JSON inputs since JSON has only a single numeric type. This is inline with textproto conversion (and even C++ implicit double -> float conversion). (commit: 57dac6cdca7ffaed0e186ff370cf346db57d1cbc)
  • Adding enable_profiler command line flag. (commit: 7e8720dacc1e644d2858d4856fc56d6c01a81441)
  • Add logging in ServerCore. (commit: 623da67418df5be71494a4d82065351d6e4425c3)
  • Fix zlib. (commit: bc24390a45f5f9dbbf2fa1a05e39c9eb128187ee)
  • Fix broken GPU build by add TF cuda options: (commit: 05377a99a61217976cb4ddd2ea2c7f1ce71ae0eb)

Thanks to our Contributors

This release contains contributions from many people at Google, as well as:

Abhinav Pundir, Abolfazl Shahbazi, Aurélien Geron, Bairen Yi, gbaned, handong, Hao Ziyu, Junqin Zhang, kiddos, Oliver Sampson, OniB, Runzhen Wang, skawasak, zou000

serving - 2.5.0-rc1

Published by netfs over 3 years ago

TensorFlow Serving using TensorFlow 2.5.0-rc1

Major Features and Improvements

  • Upgrade to CUDA 11.2 and and cuDNN 8.1.0 (commit: 1975e3e5701d1cb266d356834ec50a326016983f)
  • Update TF Text to v2.4.3. (commit: ccfb606b2c86f4ffb57ecd8454c8809ab8c13090)
  • Experimental support for serving JAX and XLA/CPU models. (commit: 3c1b2b34ec516afbb73543cc62d3979f013a33f1)
  • Add latency and availability metrics to the Prometheus API (#1623) (commit: dfb41f191ad9b61e9a8ca2cda4fa381733b9179d)
  • Support URL reserved characters for REST API (#1726) (commit: dd9c467e15b996be40ceb356f69d3d85c503e72d)
  • Add Cross-Origin Resource Sharing (CORS) headers to REST API (#1817)

Breaking Changes

  • No breaking changes

Bug Fixes and Other Changes

  • Fix typo in REQUIRED_PACKAGES for grpcio (commit: b9ed0f80852d7a33b0745833ff9c0243178b0c5d)
  • Update resnet_k8s.yaml file (commit: e7b7b337eb82bec1556b6ab47353138c5e1a7f8b)
  • Fix a compile warning thrown by gcc-9 (commit: 38a017db5c35f60a9641eef0d70f4e8eaecd5608)
  • Fix typo (commit: dbcd54fc323a87b459de36a6f4ab9d03cc4679d2)
  • Update json_tensor.cc (commit: a0a9d14322bf30299d0056182952c00d1961bf35)
  • Add TfLiteInterpreterPool to make concurrent use of TfliteSession better (commit: d9efa43a77e609260614d9633a3ee7c4cc3e2843)
  • Enable download of TF Serving sources at arbitrary commit for CPU docker image. (commit: de1ab9e0b5abf8ee7540cc711fda2be9fcbd1b8c)
  • Updated tests to newer API (commit: 30dd2fe67ad00b38324d01351ca52c571701f97f)
  • Control number of grpc threads for request handling to avoid OOM (Fixes #1785). (commit: ac0eb73e71e0658196f02aa6518e119351e14a95)
  • Add dedicated aliases field to ModelServerConfig. (commit: 358f7d1fe94dc2c904b1c4de68c7bbfe42088b2d)
  • Update docker command line to work with GPUs (Fixes #1768). (commit: b41a28bb2c815bd76010d3a39a3ea52b5da85f35)
  • Option to disable grpc over http (Fixes #1764) (commit: f08729066771c55a44941a7661ac4d0bc48b85dd)
  • Remove an unused experimental config option "experimental_fixed_input_tensors_filepath". (commit: 3234fca77b7bd433bd0b657c506d3526128e4626)
  • Removing CurriedSession, since it is no longer used. (commit: 87793ad8cd9b8c023cd03b6bb640c2a3eb66d619)
  • Improve error message for file not exists. (commit: 78d47f7031d5a60b8cfff4fd0f0a2e1ec0586c01)
  • Fix inference request delay when model is switched (Fixes #1796). (commit: 803dd4298af40e7b9fd193f16b86fcb795923dcc)
  • Transition TensorFlow Serving to TensorFlow's new WORKSPACE protocol. (commit: 50a7ef370139073640e62178ab95b5894f587667)
  • Clarifying object values in REST requests to include B64 encoding and similar key:value pair objects. (commit: 0536678a05073feaa3f2815cb9012abd7a86907a)
  • Remove experimental comment on TfLiteSession (commit: ab7f9a5e4946d8be08b135d18ac8172a1b494e05)
  • Register custom TfLite ParseExample and add benchmark (commit: 20fe3ca4f292b1ae575217ecb69981d59addc720)
  • Use respectful terms. (commit: b73bd7b5f9cce14a28c15a52f683d32a81773700)
  • Pre-allocate memory for certain vectors where the size is known. (commit: e208b6ed8a621e91684dfa233b0923ea367f6a43)
  • Updating serving_basic for adjusting serving_basic.md file and making it up2date with TF2.x - including: (commit: cea306a2f3857e5715233b9f45b49097e4d5b4c8)
  • Use NullSafeStringView for potentially null pointer returned from libevent (commit: a46fdb26bd8b78d57a47c13b116748ed9551cdc6)
  • Replace nullptr constructor for string_view with empty strings (commit: a98d1649061133c589c1a3872aee9ecd774bc52a)
  • Fixing MKL builds due to missing 'build_with_openmp' option (commit: 0ed23dfe3d02e8bfcf19907da4b3d764146eefc0)
  • Implement batch parallelism for tflite sessions (commit: fec1d5db3cec6c81635e6f77923f3112f09a0289)
  • Fix GPU docker image massive increase in size (#1813) (commit: 5a0dfd9468d718da1bd39687da0f80fe489ea0af)
  • Fix TensorFlow Serving build with MKL+OpenMP (commit: ddad074e9f2347e8898811c97db1e454f00c9062)
  • Remove hashtable custom op dependencies (commit: bb51722b1dea8d139c46e2fe6fb52fa379da164c)
  • Enable aspired version which failed to load to attempt reload. (commit: 2530a336ccff3fa202fd24967562d1bbe0055a6c)
  • Fixed a compilation error in aspired_versions_manager.cc (commit: 4ca9a4b41402597fea4933893cac2639d23d7ded)
  • Add "_r" root event annotation to ProcessBatch events. (commit: e5c3aec49d28b06f98967a609129862d81b61019)
  • Bump minimum bazel version 3.7.2. (commit: 5edcd13d323bb133a6505fd558422d008307ce72)
  • Dont hardcode path to python3. (commit: 63b2d1c097d540841a2657334f81f0421ebe11f0)
  • Fix package build due to config move in: (commit: 18dd7666bc7ed64c1254fb21895d1602f160b2f7)
  • Add model_service_cc_grpc_proto (commit: a670ff5537beb41e90020065e25a682498b9de38)
  • Fix memory leak from allocating input tensors (commit: 2f9b6a0ea802ee726d1aba277650a8ddf3a411f9)
  • Allowing lossy floating point conversions for JSON inputs (commit: 57dac6cdca7ffaed0e186ff370cf346db57d1cbc)
  • Adding enable_profiler command line flag. (commit: 7e8720dacc1e644d2858d4856fc56d6c01a81441)
  • Add logging in ServerCore. (commit: 623da67418df5be71494a4d82065351d6e4425c3)
  • Removes mention of ASCII (commit: 8e97b59fc3376d7bea8ea8c067da09c225c1da8a)

Thanks to our Contributors

This release contains contributions from many people at Google, as well as:

Abhinav Pundir, Abolfazl Shahbazi, Aurélien Geron, Bairen Yi, gbaned, handong, Hao Ziyu, Junqin Zhang, kiddos, Oliver Sampson, OniB, Runzhen Wang, skawasak, zou000

serving - 2.5.0-rc0

Published by nniuzft over 3 years ago

TensorFlow Serving using TensorFlow 2.5.0-rc0

Major Features and Improvements

  • Upgrade to CUDA 11.2. (commit: 1975e3e5701d1cb266d356834ec50a326016983f)
  • Experimental support for serving JAX and XLA/CPU models. (commit: 3c1b2b34ec516afbb73543cc62d3979f013a33f1)
  • Add latency and availability metrics to the Prometheus API (#1623) (commit: dfb41f191ad9b61e9a8ca2cda4fa381733b9179d)
  • Update TF Text to v2.4.3. (commit: ccfb606b2c86f4ffb57ecd8454c8809ab8c13090)
  • Support URL reserved characters for REST API (#1726) (commit: dd9c467e15b996be40ceb356f69d3d85c503e72d)
  • Add Cross-Origin Resource Sharing (CORS) headers to REST API (#1817)

Breaking Changes

  • No breaking changes

Bug Fixes and Other Changes

  • Fix typo in REQUIRED_PACKAGES for grpcio (commit: b9ed0f80852d7a33b0745833ff9c0243178b0c5d)
  • Update resnet_k8s.yaml file (commit: e7b7b337eb82bec1556b6ab47353138c5e1a7f8b)
  • Fix a compile warning thrown by gcc-9 (commit: 38a017db5c35f60a9641eef0d70f4e8eaecd5608)
  • Fix typo (commit: dbcd54fc323a87b459de36a6f4ab9d03cc4679d2)
  • Update json_tensor.cc (commit: a0a9d14322bf30299d0056182952c00d1961bf35)
  • Add TfLiteInterpreterPool to make concurrent use of TfliteSession better (commit: d9efa43a77e609260614d9633a3ee7c4cc3e2843)
  • Enable download of TF Serving sources at arbitrary commit for CPU docker image. (commit: de1ab9e0b5abf8ee7540cc711fda2be9fcbd1b8c)
  • Updated tests to newer API (commit: 30dd2fe67ad00b38324d01351ca52c571701f97f)
  • Control number of grpc threads for request handling to avoid OOM (Fixes #1785). (commit: ac0eb73e71e0658196f02aa6518e119351e14a95)
  • Add dedicated aliases field to ModelServerConfig. (commit: 358f7d1fe94dc2c904b1c4de68c7bbfe42088b2d)
  • Update docker command line to work with GPUs (Fixes #1768). (commit: b41a28bb2c815bd76010d3a39a3ea52b5da85f35)
  • Option to disable grpc over http (Fixes #1764) (commit: f08729066771c55a44941a7661ac4d0bc48b85dd)
  • Remove an unused experimental config option "experimental_fixed_input_tensors_filepath". (commit: 3234fca77b7bd433bd0b657c506d3526128e4626)
  • Removing CurriedSession, since it is no longer used. (commit: 87793ad8cd9b8c023cd03b6bb640c2a3eb66d619)
  • Improve error message for file not exists. (commit: 78d47f7031d5a60b8cfff4fd0f0a2e1ec0586c01)
  • Fix inference request delay when model is switched (Fixes #1796). (commit: 803dd4298af40e7b9fd193f16b86fcb795923dcc)
  • Transition TensorFlow Serving to TensorFlow's new WORKSPACE protocol. (commit: 50a7ef370139073640e62178ab95b5894f587667)
  • Clarifying object values in REST requests to include B64 encoding and similar key:value pair objects. (commit: 0536678a05073feaa3f2815cb9012abd7a86907a)
  • Remove experimental comment on TfLiteSession (commit: ab7f9a5e4946d8be08b135d18ac8172a1b494e05)
  • Register custom TfLite ParseExample and add benchmark (commit: 20fe3ca4f292b1ae575217ecb69981d59addc720)
  • Use respectful terms. (commit: b73bd7b5f9cce14a28c15a52f683d32a81773700)
  • Pre-allocate memory for certain vectors where the size is known. (commit: e208b6ed8a621e91684dfa233b0923ea367f6a43)
  • Updating serving_basic for adjusting serving_basic.md file and making it up2date with TF2.x - including: (commit: cea306a2f3857e5715233b9f45b49097e4d5b4c8)
  • Use NullSafeStringView for potentially null pointer returned from libevent (commit: a46fdb26bd8b78d57a47c13b116748ed9551cdc6)
  • Replace nullptr constructor for string_view with empty strings (commit: a98d1649061133c589c1a3872aee9ecd774bc52a)
  • Fixing MKL builds due to missing 'build_with_openmp' option (commit: 0ed23dfe3d02e8bfcf19907da4b3d764146eefc0)
  • Implement batch parallelism for tflite sessions (commit: fec1d5db3cec6c81635e6f77923f3112f09a0289)
  • Fix GPU docker image massive increase in size (#1813) (commit: 5a0dfd9468d718da1bd39687da0f80fe489ea0af)
  • Fix TensorFlow Serving build with MKL+OpenMP (commit: ddad074e9f2347e8898811c97db1e454f00c9062)
  • Remove hashtable custom op dependencies (commit: bb51722b1dea8d139c46e2fe6fb52fa379da164c)
  • Enable aspired version which failed to load to attempt reload. (commit: 2530a336ccff3fa202fd24967562d1bbe0055a6c)
  • Fixed a compilation error in aspired_versions_manager.cc (commit: 4ca9a4b41402597fea4933893cac2639d23d7ded)
  • Add "_r" root event annotation to ProcessBatch events. (commit: e5c3aec49d28b06f98967a609129862d81b61019)
  • Bump minimum bazel version 3.7.2. (commit: 5edcd13d323bb133a6505fd558422d008307ce72)
  • Dont hardcode path to python3. (commit: 63b2d1c097d540841a2657334f81f0421ebe11f0)
  • Fix package build due to config move in: (commit: 18dd7666bc7ed64c1254fb21895d1602f160b2f7)
  • Add model_service_cc_grpc_proto (commit: a670ff5537beb41e90020065e25a682498b9de38)
  • Fix memory leak from allocating input tensors (commit: 2f9b6a0ea802ee726d1aba277650a8ddf3a411f9)
  • Allowing lossy floating point conversions for JSON inputs (commit: 57dac6cdca7ffaed0e186ff370cf346db57d1cbc)
  • Adding enable_profiler command line flag. (commit: 7e8720dacc1e644d2858d4856fc56d6c01a81441)
  • Add logging in ServerCore. (commit: 623da67418df5be71494a4d82065351d6e4425c3)
  • Removes mention of ASCII (commit: 8e97b59fc3376d7bea8ea8c067da09c225c1da8a)

Thanks to our Contributors

This release contains contributions from many people at Google, as well as:

Abhinav Pundir, Abolfazl Shahbazi, Aurélien Geron, Bairen Yi, gbaned, handong, Hao Ziyu, Junqin Zhang, kiddos, Oliver Sampson, OniB, Runzhen Wang, skawasak, zou000

serving - 2.2.2

Published by netfs over 3 years ago

TensorFlow Serving using TensorFlow 2.2.2.

Major Features and Improvements

  • No major features or improvements.

Breaking Changes

  • No breaking changes

Bug Fixes and Other Changes

  • Reduces startup time on GPUs (commit: 54de1065213e5a8a888203a2795eb556881fc08a)
serving - 2.4.1

Published by nniuzft over 3 years ago

TensorFlow Serving using TensorFlow 2.4.1.

Major Features and Improvements

  • No major features or improvements.

Breaking Changes

  • No breaking changes

Bug Fixes and Other Changes

  • This release is based on TF version 2.4.1
serving - 2.4.0

Published by guanxinq almost 4 years ago

Major Features and Improvements

  • Update TF Text to v2.3.0.
  • Upgrade to CUDA Version 11.0.
  • Update CUDNN_VERSION to 8.0.4.30.
  • Adds user guide for Remote Predict Op.
  • Add support for serving regress/classify (native keras) TF2 models.

Breaking Changes

Bug Fixes and Other Changes

  • Adding /usr/local/cuda/extras/CUPTI/lib64 to LD_LIBRARY_PATH in order to unblock profiling (commit: 1270b8ce192225edcaafb00a50822216dd0b1de0)
  • Improve error message when version directory is not found (commit: d687d3e8827c82f4f1b68337c67b2cbe6e4126e7)
  • Migrate the remaining references of tf.app to compat.v1. (commit: 06fbf878a98c8bd4202e33bc1c097a6ce184d06e)
  • Cleanup TraceMe idioms (commit: f22f802c73bfdd548f85dacffc24022b0d79dfc7)
  • Adds LICENSE file to tensorflow-serving-api python package. (commit: 41188d482beb693d4e79e6934d25f1edd44321ac)
  • Enable a way to 'forget' unloaded models in the ServableStateMonitor. (commit: 53c5a65e8158dc1a2a85a2394482cc6acc1736bc)
  • Added abstract layer for remote predict op over different RPC protocols with template. (commit: c54ca7ec95928b6eec39f350140835ebbe3caeb0)
  • Add an example which call the Remote Predict Op directly. (commit: d5b980f487996aa1f890a559eae968735dfebf5d)
  • For batching session in TF serving model server, introduce options to enable large batch splitting. (commit: f84187e8d3e19a298656a661a888c0563c21910e)
  • Add multi-inference support for TF2 models that use (commit: abb8d3b516a310ec4269cd6bf892644d5150485a)
  • Use absl::optional instead of tensorflow::serving::optional. (commit: c809305a50412a2b47f2287c76ea0be1070aabd6)
  • Use absl::optional instead of tensorflow::serving::optional. (commit: cf1cf93eac1896c3c482d17b440489edea110670)
  • Remove tensorflow::serving::MakeCleanup and use tensorflow::gtl::MakeCleanup. (commit: 6ccb003eb45f4961128e5cc2edf5d8b61ef51111)
  • Use absl::optional and remove tensorflow::serving::optional. (commit: e8e5222abbb39e84d1d4e5e9813626b2cc51daac)
  • Deprecate tensorflow::CreateProfilerService() and update serving client. (commit: 98a55030e10a61ee0c3f6b8fc57e2cf63fc59719)
  • Change the SASS & PTX we ship with TF (commit: 086929269b5f2c0f5d71c30accb79d74694c9ece)
  • Adding custom op support. (commit: 892ea42864676b67cbccdfa0794a15d30e65a1b6)
  • Upgrade to PY3 for tests. (commit: 02624a83f70060095df7c132fa46a7a09f9bff6a)
  • Makes clear how to make a default config file for serving multiple models. (commit: 084eaeb15fdc87d83b8c19f558dc1f56bd3a024e)
  • Use TraceMeEncode in BatchingSession's TraceMe. (commit: 78ff058501274aa37b6bbc18aec225604d4cda47)
  • Export metrics for runtime latency for predict/classify/regress. (commit: c317582981cfc1550b27d9d73f71c6ca38e5c8c5)
  • Refactor net_http/client to expose request/response functionality as a public API (not yet finalized) for usage testing ServerRequestInterface and HttpServerInterface instances. (commit: 0b951c807375f1f305280a96124d8b6d6e045bd2)
  • In model warm-up path, re-write error code out-of-range (intended when reading EOF in a file) to ok. (commit: d9bde73569385b4ef3ef8e36d2c832a8ae9a92ad)
  • fix Client Rest API endpoint (commit: b847bac5f2e1dc6a98f431b1fdf42ceebceceeb6)
  • Support multiple SignatureDefs by key in TFLite models (commit: 2e14cd9dc2647127d7cb8c44ceab5dfcf6ac28c4)
  • Add dedicated aliases field to ModelServerConfig. (commit: 718152dc386f9fa7b21ed36d9d85518e987d7bf5)
  • Remove deprecated flag fail_if_no_model_versions_found from tensorflow serving binary (commit: 4b624628977a12b1757b9ddcd3312b3768de8231)
  • Fix TraceMe instrumentation for the padding size. (commit: 0cb94cd79aacb965b3923d4a51b4091cf84d5e22)
  • Add vlog to dump updated model label map (for debugging) each time the map is updated. (commit: ac10e74078123189dc1c8a3cd29d530b7c972782)
  • Add python wrapper for remote predict op and clean the build and include files. (commit: d0daa1064ecdd56ecb5c0a8aca37c3e198cb313d)
  • Add portpicker module required to run modelserver e2e tests. (commit: 82f8cc039d091916b8186dfa1ff4b6c006e7277c)
  • changing "infintiy" to "really high value" (commit: c96474cfcca46b1216e52634efb68986cf8aa9b8)
  • Minimal commandline client to trigger profiler on the modelserver. (commit: c0a5619a01e3af69459aa6396d614945370bbd02)
  • Add signature name to RPOp. (commit: 84dfc8b66ff6c1a693766613034ddc3ff044a330)
  • When RPC error occurs, the output tensors should still get allocated. (commit: 9113de22353350443bdd42c5d594ec653e57c0da)
  • Fix BM_MobileNet benchmark (commit: af665627b8152d4c62d207a97c6e712cb2e9a120)
  • Add JSPB BUILD targets for inference and example proto files. (commit: f1009eb0e6bdae2e35dbfb9f4ad7270e74705e2e)
  • Fall back to legacy TFLite tensor naming when parsing signature defs in TFLiteSession. (commit: 3884187cb9253bb9baa240b2009cfc6d4847b9f9)

Thanks to our Contributors

This release contains contributions from many people at Google, as well as:

Adarshreddy Adelli, Lescurel

serving - 2.4.0-rc4

Published by minglotus-6 almost 4 years ago

TensorFlow Serving using TensorFlow 2.4.0-rc4.

Major Features and Improvements

  • Update TF Text to v2.3.0.
  • Upgrade to CUDA Version 11.0.
  • Update CUDNN_VERSION to 8.0.4.30.
  • Adds user guide for Remote Predict Op
  • Upgrade to PY3 for tests.

Breaking Changes

Bug Fixes and Other Changes

  • Migrate the remaining references of tf.app to compat.v1. (commit: 06fbf878a98c8bd4202e33bc1c097a6ce184d06e)
  • Enable a way to 'forget' unloaded models in the ServableStateMonitor. (commit: 53c5a65e8158dc1a2a85a2394482cc6acc1736bc)d91cd4a00)
  • Added abstract layer for remote predict op over different RPC protocols with template. (commit: c54ca7ec95928b6eec39f350140835ebbe3caeb0)
  • Adds user guide for Remote Predict Op (commit: fc82463d197dde52a613b1717f8705964c57178f)
  • Add support for serving regress/classify (native) TF2 models. (commit: b724ceda57b4fdc9b5bd09da231c7186435cd807)
  • For batching session in TF serving model server, introduce options to enable large batch splitting. (commit: f84187e8d3e19a298656a661a888c0563c21910e)
  • Add multi-inference support for TF2 models that use (commit: abb8d3b516a310ec4269cd6bf892644d5150485a)
  • Change the SASS & PTX we ship with TF (commit: 086929269b5f2c0f5d71c30accb79d74694c9ece)
  • Adding custom op support. (commit: 892ea42864676b67cbccdfa0794a15d30e65a1b6)
  • Broaden net_http client visibility and use HTTPStatusCode in ClientResponse status (commit: 72927141f29d83614cf8304725b1a7e6357ce6ac)
  • In model warm-up path, re-write error code out-of-range (intended when reading EOF in a file) to ok. (commit: d9bde73569385b4ef3ef8e36d2c832a8ae9a92ad)
  • Support multiple SignatureDefs by key in TFLite models (commit: 2e14cd9dc2647127d7cb8c44ceab5dfcf6ac28c4)
  • Add dedicated aliases field to ModelServerConfig. (commit: 718152dc386f9fa7b21ed36d9d85518e987d7bf5)
  • Add portpicker module required to run modelserver e2e tests. (commit: 82f8cc039d091916b8186dfa1ff4b6c006e7277c)
  • Minimal commandline client to trigger profiler on the modelserver. (commit: c0a5619a01e3af69459aa6396d614945370bbd02)
  • Add signature name to RPOp. (commit: 84dfc8b66ff6c1a693766613034ddc3ff044a330)

Thanks to our Contributors

This release contains contributions from many people at Google.

serving - 2.4.0-rc3

Published by shadowdragon89 almost 4 years ago

TensorFlow Serving using TensorFlow 2.4.0-rc3

Major Features and Improvements

  • Update TF Text to v2.3.0.
  • Upgrade to CUDA Version 11.0.
  • Update CUDNN_VERSION to 8.0.4.30.
  • Upgrade to PY3 for tests.

Breaking Changes

Bug Fixes and Other Changes

  • Migrate the remaining references of tf.app to compat.v1. (commit: 06fbf878a98c8bd4202e33bc1c097a6ce184d06e)
  • Enable a way to 'forget' unloaded models in the ServableStateMonitor. (commit: 53c5a65e8158dc1a2a85a2394482cc6acc1736bc)d91cd4a00)
  • Added abstract layer for remote predict op over different RPC protocols with template. (commit: c54ca7ec95928b6eec39f350140835ebbe3caeb0)
  • Adds user guide for Remote Predict Op (commit: fc82463d197dde52a613b1717f8705964c57178f)
  • Add support for serving regress/classify (native) TF2 models. (commit: b724ceda57b4fdc9b5bd09da231c7186435cd807)
  • For batching session in TF serving model server, introduce options to enable large batch splitting. (commit: f84187e8d3e19a298656a661a888c0563c21910e)
  • Add multi-inference support for TF2 models that use (commit: abb8d3b516a310ec4269cd6bf892644d5150485a)
  • Change the SASS & PTX we ship with TF (commit: 086929269b5f2c0f5d71c30accb79d74694c9ece)
  • Adding custom op support. (commit: 892ea42864676b67cbccdfa0794a15d30e65a1b6)
  • Broaden net_http client visibility and use HTTPStatusCode in ClientResponse status (commit: 72927141f29d83614cf8304725b1a7e6357ce6ac)
  • In model warm-up path, re-write error code out-of-range (intended when reading EOF in a file) to ok. (commit: d9bde73569385b4ef3ef8e36d2c832a8ae9a92ad)
  • Support multiple SignatureDefs by key in TFLite models (commit: 2e14cd9dc2647127d7cb8c44ceab5dfcf6ac28c4)
  • Add dedicated aliases field to ModelServerConfig. (commit: 718152dc386f9fa7b21ed36d9d85518e987d7bf5)
  • Add portpicker module required to run modelserver e2e tests. (commit: 82f8cc039d091916b8186dfa1ff4b6c006e7277c)
  • Minimal commandline client to trigger profiler on the modelserver. (commit: c0a5619a01e3af69459aa6396d614945370bbd02)
  • Add signature name to RPOp. (commit: 84dfc8b66ff6c1a693766613034ddc3ff044a330)

Thanks to our Contributors

This release contains contributions from many people at Google.

serving - 2.4.0-rc2

Published by shadowdragon89 almost 4 years ago

TensorFlow Serving using TensorFlow 2.4.0-rc2

Major Features and Improvements

  • Update TF Text to v2.3.0.
  • Upgrade to CUDA Version 11.0.
  • Update CUDNN_VERSION to 8.0.4.30.
  • Upgrade to PY3 for tests.

Breaking Changes

Bug Fixes and Other Changes

  • Update TF Text to v2.3.0. (commit: 390732ace75ca9110c706f0aeb65d35a5390ca81)
  • Migrate the remaining references of tf.app to compat.v1. (commit: 06fbf878a98c8bd4202e33bc1c097a6ce184d06e)
  • Enable a way to 'forget' unloaded models in the ServableStateMonitor. (commit: 53c5a65e8158dc1a2a85a2394482cc6acc1736bc)d91cd4a00)
  • Added abstract layer for remote predict op over different RPC protocols with template. (commit: c54ca7ec95928b6eec39f350140835ebbe3caeb0)
  • Adds user guide for Remote Predict Op (commit: fc82463d197dde52a613b1717f8705964c57178f)
  • Add support for serving regress/classify (native) TF2 models. (commit: b724ceda57b4fdc9b5bd09da231c7186435cd807)
  • For batching session in TF serving model server, introduce options to enable large batch splitting. (commit: f84187e8d3e19a298656a661a888c0563c21910e)
  • Add multi-inference support for TF2 models that use (commit: abb8d3b516a310ec4269cd6bf892644d5150485a)
  • Change the SASS & PTX we ship with TF (commit: 086929269b5f2c0f5d71c30accb79d74694c9ece)
  • Adding custom op support. (commit: 892ea42864676b67cbccdfa0794a15d30e65a1b6)
  • Upgrade to PY3 for tests. (commit: 02624a83f70060095df7c132fa46a7a09f9bff6a)
  • Upgrade to CUDA Version 11.0. (commit: b291c4df771b3a6767782bdcfa086f93512ca182)
  • Broaden net_http client visibility and use HTTPStatusCode in ClientResponse status (commit: 72927141f29d83614cf8304725b1a7e6357ce6ac)
  • In model warm-up path, re-write error code out-of-range (intended when reading EOF in a file) to ok. (commit: d9bde73569385b4ef3ef8e36d2c832a8ae9a92ad)
  • Update CUDNN_VERSION to 8.0.4.30. (commit: 84e0189d1b9828813125efef9ff2d541f101705a)
  • Support multiple SignatureDefs by key in TFLite models (commit: 2e14cd9dc2647127d7cb8c44ceab5dfcf6ac28c4)
  • Add dedicated aliases field to ModelServerConfig. (commit: 718152dc386f9fa7b21ed36d9d85518e987d7bf5)
  • Add portpicker module required to run modelserver e2e tests. (commit: 82f8cc039d091916b8186dfa1ff4b6c006e7277c)
  • Minimal commandline client to trigger profiler on the modelserver. (commit: c0a5619a01e3af69459aa6396d614945370bbd02)
  • Add signature name to RPOp. (commit: 84dfc8b66ff6c1a693766613034ddc3ff044a330)

Thanks to our Contributors

This release contains contributions from many people at Google.

serving - 2.3.0

Published by yimingz-a about 4 years ago

TensorFlow Serving using TensorFlow 2.3.0

Bug Fixes and Other Changes

  • Add a ThreadPoolFactory abstraction for returning inter- and intra- thread pools, and update PredictRequest handling logic to use the new abstraction. (commit: 8e3a00cd8ef00523227cbe1f694ab56454a880c3)
  • Update Dockerfile.devel* with py3.6 installed. (commit: b3f46d44d07480266b28776caa13211339777bc5)
  • Add more metrics for batching. (commit: f0bd9cf8b85710b638938361d356dbf15fda2e86)
  • Rename method to clarify intent. (commit: 9feac12f2223124c7ecc85a687e1ee2b24e3f7ad)
  • Plug ThreadPoolFactory into Classify request handling logic. (commit: 975f474a4ea9ef134439e266ec4a471741253ecf)
  • Plug ThreadPoolFactory into Regress request handling logic. (commit: ff9ebf2db8bf7cbc7bb199bbb207409eae25d5cc)
  • Plug ThreadPoolFactory into MultiInference request handling logic. (commit: 9a2db1da9b7e992d29ad4ccfcb125734d0cd760e)
  • Add a tflite benchmark for Mobilenet v1 quant (commit: e26682237cf756eca2dc12c83e8d5d24f00c1261)
  • Allow batch size of zero in row format JSON (commit: fee9d12070a76c1cf56bc8ae40f306a09dfd07b1)
  • Add tests for zero-sized batch (commit: b064c1d3df03b0401c5ca61de0d5ab36cd5645a5)
  • Support for MLMD(https://www.tensorflow.org/tfx/guide/mlmd) broadcast in TensorFlow serving. (commit: 4f8d3b789964d173f2d0bd87a42abfbd6a2b1e71)
  • Fix docker based builds (fixes #1596) (commit: ca2e0032d1ead843398d7744e8c51ead28daf63c)
  • Fix order dependency in batching_session_test. (commit: 58540f746c65516dc3fcda7751c6983050307409)
  • Split BasicTest in machine_learning_metadata_test into multiple test methods without order dependency. (commit: 745c735e315941925e324cbebe78a1f09d5a7443)
  • Revert pinning the version for "com_google_absl". (commit: ff9e950fa692c6f9387239bb9fa877975e8cf1c1)
  • Minimize the diffs between mkl and non-mkl Dockerfiles (commit: e7830148e53acfec7d3af7dd512a7e825f75da2a)
  • Pin "com_google_absl" at the same version(with same patch) with Tensorflow. (commit: f46b88af8af94be3c6497cc6c50a4e5c0625b2d5)
  • Update TF Text to v2.2.0. (commit: f8ea95d906421ff9517b0027662546741c486edf)
  • fix broken web link (commit: 0cb123f18df4032d8f22c1b2e19b4f41bd6c3da3)
  • Test zero-sized batch with tensors of different shapes (commit: 1f7aebd906a70ba0fa04105ceee6227960b764f7)
  • Test inconsistent batch size between zero and non-zero (commit: 91afd42dab8ce50f86bbf65065dce0c28163314b)
  • Fix broken GetModelMetadata request processing (#1612) (commit: c1ce0752f1076bd6f92e1af5f73e3a3c552f4691)
  • Adds support for SignatureDefs stored in metadata buffers to tflite sessions (commit: 4867fedbff8a33f499972268abe96618abcb81aa)
  • Update ICU library to include knowledge of built-in data files. (commit: c32ebd5e9f09828c80413ca989b99e8544502c1a)
  • Add support for version labels to REST API (Fixes #1555). (commit: 3df036223b66738de1b873e9b163230fb7661cb4)
  • Update TF Text regression model to catch errors thrown from within ops. (commit: 425d596b9b0aef2bf3ea675c985f01e55f880a4e)
  • Upgrade to CUDA Version 10.1. (commit: fd5a2a2508daf21ad174b4ec7b62501486137c01)
  • Migrates profiler_client trace to the new api in tensorflow_model_server_test. (commit: 8d7d1d6bbc50756e73aed4b9eb5a2c8ff25cdc79)
  • Update the testing model for TRT to fix the test. (commit: 28f812d8ce8f256e2d9256d6a98cd8f75f747842)
  • Add release notes for TF Serving 2.2.0 (commit: 54475e6508889c13992aced1da12a372d997e4e3)
  • Update bazel version requirement and version used in the docker images to match with TF (3.0.0). (commit: 56854d3fa27cce8c1f7816214f59e6e82c4bf5fc)
  • Fixes instructions on sample commands to serve a model with docker. (commit: a5cd1caafacd7480f5d8d2dd164adce3410b024f)
  • Change use_tflite_model to prefer_tflite_model to allow multi-tenancy of Tensorflow models with Tensorflow Lite models. (commit: 8589d8177bd300625b4c7596240150f8a8002d19)
  • Introducing Arena usage to TensorFlow Serving's HTTP handlers. (commit: a33978ca4c29387845e9b51d5653b997d4b3f814)
  • Fix tensorflow::errors:* calls, which use StrCat instead of StrFormat (commit: 2c0bcec68c040306e009b5a10d4bc80bc58fe0c5)
  • Instrumentation for BatchingSession: (commit: 3ca9e89d1b6147706981467a84c6421c44d3794a)
  • adjust error message for incorrect keys of instances object (commit: 83863b8fec26a8ea2d3957366173f9a52658b469)
  • Update rules_pkg to latest (0.2.5) release. (commit: 932358ec7511e54ad9c93ea606cc677da2d1fcb2)
  • In batching session, implement the support for 'enable_large_batch_splitting'. (commit: d7c6a65b816849cf2b84015a5b2972be7950dc89)
  • Update version for 2.3.0-rc0 release. (commit: 3af330317628a713a6e318097c7cd6fa8571165d)
  • Set cuda compute capabilities for cuda build config. (commit: 731a34f0b3f43a6f7a8da85655d3a4a5c72d066a)
  • Update version for 2.3.0 release. (commit: 8b4c7095b9931442a77288624fdd1a207671eb4c)

Thanks to our Contributors

This release contains contributions from many people at Google.

serving - 2.3.0-rc0

Published by minglotus-6 about 4 years ago

TensorFlow Serving using TensorFlow 2.3.0-rc2

Major Features and Improvements

  • Update TF Text to v2.2.0. (commit: f8ea95d906421ff9517b0027662546741c486edf)

Breaking Changes

Bug Fixes and Other Changes

  • Add a ThreadPoolFactory abstraction for returning inter- and intra- thread pools, and update PredictRequest handling logic to use the new abstraction. (commit: 8e3a00cd8ef00523227cbe1f694ab56454a880c3)
  • Update Dockerfile.devel* with py3.6 installed. (commit: b3f46d44d07480266b28776caa13211339777bc5)
  • Add more metrics for batching. (commit: f0bd9cf8b85710b638938361d356dbf15fda2e86)
  • Plug ThreadPoolFactory into Classify request handling logic. (commit: 975f474a4ea9ef134439e266ec4a471741253ecf)
  • Plug ThreadPoolFactory into Regress request handling logic. (commit: ff9ebf2db8bf7cbc7bb199bbb207409eae25d5cc)
  • Plug ThreadPoolFactory into MultiInference request handling logic. (commit: 9a2db1da9b7e992d29ad4ccfcb125734d0cd760e)
  • Allow batch size of zero in row format JSON (commit: fee9d12070a76c1cf56bc8ae40f306a09dfd07b1)
  • Support for MLMD(https://www.tensorflow.org/tfx/guide/mlmd) broadcast in TensorFlow serving. (commit: 4f8d3b789964d173f2d0bd87a42abfbd6a2b1e71)
  • Adds support for SignatureDefs stored in metadata buffers to tflite sessions (commit: 4867fedbff8a33f499972268abe96618abcb81aa)
  • Add support for version labels to REST API (Fixes #1555). (commit: 3df036223b66738de1b873e9b163230fb7661cb4)
  • Update TF Text regression model to catch errors thrown from within ops. (commit: 425d596b9b0aef2bf3ea675c985f01e55f880a4e)
  • Upgrade to CUDA Version 10.1. (commit: fd5a2a2508daf21ad174b4ec7b62501486137c01)
  • Migrates profiler_client trace to the new api in tensorflow_model_server_test. (commit: 8d7d1d6bbc50756e73aed4b9eb5a2c8ff25cdc79)

Thanks to our Contributors

This release contains contributions from many people at Google.

serving - 2.2.0

Published by nrobeR over 4 years ago

TensorFlow Serving using TensorFlow 2.2.0

Major Features and Improvements

  • Upgrade to CUDA Version 10.1. (commit: 3ab70a7811f63b994da076e2688ccc66feccee96)
  • Update TF Text to v2.2.0. (commit: fd9842816eddb4782579eadd119156190d6d2fec)

Breaking Changes

Bug Fixes and Other Changes

  • This release is based on TensorFlow version 2.2.0
  • Add a SourceAdapter that adds a prefix to StoragePath. (commit: f337623da81521eefd8cdc2da1c4a450ecf1d028)
  • Switch users of tensorflow::Env::Now*() to EnvTime::Now*(). (commit: 8a0895eb8127941b2e9dada20718dd28f3dbaee1)
  • Remove SessionBundle support from Predictor. (commit: 2090d67f4e5e8ee5aa7faf8437bea096a438450a)
  • Replace the error_codes.proto references in tf serving. (commit: ab475bf6c5e5e4b3b42ffa2aecf18b39fd481ad3)
  • Adds performance guide and documentation for TensorBoard integration (commit: f1e4eb2259da90bb9c5fe028ba824ac18a436f67)
  • Remove SessionBundleSourceAdapter as we load Session bundles via (commit: d50aa2b0b986b11368ddcf6b6eb20b9381af474c)
  • Use SavedModelBundleSourceAdapterConfig instead of (commit: 8ed3ceea985529a350290cf782cb34c3c66827d4)
  • Update minimum bazel version to 1.2.1. (commit: 1a36026198df5f7dec1e626ef9b112fecdd2916b)
  • Drop support for beta gRPC APIs. (commit: 13d01fc64330ff883bd1553122d9fd114a5a7368)
  • API spec for httpserver response-streaming (with flow-control). (commit: fd597f074ce127056515bc52ee3a3d4ff4b727bb)
  • Change Python version to PY3. (commit: 7516746a311f96b57a60598feba40cbdd3989e73)
  • Update Python tests in PY3. (commit: 0cf65d2105c191c00fba8918ba75fc955bbeace3)
  • Upgrade bazel version for Dockerfiles. (commit: e507aa193b9f3520d40e3da5e4d2263280ff35e4)
  • Change dockerfile for PY3. (commit: 7cbd06e8b7720b82b1d2dfae54c3a828d3a52eb4)
  • Reduce contention in FastReadDynamicPtr by sharding the ReadPtrs, by default one per CPU. (commit: d3b374bc70348f2e5e22b7e9ebb191ee9d5b3268)
  • Ensure that all outstanding ReadPtrs are destroyed before allowing a (commit: e41ee40826652b6aa5a3f071107074923d6ff6c7)
  • Allow splitting fields from batched session metadata into individual sessions (commit: caf2a92ba07ca4d10515f0b018c920e9b325c6c8)
  • Allow passing ThreadPoolOptions in various Session implementations. (commit: 2b6212cf0aa88b719ee00267f83c89d4f7599ef1)
  • Update bazel version used in the docker images. (commit: 162f72949c6ecbe9e610182c923dec0aa5924cf2)
  • Format error strings correctly in JSON response (Fixes #1600). (commit: 1ff4d31cd9a0a736162813c149139cce0ccaaa2c)
  • Fix broken GetModelMetadata request processing (#1612) (commit: 55c40374b548b89e8de6d899ef2b0b355c0fa9e5)
  • Support Python 3.7 in tensorflow-serving-api package (Fixes #1640) (commit: f775bb25e80a6c7b3c66842eb9085d44d9752ec2)
  • Update ICU library to include knowledge of built-in data files. (commit: 774f2489384cf985c534298d1303474c268efe5c)
  • Adds storage.googleapis.com as the primary download location for the ICU, and resets the sha256 to match this archive. (commit: 028d05095c4e302c06096e5ea32917718828ea47)
serving - 2.2.0-rc2

Published by lilao over 4 years ago

TensorFlow Serving using TensorFlow 2.2.0-rc4

serving - 2.2.0-rc1

Published by minglotus-6 over 4 years ago

Release 2.2.0-rc1

Major Features and Improvements

Breaking Changes

Bug Fixes and Other Changes

  • Add a SourceAdapter that adds a prefix to StoragePath. (commit: f337623)
    In ServableStateMonitor, introduce a helper method 'GetAvailableServables' in ServableStateMonitor. (commit: 4dcfa25)
  • Adds performance guide and documentation for TensorBoard integration (commit: f1e4eb2)
  • Use SavedModelBundleFactory for session source adapter instead (commit: a59e0dc)
  • Update TF Text to 2.0.1 (commit: bd206e4)
  • API spec for httpserver response-streaming (with flow-control). (commit: fd597f0)
  • Update TF Text to v2.1.1 which removes ICU variable conflict with core TF. (commit: 2fd33e1)
  • Reduce contention in FastReadDynamicPtr by sharding the ReadPtrs, by default one per CPU. (commit: d3b374b)
  • Refined the httpserver flow-control API for streaming writes. (commit: e085cb3)
  • Explicitly disable Tensorflow v2 behaviors for all TF1.x binaries and tests (commit: 3ffd969)
  • Allow splitting fields from batched session metadata into individual sessions (commit: caf2a92)
  • For google-internal, optimize proto parse code for lazy scenario, by annotate byte-type fields with Cord. (commit: 726ca19)
  • Allow passing ThreadPoolOptions in various Session implementations. (commit: 2b6212c)
  • Small performance optimization. (commit: 4196a72)

Thanks to our Contributors

This release contains contributions from many people at Google.

serving - 2.2.0-rc0

Published by nrobeR over 4 years ago

TensorFlow Serving using TensorFlow 2.2.0-rc0.

Major Features and Improvements

Breaking Changes

Bug Fixes and Other Changes

  • Add a SourceAdapter that adds a prefix to StoragePath. (commit: f337623da81521eefd8cdc2da1c4a450ecf1d028)
  • Switch users of tensorflow::Env::Now*() to EnvTime::Now*(). (commit: 8a0895eb8127941b2e9dada20718dd28f3dbaee1)
  • Update the serving model test to use profiler_client.start_tracing instead of (commit: 495e15bbd0f431fa2e93c79cc216f6dbcfc9b6a6)
  • Remove SessionBundle usage from regressor/classifier. (commit: b1b6ffd7dfa654a4f4629519d902d958c07ee7c8)
  • Remove SessionBundle support from Predictor. (commit: 2090d67f4e5e8ee5aa7faf8437bea096a438450a)
  • In ServableStateMonitor, introduce a helper method 'GetAvailableServables' in ServableStateMonitor. (commit: 4dcfa25b2b65f5a288c774489ad114a2744b56f9)
  • Adds performance guide and documentation for TensorBoard integration (commit: f1e4eb2259da90bb9c5fe028ba824ac18a436f67)
  • Fix docs to refer to SavedModelBundleSourceAdatperConfig as (commit: 4de4c7b6d2f5932f8f2be939335e38c94f1f2dac)
  • Update minimum bazel version to 1.2.1. (commit: 1a36026198df5f7dec1e626ef9b112fecdd2916b)
  • Drop support for beta gRPC APIs. (commit: 13d01fc64330ff883bd1553122d9fd114a5a7368)
  • Change Python version to PY3. (commit: 7516746a311f96b57a60598feba40cbdd3989e73)
  • Change dockerfile for PY3. (commit: 7cbd06e8b7720b82b1d2dfae54c3a828d3a52eb4)
  • Update TF Text to v2.1.1 which removes ICU variable conflict with core TF. (commit: 2fd33e18a3019d90a4588eeebd364d64b37605ac)
  • Reduce contention in FastReadDynamicPtr by sharding the ReadPtrs, by default one per CPU. (commit: d3b374bc70348f2e5e22b7e9ebb191ee9d5b3268)
  • Ensure that all outstanding ReadPtrs are destroyed before allowing a (commit: e41ee40826652b6aa5a3f071107074923d6ff6c7)
  • Refined the httpserver flow-control API for streaming writes. (commit: e085cb37b4b5c62154aa063dcbb781b24d5b3fd4)
  • Makes clear that version labels are not supported by REST API surface (commit: 9781ed1857ab0b1195107821ad0f3720872546a1)
  • Allow splitting fields from batched session metadata into individual sessions (commit: caf2a92ba07ca4d10515f0b018c920e9b325c6c8)
  • Allow passing ThreadPoolOptions in various Session implementations. (commit: 2b6212cf0aa88b719ee00267f83c89d4f7599ef1)

Thanks to our Contributors

This release contains contributions from many people at Google, as well as:

Haiming, Shyam Saladi, Tensorflow-Copybara

serving - 2.1.0

Published by shadowdragon89 almost 5 years ago

TensorFlow Serving using TensorFlow 2.1.0

Major Features and Improvements

Breaking Changes

Bug Fixes and Other Changes

  • Fix link for TFRecord in Saved Model Warmup documentation. (commit: 127a112a91bda3d7d3c3a56802632376bbe3e36e)
  • Fix typo in http server log message. (commit: 509f6da062dc9b091ad6961a94740cf64e265c36)
  • Be able to discard aspired-versions request from SourceRouter (commit: 10e4987502ee91fe74c6c179ed4ba52f17cc75b4)
  • Use public tf.config APIs (commit: 87a4b2b28729bd269ab367742998b6f8426ea1b7)
  • Fix copying of string tensor outputs by explicitly copying each (commit: 9544077bdb6eef9b20a0688a042155ee6dea011a)
  • Migrate from std::string to tensorflow::tstring. (commit: e24571ac9ce390733f3b02188c7d740f08fff62d)
serving - TensorFlow Serving using TensorFlow 2.1.0-rc1

Published by minglotus-6 almost 5 years ago

Release 2.1

Major Features and Improvements

  • Be able to discard aspired-versions request from SourceRouter (commit: 10e4987502ee91fe74c6c179ed4ba52f17cc75b4)
  • Adds integration with TensorBoard profiler service (commit: 91adea9716b57dd58427714427b944fbe9b3f89e)

Breaking Changes

Bug Fixes and Other Changes

  • Fix link for TFRecord in Saved Model Warmup documentation. (commit: 127a112a91bda3d7d3c3a56802632376bbe3e36e)
  • Use public tf.config APIs (commit: 87a4b2b28729bd269ab367742998b6f8426ea1b7)

Thanks to our Contributors

This release contains contributions from many people at Google, as well as:

Minglotus-6

serving - 1.13.1

Published by lilao almost 5 years ago

TensorFlow Serving using TensorFlow 1.13.2