Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.
MIT License
Bot releases are visible (Hide)
Published by michaelfeil 5 months ago
EngineArray
Multi-Model [1/3] by @michaelfeil in https://github.com/michaelfeil/infinity/pull/200
BatchHandler
into ModelWorker
by @michaelfeil in https://github.com/michaelfeil/infinity/pull/202
Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.32...0.0.33
Published by michaelfeil 6 months ago
You can now run a model with a alias. This will help you communicating with the API.
infinity_emb --served-model-name "your_nickname"
You can now use preload
models. This acts as a "run download and load into ram" test. Upon execution, all files are cached, which will speedup consecutive loads. For additonal speedups, use --no-model-warmup
to skip model warmup after loading.
infinity_emb --preload-only --model--name-or-path BAAI/bge-large-en-v1.5
PR's
Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.31...0.0.32
Published by michaelfeil 7 months ago
ENUM..TypeHint
into a function by @michaelfeil in https://github.com/michaelfeil/infinity/pull/172
/docs
and optional imports by @michaelfeil in https://github.com/michaelfeil/infinity/pull/175
Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.30...0.0.31
Published by michaelfeil 7 months ago
Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.29...0.0.30
Published by michaelfeil 7 months ago
This will be the last release with fastembed - fastembed and optimum provide similar capabilities. Please use optimum going forward.
Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.28...0.0.29
Published by michaelfeil 7 months ago
Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.27...0.0.28
Published by michaelfeil 7 months ago
Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.26...0.0.27
Published by michaelfeil 8 months ago
Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.25...0.0.26
Published by michaelfeil 8 months ago
Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.24...0.0.25
Published by michaelfeil 8 months ago
Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.23...0.0.24
Published by michaelfeil 9 months ago
Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.22...0.0.23
Published by michaelfeil 9 months ago
Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.21...0.0.22
Published by michaelfeil 9 months ago
Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.20...0.0.21
Published by michaelfeil 9 months ago
Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.19...0.0.20
Published by michaelfeil 9 months ago
Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.18...0.0.19
Published by michaelfeil 9 months ago
0.0.18
Published by michaelfeil 9 months ago
0.0.18
Published by michaelfeil 9 months ago
0.0.18
Published by michaelfeil 10 months ago
Breaking: Switched to Cuda 12.1 and torch 2.1.2
Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.16...0.0.17
Published by michaelfeil 10 months ago
capabilities
by @michaelfeil in https://github.com/michaelfeil/infinity/pull/53
Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.15...0.0.16