infinity

Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.

MIT License

Downloads
11.5K
Stars
1.1K

Bot releases are visible (Hide)

infinity - 0.0.15

Published by michaelfeil 10 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.14...0.0.15

infinity - 0.0.14

Published by michaelfeil 10 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.13...0.0.14

infinity - 0.0.14

Published by michaelfeil 10 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.13...0.0.14

infinity - 0.0.13

Published by michaelfeil 10 months ago

infinity - 0.0.12

Published by michaelfeil 10 months ago

What's Changed

Breaking:

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.11...0.0.12

infinity - 0.0.11

Published by michaelfeil 10 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.10...0.0.11

infinity - 0.0.10

Published by michaelfeil 11 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.9...0.0.10

infinity - 0.0.9

Published by michaelfeil 11 months ago

What's Changed

New Contributors

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.8...0.0.9

infinity - 0.0.8

Published by michaelfeil 11 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.7...0.0.8

infinity - 0.0.7

Published by michaelfeil 11 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.6...0.0.7

infinity - 0.0.6

Published by michaelfeil 11 months ago

infinity - 0.0.5

Published by michaelfeil 12 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.4...0.0.5

infinity - 0.0.4

Published by michaelfeil 12 months ago

What's Changed

PRs:

Issues:
Closes https://github.com/michaelfeil/infinity/issues/5 ONNX Support via https://github.com/qdrant/fastembed/
Closes https://github.com/michaelfeil/infinity/issues/22 making pytorch and optional dependency

tl,dr

  • fastembed as backend besides ct2 or torch
  • v1/models returns "backend"
  • makes torch an optional dependency
  • calculates "min" sleep time dynamically on startup _> slightly optimized.
  • default model is now "BAAI/bge-small-en-v1.5"

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.3...0.0.4

infinity - 0.0.3

Published by michaelfeil 12 months ago

What's Changed

Docs:

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.2rc0...0.0.3

infinity - 0.0.2

Published by michaelfeil 12 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.1...0.0.2rc0

infinity - 0.0.1

Published by michaelfeil about 1 year ago

Initial release of Infinity

infinity - 0.0.1-dev3

Published by michaelfeil about 1 year ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.1-dev2...0.0.1-dev3

infinity - 0.0.1-dev2 - Speedups

Published by michaelfeil about 1 year ago

adds new dependency (orjson) for faster response serialization - 300%
uses torch.inference_mode() and delayed moving to CPU - 10%
adds uvicorn[standard] - slightly faster 2-5%?
Updates readme

https://github.com/michaelfeil/infinity/pull/2

infinity - 0.0.1-dev1

Published by michaelfeil about 1 year ago

This is a release for testing the CI of Infinity.