infinity

Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.

MIT License

Downloads
11.5K
Stars
1.1K

Bot releases are visible (Hide)

infinity - 0.0.33

Published by michaelfeil 5 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.32...0.0.33

infinity - 0.0.32

Published by michaelfeil 6 months ago

What's Changed

You can now run a model with a alias. This will help you communicating with the API.

infinity_emb --served-model-name "your_nickname"

You can now use preload models. This acts as a "run download and load into ram" test. Upon execution, all files are cached, which will speedup consecutive loads. For additonal speedups, use --no-model-warmup to skip model warmup after loading.

infinity_emb --preload-only --model--name-or-path BAAI/bge-large-en-v1.5 

PR's

New Contributors

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.31...0.0.32

infinity - 0.0.31

Published by michaelfeil 7 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.30...0.0.31

infinity - 0.0.30

Published by michaelfeil 7 months ago

What's Changed

New Contributors

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.29...0.0.30

infinity - 0.0.29

Published by michaelfeil 7 months ago

What's Changed

This will be the last release with fastembed - fastembed and optimum provide similar capabilities. Please use optimum going forward.

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.28...0.0.29

infinity - 0.0.28

Published by michaelfeil 7 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.27...0.0.28

infinity - 0.0.27

Published by michaelfeil 7 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.26...0.0.27

infinity - 0.0.26

Published by michaelfeil 8 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.25...0.0.26

infinity - 0.0.25

Published by michaelfeil 8 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.24...0.0.25

infinity - 0.0.24

Published by michaelfeil 8 months ago

What's Changed

New Contributors

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.23...0.0.24

infinity - 0.0.23

Published by michaelfeil 9 months ago

What's Changed

New Contributors

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.22...0.0.23

infinity - 0.0.22

Published by michaelfeil 9 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.21...0.0.22

infinity - 0.0.21

Published by michaelfeil 9 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.20...0.0.21

infinity - 0.0.20

Published by michaelfeil 9 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.19...0.0.20

infinity - 0.0.19 - yanked

Published by michaelfeil 9 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.18...0.0.19

infinity - 0.0.18

Published by michaelfeil 9 months ago

0.0.18

infinity - 0.0.18

Published by michaelfeil 9 months ago

0.0.18

infinity - 0.0.18

Published by michaelfeil 9 months ago

0.0.18

infinity - 0.0.17

Published by michaelfeil 10 months ago

What's Changed

Breaking: Switched to Cuda 12.1 and torch 2.1.2

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.16...0.0.17

infinity - 0.0.16

Published by michaelfeil 10 months ago

What's Changed

Full Changelog: https://github.com/michaelfeil/infinity/compare/0.0.15...0.0.16