friendli-client

Friendli: the fastest serving engine for generative AI

APACHE-2.0 License

Downloads
47.4K
Stars
40
Committers
9

Bot releases are visible (Hide)

friendli-client - Release v1.2.0 🚀

Published by kooyunmo 9 months ago

Release Notes v1.2.0 🚀

Features 🌟

  1. Distinguish Merged QKV for more precise and efficient handling of query, key, and value transformations.
  2. Now LoRA can be applied to the MPT model.
  3. Introduced support for Mixtral model checkpoint conversion.

Bug Fixes and Improvements 🐛🔨

  1. AWQ bug during using gpt-j.
  2. Addressed a critical issue where CUDA Out of Memory (OOM) errors occurred while using AWQ.
  3. Minor update to phi model type.
friendli-client - Release 1.1.0 🚀

Published by kooyunmo 10 months ago

  • The features related to Friendli Dedicated Endpoints are temporarily removed from the client package. Please use periflow-client instead to use the features.

[!NOTE]
We are actively integrating the features in Friendli Dedicated Endpoints (previously known as PeriFlow Cloud) to Friendli Suite.
Please use periflow-client pacakge instead of friendli-client to use features for Friendli Dedicated Endpoints.
Those features will be supported with friendli-client very shortly.

  • CLI commands are updated.
    • Commands related to the Friendli Dedicated Endpoints are removed temporarily.
    • Commands for API calls to the serverless endpoints are added. Check the example usage below:
friendli api chat-completions create \
  -g "user Tell me how to make a delicious pancake" \
  -m llama-2-13b-chat
friendli-client - Release 1.0.0 🚀

Published by kooyunmo 10 months ago

🌟 Exciting Major Version Update: Introducing Friendli Suite! 🌈

We're thrilled to announce the official release of Friendli Suite, bringing a wealth of enhancements and features to your fingertips.
With this major update, we've given our package and GitHub repository a facelift, transitioning from periflow-client to the all-new and improved friendli-client.

Here's a rundown of the key changes:

🔄 CLI Command Prefix Update

The CLI command prefix has undergone a transformation! Say goodbye to the old pf and embrace the fresh friendli. Now, to sign in, simply use friendli login instead of the previous pf login.

🐍 Python SDK Breaking Change

In the Python SDK, we've introduced a breaking change that aligns its semantics with the OpenAI Python SDK v1. Fear not, as this update brings more consistency and compatibility. Dive into the details and explore examples on our documentation page.

import os
from friendli import Friendli

client = Friendli(api_key=os.environ.get("FRIENDLI_API_KEY"))

chat_completion = client.chat.completions.create(
    model="llama-2-13b-chat",
    messages=[
        {
            "role": "user",
            "content": "Tell me how to make a delicious pancake"
        }
    ],
    stream=False,
)

print(chat_completion.choices[0].message.content)

🚀 Upgrade now to unlock a world of possibilities with Friendli Suite! If you encounter any challenges or have questions, don't hesitate to reach out. Happy coding! 🤖💬

friendli-client - Release 0.1.13 🚀

Published by kooyunmo 11 months ago

  • Support checkpoints with safetensors format.

Full Changelog: https://github.com/friendliai/periflow-client/compare/0.1.12...0.1.13

friendli-client - Release 0.1.12 🚀

Published by kooyunmo 12 months ago

  • Now you can convert HuggingFace adapter models to PeriFlow-compliant format. Run pf convert-adapter --help to see details.
friendli-client - Release 0.1.11 🚀

Published by kooyunmo 12 months ago

  • Minor: Strict package version check is disabled.
  • The "offloading" option is added to the quantization config file. This option enables the GPU to CPU offloading to save the GPU memory usage.
friendli-client - Release 0.1.10 🚀

Published by kooyunmo about 1 year ago

This release includes a hotfix of 0.1.9.

friendli-client - Release 0.1.9 🚀

Published by kooyunmo about 1 year ago

What's Changed

  • Now we support AWQ quantization!
friendli-client - Release 0.1.8 🚀

Published by kooyunmo about 1 year ago

What's Changed

  • Now API request is sent using protobuf.
  • New API support for tokenization and detokenization.
  • Quantization can be applied during the checkpoint conversion by pf checkpoint convert. Currently, only SmoothQuant is supported.

Full Changelog: https://github.com/friendliai/periflow-client/compare/0.1.7...0.1.8

friendli-client - Release 0.1.7 🚀

Published by kooyunmo about 1 year ago

What's Changed

Full Changelog: https://github.com/friendliai/periflow-client/compare/0.1.6...0.1.7

friendli-client - Release 0.1.6 🚀

Published by kooyunmo about 1 year ago

What's Changed

Full Changelog: https://github.com/friendliai/periflow-client/compare/0.1.5...0.1.6

friendli-client - Release 0.1.5 🚀

Published by kooyunmo about 1 year ago

What's Changed

Full Changelog: https://github.com/friendliai/periflow-client/compare/0.1.4...0.1.5

friendli-client - Release 0.1.4 🚀

Published by kooyunmo about 1 year ago

What's Changed

Full Changelog: https://github.com/friendliai/periflow-client/compare/0.1.3...0.1.4

friendli-client - Release 0.1.3 🚀

Published by kooyunmo about 1 year ago

What's Changed

Full Changelog: https://github.com/friendliai/periflow-client/compare/0.1.2...0.1.3

friendli-client - Release 0.1.2 🚀

Published by kooyunmo about 1 year ago

What's Changed

Full Changelog: https://github.com/friendliai/periflow-client/compare/0.1.1...0.1.2

friendli-client - Release 0.1.1 🚀

Published by kooyunmo about 1 year ago

What's Changed

Full Changelog: https://github.com/friendliai/periflow-client/compare/0.1.0...0.1.1

friendli-client - Release 0.1.0 🚀

Published by kooyunmo over 1 year ago

Package Rankings
Top 38.35% on Pypi.org
Related Projects