Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, MiniCPM, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, GraphRAG, DeepSpeed, vLLM, FastChat, Axolotl, etc.
APACHE-2.0 License
Note: BigDL v2.4.0 has been updated to include functional and security updates. Users should update to the latest version.
Published by glorysdj over 1 year ago
Note: BigDL v2.3.0 has been updated to include functional and security updates. Users should update to the latest version.
Nano
trace
and quantization
process (for PyTorch and TensorFlow model optimizations)Orca:
Chronos
bigdl.chronos.aiops
module for AIOps use case on top of Chronos algorithms.Friesian:
PPML
Published by Le-Zheng almost 2 years ago
Note: BigDL v2.2.0 has been updated to include functional and security updates. Users should update to the latest version.
Published by glorysdj about 2 years ago
Note: BigDL v2.1.0 has been updated to include functional and security updates. Users should update to the latest version.
Published by glorysdj over 2 years ago
Note: BigDL v2.0.0 has been updated to include functional and security updates. Users should update to the latest version.
Published by Le-Zheng over 3 years ago
Published by Le-Zheng over 3 years ago
Published by Le-Zheng almost 4 years ago
Published by Le-Zheng almost 4 years ago
Published by wzhongyuan almost 5 years ago
Continue RNN optimization. We support both LSTM and GRU integration with MKL-DNN which acheives ~3x performance
ONNX support. We support loading third party framework models via ONNX
Richer data preprocssing support and segmentation inference pipeline support
Published by wzhongyuan over 5 years ago
Continue VNNI acceleration support, we add optimization for more CNN models including object detection models, enhance model scales generation support for VNNI.
Add attention based model support, we add Transformer implementation for both lanuage model and translation model.
RNN optimization, We support LSTM integration with MKL-DNN which acheives ~3x performance speedup.
Published by wzhongyuan over 5 years ago
Published by yiheng about 6 years ago
mapPartition
Published by yiheng over 6 years ago
Published by yiheng over 6 years ago
Published by qiuxin2012 almost 7 years ago
Published by yiheng almost 7 years ago
Published by qiuxin2012 over 7 years ago
Published by yiheng over 7 years ago
Release Notes