Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
APACHE-2.0 License
Bot releases are hidden (Show)
target_ids
argument is now optional. (#1825)sequence_length
for transformer exports is now automatically inferred if it is not supplied. (#1826)SmoothQuant
updated to support proper device forwarding where it would not work properly in FSDP setups and crash. (#1830)nsamples
increased to 512, the stability of OBCQ improved, resulting in a higher likelihood of it converging correctly. (#1812)SmoothQuant
NaN values are resolved during computation. (#1872)TypeError
with OBCQ when no sequence_length
is provided is now resolved. (#1899)Published by jeanniefinks 10 months ago
This is a patch release for 1.6.0 that contains the following changes:
LICENSE-NEURALMAGIC
to LICENSE
in the NOTICE file. (#1915)Published by jeanniefinks 10 months ago
Version support added:
Ultralytics YOLOv8 training and sparsification pipelines added. (Documentation) (#1517, #1522, #1520, #1528, #1521, #1561, #1579, #1597, #1599, #1629, #1637, #1638, #1673, #1686, #1656, #1787)
NOTICE updated to reflect now public-facing Ultralytics Enterprise Enterprise Software License Agreement for YOLOv3/v5/v8.
Initial sparsification framework v2 added for better generative AI support and improved functionality and extensibility. (Documentation available in v1.7) (#1713, #1751, #1742, #1763, #1759, #1769)
BLOOM, CodeGen, OPT, Falcon, GPTNeo, LLAMA, MPT, and Whisper large language and generative models are supported through transformers training, sparsification, and export pipelines. (Documentation) (#1562, #1571, #1585, #1584, #1616, #1633, #1590, #1644, #1615, #1664, #1646, #1631, #1648, #1683, #1687, #1677, #1692, #1694, #1699, #1703, #1709, #1691, #171, #1720, #1746)
QuantizationModifier for PyTorch sparsification pathways implemented to enable cleaner, more robust, and simpler arguments for quantizing models in comparison to the legacy quantization modifier. (Documentation) (#1568, #1594, #1639, #1693, #1745, #1738)
CLIP pruning, quantization, and export supported. (Documentation) ( #1581, #1626, #1711)
INT4 quantization support added for model sparsification and export. (Documentation available in v1.8 with LLM support expansion)(#1670)
DDP support added to Torchvision image classification training and sparsification pipelines. (Documentation available in v1.8 with new research paper)(#1698, #1784)
SparseGPT, OBC, and OBQ one-shot/post-training pruning and quantization modifiers added for PyTorch pathways. (Documentation) (#1705, #1736, #1737, #1761, #1770, #1781, #1776, #1777, #1758)
SparseML upgraded for SparseZoo V2 model file structure changes, which expands the number of supported files and reduces the number of bytes that need to be downloaded for model checkpoints, folders, and files. (#1719)
Docker builds updated to consistently rebuild for new releases and nightlies. (#1506, #1531, #1543, #1537, #1665, #1684)
README and documentation updated to include: Slack Community name change, Contact Us form introduction, Python version changes; corrections for YOLOv5 torchvision, transformers, and SparseZoo broken links; and installation command. (#1536, #1577, #1578, #1610, #1617, #1612, #1602, #1659, #1721, #1725 , #1726, #1785)
Improved support for large ONNX files to improve loading performance and limit memory performance issues, especially for LLMs. (#1515, #1540, #1514, #1586)
Transformers datasets can now be created without a model needing to be passed in. (#1544, #1545)
Torchvision training and sparsification pipelines updated to enable patch versions of torchvision as installable dependencies, whereas before the version was restricted to 0.14.0 and now supports 0.14.x. (#1556)
Image classification training and sparsification pipelines for torchvision now support arguments for RGB emans and standard deviations to be passed in, enabling overriding of the default ImageNet values that were hardcoded. (#1546)
YOLOv5 training and sparsification pipelines migrated to install from nm-yolov5
on PyPI and remove the autoinstall from the nm-yolov5
GitHub repository that would happen on invocation of the relevant pathways, enabling more predictable environments. (#1518, #1564, #1566)
Transformers training and sparsification pipelines migrated to install from nm-transformers
on PyPI and remove the autoinstall from the nm-transformers
GitHub repository that would happen on invocation of the relevant pathways, enabling more predictable environments. (#1518, #1553, #1564, #1566, #1730)
Deprecated and no longer supported:
sparseml.benchmark
commands and utilities; may be refactored in a future release (#1625)Pydantic version pinned to <2.0 preventing potential issues with untested versions. (#1645)
Automatic link checking added to GitHub actions. (#1525)
ONNX export for MobileBERT results in an exported ONNX model that previously had poor performance in DeepSparse. (#1539)
OpenCV is now installed for image classification pathways when running pip install sparseml[torchvision]
. Before it would crash with a missing dependency error of opencv unless installed. (#1575)
Scipy version dependency issues resolved with scikit-image
which would result in incompatibility errors on install of scikit-image
for computer vision pathways. (#1570)
Transformers export pathways for quantized models addressed where the export would improperly crash and not export for all transformers models. (#1654)
Transformers data support for jsonl files through the question answering pathways was resulting in a JSONDecodeError; these are now loading correctly. (#1667, #1669)
Unit and integration tests updated to remove temporary test files and limit test file creation which were not being properly deleted. (#1609, #1668, #1672, #1696)
Image classification pipelines no longer crash with an extra argument error when using CIFAR10 or CIFAR100 datasets. (#1671)
Published by jeanniefinks about 1 year ago
This is a patch release for 1.5.0 that contains the following changes:
Published by jeanniefinks over 1 year ago
This is a patch release for 1.5.0 that contains the following changes:
Published by jeanniefinks over 1 year ago
This is a patch release for 1.5.0 that contains the following changes:
Published by jeanniefinks over 1 year ago
This is a patch release for 1.5.0 that contains the following changes:
datasets_dir
argument in YOLOv8 training command to address missing args error. (#1620)Published by jeanniefinks over 1 year ago
export NM_DISABLE_ANALYTICS=True
(#1487)pip install sparseml[transformers]
and pip install sparseml[yolov5]
will need to be used.scikit-learn
now replaced with sklearn
to stay current with dependency name changes. (#1294)Published by jeanniefinks over 1 year ago
This is a patch release for 1.4.0 that contains the following changes:
Published by jeanniefinks over 1 year ago
This is a patch release for 1.4.0 that contains the following changes:
Published by jeanniefinks over 1 year ago
This is a patch release for 1.4.0 that contains the following changes:
Published by jeanniefinks over 1 year ago
recipe_template
CLI no longer has improper code documentation, impairing operability. (#1170)Published by jeanniefinks almost 2 years ago
This is a patch release for 1.3.0 that contains the following changes:
Published by jeanniefinks almost 2 years ago
Published by jeanniefinks almost 2 years ago
Published by jeanniefinks about 2 years ago
This is a patch release for 1.1.0 that contains the following changes:
Published by jeanniefinks about 2 years ago
Published by jeanniefinks over 2 years ago
This is a patch release for 1.0.0 that contains the following changes:
Published by jeanniefinks over 2 years ago
onnxruntime
to optional install extra. onnxruntime
no longer a root dependency and will only be imported when using specific pathways.sparseml
src folder for yolov5.--do_train
or --do_eval arguments
were passed in.Published by jeanniefinks over 2 years ago
This is a patch release for 0.12.0 that contains the following changes: