A python package to build AI-powered real-time audio applications
MIT License
This version fixes a crash caused by a change in matplotlib's API (issue #234).
It also introduces a test suite and CI improvements for development.
Quantized ONNX versions of pyannote/segmentation
and pyannote/embedding
were added under assets/models
to run tests independently from the HuggingFace space.
Full Changelog: https://github.com/juanmc2005/diart/compare/v0.9...v0.9.1
Published by juanmc2005 11 months ago
Major changes in this new version! Including compatibility with pyannote 3.*, SpeechBrain, WeSpeaker and NeMo embedding models, totaling 8 new models to create speaker diarization pipelines and 1 new model for voice activity detection.
This version also adds compatibility with ONNX models and a documentation page at diart.readthedocs.io
Thank you @sorgfresser for your huge contribution in #188 !
Full Changelog: https://github.com/juanmc2005/diart/compare/v0.8...v0.9
Published by juanmc2005 almost 1 year ago
diart.stream
by @juanmc2005 in #183PipelineConfig.from_dict()
by @juanmc2005 in #189Thank you @sneakers-the-rat for your extremely valuable feedback and help as part of the JOSS review!
Full Changelog: https://github.com/juanmc2005/diart/compare/v0.7...v0.8
Published by juanmc2005 over 1 year ago
Full Changelog: https://github.com/juanmc2005/StreamingSpeakerDiarization/compare/v0.6...v0.7
Published by juanmc2005 almost 2 years ago
cropping_mode
to DelayedAggregation
by @bhigy in https://github.com/juanmc2005/StreamingSpeakerDiarization/pull/105
Full Changelog: https://github.com/juanmc2005/StreamingSpeakerDiarization/compare/v0.5.1...v0.6
Published by juanmc2005 about 2 years ago
Full Changelog: https://github.com/juanmc2005/StreamingSpeakerDiarization/compare/v0.5...v0.5.1
Published by juanmc2005 about 2 years ago
study_or_path
as a Path for conversion from string by @AMITKESARI2000 in https://github.com/juanmc2005/StreamingSpeakerDiarization/pull/74
diart.benchmark
when output is provided by @juanmc2005 in https://github.com/juanmc2005/StreamingSpeakerDiarization/pull/86
Thank you @AMITKESARI2000, @ckliao-nccu and @zaouk for all the bug fixes!
Full Changelog: https://github.com/juanmc2005/StreamingSpeakerDiarization/compare/v0.4...v0.5
Published by juanmc2005 over 2 years ago
resolve_features
with TemporalFeatureFormatter
by @juanmc2005 in https://github.com/juanmc2005/StreamingSpeakerDiarization/pull/59
pyannote.audio
optional (still mandatory to run default pipeline) by @juanmc2005 in https://github.com/juanmc2005/StreamingSpeakerDiarization/pull/61
Full Changelog: https://github.com/juanmc2005/StreamingSpeakerDiarization/compare/v0.3...v0.4
Published by juanmc2005 over 2 years ago
OverlapAwareSpeakerEmbedding
class by @juanmc2005 in #51RealTimeInference
and Benchmark
by @juanmc2005 in #55Full Changelog: https://github.com/juanmc2005/StreamingSpeakerDiarization/compare/v0.2.1...v0.3
Published by juanmc2005 almost 3 years ago
buffer_output
causing a crash by @juanmc2005 in https://github.com/juanmc2005/StreamingSpeakerDiarization/pull/24
Full Changelog: https://github.com/juanmc2005/StreamingSpeakerDiarization/compare/v0.2...v0.2.1
Published by juanmc2005 almost 3 years ago
operators.aggregate()
with functional.DelayedAggregation
by @juanmc2005 in https://github.com/juanmc2005/StreamingSpeakerDiarization/pull/16
DelayedAggregation
by @juanmc2005 in https://github.com/juanmc2005/StreamingSpeakerDiarization/pull/18
OutputBuilder
+ better demo performance by @juanmc2005 in https://github.com/juanmc2005/StreamingSpeakerDiarization/pull/20
Full Changelog: https://github.com/juanmc2005/StreamingSpeakerDiarization/compare/v0.1...v0.2
Published by juanmc2005 almost 3 years ago
Initial release!
Thanks to @igordertigor for the package building configuration.