v5.2.0
This release adds TextToSpeech and Cross-Encoder pipelines. The performance of the embeddings.batchtransform
method was significantly improved, enabling a speed up in building semantic graphs. Default configuration is now available for Embeddings, allowing an Embeddings instance to be created with no arguments like Pipelines.
See below for full details on the new features, improvements and bug fixes.
New Features
- Add Cross-Encoder support to Similarity pipeline (#372)
- Create compression package (#376)
- Add TextToSpeech pipeline (#389)
- Add TextToSpeech Notebook (#391)
- Add default configuration for Embeddings (#393)
Improvements
- Filter HF API list models request (#381)
- Split pipeline extras by function area (#387)
- Update data package to handle label arrays (#388)
- Modify transcription pipeline to accept raw waveform data (#390)
- Transcription pipeline improvements (#392)
- Allow searching by embedding (#396)
- Modified logger configuration in init.py (libraries shouldn't modify root logger) - Thank you @adin786! (#397)
- Pass evaluation metrics to underlying Trainer (#398)
- Improve batchtransform performance (#399)
Bug Fixes
- Example 31 - Duplicate image detection not working (#357)
- All sorts of issues with Example 18 - Export and run models with ONNX (#369)
- Fix issue with select distinct bug (#379)
- Update build script and tests to address issues with latest version of FastAPI (#380)
- Fix issue with similar and bracket SQL expressions embedded in functions (#382)
- Fix bug with embeddings functions and application config bug (#400)