Stars
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
🦜🔗 Build context-aware reasoning applications
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
singing voice change based on whisper, and lora for singing voice clone
Stable Diffusion web UI
flutydeer / audio-slicer
Forked from openvpi/audio-slicerA simple GUI application that slices audio with silence detection