- Beijing
-
10:57
- 8h ahead - alanlee.fun
- @bluekirin93
- https://www.zhihu.com/people/lyjwf1216
Lists (32)
Sort Name ascending (A-Z)
Acceleration
模型加速。Audio/Video
Benchmark
ChatGPT
Collection
各种收集类仓库。Crawler
爬虫和代理池相关。DanBan
Dataset
数据集收集。DataVis
数据可视化工具相关。DL Serving
用于深度学习模型部署的框架等。Download Tool
下载工具。Encoding
Face Cropping
从图片中裁剪出人脸。Font
字体相关。GUI
Hexo Theme
Keyword Extraction
Korean
LanguageDetection
语言检测工具。LLM application dev tool
MacTools
Minimalism
一些极简主义、小巧的工具。Monitor
News
新闻分析相关。OpenIE
Open Information ExtractionPDF Tools
PDF 处理工具。Profiling
Prompt
RAG
Templates
Some useful templates which save your time.TG Bot
TTS
Stars
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Bringing BERT into modernity via both architecture changes and scaling
Convert any git repository into an engaging podcast
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Python APIs for web automation, testing, and bypassing bot-detection.
Curated list of datasets and tools for post-training.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
The fast, Pythonic way to build Model Context Protocol servers 🚀
Webui for using XTTS and for finetuning it
Slightly improved official version for finetune xtts
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
Foundational model for human-like, expressive TTS
Quality News - Towards a fairer ranking formula for Hacker News
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.
Concatenate a directory full of files into a single prompt for use with LLMs
The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)
Code for the manim-generated scenes used in 3blue1brown videos
A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API
Inference and training library for high-quality TTS models.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
An open source multi-tool for exploring and publishing data