Solve puzzles. Learn CUDA
Running large language models on a single GPU
Open Source Differentiable Computer Vision Library
Standardized Serverless ML Inference Platform on Kubernetes
ReFT: Representation Finetuning for Language Models
Toolkit for conversational AI
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Easily compute clip embeddings and build a clip retrieval system
Making large AI models cheaper, faster and more accessible
Large Language Model Text Generation Inference
A sound cloning tool with a web interface, using your voice
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Voice Recognition to Text Tool
Instill Core is a full-stack AI infrastructure tool for data
Openai style api for open large language models
Data manipulation and transformation for audio signal processing
Sharp Monocular Metric Depth in Less Than a Second
Faster Whisper transcription with CTranslate2
Minimal Python framework for scalable AI inference servers fast
Generative AI reference workflows
Unified Model Serving Framework
Open source AI VTuber platform with voice chat and Live2D avatars
The official repo of Qwen chat & pretrained large language model
Diffusion Transformer with Fine-Grained Chinese Understanding
A unified framework for scalable computing