Skip to content

README

工具

Video

  • Open-Sora
    • Open-Sora: Democratizing Efficient Video Production for All

Application

  • chatbox
    • User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)
    • 【个人】多端&多模型聊天框架
  • anything-llm
    • The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
    • 这是一个全栈应用程序,可以将任何文档、资源(如网址链接、音频、视频)或内容片段转换为上下文,以便任何大语言模型(LLM)在聊天期间作为参考使用。此应用程序允许您选择使用哪个LLM或向量数据库,同时支持多用户管理并设置不同权限。
  • cherry-studio
    • 🍒 Cherry Studio is a desktop client that supports for multiple LLM providers. Support deepseek-r1
    • 支持多服务商的AI对话框客户端
  • lobe-chat
    • 🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Qwen / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS/Plugins/Artifacts). One-click FREE deployment of your private ChatGPT/ Claude application.
    • 【企业级】多模态交互框架
  • dify
    • Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
    • Dify是一个开源的LLM应用程序开发平台。Dify的直观界面结合了AI工作流程、RAG管道、代理功能、模型管理、可观察性功能等,让您快速从原型到生产。
  • open-webui
    • User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Deploy

  • ollama
    • Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
    • 开始使用Llama 3.3、DeepSeek-R1、Phi-4、Gemma 2和其他大型语言模型。
  • LM Studio
    • 大模型部署工具
    • lms LM Studio CLI
  • lmdeploy
    • LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
    • 大模型部署工具
  • sglang
    • SGLang is a fast serving framework for large language models and vision language models.
    • 大模型部署工具
  • vllm
    • A high-throughput and memory-efficient inference and serving engine for LLMs
    • 大模型部署工具
  • ktransformers
    • A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

集群

  • exo
    • Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

分布式

  • ColossalAI
    • Making large AI models cheaper, faster and more accessible
    • 分布式训练和推理框架

RAG

  • ragflow
    • 基于深度文档理解构建的开源 RAG(Retrieval-Augmented Generation)引擎。RAGFlow 可以为各种规模的企业及个人提供一套精简的 RAG 工作流程,结合大语言模型(LLM)针对用户各类不同的复杂格式数据提供可靠的问答以及有理有据的引用
    • 【个人】轻量级便捷RAG 推理框架
  • kotaemon
    • An open-source RAG-based tool for chatting with your documents.
    • 【企业级】RAG&GraphRAG 知识库对话框架
  • open-webui
    • User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
    • 【企业级】RAG&Agent 多功能综合前端框架

微调

  • unsloth
    • Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
  • LLaMA-Factory
    • Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
  • ms-SWIFT
    • Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).

模型社区

向量数据库

  • milvus
    • Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search