Discover the top trending Python repositories and projects on Github. Explore the latest trends in Python development.
kotaemon
An open-source RAG-based tool for chatting with your documents.swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.HivisionIDPhotos
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。crawl4ai
🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & ScrapperGOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Modelsapiens
High-resolution models for human tasks.manim
Animation engine for explanatory math videosDeep-Live-Cam
real time face swap and one-click video deepfake with only a single image (uncensored)ml-depth-pro
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.supervision
We write your reusable computer vision tools. 💜openfreemap
LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.Liger-Kernel
Efficient Triton Kernels for LLM Trainingyt-dlp
A feature-rich command-line audio/video downloadersherlock
Hunt down social media accounts by username across social networksPython
All Algorithms implemented in Pythonminimind
「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练!manim
A community-maintained Python framework for creating mathematical animations.MinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。agents
Build real-time multimodal AI applications 🤖🎙️📹LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)dspy
DSPy: The framework for programming—not prompting—foundation modelswhisper
Robust Speech Recognition via Large-Scale Weak Supervisionpaper-qa
LLM Chain for answering questions from documents with citationsweather_landscape
Visualizing Weather Forecasts Through Landscape Imageryclaude-engineer
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabilities of a large language model with practical file system operations and web search functionality.surya
OCR, layout analysis, reading order, line detection in 90+ languagesCogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)hackingtool
ALL IN ONE Hacking Tool For HackersOpenBBTerminal
Investment Research for Everyone, Everywhere.langchain
⚡ Building applications with LLMs through composability ⚡whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)LitServe
Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLiteerpnext
Free and Open Source Enterprise Resource Planning (ERP)localstack
💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offlinetinystatus
Tiny status page generated by a Python scriptstable-diffusion-webui
Stable Diffusion web UIlangflow
⛓️ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) systemtorchtitan
A native PyTorch Library for large model traininggptme
A CLI and web UI to interact with LLMs in a Chat-style interface, with code execution capabilities.wordfreq
Access a database of word frequencies, in various natural languages.storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionsystem-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.cupy
NumPy & SciPy for GPUposthog
🦔 PostHog provides open-source product analytics, session recording, feature flagging and A/B testing that you can self-host.gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!authentik
The authentication glue you need.llm
Access large language models from the command-linelerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learningpygwalker
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysisspeech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-odiamond
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model.marker
Convert PDF to markdown quickly with high accuracypublic-apis
A collective list of free APIstransformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phonefaster-whisper
Faster Whisper transcription with CTranslate2spann3r
3D Reconstruction with Spatial Memoryhttpdbg
A very simple tool to debug HTTP(S) client requests.core
🏡 Open source home automation that puts local control and privacy first.ao
PyTorch native quantization and sparsity for training and inferencenanodjango
Run Django models and views from a single file, and convert it to a full project.deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for PythonPaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.frigate
NVR with realtime local object detection for IP camerasfastapi
FastAPI framework, high performance, easy to learn, fast to code, ready for productionlitgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.haystack
🔍 LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.LibreTranslate
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.faceswap
Deepfakes Software For Allyou-get
⏬ Dumb downloader that scrapes the weblinkding
Self-hosted bookmark manager that is designed be to be minimal, fast, and easy to set up using Docker.litecli
CLI for SQLite Databases with auto-completion and syntax highlightingstreamlit
Streamlit — A faster way to build and share data apps.mitmproxy
An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.ArchiveBox
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...searxng
SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.posting
The modern API client that lives in your terminal.Scrapegraph-ai
Python scraper based on AInicegui
Create web-based user interfaces with Python. The nice way.stable-diffusion-webui-forge
videos
Code for the manim-generated scenes used in 3blue1brown videosreflex
🕸️ Web apps in pure Python 🐍diagrams
🎨 Diagram as Code for prototyping cloud system architecturesswift-ocr-llm-powered-pdf-to-markdown
An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing and batching to deliver high-quality text extraction from complex PDF documents. Ideal for businesses seeking efficient document digitization and data extraction solutions.fiftyone
The open-source tool for building high-quality datasets and computer vision modelsnanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.machina
OpenCV+YOLO+LLAVA+FAISS powered video surveillance systemMemGPT
Letta (fka MemGPT) is a framework for creating stateful LLM services.MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programmingvideo2x
A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K, SRMD and RealSR. Started in Hack the Valley II, 2018.instructor
structured outputs for llmsfacebookresearch
Python, Jupyter Notebook, C++microsoft
C#, Python, TypeScriptdonnemartin
Python, R, Javaopenai
Python, Jupyter Notebook, TypeScriptpublic-apis
Pythonhuggingface
Python, Jupyter Notebook, RustTheAlgorithms
Python, HTML, TypeScriptvinta
Python, JavaScript, HCLpytorch
Python, C++, Jupyter NotebookPaddlePaddle
Python, C++, Jupyter Notebookjackfrued
Python, HTML, Objective-Ctensorflow
Python, Jupyter Notebook, TypeScriptTHUDM
Python, Jupyter Notebook, OthersAUTOMATIC1111
C#, Python, C++ytdl-org
Pythongoogle-research
Python, Jupyter Notebook, C++lucidrains
Python, Nim, Otherstiangolo
Python, Dockerfile, Shell521xueweihan
Python, HTML, JavaScriptNVlabs
Python, Jupyter Notebook, C++open-mmlab
Python, Jupyter Notebook, C++pallets
Python, HTMLgoogle-deepmind
Python, Jupyter Notebook, C++psf
Python, HTML, TypeScriptLove Open Source and this site? Check out how you can help us