Discover the top trending Python repositories and projects on Github. Explore the latest trends in Python development.
Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image (uncensored)kotaemon
An open-source RAG-based tool for chatting with your documents.swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.HivisionIDPhotos
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。crawl4ai
🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrappersapiens
High-resolution models for human tasks.GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end ModelMinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。authentik
The authentication glue you need.CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)Liger-Kernel
Efficient Triton Kernels for LLM Trainingmanim
Animation engine for explanatory math videossupervision
We write your reusable computer vision tools. 💜yt-dlp
A feature-rich command-line audio/video downloaderml-depth-pro
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.torchchat
Run PyTorch LLMs locally on servers, desktop and mobileMiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phonespeech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-oopenfreemap
13ft
My own custom 12ft.io replacementdspy
DSPy: The framework for programming—not prompting—foundation modelsLLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) systemOpenBBTerminal
Investment Research for Everyone, Everywhere.LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)whisper
Robust Speech Recognition via Large-Scale Weak Supervisionsherlock
Hunt down social media accounts by username across social networksclaude-engineer
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabilities of a large language model with practical file system operations and web search functionality.Python
All Algorithms implemented in Pythonmagic-wormhole
get things from one computer to another, safelymanim
A community-maintained Python framework for creating mathematical animations.langchain
⚡ Building applications with LLMs through composability ⚡agents
Build real-time multimodal AI applications 🤖🎙️📹lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.langflow
⛓️ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.LitServe
Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.llm_aided_ocr
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.paper-qa
LLM Chain for answering questions from documents with citationsstable-diffusion-webui
Stable Diffusion web UIwhisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.surya
OCR, layout analysis, reading order, line detection in 90+ languagesweather_landscape
Visualizing Weather Forecasts Through Landscape Imagerystorm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.hackingtool
ALL IN ONE Hacking Tool For Hackerssystem-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.posting
The modern API client that lives in your terminal.localstack
💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offlineposthog
🦔 PostHog provides open-source product analytics, session recording, feature flagging and A/B testing that you can self-host.public-apis
A collective list of free APIsTTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLiteerpnext
Free and Open Source Enterprise Resource Planning (ERP)lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learningmarker
Convert PDF to markdown quickly with high accuracystable-fast-3d
SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglementtorchtitan
A native PyTorch Library for large model trainingShow-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.tinystatus
Tiny status page generated by a Python scriptfrigate
NVR with realtime local object detection for IP camerasopensnitch
OpenSnitch is a GNU/Linux interactive application firewall inspired by Little Snitch.transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.LibreTranslate
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.faster-whisper
Faster Whisper transcription with CTranslate2gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!LongWriter
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMsparler-tts
Inference and training library for high-quality TTS models.llm
Access large language models from the command-linesearxng
SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.gptme
A CLI and web UI to interact with LLMs in a Chat-style interface, with code execution capabilities.haystack
🔍 LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.wordfreq
Access a database of word frequencies, in various natural languages.metube
Self-hosted YouTube downloader (web UI for youtube-dl / yt-dlp)stable-diffusion-webui-forge
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programmingcupy
NumPy & SciPy for GPUroop
one-click face swapfastapi
FastAPI framework, high performance, easy to learn, fast to code, ready for productionNomadNet
Communicate Freelypyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音core
🏡 Open source home automation that puts local control and privacy first.pygwalker
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysisInternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型streamlit
Streamlit — A faster way to build and share data apps.PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content ExtractionInstantSplat
InstantSplat: Sparse-view SfM-free Gaussian Splatting in Secondslitgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838LiveTalking
Real time interactive streaming digital humannano-graphrag
A simple, easy-to-hack GraphRAG implementationpi-ci
Prepare Raspberry Pi 3, 4 & 5 configurations using a virtual machine.sparrow
Data processing with ML and LLMinstructor
structured outputs for llmsdeepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Pythonreflex
🕸️ Web apps in pure Python 🐍mitmproxy
An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.Fooocus
Focus on prompting and generatingao
PyTorch native quantization and sparsity for training and inferencefacebookresearch
Python, Jupyter Notebook, C++microsoft
C#, Python, TypeScriptdonnemartin
Python, Java, Ropenai
Python, Jupyter Notebook, JavaScriptpublic-apis
Pythonhuggingface
Python, Jupyter Notebook, RustTheAlgorithms
Python, HTML, TypeScriptvinta
Python, JavaScript, Dockerfilepytorch
Python, C++, Jupyter NotebookPaddlePaddle
Python, C++, Jupyter Notebookjackfrued
Python, HTML, Objective-Ctensorflow
Python, Jupyter Notebook, C++THUDM
Python, Jupyter Notebook, OthersAUTOMATIC1111
C#, Python, C++ytdl-org
Pythongoogle-research
Python, Jupyter Notebook, C++lucidrains
Python, Nim, Otherstiangolo
Python, Shell, Dockerfile521xueweihan
Python, HTML, CSSNVlabs
Python, Jupyter Notebook, C++open-mmlab
Python, Jupyter Notebook, C++pallets
Python, HTMLgoogle-deepmind
Python, Jupyter Notebook, C++psf
Python, HTML, RustLove Open Source and this site? Check out how you can help us