Discover the top trending Python repositories and projects on Github. Explore the latest trends in Python development.

Trending Repositories

1

kotaemon

An open-source RAG-based tool for chatting with your documents.
🔥🔥🔥
2

swarm

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
🔥🔥🔥
3

HivisionIDPhotos

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
🔥🔥
4

crawl4ai

🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper
🔥🔥
5

GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
📣
6

sapiens

High-resolution models for human tasks.
📣
7

manim

Animation engine for explanatory math videos
📣
8

Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image (uncensored)
📣
9

ml-depth-pro

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
📣
10

supervision

We write your reusable computer vision tools. 💜
📣
11

openfreemap

📣
12

LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
📣
13

lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
⬆️
14

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
⬆️
15

Liger-Kernel

Efficient Triton Kernels for LLM Training
⬆️
16

yt-dlp

A feature-rich command-line audio/video downloader
⬆️
17

sherlock

Hunt down social media accounts by username across social networks
⬆️
18

Python

All Algorithms implemented in Python
⬆️
19

minimind

「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练!
⬆️
20

manim

A community-maintained Python framework for creating mathematical animations.
⬆️
21

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
⬆️
22

agents

Build real-time multimodal AI applications 🤖🎙️📹
⬆️
23

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
⬆️
24

dspy

DSPy: The framework for programming—not prompting—foundation models
⬆️
25

whisper

Robust Speech Recognition via Large-Scale Weak Supervision
⬆️
26

paper-qa

LLM Chain for answering questions from documents with citations
⬆️
27

weather_landscape

Visualizing Weather Forecasts Through Landscape Imagery
⬆️
28

claude-engineer

Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabilities of a large language model with practical file system operations and web search functionality.
⬆️
29

surya

OCR, layout analysis, reading order, line detection in 90+ languages
⬆️
30

CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
⬆️
31

hackingtool

ALL IN ONE Hacking Tool For Hackers
⬆️
32

OpenBBTerminal

Investment Research for Everyone, Everywhere.
⬆️
33

langchain

⚡ Building applications with LLMs through composability ⚡
⬆️
34

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
⬆️
35

LitServe

Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.
⬆️
36

ultralytics

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
⬆️
37

erpnext

Free and Open Source Enterprise Resource Planning (ERP)
⬆️
38

localstack

💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
⬆️
39

tinystatus

Tiny status page generated by a Python script
⬆️
40

stable-diffusion-webui

Stable Diffusion web UI
⬆️
41

langflow

⛓️ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.
⬆️
42

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system
⬆️
43

torchtitan

A native PyTorch Library for large model training
⬆️
44

gptme

A CLI and web UI to interact with LLMs in a Chat-style interface, with code execution capabilities.
⬆️
45

wordfreq

Access a database of word frequencies, in various natural languages.
⬆️
46

storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
⬆️
47

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
⬆️
48

system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
⬆️
49

crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
⬆️
50

cupy

NumPy & SciPy for GPU
⬆️
51

posthog

🦔 PostHog provides open-source product analytics, session recording, feature flagging and A/B testing that you can self-host.
⬆️
52

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
⬆️
53

authentik

The authentication glue you need.
⬆️
54

llm

Access large language models from the command-line
⬆️
55

lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
⬆️
56

pygwalker

PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
⬆️
57

speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o
⬆️
58

diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model.
⬆️
59

marker

Convert PDF to markdown quickly with high accuracy
⬆️
60

public-apis

A collective list of free APIs
⬆️
61

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
⬆️
62

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
⬆️
63

faster-whisper

Faster Whisper transcription with CTranslate2
⬆️
64

spann3r

3D Reconstruction with Spatial Memory
⬆️
65

httpdbg

A very simple tool to debug HTTP(S) client requests.
⬆️
66

core

🏡 Open source home automation that puts local control and privacy first.
⬆️
67

ao

PyTorch native quantization and sparsity for training and inference
⬆️
68

nanodjango

Run Django models and views from a single file, and convert it to a full project.
⬆️
69

deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
⬆️
70

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
⬆️
71

Show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
⬆️
72

frigate

NVR with realtime local object detection for IP cameras
⬆️
73

fastapi

FastAPI framework, high performance, easy to learn, fast to code, ready for production
⬆️
74

litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
⬆️
75

haystack

🔍 LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
⬆️
76

LibreTranslate

Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
⬆️
77

faceswap

Deepfakes Software For All
⬆️
78

you-get

⏬ Dumb downloader that scrapes the web
⬆️
79

linkding

Self-hosted bookmark manager that is designed be to be minimal, fast, and easy to set up using Docker.
⬆️
80

litecli

CLI for SQLite Databases with auto-completion and syntax highlighting
⬆️
81

streamlit

Streamlit — A faster way to build and share data apps.
⬆️
82

mitmproxy

An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
⬆️
83

ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
⬆️
84

searxng

SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
⬆️
85

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
⬆️
86

posting

The modern API client that lives in your terminal.
⬆️
87

Scrapegraph-ai

Python scraper based on AI
⬆️
88

nicegui

Create web-based user interfaces with Python. The nice way.
⬆️
89

stable-diffusion-webui-forge

⬆️
90

videos

Code for the manim-generated scenes used in 3blue1brown videos
⬆️
91

reflex

🕸️ Web apps in pure Python 🐍
⬆️
92

diagrams

🎨 Diagram as Code for prototyping cloud system architectures
⬆️
93

swift-ocr-llm-powered-pdf-to-markdown

An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing and batching to deliver high-quality text extraction from complex PDF documents. Ideal for businesses seeking efficient document digitization and data extraction solutions.
⬆️
94

fiftyone

The open-source tool for building high-quality datasets and computer vision models
⬆️
95

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.
⬆️
96

machina

OpenCV+YOLO+LLAVA+FAISS powered video surveillance system
⬆️
97

MemGPT

Letta (fka MemGPT) is a framework for creating stateful LLM services.
⬆️
98

MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
⬆️
99

video2x

A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K, SRMD and RealSR. Started in Hack the Valley II, 2018.
⬆️
100

instructor

structured outputs for llms
⬆️