Discover the top trending Python repositories and projects on Github. Explore the latest trends in Python development.

Trending Repositories

1

Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image (uncensored)
🔥🔥🔥
2

kotaemon

An open-source RAG-based tool for chatting with your documents.
🔥
3

swarm

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
📣
4

HivisionIDPhotos

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
📣
5

crawl4ai

🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper
📣
6

sapiens

High-resolution models for human tasks.
⬆️
7

GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
⬆️
8

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
⬆️
9

authentik

The authentication glue you need.
⬆️
10

CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
⬆️
11

Liger-Kernel

Efficient Triton Kernels for LLM Training
⬆️
12

manim

Animation engine for explanatory math videos
⬆️
13

supervision

We write your reusable computer vision tools. 💜
⬆️
14

yt-dlp

A feature-rich command-line audio/video downloader
⬆️
15

ml-depth-pro

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
⬆️
16

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
⬆️
17

torchchat

Run PyTorch LLMs locally on servers, desktop and mobile
⬆️
18

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
⬆️
19

speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o
⬆️
20

openfreemap

⬆️
21

13ft

My own custom 12ft.io replacement
⬆️
22

dspy

DSPy: The framework for programming—not prompting—foundation models
⬆️
23

LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
⬆️
24

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system
⬆️
25

OpenBBTerminal

Investment Research for Everyone, Everywhere.
⬆️
26

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
⬆️
27

whisper

Robust Speech Recognition via Large-Scale Weak Supervision
⬆️
28

sherlock

Hunt down social media accounts by username across social networks
⬆️
29

claude-engineer

Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabilities of a large language model with practical file system operations and web search functionality.
⬆️
30

Python

All Algorithms implemented in Python
⬆️
31

magic-wormhole

get things from one computer to another, safely
⬆️
32

manim

A community-maintained Python framework for creating mathematical animations.
⬆️
33

langchain

⚡ Building applications with LLMs through composability ⚡
⬆️
34

agents

Build real-time multimodal AI applications 🤖🎙️📹
⬆️
35

lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
⬆️
36

langflow

⛓️ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.
⬆️
37

LitServe

Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.
⬆️
38

llm_aided_ocr

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
⬆️
39

paper-qa

LLM Chain for answering questions from documents with citations
⬆️
40

stable-diffusion-webui

Stable Diffusion web UI
⬆️
41

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
⬆️
42

crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
⬆️
43

surya

OCR, layout analysis, reading order, line detection in 90+ languages
⬆️
44

weather_landscape

Visualizing Weather Forecasts Through Landscape Imagery
⬆️
45

storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
⬆️
46

hackingtool

ALL IN ONE Hacking Tool For Hackers
⬆️
47

system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
⬆️
48

posting

The modern API client that lives in your terminal.
⬆️
49

localstack

💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
⬆️
50

posthog

🦔 PostHog provides open-source product analytics, session recording, feature flagging and A/B testing that you can self-host.
⬆️
51

public-apis

A collective list of free APIs
⬆️
52

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
⬆️
53

ultralytics

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
⬆️
54

erpnext

Free and Open Source Enterprise Resource Planning (ERP)
⬆️
55

lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
⬆️
56

marker

Convert PDF to markdown quickly with high accuracy
⬆️
57

stable-fast-3d

SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement
⬆️
58

torchtitan

A native PyTorch Library for large model training
⬆️
59

Show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
⬆️
60

tinystatus

Tiny status page generated by a Python script
⬆️
61

frigate

NVR with realtime local object detection for IP cameras
⬆️
62

opensnitch

OpenSnitch is a GNU/Linux interactive application firewall inspired by Little Snitch.
⬆️
63

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
⬆️
64

LibreTranslate

Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
⬆️
65

faster-whisper

Faster Whisper transcription with CTranslate2
⬆️
66

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
⬆️
67

LongWriter

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
⬆️
68

parler-tts

Inference and training library for high-quality TTS models.
⬆️
69

llm

Access large language models from the command-line
⬆️
70

searxng

SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
⬆️
71

gptme

A CLI and web UI to interact with LLMs in a Chat-style interface, with code execution capabilities.
⬆️
72

haystack

🔍 LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
⬆️
73

wordfreq

Access a database of word frequencies, in various natural languages.
⬆️
74

metube

Self-hosted YouTube downloader (web UI for youtube-dl / yt-dlp)
⬆️
75

stable-diffusion-webui-forge

⬆️
76

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
⬆️
77

MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
⬆️
78

cupy

NumPy & SciPy for GPU
⬆️
79

roop

one-click face swap
⬆️
80

fastapi

FastAPI framework, high performance, easy to learn, fast to code, ready for production
⬆️
81

NomadNet

Communicate Freely
⬆️
82

pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
⬆️
83

core

🏡 Open source home automation that puts local control and privacy first.
⬆️
84

pygwalker

PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
⬆️
85

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
⬆️
86

streamlit

Streamlit — A faster way to build and share data apps.
⬆️
87

PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction
⬆️
88

InstantSplat

InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds
⬆️
89

litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
⬆️
90

mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
⬆️
91

LiveTalking

Real time interactive streaming digital human
⬆️
92

nano-graphrag

A simple, easy-to-hack GraphRAG implementation
⬆️
93

pi-ci

Prepare Raspberry Pi 3, 4 & 5 configurations using a virtual machine.
⬆️
94

sparrow

Data processing with ML and LLM
⬆️
95

instructor

structured outputs for llms
⬆️
96

deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
⬆️
97

reflex

🕸️ Web apps in pure Python 🐍
⬆️
98

mitmproxy

An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
⬆️
99

Fooocus

Focus on prompting and generating
⬆️
100

ao

PyTorch native quantization and sparsity for training and inference
⬆️