Discover the top trending Python repositories and projects on Github. Explore the latest trends in Python development.

Trending Repositories

1

you-get

⏬ Dumb downloader that scrapes the web
🔥🔥🔥
2

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
🔥🔥🔥
3

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🔥🔥
4

pgai

Bring AI models closer to your PostgreSQL data
🔥🔥
5

kotaemon

An open-source RAG-based tool for chatting with your documents.
🔥
6

dspy

DSPy: The framework for programming—not prompting—foundation models
🔥
7

changedetection.io

The best and simplest free open source website change detection, restock monitor and notification service. Restock Monitor, change detection. Designed for simplicity - Simply monitor which websites had a text change for free. Free Open source web page change detection, Website defacement monitoring, Price change and Price Drop notification
🔥
8

videos

Code for the manim-generated scenes used in 3blue1brown videos
📣
9

mac_computer_use

📣
10

lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
📣
11

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
⬆️
12

haystack

🔍 LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
⬆️
13

core

🏡 Open source home automation that puts local control and privacy first.
⬆️
14

stable-diffusion-webui

Stable Diffusion web UI
⬆️
15

ultralytics

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
⬆️
16

crawl4ai

🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper
⬆️
17

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
⬆️
18

backtrader

Python Backtesting library for trading strategies
⬆️
19

manim

A community-maintained Python framework for creating mathematical animations.
⬆️
20

whisper

Robust Speech Recognition via Large-Scale Weak Supervision
⬆️
21

manim

Animation engine for explanatory math videos
⬆️
22

podcastfy

An Open Source alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
⬆️
23

OpenBBTerminal

Investment Research for Everyone, Everywhere.
⬆️
24

waydroid

Waydroid uses a container-based approach to boot a full Android system on a regular GNU/Linux system like Ubuntu.
⬆️
25

open_clip

An open source implementation of CLIP.
⬆️
26

IC-Light

More relighting!
⬆️
27

crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
⬆️
28

parler-tts

Inference and training library for high-quality TTS models.
⬆️
29

speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o
⬆️
30

pygwalker

PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
⬆️
31

yt-dlp

A feature-rich command-line audio/video downloader
⬆️
32

posthog

🦔 PostHog provides open-source product analytics, session recording, feature flagging and A/B testing that you can self-host.
⬆️
33

langchain

⚡ Building applications with LLMs through composability ⚡
⬆️
34

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.
⬆️
35

stable-diffusion-webui-forge

⬆️
36

deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
⬆️
37

torchtitan

A native PyTorch Library for large model training
⬆️
38

swarm

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
⬆️
39

sclg

The Scene Language: Representing Scenes with Programs, Words, and Embeddings (arXiv preprint)
⬆️
40

HivisionIDPhotos

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
⬆️
41

activitywatch

The best free and open-source automated time tracker. Cross-platform, extensible, privacy-focused.
⬆️
42

surya

OCR, layout analysis, reading order, line detection in 90+ languages
⬆️
43

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
⬆️
44

fastapi

FastAPI framework, high performance, easy to learn, fast to code, ready for production
⬆️
45

typer

Typer, build great CLIs. Easy to code. Based on Python type hints.
⬆️
46

cumulative-table-design

This repository helps teach people how to correctly define and create cumulative tables!
⬆️
47

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
⬆️
48

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
⬆️
49

searxng

SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
⬆️
50

otterwiki

A minimalistic wiki powered by python, markdown and git.
⬆️
51

system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
⬆️
52

edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
⬆️
53

CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
⬆️
54

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
⬆️
55

kitty

Cross-platform, fast, feature-rich, GPU based terminal
⬆️
56

pytest

The pytest framework makes it easy to write small tests, yet scales to support complex functional testing
⬆️
57

frigate

NVR with realtime local object detection for IP cameras
⬆️
58

airbyte

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
⬆️
59

faster-whisper

Faster Whisper transcription with CTranslate2
⬆️
60

nano-graphrag

A simple, easy-to-hack GraphRAG implementation
⬆️
61

pdfplumber

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
⬆️
62

sapiens

High-resolution models for human tasks.
⬆️
63

storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
⬆️
64

mlflow

Open source platform for the machine learning lifecycle
⬆️
65

qlib

Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, market dynamics modeling, and RL.
⬆️
66

peewee

a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb
⬆️
67

CTFd

CTFs as you need them
⬆️
68

textual

The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.
⬆️
69

BiRefNet

[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
⬆️
70

openfreemap

⬆️
71

speechbrain

A PyTorch-based Speech Toolkit
⬆️
72

python-dotenv

Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-factor principles.
⬆️
73

sd-webui-controlnet

WebUI extension for ControlNet
⬆️
74

chatgpt-on-wechat

基于大模型搭建的微信聊天机器人,同时支持微信、企业微信、公众号、飞书、钉钉接入,可选择GPT3.5/GPT4.0/Claude/文心一言/讯飞星火/通义千问/Gemini/GLM-4/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
⬆️
75

signified

⬆️
76

buenavista

A Postgres Proxy Server in Python
⬆️
77

MeZO

[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
⬆️
78

viet-tts

VietTTS: An Open-Source Vietnamese Text to Speech
⬆️
79

subsets

Code for Reparameterizable Subset Sampling via Continuous Relaxations, IJCAI 2019.
⬆️
80

langflow

⛓️ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.
⬆️
81

opensips-ai-voice-connector-ce

OpenSIPS AI Voice Connector Community Edition Platform
⬆️
82

RapidOCR

Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle. (将PaddleOCR模型做了转换,采用ONNXRuntime推理,速度很快)
⬆️
83

openai-python

The official Python library for the OpenAI API
⬆️
84

posting

The modern API client that lives in your terminal.
⬆️
85

deREferencing

IDA Pro plugin that implements more user-friendly register and stack views
⬆️
86

Mora

Mora: More like Sora for Generalist Video Generation
⬆️
87

s3-connector-for-pytorch

The Amazon S3 Connector for PyTorch delivers high throughput for PyTorch training jobs that access and store data in Amazon S3.
⬆️
88

MobileLLM

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
⬆️
89

gptme

A CLI and web UI to interact with LLMs in a Chat-style interface, with code execution capabilities.
⬆️
90

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
⬆️
91

streaming

A Data Streaming Library for Efficient Neural Network Training
⬆️
92

MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
⬆️
93

locust

Write scalable load tests in plain Python 🚗💨
⬆️
94

bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
⬆️
95

diagrams

🎨 Diagram as Code for prototyping cloud system architectures
⬆️
96

diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model.
⬆️
97

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
⬆️
98

TV

TVBox、影视仓、影迷配置文件,自动更新,可自定义配置,可用于IPTV电视,自定义频道菜单,定时自动更新模板直播源接口,过滤ipv4/ipv6接口,按响应时间、分辨率综合排序,保证时效性 | Auto-updating TVBox configs, customizable for IPTV, custom channel menus, scheduled live source updates, filters ipv4/ipv6 interfaces, sorts by response time, resolution, ensures timeliness
⬆️
99

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration
⬆️
100

pyxel

A retro game engine for Python
⬆️