Discover the top trending Python repositories and projects on Github. Explore the latest trends in Python development.

Trending Repositories

1

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.29% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run.
🔥🔥🔥
2

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🔥
3

LLaMA-Factory

Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
🔥
4

torchtune

A Native-PyTorch Library for LLM Fine-tuning
🔥
5

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All
📣
6

QAnything

Question and Answer based on Anything.
📣
7

searxng

SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
📣
8

langchain

⚡ Building applications with LLMs through composability ⚡
📣
9

penzai

A JAX research toolkit for building, editing, and visualizing neural networks.
📣
10

WireViz

Easily document cables and wiring harnesses
📣
11

langflow

⛓️ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.
📣
12

aiXcoder-7B

official repository of aiXcoder-7B Code Large Language Model
📣
13

crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
⬆️
14

system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
⬆️
15

dspy

DSPy: The framework for programming—not prompting—foundation models
⬆️
16

stable-diffusion-webui

Stable Diffusion web UI
⬆️
17

MediaCrawler

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫
⬆️
18

instructor

structured outputs for llms
⬆️
19

recurrentgemma

Open weights language model from Google DeepMind, based on Griffin.
⬆️
20

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.
⬆️
21

whisper

Robust Speech Recognition via Large-Scale Weak Supervision
⬆️
22

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
⬆️
23

yt-dlp

A feature-rich command-line audio/video downloader
⬆️
24

public-apis

A collective list of free APIs
⬆️
25

docker-android

Android in docker solution with noVNC supported and video recording
⬆️
26

MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
⬆️
27

manim

A community-maintained Python framework for creating mathematical animations.
⬆️
28

PurpleLlama

Set of tools to assess and improve LLM security.
⬆️
29

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
⬆️
30

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
⬆️
31

lit-gpt

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
⬆️
32

pyreft

ReFT: Representation Finetuning for Language Models
⬆️
33

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
⬆️
34

posthog

🦔 PostHog provides open-source product analytics, session recording, feature flagging and A/B testing that you can self-host.
⬆️
35

attorch

A small subset of PyTorch's neural network modules, written purely in Python using OpenAI's Triton.
⬆️
36

llm-action

本项目旨在分享大模型相关技术原理以及实战经验。
⬆️
37

DeepFaceLive

Real-time face swap for PC streaming or video calls
⬆️
38

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V
⬆️
39

ipex-llm

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max). A PyTorch LLM library that seamlessly integrates with llama.cpp, HuggingFace, LangChain, LlamaIndex, DeepSpeed, vLLM, FastChat, ModelScope, etc.
⬆️
40

dust3r

DUSt3R: Geometric 3D Vision Made Easy
⬆️
41

Fooocus

Focus on prompting and generating
⬆️
42

gaussian-opacity-fields

Gaussian Opacity Fields for Efficient and Compact Surface Reconstruction in Unbounded Scenes
⬆️
43

video-subtitle-remover

基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
⬆️
44

privateGPT

Interact with your documents using the power of GPT, 100% privately, no data leaks
⬆️
45

chroma

the AI-native open-source embedding database
⬆️
46

surya

OCR and line detection in 90+ languages
⬆️
47

llm2vec

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
⬆️
48

gpt-researcher

GPT based autonomous agent that does online comprehensive research on any given topic
⬆️
49

reflex

🕸 Web apps in pure Python 🐍
⬆️
50

manim

Animation engine for explanatory math videos
⬆️
51

marker

Convert PDF to markdown quickly with high accuracy
⬆️
52

pcb2blender

KiCad to Blender 3D model workflow
⬆️
53

lama-cleaner

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
⬆️
54

lm-evaluation-harness

A framework for few-shot evaluation of language models.
⬆️
55

OmniLMM

Large Multi-modal Models for Strong Performance and Efficient Deployment
⬆️
56

faster-whisper

Faster Whisper transcription with CTranslate2
⬆️
57

qiling

A True Instrumentable Binary Emulation Framework
⬆️
58

sparrow

Data extraction with ML and LLM
⬆️
59

Python

All Algorithms implemented in Python
⬆️
60

llama

Inference code for LLaMA models
⬆️
61

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
⬆️
62

text-generation-inference

Large Language Model Text Generation Inference
⬆️
63

mergekit

Tools for merging pretrained large language models.
⬆️
64

rembg

Rembg is a tool to remove images background
⬆️
65

MiniGPT4-video

⬆️
66

flowmap

Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, Ayush Tewari, and Vincent Sitzmann
⬆️
67

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
⬆️
68

gdb-dashboard

Modular visual interface for GDB in Python
⬆️
69

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
⬆️
70

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.
⬆️
71

gorilla

Gorilla: An API store for LLMs
⬆️
72

fastapi

FastAPI framework, high performance, easy to learn, fast to code, ready for production
⬆️
73

filesystem_spec

A specification that python filesystems should adhere to.
⬆️
74

moondream

tiny vision language model
⬆️
75

flash-attention

Fast and memory-efficient exact attention
⬆️
76

asitop

Perf monitoring CLI tool for Apple Silicon
⬆️
77

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
⬆️
78

chatgpt-on-wechat

基于大模型搭建的微信聊天机器人,同时支持微信、企业微信、公众号、飞书、钉钉接入,可选择GPT3.5/GPT4.0/Claude/文心一言/讯飞星火/通义千问/Gemini/GLM-4/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
⬆️
79

frigate

NVR with realtime local object detection for IP cameras
⬆️
80

trl

Train transformer language models with reinforcement learning.
⬆️
81

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
⬆️
82

streamlit

Streamlit — A faster way to build and share data apps.
⬆️
83

stable-audio-tools

Generative models for conditional audio generation
⬆️
84

generative-models

Generative Models by Stability AI
⬆️
85

pyinfra

pyinfra automates infrastructure super fast at massive scale. It can be used for ad-hoc command execution, service deployment, configuration management and more.
⬆️
86

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".
⬆️
87

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
⬆️
88

codellama

Inference code for CodeLlama models
⬆️
89

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
⬆️
90

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型
⬆️
91

Douyin_TikTok_Download_API

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。
⬆️
92

supervision

We write your reusable computer vision tools. 💜
⬆️
93

typer

Typer, build great CLIs. Easy to code. Based on Python type hints.
⬆️
94

haystack

🔍 LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
⬆️
95

SWE-bench

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?
⬆️
96

SUPIR

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild
⬆️
97

WingetUI

WingetUI: A better UI for your package managers
⬆️
98

kitty

Cross-platform, fast, feature-rich, GPU based terminal
⬆️
99

DrissionPage

基于python的网页自动化工具。既能控制浏览器,也能收发数据包。可兼顾浏览器自动化的便利性和requests的高效率。功能强大,内置无数人性化设计和便捷功能。语法简洁而优雅,代码量少。
⬆️
100

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
⬆️