Awesome Python

Admin Panels

  • ajenti ajenti 7,346
    star
    updated 28 days ago MIT License

    Ajenti Core and stock plugins

  • updated over 1 year ago GNU Affero Genera...

    Modern responsive template for the Django admin interface with improved functionality. We are proud to announce completely new Jet. Please check out Live Demo

  • updated 3 months ago BSD 3-Clause "New...

    Simple and extensible administrative interface framework for Flask

  • flower flower 6,049
    star
    updated about 2 months ago Other

    Real-time monitor and web admin for Celery distributed task queue

  • updated 8 months ago MIT License

    Jet Admin โ€“ No-code Business App builder

  • xadmin xadmin 4,739
    star
    updated 3 months ago BSD 3-Clause "New...

    Drop-in replacement of Django admin comes with lots of goodies, fully extensible with plugin support, pretty UI based on Twitter Bootstrap.

Algorithms and Design Patterns

  • algorithms algorithms 23,459
    star
    updated 20 days ago MIT License

    Minimal examples of data structures and algorithms in Python

  • updated about 2 months ago

    A curated list of awesome places to learn and/or practice algorithms.

  • PyPattyrn PyPattyrn 2,176
    star
    updated over 1 year ago MIT License

    A simple library for implementing common design patterns.

  • Python Python 176,952
    star
    updated 8 days ago MIT License

    All Algorithms implemented in Python

  • python-ds python-ds 2,649
    star
    updated over 1 year ago MIT License

    No non-sense and no BS repo for how data structure code should be in Python - simple and elegant.

  • updated 3 months ago

    A collection of design patterns/idioms in Python

  • updated 7 months ago Other

    Python Sorted Container Types: Sorted List, Sorted Dict, and Sorted Set

  • updated 3 months ago MIT License

    A lightweight, object-oriented finite state machine implementation in Python with many extensions

ASGI Servers

  • daphne daphne 2,234
    star
    updated about 2 months ago BSD 3-Clause "New...

    Django Channels HTTP/WebSocket server

  • uvicorn uvicorn 7,411
    star
    updated 3 months ago BSD 3-Clause "New...

    An ASGI web server, for Python. ๐Ÿฆ„

Asynchronous Programming

  • updated 12 months ago Other

    A curated list of awesome Python asyncio frameworks, libraries, software and resources

  • trio trio 5,810
    star
    updated 12 days ago Other

    Trio โ€“ a friendly Python library for async concurrency and I/O

  • uvloop uvloop 9,912
    star
    updated 15 days ago Apache License 2.0

    Ultra fast asyncio event loop.

Audio

  • updated 25 days ago MIT License

    cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python

  • beets beets 12,317
    star
    updated 12 days ago MIT License

    music library manager and MusicBrainz tagger

  • dejavu dejavu 6,268
    star
    updated 7 months ago MIT License

    Audio fingerprinting and recognition in Python

  • eyeD3 eyeD3 504
    star
    updated 7 months ago GNU General Publi...

    eyeD3 is a Python module and command line program for processing ID3 tags. Information about mp3 files (i.e bit rate, sample frequency, play time, etc.) is also provided. The formats supported are ID3v1 (1.0/1.1) and ID3v2 (2.3/2.4).

  • kapre kapre 907
    star
    updated 5 months ago MIT License

    kapre: Keras Audio Preprocessors

  • librosa librosa 6,354
    star
    updated 4 months ago ISC License

    Python library for audio and music analysis

  • updated 7 months ago GNU General Publi...

    ๐ŸŽš๏ธ Open Source Audio Matching and Mastering

  • mutagen mutagen 1,422
    star
    updated about 2 months ago GNU General Publi...

    Python module for handling audio metadata

  • updated 5 months ago Apache License 2.0

    Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

  • pydub pydub 8,125
    star
    updated 2 months ago MIT License

    Manipulate audio with a simple and easy high level interface

  • updated 4 months ago MIT License

    Read audio and music meta data and duration of MP3, OGG, OPUS, MP4, M4A, FLAC, WMA, Wave and AIFF files with python 2 or 3

Authentication

  • authlib authlib 3,619
    star
    updated 11 months ago BSD 3-Clause "New...

    The ultimate Python library in building OAuth, OpenID Connect clients and servers. JWS,JWE,JWK,JWA,JWT included.

  • updated 7 months ago MIT License

    Integrated set of Django applications addressing authentication, registration, account management as well as 3rd party (social) account authentication.

  • pyjwt pyjwt 4,760
    star
    updated 4 months ago MIT License

    JSON Web Token implementation in Python

  • updated 3 months ago MIT License

    A JOSE implementation in Python

  • updated 5 months ago MIT License

    Python module for generating and verifying JSON Web Tokens

  • updated almost 2 years ago MIT License

    A fully tested, abstract interface to creating OAuth clients and servers.

  • updated over 1 year ago BSD 3-Clause "New...

    Social auth made simple

Build Tools

  • updated 4 months ago Apache License 2.0

    Your Gateway to Embedded Software Development Excellence ๐Ÿ‘ฝ

  • pybuilder pybuilder 1,616
    star
    updated about 1 month ago Apache License 2.0

    Software build automation tool for Python.

Built-in Classes Enhancement

  • attrs attrs 5,051
    star
    updated 10 days ago MIT License

    Python Classes Without Boilerplate

  • bidict bidict 1,374
    star
    updated 2 months ago Mozilla Public Li...

    The bidirectional mapping library for Python.

  • Box Box 2,267
    star
    updated 7 months ago MIT License

    Python dictionaries with advanced dot notation access

  • updated about 1 year ago MIT License

    Python library that provides a method of accessing lists and dicts with a dotted path notation.

Caching

  • beaker beaker 511
    star
    updated 23 days ago Other

    WSGI middleware for sessions and caching

  • updated about 1 year ago BSD 3-Clause "New...

    Automatic caching and invalidation for Django models through the ORM.

  • updated 10 months ago BSD 3-Clause "New...

    A slick ORM cache with automatic granular event-driven invalidation.

  • updated 6 months ago BSD 3-Clause "New...

    A Python wrapper around the libmemcached interface from TangentOrg.

ChatGPT and LLMs

  • updated over 1 year ago MIT License

    Code for the model presented in the paper: "code2seq: Generating Sequences from Structured Representations of Code"

  • updated about 2 years ago BSD 3-Clause "New...
  • gpt-2 gpt-2 20,844
    star
    updated 2 months ago Other

    Code for the paper "Language Models are Unsupervised Multitask Learners"

  • updated over 4 years ago MIT License

    Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

  • updated 4 months ago MIT License

    Specify what you want it to build, the AI asks for clarification, and then builds it.

  • pdfGPT pdfGPT 6,485
    star
    updated 2 months ago MIT License

    PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbot!

  • PPLM PPLM 1,102
    star
    updated about 1 month ago Apache License 2.0

    Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.

  • updated 3 days ago Apache License 2.0

    Multilingual Sentence & Image Embeddings with BERT

  • updated 9 months ago Apache License 2.0

    Transformers for Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI

  • sockeye sockeye 1,181
    star
    updated 9 months ago Apache License 2.0

    Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch

  • texar texar 2,381
    star
    updated over 2 years ago Apache License 2.0

    Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

  • updated 2 months ago Apache License 2.0

    Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

  • XLM XLM 2,763
    star
    updated 10 months ago Other

    PyTorch original implementation of Cross-lingual Language Model Pretraining.

ChatOps Tools

  • errbot errbot 3,047
    star
    updated about 2 months ago GNU General Publi...

    Errbot is a chatbot, a daemon that connects to your favorite chat service and bring your tools and some fun into the conversation.

CMS

  • updated 3 months ago BSD 3-Clause "New...

    A Django-based CMS with a focus on extensibility and concise code

  • indico indico 1,599
    star
    updated about 2 months ago MIT License

    Indico - A feature-rich event management system, made @ CERN, the place where the Web was born.

  • Kotti Kotti 410
    star
    updated 6 months ago Other

    Kotti is a high-level, Pythonic web application framework based on Pyramid and SQLAlchemy. It includes an extensible Content Management System called the Kotti CMS.

  • mezzanine mezzanine 4,701
    star
    updated 3 months ago BSD 2-Clause "Sim...

    CMS framework for Django

Code Analysis

  • updated about 2 months ago Other

    :octocat: A curated awesome list of flake8 extensions. Feel free to contribute! ๐ŸŽ“

  • updated 4 months ago Creative Commons ...

    Collection of awesome Python types, stubs, plugins, and tools to work with them.

  • coala coala 3,509
    star
    updated over 1 year ago GNU Affero Genera...

    coala provides a unified command-line interface for linting and fixing all your code, regardless of the programming languages you use.

  • code2flow code2flow 3,574
    star
    updated 11 months ago MIT License

    Pretty good call graphs for dynamic languages

  • updated 10 months ago Other

    A Python library that generates static type annotations by collecting runtime types

  • updated about 1 month ago GNU General Publi...

    Inspects Python source files and provides information about type and location of classes, methods etc

  • updated over 5 years ago GNU General Publi...

    pycallgraph is a Python module that creates call graphs for Python programs.

  • pylama pylama 1,033
    star
    updated 5 months ago MIT License

    Code audit tool for python.

  • updated about 2 months ago MIT License

    Performant type-checking for python.

  • pytype pytype 4,454
    star
    updated 3 months ago Other

    A static type analyzer for Python code

  • typeshed typeshed 4,002
    star
    updated 12 days ago Other

    Collection of library stubs for Python, with static types

  • vulture vulture 2,930
    star
    updated about 1 month ago MIT License

    Find dead Python code

  • updated 2 months ago MIT License

    The strictest and most opinionated python linter ever!

  • yapf yapf 13,560
    star
    updated 2 months ago Apache License 2.0

    A formatter for Python files

Code Coverage

  • updated 12 months ago Apache License 2.0

    Python report uploader for Codecov

  • updated 10 days ago Apache License 2.0

    The code coverage tool for Python

  • updated about 1 month ago Other

    A Django plugin for pytest.

  • updated about 1 month ago MIT License

    extracted --boxed from pytest-xdist to ensure backward compat

  • updated about 1 month ago MIT License

    Thin-wrapper around the mock package for easier use with pytest

  • updated about 1 month ago MIT License

    ๐ŸŽฒ Pytest plugin to randomly order tests and control random.seed

  • updated about 1 month ago Other

    a pytest plugin that re-runs failed tests up to -n times to eliminate flakey failures

  • updated about 2 months ago MIT License
  • updated about 1 month ago MIT License

    pytest plugin for distributed testing and loop-on-failures testing modes.

  • updated about 1 month ago MIT License

    pytest external process plugin

Command-line Interface Development

  • updated 10 months ago MIT License

    A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations!

  • updated 4 months ago Apache License 2.0

    A cross platform package to do curses-like operations, plus higher level APIs and widgets to create text UIs and ASCII art animations

  • updated almost 2 years ago MIT License

    plotting in the terminal

  • python-fire python-fire 26,112
    star
    updated 14 days ago Other

    Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.

  • rich rich 46,695
    star
    updated 17 days ago MIT License

    Rich is a Python library for rich text and beautiful formatting in the terminal.

  • tqdm tqdm 26,439
    star
    updated 4 months ago Other

    โšก A Fast, Extensible Progress Bar for Python and CLI

Command-line Tools

  • aws-cli aws-cli 14,304
    star
    updated 6 months ago Other

    Universal Command Line Interface for Amazon Web Services

  • doitlive doitlive 3,387
    star
    updated about 1 month ago MIT License

    Because sometimes you need to do it live

  • howdoi howdoi 10,202
    star
    updated 8 months ago MIT License

    instant coding answers via the command line

  • iredis iredis 2,468
    star
    updated about 1 month ago BSD 3-Clause "New...

    Interactive Redis: A Terminal Client for Redis with AutoCompletion and Syntax Highlighting.

  • updated over 1 year ago Apache License 2.0

    Kubernetes shell: An integrated shell for working with the Kubernetes

  • litecli litecli 1,986
    star
    updated 4 months ago BSD 3-Clause "New...

    CLI for SQLite Databases with auto-completion and syntax highlighting

  • mycli mycli 11,182
    star
    updated 2 months ago Other

    A Terminal Client for MySQL with AutoCompletion and Syntax Highlighting.

  • updated 3 months ago MIT License

    PathPicker accepts a wide range of input -- output from git commands, grep results, searches -- pretty much anything. After parsing the input, PathPicker presents you with a nice UI to select which files you're interested in. After that you can open them in your favorite editor or execute arbitrary commands.

  • percol percol 3,227
    star
    updated almost 2 years ago

    adds flavor of interactive filtering to the traditional pipe concept of UNIX shell

  • pgcli pgcli 11,553
    star
    updated about 2 months ago BSD 3-Clause "New...

    Postgres CLI with autocompletion and syntax highlighting

  • saws saws 5,178
    star
    updated about 2 years ago Other

    A supercharged AWS command line interface (CLI).

  • thefuck thefuck 80,068
    star
    updated 5 months ago MIT License

    Magnificent app which corrects your previous console command.

  • tmux tmux 32,442
    star
    updated 13 days ago Other

    tmux source code

  • try try 702
    star
    updated almost 2 years ago MIT License

    Dead simple CLI tool to try Python packages - It's never been easier! ๐Ÿ“ฆ

Compatibility

  • updated 5 months ago Other

    Modernizes Python code for eventual Python 3 migration. Built on top of fissix (a fork of lib2to3)

Computer Vision

  • updated 3 months ago MIT License

    The world's simplest facial recognition api for Python and the command line

  • kornia kornia 9,166
    star
    updated 16 days ago Apache License 2.0

    Geometric Computer Vision Library for Spatial AI

  • updated about 1 month ago Apache License 2.0

    A Python wrapper for Google Tesseract

  • SimpleCV SimpleCV 2,653
    star
    updated about 1 year ago BSD 3-Clause "New...

    The Open Source Framework for Machine Vision

  • tesserocr tesserocr 1,898
    star
    updated 6 months ago MIT License

    A Python wrapper for the tesseract-ocr API

Concurrency and Parallelism

  • greenlet greenlet 1,564
    star
    updated 3 months ago Other

    Lightweight in-process concurrent programming

  • scoop scoop 612
    star
    updated about 1 year ago GNU Lesser Genera...

    SCOOP (Scalable COncurrent Operations in Python)

  • uvloop uvloop 9,912
    star
    updated 15 days ago Apache License 2.0

    Ultra fast asyncio event loop.

Configuration

  • updated 2 months ago Other

    Python 3+ compatible port of the configobj library

  • hydra hydra 8,044
    star
    updated 23 days ago MIT License

    Hydra is a framework for elegantly configuring complex applications

  • updated 2 months ago MIT License

    Strict separation of config from code.

Crypto

  • updated 7 months ago MIT License

    Ethereum ABI utilities for python

  • updated about 2 months ago MIT License

    Account abstraction library for web3.py

  • updated 3 months ago MIT License

    The Ethereum hashing function, keccak256, sometimes (erroneously) called sha256 or sha3

  • updated 3 months ago MIT License

    A common API for Ethereum key operations.

  • updated 2 months ago MIT License

    Tool suite for testing ethereum applications.

  • updated 16 days ago MIT License

    Python types for type hinting commonly used ethereum types

  • updated 2 months ago MIT License

    Utility functions for working with ethereum related codebases.

  • py-evm py-evm 2,172
    star
    updated 13 days ago MIT License

    A Python implementation of the Ethereum Virtual Machine

  • web3.py web3.py 4,701
    star
    updated 12 days ago MIT License

    A python interface for interacting with the Ethereum blockchain and ecosystem.

Cryptography

  • pynacl pynacl 989
    star
    updated 7 months ago Apache License 2.0

    Python binding to the Networking and Cryptography (NaCl) library

Data

  • plotly.py plotly.py 15,052
    star
    updated 6 days ago MIT License

    The interactive graphing library for Python โœจ This project now includes Plotly Express!

  • updated 5 months ago Apache License 2.0

    StreamAlert is a serverless, realtime data analysis framework which empowers you to ingest, analyze, and alert on data from any environment, using datasources and alerting logic you define.

Data Analysis

  • blaze blaze 3,180
    star
    updated 6 months ago BSD 3-Clause "New...

    NumPy and Pandas interface to Big Data

  • mining mining 1,255
    star
    updated over 7 years ago MIT License

    Business Intelligence (BI) in Python, OLAP

Database

  • chroma chroma 10,591
    star
    updated 2 months ago Apache License 2.0

    the AI-native open-source embedding database

  • updated 11 months ago BSD 3-Clause "New...

    pickleDB is an open source key-value store using Python's json module.

  • tinydb tinydb 6,370
    star
    updated 2 months ago MIT License

    TinyDB is a lightweight document oriented database optimized for your happiness :)

  • ZODB ZODB 624
    star
    updated 7 months ago Other

    Python object-oriented database

Database Drivers

  • apsw apsw 689
    star
    updated 11 days ago Other

    Another Python SQLite wrapper

  • updated 4 months ago Creative Commons ...

    A curated list of awesome PostgreSQL software, libraries, tools and resources, inspired by awesome-mysql

  • updated about 1 month ago Creative Commons ...

    A collection of awesome sqlite tools, scripts, books, etc

  • updated about 1 month ago Other

    ClickHouse Python Driver with native interface support

  • updated 2 months ago Apache License 2.0

    PyMongo - the Official MongoDB Python driver

  • motor motor 2,275
    star
    updated 2 months ago Apache License 2.0

    Motor - the async Python driver for MongoDB and Tornado or asyncio

  • PyMySQL PyMySQL 7,495
    star
    updated about 2 months ago MIT License

    MySQL client library for Python

  • updated 3 months ago Apache License 2.0

    DataStax Python Driver for Apache Cassandra

  • updated over 2 years ago BSD 3-Clause "New...

    PostgreSQL database access simplified

  • updated almost 4 years ago MIT License

    A supercharged SQLite library for Python

Data Validation

  • cerberus cerberus 3,058
    star
    updated 5 months ago ISC License

    Lightweight, extensible data validation library for Python

  • updated 7 months ago MIT License

    An implementation of the JSON Schema specification for Python

  • schema schema 2,827
    star
    updated 3 months ago MIT License

    Schema validation just got Pythonic

  • updated 9 months ago Other

    Python Data Structures for Humansโ„ข.

  • updated about 1 year ago MIT License

    Lightweight data validation and adaptation Python library.

  • updated 2 months ago BSD 3-Clause "New...

    CONTRIBUTIONS ONLY: Voluptuous, despite the name, is a Python data validation library.

Data Visualization

  • altair altair 8,827
    star
    updated 12 days ago BSD 3-Clause "New...

    Declarative statistical visualization library for Python

  • bokeh bokeh 18,591
    star
    updated about 1 month ago BSD 3-Clause "New...

    Interactive Data Visualization in the browser, from Python

  • cartopy cartopy 1,329
    star
    updated about 2 months ago BSD 3-Clause "New...

    Cartopy - a cartographic python library with matplotlib support

  • chartify chartify 3,447
    star
    updated 3 months ago Apache License 2.0

    Python library that makes it easy for data scientists to create charts.

  • diagrams diagrams 33,942
    star
    updated 2 months ago MIT License

    ๐ŸŽจ Diagram as Code for prototyping cloud system architectures

  • plotnine plotnine 3,759
    star
    updated 13 days ago MIT License

    A Grammar of Graphics for Python

  • seaborn seaborn 11,646
    star
    updated about 2 months ago BSD 3-Clause "New...

    Statistical data visualization in Python

  • vispy vispy 3,193
    star
    updated about 1 month ago Other

    Main repository for Vispy

Date and Time

  • updated over 4 years ago MIT License

    A Python 3 library for parsing human-written times and dates

  • dateutil dateutil 2,189
    star
    updated 3 months ago Other

    Useful extensions to the standard Python datetime features

  • delorean delorean 1,845
    star
    updated about 1 year ago MIT License

    Delorean: Time Travel Made Easy

  • maya maya 3,399
    star
    updated over 1 year ago MIT License

    Datetimes for Humansโ„ข

  • moment moment 724
    star
    updated 11 months ago

    A Python library for dealing with dates

  • pendulum pendulum 5,961
    star
    updated about 2 months ago MIT License

    Python datetimes made easy

  • PyTime PyTime 155
    star
    updated over 1 year ago MIT License

    PyTime is an easy-use Python module which aims to operate date/time/datetime by string.

  • updated over 4 years ago BSD 3-Clause "New...

    Friendly Python Dates

Debugging

  • updated 4 months ago Apache License 2.0

    Python Cloud Debugger

  • ptvsd ptvsd 536
    star
    updated over 3 years ago Other

    Python debugger package for use with Visual Studio and Visual Studio Code.

  • pytest pytest 11,150
    star
    updated about 1 month ago MIT License

    The pytest framework makes it easy to write small tests, yet scales to support complex functional testing

  • updated 10 days ago MIT License

    Coverage plugin for pytest.

  • updated 16 days ago Other

    Plugin for generating HTML reports for pytest results

  • updated about 1 month ago MIT License

    Thin-wrapper around the mock package for easier use with pytest

  • updated about 1 month ago MIT License

    pytest plugin for distributed testing and loop-on-failures testing modes.

  • werkzeug werkzeug 6,518
    star
    updated 19 days ago BSD 3-Clause "New...

    The comprehensive WSGI web application library.

Debugging Tools

  • updated 16 days ago BSD 3-Clause "New...

    A configurable set of panels that display various debug information about the current request/response.

  • updated about 4 years ago BSD 3-Clause "New...

    A drop-in replacement for Django's runserver.

  • icecream icecream 8,227
    star
    updated about 1 month ago MIT License

    ๐Ÿฆ Never use print() to debug again.

  • ipdb ipdb 1,802
    star
    updated 8 months ago BSD 3-Clause "New...

    Integration of IPython pdb

  • updated over 2 years ago Other

    (OLD REPO) Line-by-line profiling for Python - Current repo ->

  • updated over 5 years ago GNU General Publi...

    Trace any Python program, anywhere!

  • pudb pudb 2,778
    star
    updated 5 months ago Other

    Full-screen console debugger for Python

  • py-spy py-spy 10,621
    star
    updated 9 months ago MIT License

    Sampling profiler for Python programs

  • updated 2 months ago Other

    Parsing ELF and DWARF in Python

  • pyringe pyringe 1,638
    star
    updated over 4 years ago Other

    Debugger capable of attaching to and injecting code into python processes.

  • updated 11 months ago BSD 2-Clause "Sim...

    Hunter is a flexible code tracing toolkit.

  • updated over 1 year ago BSD 2-Clause "Sim...

    Debugging manhole for python applications.

  • vprof vprof 3,939
    star
    updated over 1 year ago BSD 2-Clause "Sim...

    Visual profiler for Python

  • wdb wdb 1,571
    star
    updated over 1 year ago GNU General Publi...

    An improbable web debugger through WebSockets

Deep Learning

  • caffe caffe 33,781
    star
    updated about 1 month ago Other

    Caffe: a fast open framework for deep learning.

  • keras keras 60,597
    star
    updated 19 days ago Apache License 2.0

    Deep Learning for humans

  • pytorch pytorch 76,980
    star
    updated 1 day ago Other

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

  • pytorch3d pytorch3d 7,322
    star
    updated 10 months ago Other

    PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

  • SerpentAI SerpentAI 6,661
    star
    updated over 1 year ago MIT License

    Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!

  • tensorflow tensorflow 181,486
    star
    updated 6 days ago Apache License 2.0

    An Open Source Machine Learning Framework for Everyone

  • Theano Theano 9,840
    star
    updated 2 months ago Other

    Theano was a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It is being continued as PyTensor: www.github.com/pymc-devs/pytensor

  • TTS TTS 28,027
    star
    updated 13 days ago Mozilla Public Li...

    ๐Ÿธ๐Ÿ’ฌ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

DevOps Tools

  • ansible ansible 58,550
    star
    updated 6 months ago GNU General Publi...

    Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to cloud management, in a language that approaches plain English, using SSH, with no agents to install on remote systems. https://docs.ansible.com.

  • cuisine cuisine 1,260
    star
    updated about 1 year ago Other

    Chef-like functionality for Fabric

  • fabric fabric 14,474
    star
    updated 2 months ago BSD 2-Clause "Sim...

    Simple, Pythonic remote execution and deployment.

  • fabtools fabtools 1,252
    star
    updated over 4 years ago BSD 2-Clause "Sim...

    Tools for writing awesome Fabric files

  • foreman foreman 5,890
    star
    updated 11 months ago MIT License

    Manage Procfile-based applications

  • honcho honcho 1,569
    star
    updated 2 months ago MIT License

    Honcho: a python clone of Foreman. For managing Procfile-based applications.

  • psutil psutil 9,865
    star
    updated 13 days ago BSD 3-Clause "New...

    Cross-platform lib for process and system monitoring in Python

  • pyinfra pyinfra 2,400
    star
    updated 6 months ago MIT License

    pyinfra automates infrastructure super fast at massive scale. It can be used for ad-hoc command execution, service deployment, configuration management and more.

  • salt salt 13,741
    star
    updated about 1 month ago Apache License 2.0

    Software to automate the management and configuration of any infrastructure or application at scale. Get access to the Salt software package repository here:

  • updated 3 months ago Other

    Supervisor process control system for Unix (supervisord)

Diffusion Text to Image

  • updated about 6 years ago MIT License

    Text to image synthesis using thought vectors

Distributed Computing

  • dask dask 11,912
    star
    updated 4 days ago BSD 3-Clause "New...

    Parallel computing with task scheduling

  • faust faust 6,668
    star
    updated 4 months ago Other

    Python Stream Processing

  • luigi luigi 17,089
    star
    updated 2 months ago Apache License 2.0

    Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

  • mrjob mrjob 2,605
    star
    updated about 1 year ago Other

    Run MapReduce jobs on Hadoop or Amazon Web Services

  • ray ray 29,950
    star
    updated about 1 month ago Apache License 2.0

    Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

  • updated 7 months ago Apache License 2.0

    Run Python in Apache Storm topologies. Pythonic API, CLI tooling, and a topology DSL.

Distribution

  • updated 11 months ago GNU General Publi...

    Python virtualenvs in Debian packages

  • pyarmor pyarmor 2,763
    star
    updated about 2 months ago Other

    A tool used to obfuscate python scripts, bind obfuscated scripts to fixed machine or expire obfuscated scripts.

  • pyinstaller pyinstaller 10,970
    star
    updated 2 months ago Other

    Freeze (package) Python programs into stand-alone executables

  • shiv shiv 1,667
    star
    updated about 1 month ago BSD 2-Clause "Sim...

    shiv is a command line utility for building fully self contained Python zipapps as outlined in PEP 441, but with all their dependencies included.

Documentation

  • updated 7 months ago Other

    A curated list of awesome tools for Sphinx Python Documentation Generator

  • pdoc pdoc 1,624
    star
    updated 7 months ago The Unlicense

    API Documentation for Python Projects

  • pycco pycco 839
    star
    updated 10 months ago Other

    Literate-style documentation generator.

  • sphinx sphinx 5,754
    star
    updated 5 months ago Other

    The Sphinx documentation generator

Downloader

  • s3cmd s3cmd 4,359
    star
    updated 3 months ago GNU General Publi...

    Official s3cmd repo -- Command line tool for managing Amazon S3 and CloudFront services

  • s4cmd s4cmd 1,322
    star
    updated 8 months ago Apache License 2.0

    Super S3 command line tool

E-commerce

  • alipay alipay 326
    star
    updated over 6 years ago

    An Unofficial Alipay API for Python

  • updated about 1 month ago BSD 2-Clause "Sim...

    Ecommerce for Mezzanine

  • updated 5 months ago BSD 3-Clause "New...

    A Django based shop system

  • updated 7 months ago MIT License

    Foreign exchange rates, Bitcoin price index and currency conversion using ratesapi.io

  • merchant merchant 1,012
    star
    updated over 1 year ago BSD 3-Clause "New...

    A Django app to accept payments from various payment processors via Pluggable backends.

  • money money 227
    star
    updated 12 months ago MIT License

    Python money class with optional CLDR-backed locale-aware formatting and an extensible currency exchange solution.

  • updated over 2 years ago GNU General Publi...

    Display money format and its filthy currencies, for all money lovers out there.

Email

  • flanker flanker 1,611
    star
    updated 4 months ago Apache License 2.0

    Python email address and Mime parsing library

  • imbox imbox 1,161
    star
    updated 4 months ago MIT License

    Python IMAP for Human beings

  • mailer mailer 271
    star
    updated 5 months ago MIT License

    A light-weight, modular, message representation and mail delivery framework for Python.

  • modoboa modoboa 2,868
    star
    updated 9 days ago ISC License

    Mail hosting made simple

  • salmon salmon 626
    star
    updated 12 months ago Other

    A Python Mail Server

  • yagmail yagmail 2,579
    star
    updated over 1 year ago MIT License

    Send email in Python conveniently for gmail using yagmail

Enterprise Application Integrations

  • airflow airflow 33,999
    star
    updated 3 days ago Apache License 2.0

    Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

  • ansible ansible 58,550
    star
    updated 6 months ago GNU General Publi...

    Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to cloud management, in a language that approaches plain English, using SSH, with no agents to install on remote systems. https://docs.ansible.com.

  • updated 5 months ago Apache License 2.0

    A collection of useful CloudFormation templates

  • updated about 1 year ago Other

    AWS Glue Libraries are additions and enhancements to Spark for ETL operations.

  • updated 10 months ago MIT No Attribution

    A developer toolkit to implement Serverless best practices and increase developer velocity.

  • boto3 boto3 8,584
    star
    updated about 1 month ago Apache License 2.0

    AWS SDK for Python

  • updated 6 months ago Apache License 2.0

    Official Python client for Elasticsearch

  • flask flask 65,996
    star
    updated 20 days ago BSD 3-Clause "New...

    The Python micro framework for building web applications.

  • updated 7 months ago Apache License 2.0

    ๐Ÿ The official Python client library for Google's discovery based APIs.

  • invoke invoke 4,163
    star
    updated 3 months ago BSD 2-Clause "Sim...

    Pythonic task management & command execution.

  • luigi luigi 17,089
    star
    updated 2 months ago Apache License 2.0

    Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

  • pika pika 3,400
    star
    updated 8 months ago BSD 3-Clause "New...

    Pure Python RabbitMQ/AMQP 0-9-1 client library

  • requests requests 51,187
    star
    updated 14 days ago Apache License 2.0

    A simple, yet elegant, HTTP library.

Environment Management

  • pyenv pyenv 35,386
    star
    updated about 2 months ago MIT License

    Simple Python version management

  • updated about 1 month ago MIT License

    Virtual Python Environment builder

Fake Data

  • fake2db fake2db 2,256
    star
    updated over 4 years ago GNU General Publi...

    create custom test databases that are populated with fake data

  • faker faker 16,985
    star
    updated 8 days ago MIT License

    Faker is a Python package that generates fake data for you.

  • mimesis mimesis 4,243
    star
    updated 24 days ago MIT License

    Mimesis is a powerful Python library that empowers developers to generate massive amounts of synthetic data efficiently.

Files

  • updated 2 months ago Other

    A python wrapper for libmagic

  • updated about 2 years ago

    An object-oriented approach to Python file/directory operations.

  • watchdog watchdog 6,195
    star
    updated about 2 months ago Apache License 2.0

    Python library and shell utilities to monitor filesystem events.

Finance

  • arch arch 1,239
    star
    updated about 2 months ago Other

    ARCH models in Python

  • backtrader backtrader 12,665
    star
    updated about 1 month ago GNU General Publi...

    Python Backtesting library for trading strategies

  • empyrical empyrical 1,210
    star
    updated about 2 months ago Apache License 2.0

    Common financial risk and performance metrics. Used by zipline and pyfolio.

  • qgrid qgrid 3,018
    star
    updated 3 months ago Apache License 2.0

    An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks

  • yfinance yfinance 11,428
    star
    updated 25 days ago Apache License 2.0

    Download market data from Yahoo! Finance's API

  • zipline zipline 16,969
    star
    updated about 1 month ago Apache License 2.0

    Zipline, a Pythonic Algorithmic Trading Library

Foreign Function Interface

  • llvmlite llvmlite 1,774
    star
    updated about 2 months ago BSD 2-Clause "Sim...

    A lightweight LLVM python binding for writing JIT compilers

Forms

  • deform deform 400
    star
    updated 10 months ago Other

    A Python HTML form library.

  • updated 14 days ago BSD 3-Clause "New...

    Bootstrap 4 integration with Django.

  • updated 22 days ago MIT License

    The best way to have DRY Django forms. The app provides a tag and filter that lets you quickly render forms in a div format while providing an enormous amount of capability to configure and control the rendered HTML.

  • updated about 1 year ago MIT License

    A platform independent django form serializer

  • wtforms wtforms 1,452
    star
    updated 18 days ago BSD 3-Clause "New...

    A flexible forms validation and rendering library for Python.

Functional Programming

  • coconut coconut 3,871
    star
    updated 2 months ago Apache License 2.0

    Simple, elegant, Pythonic functional programming.

  • fn.py fn.py 3,302
    star
    updated over 1 year ago Other

    Functional programming in Python: implementation of missing features to enjoy FP

  • funcy funcy 3,143
    star
    updated 11 months ago BSD 3-Clause "New...

    A fancy and practical functional tools

  • returns returns 3,064
    star
    updated 2 months ago BSD 2-Clause "Sim...

    Make your functions return something meaningful, typed, and safe!

  • toolz toolz 4,469
    star
    updated about 2 months ago Other

    A functional standard library for Python.

Game Development

  • updated 11 months ago

    The first level of Super Mario Bros made with Python and Pygame.

  • updated 8 months ago

    This is the code base for the pygame tutorials posted on my YouTube channel.

  • updated 2 months ago
  • updated 10 months ago

    Creating the game Space Invaders using Pygame Module in Python

Geolocation

  • updated about 2 months ago MIT License

    A Django application that provides country choices for use with forms, flag icons static files, and a country field for models.

  • updated about 3 years ago GNU Lesser Genera...

    DEPRECATED GeoIP Legacy Python API

GIS

  • cartopy cartopy 1,329
    star
    updated about 2 months ago BSD 3-Clause "New...

    Cartopy - a cartographic python library with matplotlib support

  • geopandas geopandas 4,072
    star
    updated about 1 month ago BSD 3-Clause "New...

    Python tools for geographic data

  • updated 20 days ago Other

    Core components of Python Spatial Analysis Library

  • pyshp pyshp 1,066
    star
    updated 2 months ago MIT License

    This library reads and writes ESRI Shapefiles in pure Python.

Graph

  • cugraph cugraph 1,552
    star
    updated 9 days ago Apache License 2.0

    cuGraph - RAPIDS Graph Analytics Library

  • dask dask 11,912
    star
    updated 4 days ago BSD 3-Clause "New...

    Parallel computing with task scheduling

  • detectron2 detectron2 25,567
    star
    updated 8 months ago Apache License 2.0

    Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

  • hydra hydra 8,044
    star
    updated 23 days ago MIT License

    Hydra is a framework for elegantly configuring complex applications

  • metaflow metaflow 7,474
    star
    updated 8 days ago Apache License 2.0

    ๐Ÿš€ Build and manage real-life ML, AI, and data science projects with ease!

  • updated 28 days ago Other

    Neo4j Bolt driver for Python

  • networkx networkx 13,543
    star
    updated 4 months ago Other

    Network Analysis in Python

  • updated 3 months ago GNU General Publi...

    Python interface for igraph

  • SlowFast SlowFast 5,678
    star
    updated 10 months ago Apache License 2.0

    PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

  • yt yt 429
    star
    updated about 1 month ago Other

    Main yt repository

GraphQL

  • graphene graphene 7,945
    star
    updated about 1 month ago MIT License

    GraphQL framework for Python

  • updated 5 months ago MIT License

    tartiflette-aiohttp is a wrapper of aiohttp which includes the Tartiflette GraphQL Engine, do not hesitate to take a look of the Tartiflette project.

  • updated 9 months ago MIT License

    ASGI support for the Tartiflette GraphQL engine

GUI

  • updated over 1 year ago

    A common base representation of python source code for pylint and other projects

  • bandit bandit 5,900
    star
    updated 10 days ago Apache License 2.0

    Bandit is a tool designed to find common security issues in Python code.

  • dash dash 19,422
    star
    updated 6 months ago MIT License

    Data Apps & Dashboards for Python. No JavaScript Required.

  • flake8 flake8 3,068
    star
    updated 5 months ago Other

    flake8 is a python tool that glues together pycodestyle, pyflakes, mccabe, and third-party plugins to check the style and quality of some python code.

  • isort isort 6,277
    star
    updated 8 days ago MIT License

    A Python utility / library to sort imports.

  • mccabe mccabe 602
    star
    updated 6 months ago Other

    McCabe complexity checker for Python

  • napari napari 2,025
    star
    updated 8 days ago BSD 3-Clause "New...

    napari: a fast, interactive, multi-dimensional image viewer for python

  • updated 6 months ago Other

    Simple Python style checker in one Python file

  • updated 5 months ago MIT License

    docstring style checker

  • pylint pylint 4,246
    star
    updated over 1 year ago

    It's not just a linter that annoys you!

  • PyQt PyQt 5,918
    star
    updated 6 months ago

    PyQt Examples๏ผˆPyQtๅ„็งๆต‹่ฏ•ๅ’Œไพ‹ๅญ๏ผ‰ PyQt4 PyQt5

  • PySide PySide 281
    star
    updated over 7 years ago GNU Lesser Genera...

    ATTENTION: This project is deprecated, please refer to PySide2

  • PySimpleGUI PySimpleGUI 13,058
    star
    updated 12 days ago Other

    Python GUIs for Humans! PySimpleGUI is the top-rated Python application development environment. Launched in 2018 and actively developed, maintained, and supported in 2024. Transforms tkinter, Qt, WxPython, and Remi into a simple, intuitive, and fun experience for both hobbyists and expert users.

  • updated 2 months ago Other

    TraitsUI: Traits-capable windowing framework

GUI Development

  • Eel Eel 4,878
    star
    updated almost 2 years ago MIT License

    A little Python library for making simple Electron-like HTML/JS GUI apps

  • enaml enaml 1,487
    star
    updated 5 months ago Other

    Declarative User Interfaces for Python

  • Gooey Gooey 17,984
    star
    updated 6 months ago MIT License

    Turn (almost) any Python command line program into a full GUI application with one line

  • pyglet pyglet 1,681
    star
    updated 2 months ago BSD 3-Clause "New...

    pyglet is a cross-platform windowing and multimedia library for Python, for developing games and other visually rich applications.

  • PySimpleGUI PySimpleGUI 13,058
    star
    updated 12 days ago Other

    Python GUIs for Humans! PySimpleGUI is the top-rated Python application development environment. Launched in 2018 and actively developed, maintained, and supported in 2024. Transforms tkinter, Qt, WxPython, and Remi into a simple, intuitive, and fun experience for both hobbyists and expert users.

  • pywebview pywebview 4,048
    star
    updated about 1 month ago BSD 3-Clause "New...

    Build GUI for your Python program with JavaScript, HTML, and CSS

GUI / Web Testing

  • locust locust 23,382
    star
    updated 9 days ago MIT License

    Write scalable load tests in plain Python ๐Ÿš—๐Ÿ’จ

  • splinter splinter 2,676
    star
    updated about 1 month ago BSD 3-Clause "New...

    splinter - python test framework for web applications

Hardware

  • keyboard keyboard 3,602
    star
    updated 10 months ago MIT License

    Hook and simulate global keyboard events on Windows and Linux.

  • mouse mouse 837
    star
    updated 4 months ago MIT License

    Hook and simulate global mouse events in pure Python

  • updated over 2 years ago GNU General Publi...

    A module for cross-platform control of the mouse and keyboard in python that is simple to install and use.

  • scapy scapy 9,761
    star
    updated about 2 months ago GNU General Publi...

    Scapy: the Python-based interactive packet manipulation program & library. Supports Python 2 & Python 3.

HTML Manipulation

  • bleach bleach 2,590
    star
    updated about 2 months ago Other

    Bleach is an allowed-list-based HTML sanitizing library that escapes or strips markup and attributes

  • updated 3 months ago MIT License

    Standards-compliant library for parsing and serializing HTML documents and fragments in Python

  • updated 16 days ago BSD 3-Clause "New...

    Safely add untrusted strings to HTML/XML markup.

  • pyquery pyquery 2,230
    star
    updated 8 months ago Other

    A jquery-like library for python

  • updated 9 months ago MIT License

    Converts XML to Python objects

  • xmltodict xmltodict 5,305
    star
    updated 7 months ago MIT License

    Python module that makes working with XML feel like you are working with JSON

HTTP Clients

  • grequests grequests 4,400
    star
    updated 10 months ago BSD 2-Clause "Sim...

    Requests + Gevent = <3

  • updated 10 months ago Other

    Small, fast HTTP client library for Python. Features persistent connections, cache, and Google App Engine support. Originally written by Joe Gregorio, now supported by community.

  • httpx httpx 11,445
    star
    updated 4 months ago BSD 3-Clause "New...

    A next generation HTTP client for Python. ๐Ÿฆ‹

  • requests requests 51,187
    star
    updated 14 days ago Apache License 2.0

    A simple, yet elegant, HTTP library.

  • treq treq 570
    star
    updated about 2 months ago Other

    Python requests like API built on top of Twisted's HTTP client.

Image Processing

  • hmap hmap 208
    star
    updated over 4 years ago Other

    Image histogram remapping

  • updated over 3 years ago MIT License

    Nudity detection with Python

  • pagan pagan 299
    star
    updated over 1 year ago GNU General Publi...

    Python avatar generator for absolute nerds

  • Pillow Pillow 11,308
    star
    updated 4 months ago Other

    Python Imaging Library (Fork)

  • pymatting pymatting 1,662
    star
    updated 2 months ago MIT License

    A Python library for alpha matting

  • updated 9 months ago MIT License

    ใŠ™๏ธ Create standard barcodes with Python. No external dependencies. 100% Organic Python.

  • updated 4 months ago Other

    Python QR Code image generator

  • pyvips pyvips 570
    star
    updated 2 months ago MIT License

    python binding for libvips using cffi

  • pywal pywal 7,866
    star
    updated 2 months ago MIT License

    ๐ŸŽจ Generate and change color-schemes on the fly.

  • Quads Quads 1,160
    star
    updated over 1 year ago

    Computer art based on quadtrees.

  • thumbor thumbor 9,823
    star
    updated 23 days ago MIT License

    thumbor is an open-source photo thumbnail service by globo.com

Implementations

  • updated 5 months ago Other

    An implementation of Python in Common Lisp

  • cpython cpython 55,867
    star
    updated 6 months ago Other

    The Python programming language

  • grumpy grumpy 10,464
    star
    updated almost 2 years ago Apache License 2.0

    Grumpy is a Python to Go source code transcompiler and runtime.

  • updated about 2 months ago Apache License 2.0

    Implementation of Python 3.x for .NET Framework that is built on top of the Dynamic Language Runtime.

  • micropython micropython 17,190
    star
    updated 7 months ago Other

    MicroPython - a lean and efficient Python implementation for microcontrollers and constrained systems

  • PeachPy PeachPy 1,946
    star
    updated 6 months ago Other

    x86-64 assembler embedded in Python

Interactive Interpreter

  • updated 3 months ago Creative Commons ...

    A curated list of awesome Jupyter projects, libraries and resources

  • bpython bpython 2,517
    star
    updated about 2 months ago Other

    bpython - A fancy curses interface to the Python interactive interpreter

Internationalization

  • pyicu pyicu 132
    star
    updated almost 3 years ago Other

    The PyICU project repository has moved to https://pyicu.org.

Job Scheduler

  • updated about 4 years ago

    A calendaring app for Django. It is now stable, Please feel free to use it now. Active development has been taken over by bartekgorny.

  • updated almost 2 years ago Apache License 2.0

    Remote task execution tool

  • plan plan 1,166
    star
    updated over 1 year ago Other

    Crontab jobs management in Python

  • prefect prefect 14,162
    star
    updated 25 days ago Apache License 2.0

    Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines

  • schedule schedule 11,400
    star
    updated 3 months ago MIT License

    Python job scheduling for humans.

Jupyter

  • binderhub binderhub 2,448
    star
    updated 23 days ago BSD 3-Clause "New...

    Run your code in the cloud, with technology so advanced, it feels like magic!

  • updated about 2 months ago BSD 3-Clause "New...

    Spawns JupyterHub single user servers in Docker containers

  • updated about 2 months ago BSD 3-Clause "New...

    Jupyter extensions for running an RStudio rsession proxy

  • updated about 2 months ago BSD 3-Clause "New...

    Jupyter notebook server extension to proxy web services.

  • updated about 2 months ago Other

    Multi-user server for Jupyter notebooks

  • updated about 2 months ago BSD 3-Clause "New...

    Reference deployment of JupyterHub with docker

  • updated over 4 years ago BSD 3-Clause "New...

    Reference deployment of JupyterHub and nbgrader on a single server

  • updated about 2 months ago BSD 3-Clause "New...

    Kubernetes spawner for JupyterHub

  • updated about 2 months ago BSD 3-Clause "New...

    LDAP Authenticator Plugin for Jupyter

  • updated about 2 months ago BSD 3-Clause "New...

    OAuth + JupyterHub Authenticator = OAuthenticator

  • updated about 2 months ago BSD 3-Clause "New...

    Turn repositories into Jupyter-enabled Docker images

  • updated 4 months ago BSD 3-Clause "New...

    Spawn JupyterHub single-user servers with sudo

  • updated 19 days ago BSD 3-Clause "New...

    Simple JupyterHub distribution for 1-100 users on a single server

  • updated 2 months ago BSD 3-Clause "New...

    Mechanism for runtime configuration of spawners for JupyterHub

  • updated about 2 months ago Other

    Helm Chart & Documentation for deploying JupyterHub on Kubernetes

Logging

  • loguru loguru 17,762
    star
    updated 26 days ago MIT License

    Python logging made (stupidly) simple

  • updated 11 months ago MIT License

    The official Python SDK for Sentry.io

Machine Learning

  • Detectron Detectron 25,771
    star
    updated 10 months ago Apache License 2.0

    FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

  • gym gym 33,715
    star
    updated about 1 month ago Other

    A toolkit for developing and comparing reinforcement learning algorithms.

  • h2o-3 h2o-3 6,658
    star
    updated 24 days ago Apache License 2.0

    H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

  • llama llama 44,989
    star
    updated 5 months ago Other

    Inference code for LLaMA models

  • m2cgen m2cgen 2,477
    star
    updated 12 months ago MIT License

    Transform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies

  • Metrics Metrics 1,606
    star
    updated about 1 year ago Other

    Machine learning evaluation metrics, implemented in Python, R, Haskell, and MATLAB / Octave

  • mindsdb mindsdb 20,259
    star
    updated 6 days ago Other

    The platform for customizing AI from enterprise data

  • updated 7 months ago GNU Affero Genera...

    Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.

  • updated about 4 years ago Other

    lightweight python wrapper for vowpal wabbit

  • whisper whisper 57,624
    star
    updated 21 days ago MIT License

    Robust Speech Recognition via Large-Scale Weak Supervision

  • xgboost xgboost 25,402
    star
    updated 15 days ago Apache License 2.0

    Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Math

  • ipython ipython 16,090
    star
    updated about 1 month ago BSD 3-Clause "New...

    Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.

  • matplotlib matplotlib 18,720
    star
    updated 2 months ago

    matplotlib: plotting with Python

  • numba numba 9,225
    star
    updated about 2 months ago BSD 2-Clause "Sim...

    NumPy aware dynamic Python compiler using LLVM

  • numpy numpy 25,420
    star
    updated 3 months ago BSD 3-Clause "New...

    The fundamental package for scientific computing with Python.

  • pandas pandas 41,491
    star
    updated 17 days ago BSD 3-Clause "New...

    Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

  • pyro pyro 8,286
    star
    updated 27 days ago Apache License 2.0

    Deep universal probabilistic programming with Python and PyTorch

  • updated 2 months ago BSD 3-Clause "New...

    scikit-learn: machine learning in Python

  • scipy scipy 12,295
    star
    updated 12 days ago BSD 3-Clause "New...

    SciPy library main repository

  • sympy sympy 11,931
    star
    updated 17 days ago Other

    A computer algebra system written in pure Python

  • xarray xarray 3,374
    star
    updated 9 days ago Apache License 2.0

    N-D labeled arrays and datasets in Python

Microsoft Windows

  • pythonnet pythonnet 4,230
    star
    updated about 2 months ago MIT License

    Python for .NET is a package that gives Python programmers nearly seamless integration with the .NET Common Language Runtime (CLR) and provides a powerful application scripting tool for .NET developers.

  • pywin32 pywin32 4,714
    star
    updated about 1 month ago

    Python for Windows (pywin32) Extensions

Miscellaneous

  • boltons boltons 6,360
    star
    updated about 2 months ago Other

    ๐Ÿ”ฉ Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.

  • updated about 2 months ago BSD 3-Clause "New...

    Safely pass trusted data to untrusted environments and back.

  • magenta magenta 18,886
    star
    updated 2 months ago Apache License 2.0

    Magenta: Music and Art Generation with Machine Intelligence

  • updated almost 3 years ago Other

    A simple but flexible plugin system for Python.

ML - Deep Learning

  • bert bert 36,701
    star
    updated 5 months ago Apache License 2.0

    TensorFlow code and pre-trained models for BERT

  • datasets datasets 4,143
    star
    updated 3 days ago Apache License 2.0

    TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...

  • detectron2 detectron2 25,567
    star
    updated 8 months ago Apache License 2.0

    Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

  • gpt-2 gpt-2 20,844
    star
    updated 2 months ago Other

    Code for the paper "Language Models are Unsupervised Multitask Learners"

  • keras keras 60,597
    star
    updated 19 days ago Apache License 2.0

    Deep Learning for humans

  • updated 10 months ago MIT License

    Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

  • NeMo NeMo 9,773
    star
    updated about 21 hours ago Apache License 2.0

    NeMo: a framework for generative AI

  • updated about 1 year ago Apache License 2.0

    An open source framework for seq2seq models in PyTorch.

  • updated 2 months ago BSD 3-Clause "New...

    scikit-learn: machine learning in Python

  • transformers transformers 121,026
    star
    updated 29 days ago Apache License 2.0

    ๐Ÿค— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

  • yolov5 yolov5 45,908
    star
    updated 18 days ago GNU Affero Genera...

    YOLOv5 ๐Ÿš€ in PyTorch > ONNX > CoreML > TFLite

ML - General

  • updated about 1 year ago Apache License 2.0

    An open source framework for seq2seq models in PyTorch.

  • sonnet sonnet 9,682
    star
    updated 3 months ago Apache License 2.0

    TensorFlow-based neural network library

  • updated 4 months ago Apache License 2.0
  • Theano Theano 9,840
    star
    updated 2 months ago Other

    Theano was a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It is being continued as PyTensor: www.github.com/pymc-devs/pytensor

ML - Interpretability

  • AIF360 AIF360 2,276
    star
    updated 8 days ago Apache License 2.0

    A comprehensive set of fairness metrics for datasets and machine learning models, explanations for these metrics, and algorithms to mitigate bias in datasets and models.

  • lale lale 321
    star
    updated 8 days ago Apache License 2.0

    Library for Semi-Automated Data Science

ML - Ops

  • updated over 2 years ago Apache License 2.0

    Cloud ML Engine repo. Please visit the new Vertex AI samples repo at https://github.com/GoogleCloudPlatform/vertex-ai-samples

  • dvc dvc 12,992
    star
    updated 13 days ago Apache License 2.0

    ๐Ÿฆ‰ ML Experiments Management with Git

  • mlflow mlflow 17,048
    star
    updated about 22 hours ago Apache License 2.0

    Open source platform for the machine learning lifecycle

  • tfx tfx 2,065
    star
    updated 14 days ago Apache License 2.0

    TFX is an end-to-end platform for deploying production ML pipelines

ML - Reinforcement

  • garage garage 1,774
    star
    updated 11 months ago MIT License

    A toolkit for reproducible reinforcement learning research.

  • gym gym 33,715
    star
    updated about 1 month ago Other

    A toolkit for developing and comparing reinforcement learning algorithms.

  • homework homework 1,520
    star
    updated about 1 year ago MIT License

    Assignments for CS294-112.

  • Metaworld Metaworld 1,066
    star
    updated 3 months ago MIT License

    Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning

  • updated almost 2 years ago MIT License

    PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

  • updated about 3 years ago MIT License

    Implementation of algorithms for continuous control (DDPG and NAF).

  • updated about 1 year ago MIT License

    Minimal and Clean Reinforcement Learning Examples

  • rllab rllab 2,853
    star
    updated 10 months ago Other

    rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

  • trfl trfl 3,139
    star
    updated over 1 year ago Apache License 2.0

    TensorFlow Reinforcement Learning

ML - Time Series

  • keras-tcn keras-tcn 1,798
    star
    updated 4 months ago MIT License

    Keras Temporal Convolutional Network.

  • orbit orbit 1,793
    star
    updated 4 days ago Other

    A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.

  • prophet prophet 17,624
    star
    updated 10 days ago MIT License

    Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.

  • updated 3 months ago BSD 3-Clause "New...

    Statsmodels: statistical modeling and econometrics in Python

Mocking

  • freezegun freezegun 3,919
    star
    updated about 1 month ago Apache License 2.0

    Let your Python tests travel through time

  • updated 6 months ago Other

    A mocking library for requests

  • HTTPretty HTTPretty 2,080
    star
    updated about 2 months ago MIT License

    Intercept HTTP requests at the Python socket level. Fakes the whole socket module

  • updated about 1 month ago BSD 3-Clause "New...

    a socket mock framework - for all kinds of socket animals, web-clients included

  • responses responses 3,938
    star
    updated 7 months ago Apache License 2.0

    A utility for mocking out the Python Requests library.

  • vcrpy vcrpy 2,584
    star
    updated 13 days ago MIT License

    Automatically mock your HTTP interactions to simplify and speed up testing

Natural Language Processing

  • funNLP funNLP 52,971
    star
    updated 9 months ago

    ไธญ่‹ฑๆ–‡ๆ•ๆ„Ÿ่ฏใ€่ฏญ่จ€ๆฃ€ๆต‹ใ€ไธญๅค–ๆ‰‹ๆœบ/็”ต่ฏๅฝ’ๅฑžๅœฐ/่ฟ่ฅๅ•†ๆŸฅ่ฏขใ€ๅๅญ—ๆŽจๆ–ญๆ€งๅˆซใ€ๆ‰‹ๆœบๅทๆŠฝๅ–ใ€่บซไปฝ่ฏๆŠฝๅ–ใ€้‚ฎ็ฎฑๆŠฝๅ–ใ€ไธญๆ—ฅๆ–‡ไบบๅๅบ“ใ€ไธญๆ–‡็ผฉๅ†™ๅบ“ใ€ๆ‹†ๅญ—่ฏๅ…ธใ€่ฏๆฑ‡ๆƒ…ๆ„Ÿๅ€ผใ€ๅœ็”จ่ฏใ€ๅๅŠจ่ฏ่กจใ€ๆšดๆ่ฏ่กจใ€็น็ฎ€ไฝ“่ฝฌๆขใ€่‹ฑๆ–‡ๆจกๆ‹Ÿไธญๆ–‡ๅ‘้Ÿณใ€ๆฑชๅณฐๆญŒ่ฏ็”Ÿๆˆๅ™จใ€่Œไธšๅ็งฐ่ฏๅบ“ใ€ๅŒไน‰่ฏๅบ“ใ€ๅไน‰่ฏๅบ“ใ€ๅฆๅฎš่ฏๅบ“ใ€ๆฑฝ่ฝฆๅ“็‰Œ่ฏๅบ“ใ€ๆฑฝ่ฝฆ้›ถไปถ่ฏๅบ“ใ€่ฟž็ปญ่‹ฑๆ–‡ๅˆ‡ๅ‰ฒใ€ๅ„็งไธญๆ–‡่ฏๅ‘้‡ใ€ๅ…ฌๅธๅๅญ—ๅคงๅ…จใ€ๅค่ฏ—่ฏๅบ“ใ€IT่ฏๅบ“ใ€่ดข็ป่ฏๅบ“ใ€ๆˆ่ฏญ่ฏๅบ“ใ€ๅœฐๅ่ฏๅบ“ใ€ๅŽ†ๅฒๅไบบ่ฏๅบ“ใ€่ฏ—่ฏ่ฏๅบ“ใ€ๅŒปๅญฆ่ฏๅบ“ใ€้ฅฎ้ฃŸ่ฏๅบ“ใ€ๆณ•ๅพ‹่ฏๅบ“ใ€ๆฑฝ่ฝฆ่ฏๅบ“ใ€ๅŠจ็‰ฉ่ฏๅบ“ใ€ไธญๆ–‡่Šๅคฉ่ฏญๆ–™ใ€ไธญๆ–‡่ฐฃ่จ€ๆ•ฐๆฎใ€็™พๅบฆไธญๆ–‡้—ฎ็ญ”ๆ•ฐๆฎ้›†ใ€ๅฅๅญ็›ธไผผๅบฆๅŒน้…็ฎ—ๆณ•้›†ๅˆใ€bert่ต„ๆบใ€ๆ–‡ๆœฌ็”Ÿๆˆ&ๆ‘˜่ฆ็›ธๅ…ณๅทฅๅ…ทใ€cocoNLPไฟกๆฏๆŠฝๅ–ๅทฅๅ…ทใ€ๅ›ฝๅ†…็”ต่ฏๅท็ ๆญฃๅˆ™ๅŒน้…ใ€ๆธ…ๅŽๅคงๅญฆXLORE:ไธญ่‹ฑๆ–‡่ทจ่ฏญ่จ€็™พ็ง‘็Ÿฅ่ฏ†ๅ›พ่ฐฑใ€ๆธ…ๅŽๅคงๅญฆไบบๅทฅๆ™บ่ƒฝๆŠ€ๆœฏ็ณปๅˆ—ๆŠฅๅ‘Šใ€่‡ช็„ถ่ฏญ่จ€็”Ÿๆˆใ€NLUๅคช้šพไบ†็ณปๅˆ—ใ€่‡ชๅŠจๅฏน่”ๆ•ฐๆฎๅŠๆœบๅ™จไบบใ€็”จๆˆทๅ้ป‘ๅๅ•ๅˆ—่กจใ€็ฝชๅๆณ•ๅŠกๅ่ฏๅŠๅˆ†็ฑปๆจกๅž‹ใ€ๅพฎไฟกๅ…ฌไผ—ๅท่ฏญๆ–™ใ€cs224nๆทฑๅบฆๅญฆไน ่‡ช็„ถ่ฏญ่จ€ๅค„็†่ฏพ็จ‹ใ€ไธญๆ–‡ๆ‰‹ๅ†™ๆฑ‰ๅญ—่ฏ†ๅˆซใ€ไธญๆ–‡่‡ช็„ถ่ฏญ่จ€ๅค„็† ่ฏญๆ–™/ๆ•ฐๆฎ้›†ใ€ๅ˜้‡ๅ‘ฝๅ็ฅžๅ™จใ€ๅˆ†่ฏ่ฏญๆ–™ๅบ“+ไปฃ็ ใ€ไปปๅŠกๅž‹ๅฏน่ฏ่‹ฑๆ–‡ๆ•ฐๆฎ้›†ใ€ASR ่ฏญ้Ÿณๆ•ฐๆฎ้›† + ๅŸบไบŽๆทฑๅบฆๅญฆไน ็š„ไธญๆ–‡่ฏญ้Ÿณ่ฏ†ๅˆซ็ณป็ปŸใ€็ฌ‘ๅฃฐๆฃ€ๆต‹ๅ™จใ€Microsoftๅคš่ฏญ่จ€ๆ•ฐๅญ—/ๅ•ไฝ/ๅฆ‚ๆ—ฅๆœŸๆ—ถ้—ด่ฏ†ๅˆซๅŒ…ใ€ไธญๅŽๆ–ฐๅŽๅญ—ๅ…ธๆ•ฐๆฎๅบ“ๅŠapi(ๅŒ…ๆ‹ฌๅธธ็”จๆญ‡ๅŽ่ฏญใ€ๆˆ่ฏญใ€่ฏ่ฏญๅ’Œๆฑ‰ๅญ—)ใ€ๆ–‡ๆกฃๅ›พ่ฐฑ่‡ชๅŠจ็”Ÿๆˆใ€SpaCy ไธญๆ–‡ๆจกๅž‹ใ€Common Voice่ฏญ้Ÿณ่ฏ†ๅˆซๆ•ฐๆฎ้›†ๆ–ฐ็‰ˆใ€็ฅž็ป็ฝ‘็ปœๅ…ณ็ณปๆŠฝๅ–ใ€ๅŸบไบŽbert็š„ๅ‘ฝๅๅฎžไฝ“่ฏ†ๅˆซใ€ๅ…ณ้”ฎ่ฏ(Keyphrase)ๆŠฝๅ–ๅŒ…pkeใ€ๅŸบไบŽๅŒป็–—้ข†ๅŸŸ็Ÿฅ่ฏ†ๅ›พ่ฐฑ็š„้—ฎ็ญ”็ณป็ปŸใ€ๅŸบไบŽไพๅญ˜ๅฅๆณ•ไธŽ่ฏญไน‰่ง’่‰ฒๆ ‡ๆณจ็š„ไบ‹ไปถไธ‰ๅ…ƒ็ป„ๆŠฝๅ–ใ€ไพๅญ˜ๅฅๆณ•ๅˆ†ๆž4ไธ‡ๅฅ้ซ˜่ดจ้‡ๆ ‡ๆณจๆ•ฐๆฎใ€cnocr๏ผš็”จๆฅๅšไธญๆ–‡OCR็š„Python3ๅŒ…ใ€ไธญๆ–‡ไบบ็‰ฉๅ…ณ็ณป็Ÿฅ่ฏ†ๅ›พ่ฐฑ้กน็›ฎใ€ไธญๆ–‡nlp็ซž่ต›้กน็›ฎๅŠไปฃ็ ๆฑ‡ๆ€ปใ€ไธญๆ–‡ๅญ—็ฌฆๆ•ฐๆฎใ€speech-aligner: ไปŽโ€œไบบๅฃฐ่ฏญ้Ÿณโ€ๅŠๅ…ถโ€œ่ฏญ่จ€ๆ–‡ๆœฌโ€ไบง็”Ÿ้Ÿณ็ด ็บงๅˆซๆ—ถ้—ดๅฏน้ฝๆ ‡ๆณจ็š„ๅทฅๅ…ทใ€AmpliGraph: ็Ÿฅ่ฏ†ๅ›พ่ฐฑ่กจ็คบๅญฆไน (Python)ๅบ“๏ผš็Ÿฅ่ฏ†ๅ›พ่ฐฑๆฆ‚ๅฟต้“พๆŽฅ้ข„ๆต‹ใ€Scattertext ๆ–‡ๆœฌๅฏ่ง†ๅŒ–(python)ใ€่ฏญ่จ€/็Ÿฅ่ฏ†่กจ็คบๅทฅๅ…ท๏ผšBERT & ERNIEใ€ไธญๆ–‡ๅฏนๆฏ”่‹ฑๆ–‡่‡ช็„ถ่ฏญ่จ€ๅค„็†NLP็š„ๅŒบๅˆซ็ปผ่ฟฐใ€Synonymsไธญๆ–‡่ฟ‘ไน‰่ฏๅทฅๅ…ทๅŒ…ใ€HarvestText้ข†ๅŸŸ่‡ช้€‚ๅบ”ๆ–‡ๆœฌๆŒ–ๆŽ˜ๅทฅๅ…ท๏ผˆๆ–ฐ่ฏๅ‘็Žฐ-ๆƒ…ๆ„Ÿๅˆ†ๆž-ๅฎžไฝ“้“พๆŽฅ็ญ‰๏ผ‰ใ€word2word๏ผš(Python)ๆ–นไพฟๆ˜“็”จ็š„ๅคš่ฏญ่จ€่ฏ-่ฏๅฏน้›†๏ผš62็ง่ฏญ่จ€/3,564ไธชๅคš่ฏญ่จ€ๅฏนใ€่ฏญ้Ÿณ่ฏ†ๅˆซ่ฏญๆ–™็”Ÿๆˆๅทฅๅ…ท๏ผšไปŽๅ…ทๆœ‰้Ÿณ้ข‘/ๅญ—ๅน•็š„ๅœจ็บฟ่ง†้ข‘ๅˆ›ๅปบ่‡ชๅŠจ่ฏญ้Ÿณ่ฏ†ๅˆซ(ASR)่ฏญๆ–™ๅบ“ใ€ๆž„ๅปบๅŒป็–—ๅฎžไฝ“่ฏ†ๅˆซ็š„ๆจกๅž‹๏ผˆๅŒ…ๅซ่ฏๅ…ธๅ’Œ่ฏญๆ–™ๆ ‡ๆณจ๏ผ‰ใ€ๅ•ๆ–‡ๆกฃ้ž็›‘็ฃ็š„ๅ…ณ้”ฎ่ฏๆŠฝๅ–ใ€Kashgariไธญไฝฟ็”จgpt-2่ฏญ่จ€ๆจกๅž‹ใ€ๅผ€ๆบ็š„้‡‘่žๆŠ•่ต„ๆ•ฐๆฎๆๅ–ๅทฅๅ…ทใ€ๆ–‡ๆœฌ่‡ชๅŠจๆ‘˜่ฆๅบ“TextTeaser: ไป…ๆ”ฏๆŒ่‹ฑๆ–‡ใ€ไบบๆฐ‘ๆ—ฅๆŠฅ่ฏญๆ–™ๅค„็†ๅทฅๅ…ท้›†ใ€ไธ€ไบ›ๅ…ณไบŽ่‡ช็„ถ่ฏญ่จ€็š„ๅŸบๆœฌๆจกๅž‹ใ€ๅŸบไบŽ14WๆญŒๆ›ฒ็Ÿฅ่ฏ†ๅบ“็š„้—ฎ็ญ”ๅฐ่ฏ•--ๅŠŸ่ƒฝๅŒ…ๆ‹ฌๆญŒ่ฏๆŽฅ้พ™andๅทฒ็ŸฅๆญŒ่ฏๆ‰พๆญŒๆ›ฒไปฅๅŠๆญŒๆ›ฒๆญŒๆ‰‹ๆญŒ่ฏไธ‰่ง’ๅ…ณ็ณป็š„้—ฎ็ญ”ใ€ๅŸบไบŽSiamese bilstmๆจกๅž‹็š„็›ธไผผๅฅๅญๅˆคๅฎšๆจกๅž‹ๅนถๆไพ›่ฎญ็ปƒๆ•ฐๆฎ้›†ๅ’Œๆต‹่ฏ•ๆ•ฐๆฎ้›†ใ€็”จTransformer็ผ–่งฃ็ ๆจกๅž‹ๅฎž็Žฐ็š„ๆ นๆฎHacker Newsๆ–‡็ซ ๆ ‡้ข˜่‡ชๅŠจ็”Ÿๆˆ่ฏ„่ฎบใ€็”จBERT่ฟ›่กŒๅบๅˆ—ๆ ‡่ฎฐๅ’Œๆ–‡ๆœฌๅˆ†็ฑป็š„ๆจกๆฟไปฃ็ ใ€LitBank๏ผšNLPๆ•ฐๆฎ้›†โ€”โ€”ๆ”ฏๆŒ่‡ช็„ถ่ฏญ่จ€ๅค„็†ๅ’Œ่ฎก็ฎ—ไบบๆ–‡ๅญฆ็ง‘ไปปๅŠก็š„100้ƒจๅธฆๆ ‡่ฎฐ่‹ฑๆ–‡ๅฐ่ฏด่ฏญๆ–™ใ€็™พๅบฆๅผ€ๆบ็š„ๅŸบๅ‡†ไฟกๆฏๆŠฝๅ–็ณป็ปŸใ€่™šๅ‡ๆ–ฐ้—ปๆ•ฐๆฎ้›†ใ€Facebook: LAMA่ฏญ่จ€ๆจกๅž‹ๅˆ†ๆž๏ผŒๆไพ›Transformer-XL/BERT/ELMo/GPT้ข„่ฎญ็ปƒ่ฏญ่จ€ๆจกๅž‹็š„็ปŸไธ€่ฎฟ้—ฎๆŽฅๅฃใ€CommonsenseQA๏ผš้ขๅ‘ๅธธ่ฏ†็š„่‹ฑๆ–‡QAๆŒ‘ๆˆ˜ใ€ไธญๆ–‡็Ÿฅ่ฏ†ๅ›พ่ฐฑ่ต„ๆ–™ใ€ๆ•ฐๆฎๅŠๅทฅๅ…ทใ€ๅ„ๅคงๅ…ฌๅธๅ†…้ƒจ้‡Œๅคง็‰›ๅˆ†ไบซ็š„ๆŠ€ๆœฏๆ–‡ๆกฃ PDF ๆˆ–่€… PPTใ€่‡ช็„ถ่ฏญ่จ€็”ŸๆˆSQL่ฏญๅฅ๏ผˆ่‹ฑๆ–‡๏ผ‰ใ€ไธญๆ–‡NLPๆ•ฐๆฎๅขžๅผบ๏ผˆEDA๏ผ‰ๅทฅๅ…ทใ€่‹ฑๆ–‡NLPๆ•ฐๆฎๅขžๅผบๅทฅๅ…ท ใ€ๅŸบไบŽๅŒป่ฏ็Ÿฅ่ฏ†ๅ›พ่ฐฑ็š„ๆ™บ่ƒฝ้—ฎ็ญ”็ณป็ปŸใ€ไบฌไธœๅ•†ๅ“็Ÿฅ่ฏ†ๅ›พ่ฐฑใ€ๅŸบไบŽmongodbๅญ˜ๅ‚จ็š„ๅ†›ไบ‹้ข†ๅŸŸ็Ÿฅ่ฏ†ๅ›พ่ฐฑ้—ฎ็ญ”้กน็›ฎใ€ๅŸบไบŽ่ฟœ็›‘็ฃ็š„ไธญๆ–‡ๅ…ณ็ณปๆŠฝๅ–ใ€่ฏญ้Ÿณๆƒ…ๆ„Ÿๅˆ†ๆžใ€ไธญๆ–‡ULMFiT-ๆƒ…ๆ„Ÿๅˆ†ๆž-ๆ–‡ๆœฌๅˆ†็ฑป-่ฏญๆ–™ๅŠๆจกๅž‹ใ€ไธ€ไธชๆ‹็…งๅš้ข˜็จ‹ๅบใ€ไธ–็•Œๅ„ๅ›ฝๅคง่ง„ๆจกไบบๅๅบ“ใ€ไธ€ไธชๅˆฉ็”จๆœ‰่ถฃไธญๆ–‡่ฏญๆ–™ๅบ“ qingyun ่ฎญ็ปƒๅ‡บๆฅ็š„ไธญๆ–‡่Šๅคฉๆœบๅ™จไบบใ€ไธญๆ–‡่Šๅคฉๆœบๅ™จไบบseqGANใ€็œๅธ‚ๅŒบ้•‡่กŒๆ”ฟๅŒบๅˆ’ๆ•ฐๆฎๅธฆๆ‹ผ้Ÿณๆ ‡ๆณจใ€ๆ•™่‚ฒ่กŒไธšๆ–ฐ้—ป่ฏญๆ–™ๅบ“ๅŒ…ๅซ่‡ชๅŠจๆ–‡ๆ‘˜ๅŠŸ่ƒฝใ€ๅผ€ๆ”พไบ†ๅฏน่ฏๆœบๅ™จไบบ-็Ÿฅ่ฏ†ๅ›พ่ฐฑ-่ฏญไน‰็†่งฃ-่‡ช็„ถ่ฏญ่จ€ๅค„็†ๅทฅๅ…ทๅŠๆ•ฐๆฎใ€ไธญๆ–‡็Ÿฅ่ฏ†ๅ›พ่ฐฑ๏ผšๅŸบไบŽ็™พๅบฆ็™พ็ง‘ไธญๆ–‡้กต้ข-ๆŠฝๅ–ไธ‰ๅ…ƒ็ป„ไฟกๆฏ-ๆž„ๅปบไธญๆ–‡็Ÿฅ่ฏ†ๅ›พ่ฐฑใ€masr: ไธญๆ–‡่ฏญ้Ÿณ่ฏ†ๅˆซ-ๆไพ›้ข„่ฎญ็ปƒๆจกๅž‹-้ซ˜่ฏ†ๅˆซ็Ž‡ใ€Python้Ÿณ้ข‘ๆ•ฐๆฎๅขžๅนฟๅบ“ใ€ไธญๆ–‡ๅ…จ่ฏ่ฆ†็›–BERTๅŠไธคไปฝ้˜…่ฏป็†่งฃๆ•ฐๆฎใ€ConvLab๏ผšๅผ€ๆบๅคšๅŸŸ็ซฏๅˆฐ็ซฏๅฏน่ฏ็ณป็ปŸๅนณๅฐใ€ไธญๆ–‡่‡ช็„ถ่ฏญ่จ€ๅค„็†ๆ•ฐๆฎ้›†ใ€ๅŸบไบŽๆœ€ๆ–ฐ็‰ˆๆœฌrasaๆญๅปบ็š„ๅฏน่ฏ็ณป็ปŸใ€ๅŸบไบŽTensorFlowๅ’ŒBERT็š„็ฎก้“ๅผๅฎžไฝ“ๅŠๅ…ณ็ณปๆŠฝๅ–ใ€ไธ€ไธชๅฐๅž‹็š„่ฏๅˆธ็Ÿฅ่ฏ†ๅ›พ่ฐฑ/็Ÿฅ่ฏ†ๅบ“ใ€ๅค็›˜ๆ‰€ๆœ‰NLPๆฏ”่ต›็š„TOPๆ–นๆกˆใ€OpenCLaP๏ผšๅคš้ข†ๅŸŸๅผ€ๆบไธญๆ–‡้ข„่ฎญ็ปƒ่ฏญ่จ€ๆจกๅž‹ไป“ๅบ“ใ€UER๏ผšๅŸบไบŽไธๅŒ่ฏญๆ–™+็ผ–็ ๅ™จ+็›ฎๆ ‡ไปปๅŠก็š„ไธญๆ–‡้ข„่ฎญ็ปƒๆจกๅž‹ไป“ๅบ“ใ€ไธญๆ–‡่‡ช็„ถ่ฏญ่จ€ๅค„็†ๅ‘้‡ๅˆ้›†ใ€ๅŸบไบŽ้‡‘่ž-ๅธๆณ•้ข†ๅŸŸ(ๅ…ผๆœ‰้—ฒ่Šๆ€ง่ดจ)็š„่Šๅคฉๆœบๅ™จไบบใ€g2pC๏ผšๅŸบไบŽไธŠไธ‹ๆ–‡็š„ๆฑ‰่ฏญ่ฏป้Ÿณ่‡ชๅŠจๆ ‡่ฎฐๆจกๅ—ใ€Zincbase ็Ÿฅ่ฏ†ๅ›พ่ฐฑๆž„ๅปบๅทฅๅ…ทๅŒ…ใ€่ฏ—ๆญŒ่ดจ้‡่ฏ„ไปท/็ป†็ฒ’ๅบฆๆƒ…ๆ„Ÿ่ฏ—ๆญŒ่ฏญๆ–™ๅบ“ใ€ๅฟซ้€Ÿ่ฝฌๅŒ–ใ€Œไธญๆ–‡ๆ•ฐๅญ—ใ€ๅ’Œใ€Œ้˜ฟๆ‹‰ไผฏๆ•ฐๅญ—ใ€ใ€็™พๅบฆ็Ÿฅ้“้—ฎ็ญ”่ฏญๆ–™ๅบ“ใ€ๅŸบไบŽ็Ÿฅ่ฏ†ๅ›พ่ฐฑ็š„้—ฎ็ญ”็ณป็ปŸใ€jieba_fast ๅŠ ้€Ÿ็‰ˆ็š„jiebaใ€ๆญฃๅˆ™่กจ่พพๅผๆ•™็จ‹ใ€ไธญๆ–‡้˜…่ฏป็†่งฃๆ•ฐๆฎ้›†ใ€ๅŸบไบŽBERT็ญ‰ๆœ€ๆ–ฐ่ฏญ่จ€ๆจกๅž‹็š„ๆŠฝๅ–ๅผๆ‘˜่ฆๆๅ–ใ€Pythonๅˆฉ็”จๆทฑๅบฆๅญฆไน ่ฟ›่กŒๆ–‡ๆœฌๆ‘˜่ฆ็š„็ปผๅˆๆŒ‡ๅ—ใ€็Ÿฅ่ฏ†ๅ›พ่ฐฑๆทฑๅบฆๅญฆไน ็›ธๅ…ณ่ต„ๆ–™ๆ•ด็†ใ€็ปดๅŸบๅคง่ง„ๆจกๅนณ่กŒๆ–‡ๆœฌ่ฏญๆ–™ใ€StanfordNLP 0.2.0๏ผš็บฏPython็‰ˆ่‡ช็„ถ่ฏญ่จ€ๅค„็†ๅŒ…ใ€NeuralNLP-NeuralClassifier๏ผš่…พ่ฎฏๅผ€ๆบๆทฑๅบฆๅญฆไน ๆ–‡ๆœฌๅˆ†็ฑปๅทฅๅ…ทใ€็ซฏๅˆฐ็ซฏ็š„ๅฐ้—ญๅŸŸๅฏน่ฏ็ณป็ปŸใ€ไธญๆ–‡ๅ‘ฝๅๅฎžไฝ“่ฏ†ๅˆซ๏ผšNeuroNER vs. BertNERใ€ๆ–ฐ้—ปไบ‹ไปถ็บฟ็ดขๆŠฝๅ–ใ€2019ๅนด็™พๅบฆ็š„ไธ‰ๅ…ƒ็ป„ๆŠฝๅ–ๆฏ”่ต›๏ผšโ€œ็ง‘ๅญฆ็ฉบ้—ด้˜Ÿโ€ๆบ็ ใ€ๅŸบไบŽไพๅญ˜ๅฅๆณ•็š„ๅผ€ๆ”พๅŸŸๆ–‡ๆœฌ็Ÿฅ่ฏ†ไธ‰ๅ…ƒ็ป„ๆŠฝๅ–ๅ’Œ็Ÿฅ่ฏ†ๅบ“ๆž„ๅปบใ€ไธญๆ–‡็š„GPT2่ฎญ็ปƒไปฃ็ ใ€ML-NLP - ๆœบๅ™จๅญฆไน (Machine Learning)NLP้ข่ฏ•ไธญๅธธ่€ƒๅˆฐ็š„็Ÿฅ่ฏ†็‚นๅ’Œไปฃ็ ๅฎž็Žฐใ€nlp4han:ไธญๆ–‡่‡ช็„ถ่ฏญ่จ€ๅค„็†ๅทฅๅ…ท้›†(ๆ–ญๅฅ/ๅˆ†่ฏ/่ฏๆ€งๆ ‡ๆณจ/็ป„ๅ—/ๅฅๆณ•ๅˆ†ๆž/่ฏญไน‰ๅˆ†ๆž/NER/Nๅ…ƒ่ฏญๆณ•/HMM/ไปฃ่ฏๆถˆ่งฃ/ๆƒ…ๆ„Ÿๅˆ†ๆž/ๆ‹ผๅ†™ๆฃ€ๆŸฅใ€XLM๏ผšFacebook็š„่ทจ่ฏญ่จ€้ข„่ฎญ็ปƒ่ฏญ่จ€ๆจกๅž‹ใ€็”จๅŸบไบŽBERT็š„ๅพฎ่ฐƒๅ’Œ็‰นๅพๆๅ–ๆ–นๆณ•ๆฅ่ฟ›่กŒ็Ÿฅ่ฏ†ๅ›พ่ฐฑ็™พๅบฆ็™พ็ง‘ไบบ็‰ฉ่ฏๆกๅฑžๆ€งๆŠฝๅ–ใ€ไธญๆ–‡่‡ช็„ถ่ฏญ่จ€ๅค„็†็›ธๅ…ณ็š„ๅผ€ๆ”พไปปๅŠก-ๆ•ฐๆฎ้›†-ๅฝ“ๅ‰ๆœ€ไฝณ็ป“ๆžœใ€CoupletAI - ๅŸบไบŽCNN+Bi-LSTM+Attention ็š„่‡ชๅŠจๅฏนๅฏน่”็ณป็ปŸใ€ๆŠฝ่ฑก็Ÿฅ่ฏ†ๅ›พ่ฐฑใ€MiningZhiDaoQACorpus - 580ไธ‡็™พๅบฆ็Ÿฅ้“้—ฎ็ญ”ๆ•ฐๆฎๆŒ–ๆŽ˜้กน็›ฎใ€brat rapid annotation tool: ๅบๅˆ—ๆ ‡ๆณจๅทฅๅ…ทใ€ๅคง่ง„ๆจกไธญๆ–‡็Ÿฅ่ฏ†ๅ›พ่ฐฑๆ•ฐๆฎ๏ผš1.4ไบฟๅฎžไฝ“ใ€ๆ•ฐๆฎๅขžๅผบๅœจๆœบๅ™จ็ฟป่ฏ‘ๅŠๅ…ถไป–nlpไปปๅŠกไธญ็š„ๅบ”็”จๅŠๆ•ˆๆžœใ€allennlp้˜…่ฏป็†่งฃ:ๆ”ฏๆŒๅคš็งๆ•ฐๆฎๅ’Œๆจกๅž‹ใ€PDF่กจๆ ผๆ•ฐๆฎๆๅ–ๅทฅๅ…ท ใ€ Graphbrain๏ผšAIๅผ€ๆบ่ฝฏไปถๅบ“ๅ’Œ็ง‘็ ”ๅทฅๅ…ท๏ผŒ็›ฎ็š„ๆ˜ฏไฟƒ่ฟ›่‡ชๅŠจๆ„ไน‰ๆๅ–ๅ’Œๆ–‡ๆœฌ็†่งฃไปฅๅŠ็Ÿฅ่ฏ†็š„ๆŽข็ดขๅ’ŒๆŽจๆ–ญใ€็ฎ€ๅŽ†่‡ชๅŠจ็ญ›้€‰็ณป็ปŸใ€ๅŸบไบŽๅ‘ฝๅๅฎžไฝ“่ฏ†ๅˆซ็š„็ฎ€ๅŽ†่‡ชๅŠจๆ‘˜่ฆใ€ไธญๆ–‡่ฏญ่จ€็†่งฃๆต‹่ฏ„ๅŸบๅ‡†๏ผŒๅŒ…ๆ‹ฌไปฃ่กจๆ€ง็š„ๆ•ฐๆฎ้›†&ๅŸบๅ‡†ๆจกๅž‹&่ฏญๆ–™ๅบ“&ๆŽ’่กŒๆฆœใ€ๆ ‘ๆดž OCR ๆ–‡ๅญ—่ฏ†ๅˆซ ใ€ไปŽๅŒ…ๅซ่กจๆ ผ็š„ๆ‰ซๆๅ›พ็‰‡ไธญ่ฏ†ๅˆซ่กจๆ ผๅ’Œๆ–‡ๅญ—ใ€่ฏญๅฃฐ่ฟ็งปใ€Pythonๅฃ่ฏญ่‡ช็„ถ่ฏญ่จ€ๅค„็†ๅทฅๅ…ท้›†(่‹ฑๆ–‡)ใ€ similarity๏ผš็›ธไผผๅบฆ่ฎก็ฎ—ๅทฅๅ…ทๅŒ…๏ผŒjava็ผ–ๅ†™ใ€ๆตท้‡ไธญๆ–‡้ข„่ฎญ็ปƒALBERTๆจกๅž‹ ใ€Transformers 2.0 ใ€ๅŸบไบŽๅคง่ง„ๆจก้Ÿณ้ข‘ๆ•ฐๆฎ้›†Audioset็š„้Ÿณ้ข‘ๅขžๅผบ ใ€Poplar๏ผš็ฝ‘้กต็‰ˆ่‡ช็„ถ่ฏญ่จ€ๆ ‡ๆณจๅทฅๅ…ทใ€ๅ›พ็‰‡ๆ–‡ๅญ—ๅŽป้™ค๏ผŒๅฏ็”จไบŽๆผซ็”ป็ฟป่ฏ‘ ใ€186็ง่ฏญ่จ€็š„ๆ•ฐๅญ—ๅซๆณ•ๅบ“ใ€Amazonๅ‘ๅธƒๅŸบไบŽ็Ÿฅ่ฏ†็š„ไบบ-ไบบๅผ€ๆ”พ้ข†ๅŸŸๅฏน่ฏๆ•ฐๆฎ้›† ใ€ไธญๆ–‡ๆ–‡ๆœฌ็บ ้”™ๆจกๅ—ไปฃ็ ใ€็น็ฎ€ไฝ“่ฝฌๆข ใ€ Pythonๅฎž็Žฐ็š„ๅคš็งๆ–‡ๆœฌๅฏ่ฏปๆ€ง่ฏ„ไปทๆŒ‡ๆ ‡ใ€็ฑปไผผไบŽไบบๅ/ๅœฐๅ/็ป„็ป‡ๆœบๆž„ๅ็š„ๅ‘ฝๅไฝ“่ฏ†ๅˆซๆ•ฐๆฎ้›† ใ€ไธœๅ—ๅคงๅญฆใ€Š็Ÿฅ่ฏ†ๅ›พ่ฐฑใ€‹็ ”็ฉถ็”Ÿ่ฏพ็จ‹(่ต„ๆ–™)ใ€. ่‹ฑๆ–‡ๆ‹ผๅ†™ๆฃ€ๆŸฅๅบ“ ใ€ wwsearchๆ˜ฏไผไธšๅพฎไฟกๅŽๅฐ่‡ช็ ”็š„ๅ…จๆ–‡ๆฃ€็ดขๅผ•ๆ“Žใ€CHAMELEON๏ผšๆทฑๅบฆๅญฆไน ๆ–ฐ้—ปๆŽจ่็ณป็ปŸๅ…ƒๆžถๆž„ ใ€ 8็ฏ‡่ฎบๆ–‡ๆขณ็†BERT็›ธๅ…ณๆจกๅž‹่ฟ›ๅฑ•ไธŽๅๆ€ใ€DocSearch๏ผšๅ…่ดนๆ–‡ๆกฃๆœ็ดขๅผ•ๆ“Žใ€ LIDA๏ผš่ฝป้‡ไบคไบ’ๅผๅฏน่ฏๆ ‡ๆณจๅทฅๅ…ท ใ€aili - the fastest in-memory index in the East ไธœๅŠ็ƒๆœ€ๅฟซๅนถๅ‘็ดขๅผ• ใ€็Ÿฅ่ฏ†ๅ›พ่ฐฑ่ฝฆ้Ÿณๅทฅไฝœ้กน็›ฎใ€่‡ช็„ถ่ฏญ่จ€็”Ÿๆˆ่ต„ๆบๅคงๅ…จ ใ€ไธญๆ—ฅ้Ÿฉๅˆ†่ฏๅบ“mecab็š„PythonๆŽฅๅฃๅบ“ใ€ไธญๆ–‡ๆ–‡ๆœฌๆ‘˜่ฆ/ๅ…ณ้”ฎ่ฏๆๅ–ใ€ๆฑ‰ๅญ—ๅญ—็ฌฆ็‰นๅพๆๅ–ๅ™จ (featurizer)๏ผŒๆๅ–ๆฑ‰ๅญ—็š„็‰นๅพ๏ผˆๅ‘้Ÿณ็‰นๅพใ€ๅญ—ๅฝข็‰นๅพ๏ผ‰็”จๅšๆทฑๅบฆๅญฆไน ็š„็‰นๅพใ€ไธญๆ–‡็”ŸๆˆไปปๅŠกๅŸบๅ‡†ๆต‹่ฏ„ ใ€ไธญๆ–‡็ผฉๅ†™ๆ•ฐๆฎ้›†ใ€ไธญๆ–‡ไปปๅŠกๅŸบๅ‡†ๆต‹่ฏ„ - ไปฃ่กจๆ€ง็š„ๆ•ฐๆฎ้›†-ๅŸบๅ‡†(้ข„่ฎญ็ปƒ)ๆจกๅž‹-่ฏญๆ–™ๅบ“-baseline-ๅทฅๅ…ทๅŒ…-ๆŽ’่กŒๆฆœใ€PySS3๏ผš้ขๅ‘ๅฏ่งฃ้‡ŠAI็š„SS3ๆ–‡ๆœฌๅˆ†็ฑปๅ™จๆœบๅ™จๅฏ่ง†ๅŒ–ๅทฅๅ…ท ใ€ไธญๆ–‡NLPๆ•ฐๆฎ้›†ๅˆ—่กจใ€COPE - ๆ ผๅพ‹่ฏ—็ผ–่พ‘็จ‹ๅบใ€doccano๏ผšๅŸบไบŽ็ฝ‘้กต็š„ๅผ€ๆบๅๅŒๅคš่ฏญ่จ€ๆ–‡ๆœฌๆ ‡ๆณจๅทฅๅ…ท ใ€PreNLP๏ผš่‡ช็„ถ่ฏญ่จ€้ข„ๅค„็†ๅบ“ใ€็ฎ€ๅ•็š„็ฎ€ๅŽ†่งฃๆžๅ™จ๏ผŒ็”จๆฅไปŽ็ฎ€ๅŽ†ไธญๆๅ–ๅ…ณ้”ฎไฟกๆฏใ€็”จไบŽไธญๆ–‡้—ฒ่Š็š„GPT2ๆจกๅž‹๏ผšGPT2-chitchatใ€ๅŸบไบŽๆฃ€็ดข่Šๅคฉๆœบๅ™จไบบๅคš่ฝฎๅ“ๅบ”้€‰ๆ‹ฉ็›ธๅ…ณ่ต„ๆบๅˆ—่กจ(Leaderboardsใ€Datasetsใ€Papers)ใ€(Colab)ๆŠฝ่ฑกๆ–‡ๆœฌๆ‘˜่ฆๅฎž็Žฐ้›†้”ฆ(ๆ•™็จ‹ ใ€่ฏ่ฏญๆ‹ผ้Ÿณๆ•ฐๆฎใ€้ซ˜ๆ•ˆๆจก็ณŠๆœ็ดขๅทฅๅ…ทใ€NLPๆ•ฐๆฎๅขžๅนฟ่ต„ๆบ้›†ใ€ๅพฎ่ฝฏๅฏน่ฏๆœบๅ™จไบบๆก†ๆžถ ใ€ GitHub Typo Corpus๏ผšๅคง่ง„ๆจกGitHubๅคš่ฏญ่จ€ๆ‹ผๅ†™้”™่ฏฏ/่ฏญๆณ•้”™่ฏฏๆ•ฐๆฎ้›†ใ€TextCluster๏ผš็Ÿญๆ–‡ๆœฌ่š็ฑป้ข„ๅค„็†ๆจกๅ— Short text clusterใ€้ขๅ‘่ฏญ้Ÿณ่ฏ†ๅˆซ็š„ไธญๆ–‡ๆ–‡ๆœฌ่ง„่ŒƒๅŒ–ใ€BLINK๏ผšๆœ€ๅ…ˆ่ฟ›็š„ๅฎžไฝ“้“พๆŽฅๅบ“ใ€BertPunc๏ผšๅŸบไบŽBERT็š„ๆœ€ๅ…ˆ่ฟ›ๆ ‡็‚นไฟฎๅคๆจกๅž‹ใ€Tokenizer๏ผšๅฟซ้€Ÿใ€ๅฏๅฎšๅˆถ็š„ๆ–‡ๆœฌ่ฏๆกๅŒ–ๅบ“ใ€ไธญๆ–‡่ฏญ่จ€็†่งฃๆต‹่ฏ„ๅŸบๅ‡†๏ผŒๅŒ…ๆ‹ฌไปฃ่กจๆ€ง็š„ๆ•ฐๆฎ้›†ใ€ๅŸบๅ‡†(้ข„่ฎญ็ปƒ)ๆจกๅž‹ใ€่ฏญๆ–™ๅบ“ใ€ๆŽ’่กŒๆฆœใ€spaCy ๅŒปๅญฆๆ–‡ๆœฌๆŒ–ๆŽ˜ไธŽไฟกๆฏๆๅ– ใ€ NLPไปปๅŠก็คบไพ‹้กน็›ฎไปฃ็ ้›†ใ€ pythonๆ‹ผๅ†™ๆฃ€ๆŸฅๅบ“ใ€chatbot-list - ่กŒไธšๅ†…ๅ…ณไบŽๆ™บ่ƒฝๅฎขๆœใ€่Šๅคฉๆœบๅ™จไบบ็š„ๅบ”็”จๅ’Œๆžถๆž„ใ€็ฎ—ๆณ•ๅˆ†ไบซๅ’Œไป‹็ปใ€่ฏญ้Ÿณ่ดจ้‡่ฏ„ไปทๆŒ‡ๆ ‡(MOSNet, BSSEval, STOI, PESQ, SRMR)ใ€ ็”จ138GB่ฏญๆ–™่ฎญ็ปƒ็š„ๆณ•ๆ–‡RoBERTa้ข„่ฎญ็ปƒ่ฏญ่จ€ๆจกๅž‹ ใ€BERT-NER-Pytorch๏ผšไธ‰็งไธๅŒๆจกๅผ็š„BERTไธญๆ–‡NERๅฎž้ชŒใ€ๆ— ้“่ฏๅ…ธ - ๆœ‰้“่ฏๅ…ธ็š„ๅ‘ฝไปค่กŒ็‰ˆๆœฌ๏ผŒๆ”ฏๆŒ่‹ฑๆฑ‰ไบ’ๆŸฅๅ’Œๅœจ็บฟๆŸฅ่ฏขใ€2019ๅนดNLPไบฎ็‚นๅ›ž้กพใ€ Chinese medical dialogue data ไธญๆ–‡ๅŒป็–—ๅฏน่ฏๆ•ฐๆฎ้›† ใ€ๆœ€ๅฅฝ็š„ๆฑ‰ๅญ—ๆ•ฐๅญ—(ไธญๆ–‡ๆ•ฐๅญ—)-้˜ฟๆ‹‰ไผฏๆ•ฐๅญ—่ฝฌๆขๅทฅๅ…ทใ€ ๅŸบไบŽ็™พ็ง‘็Ÿฅ่ฏ†ๅบ“็š„ไธญๆ–‡่ฏ่ฏญๅคš่ฏไน‰/ไน‰้กน่Žทๅ–ไธŽ็‰นๅฎšๅฅๅญ่ฏ่ฏญ่ฏญไน‰ๆถˆๆญงใ€awesome-nlp-sentiment-analysis - ๆƒ…ๆ„Ÿๅˆ†ๆžใ€ๆƒ…็ปชๅŽŸๅ› ่ฏ†ๅˆซใ€่ฏ„ไปทๅฏน่ฑกๅ’Œ่ฏ„ไปท่ฏๆŠฝๅ–ใ€LineFlow๏ผš้ขๅ‘ๆ‰€ๆœ‰ๆทฑๅบฆๅญฆไน ๆก†ๆžถ็š„NLPๆ•ฐๆฎ้ซ˜ๆ•ˆๅŠ ่ฝฝๅ™จใ€ไธญๆ–‡ๅŒปๅญฆNLPๅ…ฌๅผ€่ต„ๆบๆ•ด็† ใ€MedQuAD๏ผš(่‹ฑๆ–‡)ๅŒปๅญฆ้—ฎ็ญ”ๆ•ฐๆฎ้›†ใ€ๅฐ†่‡ช็„ถ่ฏญ่จ€ๆ•ฐๅญ—ไธฒ่งฃๆž่ฝฌๆขไธบๆ•ดๆ•ฐๅ’Œๆตฎ็‚นๆ•ฐใ€Transfer Learning in Natural Language Processing (NLP) ใ€้ขๅ‘่ฏญ้Ÿณ่ฏ†ๅˆซ็š„ไธญๆ–‡/่‹ฑๆ–‡ๅ‘้Ÿณ่พžๅ…ธใ€Tokenizers๏ผšๆณจ้‡ๆ€ง่ƒฝไธŽๅคšๅŠŸ่ƒฝๆ€ง็š„ๆœ€ๅ…ˆ่ฟ›ๅˆ†่ฏๅ™จใ€CLUENER ็ป†็ฒ’ๅบฆๅ‘ฝๅๅฎžไฝ“่ฏ†ๅˆซ Fine Grained Named Entity Recognitionใ€ ๅŸบไบŽBERT็š„ไธญๆ–‡ๅ‘ฝๅๅฎžไฝ“่ฏ†ๅˆซใ€ไธญๆ–‡่ฐฃ่จ€ๆ•ฐๆฎๅบ“ใ€NLPๆ•ฐๆฎ้›†/ๅŸบๅ‡†ไปปๅŠกๅคงๅˆ—่กจใ€nlp็›ธๅ…ณ็š„ไธ€ไบ›่ฎบๆ–‡ๅŠไปฃ็ , ๅŒ…ๆ‹ฌไธป้ข˜ๆจกๅž‹ใ€่ฏๅ‘้‡(Word Embedding)ใ€ๅ‘ฝๅๅฎžไฝ“่ฏ†ๅˆซ(NER)ใ€ๆ–‡ๆœฌๅˆ†็ฑป(Text Classificatin)ใ€ๆ–‡ๆœฌ็”Ÿๆˆ(Text Generation)ใ€ๆ–‡ๆœฌ็›ธไผผๆ€ง(Text Similarity)่ฎก็ฎ—็ญ‰๏ผŒๆถ‰ๅŠๅˆฐๅ„็งไธŽnlp็›ธๅ…ณ็š„็ฎ—ๆณ•๏ผŒๅŸบไบŽkerasๅ’Œtensorflow ใ€Pythonๆ–‡ๆœฌๆŒ–ๆŽ˜/NLPๅฎžๆˆ˜็คบไพ‹ใ€ Blackstone๏ผš้ขๅ‘้ž็ป“ๆž„ๅŒ–ๆณ•ๅพ‹ๆ–‡ๆœฌ็š„spaCy pipelineๅ’ŒNLPๆจกๅž‹้€š่ฟ‡ๅŒไน‰่ฏๆ›ฟๆขๅฎž็Žฐๆ–‡ๆœฌโ€œๅ˜่„ธโ€ ใ€ไธญๆ–‡ ้ข„่ฎญ็ปƒ ELECTREA ๆจกๅž‹: ๅŸบไบŽๅฏนๆŠ—ๅญฆไน  pretrain Chinese Model ใ€albert-chinese-ner - ็”จ้ข„่ฎญ็ปƒ่ฏญ่จ€ๆจกๅž‹ALBERTๅšไธญๆ–‡NER ใ€ๅŸบไบŽGPT2็š„็‰นๅฎšไธป้ข˜ๆ–‡ๆœฌ็”Ÿๆˆ/ๆ–‡ๆœฌๅขžๅนฟใ€ๅผ€ๆบ้ข„่ฎญ็ปƒ่ฏญ่จ€ๆจกๅž‹ๅˆ้›†ใ€ๅคš่ฏญ่จ€ๅฅๅ‘้‡ๅŒ…ใ€็ผ–็ ใ€ๆ ‡่ฎฐๅ’Œๅฎž็Žฐ๏ผšไธ€็งๅฏๆŽง้ซ˜ๆ•ˆ็š„ๆ–‡ๆœฌ็”Ÿๆˆๆ–นๆณ•ใ€ ่‹ฑๆ–‡่„่ฏๅคงๅˆ—่กจ ใ€attnvis๏ผšGPT2ใ€BERT็ญ‰transformer่ฏญ่จ€ๆจกๅž‹ๆณจๆ„ๅŠ›ไบคไบ’ๅฏ่ง†ๅŒ–ใ€CoVoST๏ผšFacebookๅ‘ๅธƒ็š„ๅคš่ฏญ็ง่ฏญ้Ÿณ-ๆ–‡ๆœฌ็ฟป่ฏ‘่ฏญๆ–™ๅบ“๏ผŒๅŒ…ๆ‹ฌ11็ง่ฏญ่จ€(ๆณ•่ฏญใ€ๅพท่ฏญใ€่ทๅ…ฐ่ฏญใ€ไฟ„่ฏญใ€่ฅฟ็ญ็‰™่ฏญใ€ๆ„ๅคงๅˆฉ่ฏญใ€ๅœŸ่€ณๅ…ถ่ฏญใ€ๆณขๆ–ฏ่ฏญใ€็‘žๅ…ธ่ฏญใ€่’™ๅค่ฏญๅ’Œไธญๆ–‡)็š„่ฏญ้Ÿณใ€ๆ–‡ๅญ—่ฝฌๅฝ•ๅŠ่‹ฑๆ–‡่ฏ‘ๆ–‡ใ€Jiagu่‡ช็„ถ่ฏญ่จ€ๅค„็†ๅทฅๅ…ท - ไปฅBiLSTM็ญ‰ๆจกๅž‹ไธบๅŸบ็ก€๏ผŒๆไพ›็Ÿฅ่ฏ†ๅ›พ่ฐฑๅ…ณ็ณปๆŠฝๅ– ไธญๆ–‡ๅˆ†่ฏ ่ฏๆ€งๆ ‡ๆณจ ๅ‘ฝๅๅฎžไฝ“่ฏ†ๅˆซ ๆƒ…ๆ„Ÿๅˆ†ๆž ๆ–ฐ่ฏๅ‘็Žฐ ๅ…ณ้”ฎ่ฏ ๆ–‡ๆœฌๆ‘˜่ฆ ๆ–‡ๆœฌ่š็ฑป็ญ‰ๅŠŸ่ƒฝใ€็”จunetๅฎž็Žฐๅฏนๆ–‡ๆกฃ่กจๆ ผ็š„่‡ชๅŠจๆฃ€ๆต‹๏ผŒ่กจๆ ผ้‡ๅปบใ€NLPไบ‹ไปถๆๅ–ๆ–‡็Œฎ่ต„ๆบๅˆ—่กจ ใ€ ้‡‘่ž้ข†ๅŸŸ่‡ช็„ถ่ฏญ่จ€ๅค„็†็ ”็ฉถ่ต„ๆบๅคงๅˆ—่กจใ€CLUEDatasetSearch - ไธญ่‹ฑๆ–‡NLPๆ•ฐๆฎ้›†๏ผšๆœ็ดขๆ‰€ๆœ‰ไธญๆ–‡NLPๆ•ฐๆฎ้›†๏ผŒ้™„ๅธธ็”จ่‹ฑๆ–‡NLPๆ•ฐๆฎ้›† ใ€medical_NER - ไธญๆ–‡ๅŒปๅญฆ็Ÿฅ่ฏ†ๅ›พ่ฐฑๅ‘ฝๅๅฎžไฝ“่ฏ†ๅˆซ ใ€(ๅ“ˆไฝ›)่ฎฒๅ› ๆžœๆŽจ็†็š„ๅ…่ดนไนฆใ€็Ÿฅ่ฏ†ๅ›พ่ฐฑ็›ธๅ…ณๅญฆไน ่ต„ๆ–™/ๆ•ฐๆฎ้›†/ๅทฅๅ…ท่ต„ๆบๅคงๅˆ—่กจใ€Forte๏ผš็ตๆดปๅผบๅคง็š„่‡ช็„ถ่ฏญ่จ€ๅค„็†pipelineๅทฅๅ…ท้›† ใ€Pythonๅญ—็ฌฆไธฒ็›ธไผผๆ€ง็ฎ—ๆณ•ๅบ“ใ€PyLaia๏ผš้ขๅ‘ๆ‰‹ๅ†™ๆ–‡ๆกฃๅˆ†ๆž็š„ๆทฑๅบฆๅญฆไน ๅทฅๅ…ทๅŒ…ใ€TextFooler๏ผš้’ˆๅฏนๆ–‡ๆœฌๅˆ†็ฑป/ๆŽจ็†็š„ๅฏนๆŠ—ๆ–‡ๆœฌ็”Ÿๆˆๆจกๅ—ใ€Haystack๏ผš็ตๆดปใ€ๅผบๅคง็š„ๅฏๆ‰ฉๅฑ•้—ฎ็ญ”(QA)ๆก†ๆžถใ€ไธญๆ–‡ๅ…ณ้”ฎ็Ÿญ่ฏญๆŠฝๅ–ๅทฅๅ…ท

  • gensim gensim 14,969
    star
    updated 3 months ago GNU Lesser Genera...

    Topic Modelling for Humans

  • jieba jieba 31,953
    star
    updated 2 months ago MIT License

    ็ป“ๅทดไธญๆ–‡ๅˆ†่ฏ

  • langid.py langid.py 2,228
    star
    updated about 4 years ago Other

    Stand-alone language identification system

  • pattern pattern 8,643
    star
    updated 9 months ago BSD 3-Clause "New...

    Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

  • updated over 1 year ago MIT License

    pkusegๅคš้ข†ๅŸŸไธญๆ–‡ๅˆ†่ฏๅทฅๅ…ท; The pkuseg toolkit for multi-domain Chinese word segmentation

  • polyglot polyglot 2,232
    star
    updated 5 months ago Other

    Multilingual text (NLP) processing toolkit

  • pytext pytext 6,357
    star
    updated 10 months ago Other

    A natural language modeling framework based on PyTorch

  • updated 9 months ago BSD 3-Clause "New...

    Basic Utilities for PyTorch Natural Language Processing (NLP)

  • snownlp snownlp 6,291
    star
    updated about 4 years ago MIT License

    Python library for processing Chinese text

  • stanza stanza 6,998
    star
    updated 12 days ago Other

    Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Network Virtualization

  • mininet mininet 5,078
    star
    updated 3 months ago BSD 3-Clause "New...

    Emulator for rapid prototyping of Software Defined Networks

  • napalm napalm 2,158
    star
    updated 29 days ago Apache License 2.0

    Network Automation and Programmability Abstraction Layer with Multivendor support

  • pox pox 611
    star
    updated 11 months ago Apache License 2.0

    The POX network software platform

News Feed

  • updated 3 months ago BSD 3-Clause "New...

    Generate generic activity streams from the actions on your site. Users can follow any actors' activities for personalized streams.

  • updated 10 months ago Other

    Stream Framework is a Python library, which allows you to build news feed, activity streams and notification systems using Cassandra and/or Redis. The authors of Stream-Framework also provide a cloud service for feed technology:

NLP

  • allennlp allennlp 11,639
    star
    updated over 1 year ago Apache License 2.0

    An open-source NLP research library, built on PyTorch.

  • bert bert 36,701
    star
    updated 5 months ago Apache License 2.0

    TensorFlow code and pre-trained models for BERT

  • updated over 1 year ago MIT License

    Fixes contractions such as `you're` to `you are`

  • updated about 1 year ago GNU General Publi...

    Fuzzy String Matching in Python

  • gensim gensim 14,969
    star
    updated 3 months ago GNU Lesser Genera...

    Topic Modelling for Humans

  • haystack haystack 12,711
    star
    updated about 1 month ago Apache License 2.0

    ๐Ÿ” LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

  • nltk nltk 12,904
    star
    updated 17 days ago Apache License 2.0

    NLTK Source

  • polyglot polyglot 2,232
    star
    updated 5 months ago Other

    Multilingual text (NLP) processing toolkit

  • updated 3 days ago Apache License 2.0

    Multilingual Sentence & Image Embeddings with BERT

  • spaCy spaCy 28,425
    star
    updated 21 days ago MIT License

    ๐Ÿ’ซ Industrial-strength Natural Language Processing (NLP) in Python

  • stanza stanza 6,998
    star
    updated 12 days ago Other

    Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

  • transformers transformers 121,026
    star
    updated 29 days ago Apache License 2.0

    ๐Ÿค— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

  • updated about 5 years ago Apache License 2.0

    Lightning Fast Language Prediction ๐Ÿš€

  • updated 3 months ago GNU Affero Genera...

    A tool for extracting plain text from Wikipedia dumps

  • updated 2 months ago MIT License

    A little word cloud generator in Python

Object Factories

  • updated about 2 months ago MIT License

    A test fixtures replacement for Python

  • mixer mixer 927
    star
    updated about 2 months ago Other

    Mixer -- Is a fixtures replacement. Supported Django, Flask, SqlAlchemy and custom python objects.

ORM

  • updated 6 months ago

    A curated list of awesome tools for SQLAlchemy

  • dataset dataset 4,709
    star
    updated 7 months ago MIT License

    Easy-to-use data handling for SQL data stores with support for implicit table creation, bulk loading, and transactions.

  • updated 12 months ago BSD 2-Clause "Sim...

    Rich Python data types for Redis

  • updated 18 days ago MIT License

    A Python Object-Document-Mapper for working with MongoDB

  • orator orator 1,421
    star
    updated over 1 year ago MIT License

    The Orator ORM provides a simple yet beautiful ActiveRecord implementation.

  • orm orm 1,752
    star
    updated over 1 year ago BSD 3-Clause "New...

    An async ORM. ๐Ÿ—ƒ

  • peewee peewee 10,581
    star
    updated 2 months ago MIT License

    a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb

  • pony pony 3,368
    star
    updated 6 months ago Apache License 2.0

    Pony Object Relational Mapper

  • pydal pydal 468
    star
    updated 2 months ago BSD 3-Clause "New...

    A pure Python Database Abstraction Layer

Package Management

  • conda conda 5,988
    star
    updated about 1 month ago Other

    A system-level, binary package and environment manager running on all major operating systems and platforms.

  • pip-tools pip-tools 7,398
    star
    updated 15 days ago BSD 3-Clause "New...

    A set of tools to keep your pinned Python dependencies fresh.

Package Repositories

  • updated about 1 month ago Academic Free Lic...

    A PyPI mirror client according to PEP 381 http://www.python.org/dev/peps/pep-0381/

  • devpi devpi 807
    star
    updated 25 days ago

    Python PyPi staging server and packaging, testing, release tool

Pandas

  • missingno missingno 3,634
    star
    updated about 1 year ago MIT License

    Missing data visualization module for Python.

  • modin modin 9,403
    star
    updated 8 days ago Apache License 2.0

    Modin: Scale your Pandas workflows by changing a single line of code

  • pandas pandas 41,491
    star
    updated 17 days ago BSD 3-Clause "New...

    Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

  • updated 5 months ago Other

    Extract data from a wide range of Internet sources into a pandas DataFrame.

  • qgrid qgrid 3,018
    star
    updated 3 months ago Apache License 2.0

    An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks

Penetration testing

  • fsociety fsociety 9,996
    star
    updated 4 months ago MIT License

    fsociety Hacking Tools Pack โ€“ A Penetration Testing Framework

  • updated 3 months ago

    The Social-Engineer Toolkit (SET) repository from TrustedSec - All new versions of SET will be deployed here.

  • sqlmap sqlmap 28,686
    star
    updated 5 months ago Other

    Automatic SQL injection and database takeover tool

Performance

  • black black 37,119
    star
    updated 11 days ago MIT License

    The uncompromising Python code formatter

  • cython cython 8,388
    star
    updated 6 months ago Apache License 2.0

    The most widely used Python to C compiler

  • numba numba 9,225
    star
    updated about 2 months ago BSD 2-Clause "Sim...

    NumPy aware dynamic Python compiler using LLVM

  • numpy numpy 25,420
    star
    updated 3 months ago BSD 3-Clause "New...

    The fundamental package for scientific computing with Python.

  • pytest pytest 11,150
    star
    updated about 1 month ago MIT License

    The pytest framework makes it easy to write small tests, yet scales to support complex functional testing

  • scipy scipy 12,295
    star
    updated 12 days ago BSD 3-Clause "New...

    SciPy library main repository

Permissions

  • updated 5 months ago MIT License

    Awesome Django authorization, without the database

Processes

  • updated over 1 year ago MIT License

    Subprocesses for Humans 2.0.

  • sh sh 6,822
    star
    updated 3 months ago MIT License

    Python process launching

Profiling

  • profiling profiling 2,965
    star
    updated over 3 years ago BSD 3-Clause "New...

    Was an interactive continuous Python profiler.

  • updated 7 months ago BSD 3-Clause "New...

    ๐Ÿšดย Call stack profiler for Python. Shows you why your code is slow!

  • yappi yappi 1,317
    star
    updated 3 months ago MIT License

    Yet Another Python Profiler, but this time multithreading, asyncio and gevent aware.

Recommender Systems

  • annoy annoy 12,458
    star
    updated about 2 months ago Apache License 2.0

    Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

  • fastFM fastFM 1,059
    star
    updated over 1 year ago Other

    fastFM: A Library for Factorization Machines

  • implicit implicit 3,383
    star
    updated 4 months ago MIT License

    Fast Python Collaborative Filtering for Implicit Feedback Datasets

  • lightfm lightfm 4,568
    star
    updated 3 months ago Apache License 2.0

    A Python implementation of LightFM, a hybrid recommendation algorithm.

  • spotlight spotlight 2,857
    star
    updated 11 months ago MIT License

    Deep recommender models using PyTorch.

  • Surprise Surprise 6,125
    star
    updated 11 months ago BSD 3-Clause "New...

    A Python scikit for building and analyzing recommender systems

  • tensorrec tensorrec 1,249
    star
    updated 10 months ago Apache License 2.0

    A TensorFlow recommendation algorithm and framework in Python.

Refactoring

  • rope rope 1,803
    star
    updated about 1 month ago GNU Lesser Genera...

    a python refactoring library

RESTful API

  • apistar apistar 5,590
    star
    updated about 2 years ago BSD 3-Clause "New...

    The Web API toolkit. ๐Ÿ› 

  • updated about 2 months ago Other

    Build Web Services with Pyramid.

  • eve eve 6,654
    star
    updated 3 months ago Other

    REST API framework designed for human beings

  • falcon falcon 9,323
    star
    updated 2 months ago Apache License 2.0

    The no-magic web data plane API and microservices framework for Python developers, with a focus on reliability, correctness, and performance at scale.

  • fastapi fastapi 69,216
    star
    updated 15 days ago MIT License

    FastAPI framework, high performance, easy to learn, fast to code, ready for production

  • updated 7 months ago BSD 3-Clause "New...

    Simple framework for creating REST APIs

  • hug hug 6,809
    star
    updated 9 months ago MIT License

    Embrace the APIs of the future. Hug aims to make developing APIs as simple as possible, but no simpler.

  • sandman2 sandman2 1,981
    star
    updated 17 days ago Apache License 2.0

    Automatically generate a RESTful API service for your legacy database. No code required!

Robotics

  • updated 7 months ago Other

    Python sample codes for robotics algorithms.

RPC Servers

Science

  • bcbb bcbb 582
    star
    updated about 1 year ago

    Incubator for useful bioinformatics code, primarily in Python and R

  • updated 23 days ago GNU General Publi...

    Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

  • updated 3 months ago BSD 3-Clause "New...

    Statsmodels: statistical modeling and econometrics in Python

  • sympy sympy 11,931
    star
    updated 17 days ago Other

    A computer algebra system written in pure Python

  • zipline zipline 16,969
    star
    updated about 1 month ago Apache License 2.0

    Zipline, a Pythonic Algorithmic Trading Library

Security

  • impacket impacket 9,423
    star

    Impacket is a collection of Python classes for working with network protocols.

  • lemur lemur 1,651
    star
    updated 7 months ago Apache License 2.0

    Repository for the Lemur Certificate Manager

  • metaflow metaflow 7,474
    star
    updated 8 days ago Apache License 2.0

    ๐Ÿš€ Build and manage real-life ML, AI, and data science projects with ease!

  • repokid repokid 1,082
    star
    updated 8 months ago Apache License 2.0

    AWS Least Privilege for Distributed, High-Velocity Deployment

  • updated about 3 years ago Apache License 2.0

    Security Monkey monitors AWS, GCP, OpenStack, and GitHub orgs for assets and their changes over time.

  • w3af w3af 4,309
    star
    updated about 1 year ago

    w3af: web application attack and audit framework, the open source web vulnerability scanner.

Serialization

  • updated 15 days ago MIT License

    A lightweight library for converting complex objects to and from simple Python datatypes.

  • updated about 2 months ago Other

    Python bindings for the simdjson project.

  • updated 10 days ago Other

    Python wrapper around rapidjson

  • rapidjson rapidjson 13,742
    star
    updated 30 days ago Other

    A fast JSON parser/generator for C++ with both SAX/DOM style API

Serverless Frameworks

  • updated 9 months ago ISC License

    A toolkit for developing and deploying serverless Python code in AWS Lambda.

  • Zappa Zappa 2,991
    star
    updated 12 days ago MIT License

    Serverless Python

Shell

  • xonsh xonsh 7,877
    star
    updated 16 days ago Other

    ๐Ÿš Python-powered, cross-platform, Unix-gazing shell.

Simulation

  • PyPSA PyPSA 1,036
    star
    updated about 1 month ago MIT License

    PyPSA: Python for Power System Analysis

  • updated 2 months ago MIT License

    A community based Python library for quantitative economics

  • RLBot RLBot 576
    star
    updated about 1 month ago MIT License

    A framework that allows people to write their own Rocket League bots.

Specific Formats Processing

  • csvkit csvkit 5,780
    star
    updated about 1 month ago MIT License

    A suite of utilities for converting to and working with CSV, the king of tabular file formats.

  • mistune mistune 2,323
    star
    updated 7 months ago BSD 3-Clause "New...

    A fast yet powerful Python Markdown parser with renderers and plugins.

  • pdfminer pdfminer 5,178
    star
    updated over 1 year ago MIT License

    Python PDF Parser (Not actively maintained). Check out pdfminer.six.

  • pyexcel pyexcel 1,162
    star
    updated 3 months ago Other

    Single API for reading, manipulating and writing data in csv, ods, xls, xlsx and xlsm files

  • updated 2 months ago MIT License

    Create and modify Word documents with Python

  • updated about 2 months ago GNU Lesser Genera...

    Use a docx as a jinja2 template

  • updated 14 days ago MIT License

    Create Open XML PowerPoint documents in Python

  • tablib tablib 4,500
    star
    updated about 1 month ago MIT License

    Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.

  • unoconv unoconv 2,493
    star
    updated 11 months ago GNU General Publi...

    Universal Office Converter - Convert between any document format supported by LibreOffice/OpenOffice.

  • unp unp 414
    star
    updated almost 7 years ago Other

    Unpacks things.

  • xlrd xlrd 2,125
    star
    updated about 2 months ago Other

    Please use openpyxl where you can...

  • updated about 2 months ago BSD 2-Clause "Sim...

    A Python module for creating Excel XLSX files.

  • xlwt xlwt 1,029
    star
    updated about 4 years ago Other

    Library to create spreadsheet files compatible with MS Excel 97/2000/XP/2003 XLS files, on any platform.

Static Site Generator

  • lektor lektor 3,729
    star
    updated 5 months ago BSD 3-Clause "New...

    The lektor static file content management system

  • makesite makesite 1,691
    star
    updated about 1 year ago MIT License

    Simple, lightweight, and magic-free static site/blog generator for Python coders

  • mkdocs mkdocs 18,095
    star
    updated 12 days ago BSD 2-Clause "Sim...

    Project documentation with Markdown.

  • nikola nikola 2,486
    star
    updated 5 months ago MIT License

    A static website and blog generator

  • pelican pelican 12,063
    star
    updated 13 days ago GNU Affero Genera...

    Static site generator that supports Markdown and reST syntax. Powered by Python.

Study

  • updated almost 2 years ago

    Matplotlib tutorial for beginner

  • numpy-100 numpy-100 10,706
    star
    updated 8 months ago MIT License

    100 numpy exercises (with solutions)

  • updated 2 months ago Other

    An open access book on scientific visualization using python and matplotlib

Tagging

  • updated 21 days ago BSD 3-Clause "New...

    Simple tagging for django

Task Queues

  • dramatiq dramatiq 3,640
    star
    updated 9 months ago GNU Lesser Genera...

    A fast and reliable background task processing library for Python 3.

  • huey huey 4,763
    star
    updated 3 months ago MIT License

    a little task queue for python

  • mrq mrq 878
    star
    updated 10 months ago MIT License

    Mr. Queue - A distributed worker task queue in Python using Redis & gevent

  • rq rq 9,417
    star
    updated 19 days ago Other

    Simple job queues for Python

Template

  • updated 3 days ago BSD 3-Clause "New...

    A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.

  • PyGithub PyGithub 6,607
    star
    updated 13 days ago GNU Lesser Genera...

    Typed interactions with the GitHub API v3

  • updated about 1 month ago MIT License

    BDD library for the py.test runner

Template Engine

  • jinja jinja 9,852
    star
    updated about 1 month ago BSD 3-Clause "New...

    A very fast and expressive template engine.

Terminal

  • bless bless 2,701
    star
    updated 8 months ago Apache License 2.0

    Repository for BLESS, an SSH Certificate Authority that runs as a AWS Lambda function

  • bpython bpython 2,517
    star
    updated about 2 months ago Other

    bpython - A fancy curses interface to the Python interactive interpreter

  • Gooey Gooey 17,984
    star
    updated 6 months ago MIT License

    Turn (almost) any Python command line program into a full GUI application with one line

  • icdiff icdiff 4,121
    star
    updated 7 months ago Other

    improved colored diff

  • ipython ipython 16,090
    star
    updated about 1 month ago BSD 3-Clause "New...

    Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.

  • lemur lemur 1,651
    star
    updated 7 months ago Apache License 2.0

    Repository for the Lemur Certificate Manager

  • pexpect pexpect 2,425
    star
    updated 6 months ago Other

    A Python module for controlling interactive programs in a pseudo-terminal

  • repokid repokid 1,082
    star
    updated 8 months ago Apache License 2.0

    AWS Least Privilege for Distributed, High-Velocity Deployment

  • updated about 3 years ago Apache License 2.0

    Security Monkey monitors AWS, GCP, OpenStack, and GitHub orgs for assets and their changes over time.

  • tmuxp tmuxp 3,847
    star
    updated 3 months ago MIT License

    ๐Ÿ–ฅ๏ธ tmux session manager. built on libtmux

Testing

  • updated about 2 months ago MIT License

    A test fixtures replacement for Python

  • fake2db fake2db 2,256
    star
    updated over 4 years ago GNU General Publi...

    create custom test databases that are populated with fake data

  • faker faker 16,985
    star
    updated 8 days ago MIT License

    Faker is a Python package that generates fake data for you.

  • freezegun freezegun 3,919
    star
    updated about 1 month ago Apache License 2.0

    Let your Python tests travel through time

  • green green 780
    star
    updated about 2 months ago MIT License

    Green is a clean, colorful, fast python test runner.

  • updated 6 months ago Other

    A mocking library for requests

  • HTTPretty HTTPretty 2,080
    star
    updated about 2 months ago MIT License

    Intercept HTTP requests at the Python socket level. Fakes the whole socket module

  • updated 9 days ago Other

    Hypothesis is a powerful, flexible, and easy to use library for property-based testing.

  • locust locust 23,382
    star
    updated 9 days ago MIT License

    Write scalable load tests in plain Python ๐Ÿš—๐Ÿ’จ

  • mimesis mimesis 4,243
    star
    updated 24 days ago MIT License

    Mimesis is a powerful Python library that empowers developers to generate massive amounts of synthetic data efficiently.

  • mixer mixer 927
    star
    updated about 2 months ago Other

    Mixer -- Is a fixtures replacement. Supported Django, Flask, SqlAlchemy and custom python objects.

  • nose2 nose2 771
    star
    updated 16 days ago Other

    The successor to nose, based on unittest2

  • updated about 1 month ago BSD 3-Clause "New...

    a socket mock framework - for all kinds of socket animals, web-clients included

  • responses responses 3,938
    star
    updated 7 months ago Apache License 2.0

    A utility for mocking out the Python Requests library.

  • updated 3 months ago Apache License 2.0

    Generic automation framework for acceptance testing and RPA

  • splinter splinter 2,676
    star
    updated about 1 month ago BSD 3-Clause "New...

    splinter - python test framework for web applications

  • vcrpy vcrpy 2,584
    star
    updated 13 days ago MIT License

    Automatically mock your HTTP interactions to simplify and speed up testing

Testing Frameworks

  • updated 9 days ago Other

    Hypothesis is a powerful, flexible, and easy to use library for property-based testing.

  • nose2 nose2 771
    star
    updated 16 days ago Other

    The successor to nose, based on unittest2

  • updated 3 months ago Apache License 2.0

    Generic automation framework for acceptance testing and RPA

Test Runners

  • green green 780
    star
    updated about 2 months ago MIT License

    Green is a clean, colorful, fast python test runner.

Text Processing

  • chardet chardet 2,055
    star
    updated 5 months ago GNU Lesser Genera...

    Python character encoding detector

  • updated about 1 year ago GNU General Publi...

    Fuzzy String Matching in Python

  • updated 9 months ago MIT License

    Implementation of hashids (http://hashids.org) in Python. Compatible with Pythonโ€‰2 and Pythonโ€‰3

  • updated 12 months ago MIT License

    Paranoid text spacing in Python

  • ply ply 2,681
    star
    updated 7 months ago

    Python Lex-Yacc

  • pyfiglet pyfiglet 1,283
    star
    updated 2 months ago Other

    An implementation of figlet written in Python

  • pyparsing pyparsing 2,004
    star
    updated 4 months ago MIT License

    Python library for creating PEG parsers

  • updated 4 months ago Other

    A simple Python module for parsing human names into their individual components

  • updated about 1 month ago Apache License 2.0

    Python port of Google's libphonenumber

  • updated 7 months ago MIT License

    ๆฑ‰ๅญ—่ฝฌๆ‹ผ้Ÿณ(pypinyin)

  • updated 5 months ago MIT License

    Returns unicode slugs

  • updated about 1 year ago MIT License

    A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.

  • shortuuid shortuuid 1,905
    star
    updated about 1 year ago BSD 3-Clause "New...

    A generator library for concise, unambiguous and URL-safe UUIDs.

  • sqlparse sqlparse 3,539
    star
    updated 17 days ago BSD 3-Clause "New...

    A non-validating SQL parser module for Python

  • updated 12 months ago BSD 3-Clause "New...

    A slugifier that works in unicode

Third-party APIs

  • boto3 boto3 8,584
    star
    updated about 1 month ago Apache License 2.0

    AWS SDK for Python

  • updated 5 months ago Apache License 2.0

    Python SDK for Facebook's Graph API

  • gspread gspread 6,789
    star
    updated 26 days ago MIT License

    Google Sheets Python API

  • twython twython 1,849
    star
    updated over 1 year ago MIT License

    Actively maintained, pure Python wrapper for the Twitter API. Supports both normal and streaming Twitter APIs.

Typing

  • mypy mypy 17,388
    star
    updated 6 days ago Other

    Optional static typing for Python

  • typeguard typeguard 1,408
    star
    updated 16 days ago Other

    Run-time type checker for Python

  • updated 6 months ago MIT License

    Runtime inspection utilities for Python typing module

URL Manipulation

  • furl furl 2,542
    star
    updated 11 months ago Other

    ๐ŸŒ URL parsing and manipulation made easy.

  • purl purl 293
    star
    updated 8 months ago MIT License

    A simple, immutable URL class with a clean API for interrogation and manipulation.

  • updated almost 2 years ago GNU General Publi...

    ๐Ÿ”Œ Generating short urls with python has never been easier

  • webargs webargs 1,362
    star
    updated 16 days ago MIT License

    A friendly library for parsing HTTP request arguments, with built-in support for popular web frameworks, including Flask, Django, Bottle, Tornado, Pyramid, webapp2, Falcon, and aiohttp.

Utility

  • updated over 1 year ago

    A common base representation of python source code for pylint and other projects

  • bandit bandit 5,900
    star
    updated 10 days ago Apache License 2.0

    Bandit is a tool designed to find common security issues in Python code.

  • black black 37,119
    star
    updated 11 days ago MIT License

    The uncompromising Python code formatter

  • click click 14,890
    star
    updated 26 days ago BSD 3-Clause "New...

    Python composable command line interface toolkit

  • updated 10 days ago Other

    Web APIs for Django. ๐ŸŽธ

  • flake8 flake8 3,068
    star
    updated 5 months ago Other

    flake8 is a python tool that glues together pycodestyle, pyflakes, mccabe, and third-party plugins to check the style and quality of some python code.

  • flask flask 65,996
    star
    updated 20 days ago BSD 3-Clause "New...

    The Python micro framework for building web applications.

  • httpx httpx 11,445
    star
    updated 4 months ago BSD 3-Clause "New...

    A next generation HTTP client for Python. ๐Ÿฆ‹

  • isort isort 6,277
    star
    updated 8 days ago MIT License

    A Python utility / library to sort imports.

  • mccabe mccabe 602
    star
    updated 6 months ago Other

    McCabe complexity checker for Python

  • PyGithub PyGithub 6,607
    star
    updated 13 days ago GNU Lesser Genera...

    Typed interactions with the GitHub API v3

  • pylint pylint 4,246
    star
    updated over 1 year ago

    It's not just a linter that annoys you!

  • python-fire python-fire 26,112
    star
    updated 14 days ago Other

    Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.

  • scrapy scrapy 50,143
    star
    updated about 1 month ago BSD 3-Clause "New...

    Scrapy, a fast high-level web crawling & scraping framework for Python.

  • updated 11 months ago MIT License

    The official Python SDK for Sentry.io

  • updated 2 days ago MIT License

    The Database Toolkit for Python

Video

  • updated about 4 years ago Other

    Video processing routines for SciPy

  • vidgear vidgear 3,064
    star
    updated 4 months ago Apache License 2.0

    A High-performance cross-platform Video Processing Python framework powerpacked with unique trailblazing features ๐Ÿ”ฅ

Visualization

  • altair altair 8,827
    star
    updated 12 days ago BSD 3-Clause "New...

    Declarative statistical visualization library for Python

  • bokeh bokeh 18,591
    star
    updated about 1 month ago BSD 3-Clause "New...

    Interactive Data Visualization in the browser, from Python

  • updated 7 months ago Other

    Python binding for the freetype library

  • matplotlib matplotlib 18,720
    star
    updated 2 months ago

    matplotlib: plotting with Python

  • updated over 3 years ago BSD 2-Clause "Sim...

    Matplotlib 3.1 cheat sheet.

  • updated almost 2 years ago

    Matplotlib tutorial for beginner

  • numpy-100 numpy-100 10,706
    star
    updated 8 months ago MIT License

    100 numpy exercises (with solutions)

  • plotly.py plotly.py 15,052
    star
    updated 6 days ago MIT License

    The interactive graphing library for Python โœจ This project now includes Plotly Express!

  • updated 2 months ago Other

    An open access book on scientific visualization using python and matplotlib

  • updated 11 months ago

    Ten simple rules for better figures

  • vispy vispy 3,193
    star
    updated about 1 month ago Other

    Main repository for Vispy

Web

  • bottle bottle 8,217
    star
    updated 3 months ago MIT License

    bottle.py is a fast and simple micro-framework for python web-applications.

  • django django 76,089
    star
    updated 14 days ago BSD 3-Clause "New...

    The Web framework for perfectionists with deadlines.

  • updated 10 days ago Other

    Web APIs for Django. ๐ŸŽธ

  • peewee peewee 10,581
    star
    updated 2 months ago MIT License

    a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb

  • PyMySQL PyMySQL 7,495
    star
    updated about 2 months ago MIT License

    MySQL client library for Python

  • updated 10 days ago MIT License

    Coverage plugin for pytest.

  • updated about 1 month ago Other

    A Django plugin for pytest.

  • updated about 1 month ago MIT License

    A set of pytest fixtures to test Flask applications

  • updated 16 days ago Other

    Plugin for generating HTML reports for pytest results

  • updated 10 days ago Other

    Plugin for running Selenium with pytest

  • updated over 1 year ago BSD 3-Clause "New...

    Social auth made simple

  • updated 2 days ago MIT License

    THIS IS NOT THE OFFICIAL REPO - PLEASE SUBMIT PRs ETC AT: http://github.com/sqlalchemy/sqlalchemy

  • updated 2 days ago MIT License

    The Database Toolkit for Python

  • starlette starlette 8,881
    star
    updated 5 months ago BSD 3-Clause "New...

    The little ASGI framework that shines. ๐ŸŒŸ

  • uvicorn uvicorn 7,411
    star
    updated 3 months ago BSD 3-Clause "New...

    An ASGI web server, for Python. ๐Ÿฆ„

Web Asset Management

  • updated 2 months ago Other

    Compresses linked and inline javascript or CSS into a single cached file.

  • updated 17 days ago MIT License

    Pipeline is an asset packaging library for Django.

  • updated 2 months ago BSD 3-Clause "New...

    https://django-storages.readthedocs.io/

  • updated 5 months ago BSD 2-Clause "Sim...

    Flask webassets integration.

  • updated over 1 year ago BSD 2-Clause "Sim...

    Asset management for Python web development.

Web Content Extracting

  • lassie lassie 588
    star
    updated over 1 year ago MIT License

    Web Content Retrieval for Humansโ„ข

  • updated 3 months ago MIT License

    a small library for extracting rich content from urls

  • newspaper newspaper 13,586
    star
    updated about 2 months ago MIT License

    newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:

  • updated about 1 month ago MIT License

    Pythonic HTML Parsing for Humansโ„ข

  • sumy sumy 3,394
    star
    updated about 1 month ago Apache License 2.0

    Module for automatic summarization of text documents and HTML pages.

  • textract textract 3,730
    star
    updated 24 days ago MIT License

    extract text from any document. no muss. no fuss.

Web Crawling

  • grab grab 2,330
    star
    updated 9 months ago MIT License

    Web Scraping Framework

  • updated 3 months ago MIT License

    A Python library for automating interaction with websites.

  • portia portia 8,991
    star
    updated 7 months ago BSD 3-Clause "New...

    Visual scraping for Scrapy

  • pyspider pyspider 16,220
    star
    updated 9 months ago Apache License 2.0

    A Powerful Spider(Web Crawler) System in Python.

  • updated over 3 years ago BSD 3-Clause "New...

Web Frameworks

  • updated about 2 months ago Other

    The Best Django Resource, Awesome Django for mature packages.

  • updated about 2 months ago Creative Commons ...

    A curated list of awesome things related to Django

  • updated 5 months ago

    A curated list of awesome Flask resources and plugins

  • updated over 2 years ago

    A curated list of awesome Pyramid apps, projects and resources.

  • masonite masonite 2,133
    star
    updated 16 days ago MIT License

    The Modern And Developer Centric Python Web Framework. Be sure to read the documentation and join the Discord channel for questions: https://discord.gg/TwKeFahmPZ

WebSocket

  • updated 2 months ago MIT License

    WebSocket and WAMP in Python for Twisted and asyncio

  • channels channels 5,886
    star
    updated 20 days ago BSD 3-Clause "New...

    Developer-friendly asynchrony for Django

  • Library for building WebSocket servers and clients in Python

WSGI Servers

  • bjoern bjoern 2,970
    star
    updated 10 months ago Other

    A screamingly fast Python 2/3 WSGI server written in C.

  • gunicorn gunicorn 9,432
    star
    updated about 1 month ago Other

    gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.

  • waitress waitress 1,310
    star
    updated 5 months ago Other

    Waitress - A WSGI server for Python 3

  • werkzeug werkzeug 6,518
    star
    updated 19 days ago BSD 3-Clause "New...

    The comprehensive WSGI web application library.