Awesome Python

Admin Panels

  • ajenti ajenti 7,468
    star
    updated 3 months ago MIT License

    Ajenti Core and stock plugins

  • updated 9 months ago GNU Affero Genera...

    Modern responsive template for the Django admin interface with improved functionality. We are proud to announce completely new Jet. Please check out Live Demo

  • updated 6 months ago BSD 3-Clause "New...

    Simple and extensible administrative interface framework for Flask

  • flower flower 6,305
    star
    updated 4 months ago Other

    Real-time monitor and web admin for Celery distributed task queue

  • updated over 1 year ago MIT License

    Jet Admin โ€“ No-code Business App builder

  • xadmin xadmin 4,757
    star
    updated 11 months ago BSD 3-Clause "New...

    Drop-in replacement of Django admin comes with lots of goodies, fully extensible with plugin support, pretty UI based on Twitter Bootstrap.

Algorithms and Design Patterns

  • algorithms algorithms 24,009
    star
    updated 4 months ago MIT License

    Minimal examples of data structures and algorithms in Python

  • updated 5 months ago

    A curated list of awesome places to learn and/or practice algorithms.

  • PyPattyrn PyPattyrn 2,244
    star
    updated 5 months ago MIT License

    A simple library for implementing common design patterns.

  • Python Python 193,522
    star
    updated 9 days ago MIT License

    All Algorithms implemented in Python

  • python-ds python-ds 2,772
    star
    updated 7 months ago MIT License

    No non-sense and no BS repo for how data structure code should be in Python - simple and elegant.

  • updated 3 months ago

    A collection of design patterns/idioms in Python

  • updated 8 months ago Other

    Python Sorted Container Types: Sorted List, Sorted Dict, and Sorted Set

  • updated 2 months ago MIT License

    A lightweight, object-oriented finite state machine implementation in Python with many extensions

ASGI Servers

  • daphne daphne 2,377
    star
    updated 23 days ago BSD 3-Clause "New...

    Django Channels HTTP/WebSocket server

  • uvicorn uvicorn 7,747
    star
    updated 7 months ago BSD 3-Clause "New...

    An ASGI web server, for Python. ๐Ÿฆ„

Asynchronous Programming

  • updated over 1 year ago Other

    A curated list of awesome Python asyncio frameworks, libraries, software and resources

  • trio trio 6,058
    star
    updated 2 months ago Other

    Trio โ€“ a friendly Python library for async concurrency and I/O

  • uvloop uvloop 10,243
    star
    updated 2 months ago Apache License 2.0

    Ultra fast asyncio event loop.

Audio

  • updated 8 months ago MIT License

    cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python

  • beets beets 12,835
    star
    updated 12 days ago MIT License

    music library manager and MusicBrainz tagger

  • dejavu dejavu 6,393
    star
    updated 6 months ago MIT License

    Audio fingerprinting and recognition in Python

  • eyeD3 eyeD3 504
    star
    updated about 1 year ago GNU General Publi...

    eyeD3 is a Python module and command line program for processing ID3 tags. Information about mp3 files (i.e bit rate, sample frequency, play time, etc.) is also provided. The formats supported are ID3v1 (1.0/1.1) and ID3v2 (2.3/2.4).

  • kapre kapre 922
    star
    updated about 1 year ago MIT License

    kapre: Keras Audio Preprocessors

  • librosa librosa 6,354
    star
    updated 11 months ago ISC License

    Python library for audio and music analysis

  • updated 9 months ago GNU General Publi...

    ๐ŸŽš๏ธ Open Source Audio Matching and Mastering

  • mutagen mutagen 1,497
    star
    updated 4 months ago GNU General Publi...

    Python module for handling audio metadata

  • updated 7 months ago Apache License 2.0

    Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

  • pydub pydub 8,802
    star
    updated 3 months ago MIT License

    Manipulate audio with a simple and easy high level interface

  • updated 2 months ago GNU Affero Genera...

    scalable audio processing framework and server written in Python

  • updated 11 months ago MIT License

    Read audio and music meta data and duration of MP3, OGG, OPUS, MP4, M4A, FLAC, WMA, Wave and AIFF files with python 2 or 3

Authentication

  • authlib authlib 3,619
    star
    updated over 1 year ago BSD 3-Clause "New...

    The ultimate Python library in building OAuth, OpenID Connect clients and servers. JWS,JWE,JWK,JWA,JWT included.

  • updated about 1 year ago MIT License

    Integrated set of Django applications addressing authentication, registration, account management as well as 3rd party (social) account authentication.

  • pyjwt pyjwt 4,883
    star
    updated 7 months ago MIT License

    JSON Web Token implementation in Python

  • updated 4 months ago MIT License

    A JOSE implementation in Python

  • updated 12 months ago MIT License

    Python module for generating and verifying JSON Web Tokens

  • updated over 2 years ago MIT License

    A fully tested, abstract interface to creating OAuth clients and servers.

  • updated over 2 years ago BSD 3-Clause "New...

    Social auth made simple

Build Tools

  • updated 11 months ago Apache License 2.0

    Your Gateway to Embedded Software Development Excellence ๐Ÿ‘ฝ

  • pybuilder pybuilder 1,616
    star
    updated 9 months ago Apache License 2.0

    Software build automation tool for Python.

Built-in Classes Enhancement

  • attrs attrs 5,217
    star
    updated about 2 months ago MIT License

    Python Classes Without Boilerplate

  • bidict bidict 1,474
    star
    updated 23 days ago Mozilla Public Li...

    The bidirectional mapping library for Python.

  • Box Box 2,267
    star
    updated about 1 year ago MIT License

    Python dictionaries with advanced dot notation access

  • updated over 1 year ago MIT License

    Python library that provides a method of accessing lists and dicts with a dotted path notation.

Caching

  • beaker beaker 517
    star
    updated 4 months ago Other

    WSGI middleware for sessions and caching

  • updated over 1 year ago BSD 3-Clause "New...

    Automatic caching and invalidation for Django models through the ORM.

  • updated 8 months ago BSD 3-Clause "New...

    A slick ORM cache with automatic granular event-driven invalidation.

  • updated 4 months ago BSD 3-Clause "New...

    A Python wrapper around the libmemcached interface from TangentOrg.

ChatGPT and LLMs

  • updated 3 months ago MIT License

    Code for the model presented in the paper: "code2seq: Generating Sequences from Structured Representations of Code"

  • updated almost 3 years ago BSD 3-Clause "New...
  • gpt-2 gpt-2 21,450
    star
    updated 5 months ago Other

    Code for the paper "Language Models are Unsupervised Multitask Learners"

  • updated over 5 years ago MIT License

    Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

  • updated 11 months ago MIT License

    Specify what you want it to build, the AI asks for clarification, and then builds it.

  • pdfGPT pdfGPT 6,838
    star
    updated 9 months ago MIT License

    PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbot!

  • PPLM PPLM 1,125
    star
    updated 8 months ago Apache License 2.0

    Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.

  • updated 9 days ago Apache License 2.0

    State-of-the-Art Text Embeddings

  • updated over 1 year ago Apache License 2.0

    Transformers for Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI

  • sockeye sockeye 1,181
    star
    updated over 1 year ago Apache License 2.0

    Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch

  • texar texar 2,387
    star
    updated about 3 years ago Apache License 2.0

    Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

  • updated 4 months ago Apache License 2.0

    Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

  • XLM XLM 2,763
    star
    updated over 1 year ago Other

    PyTorch original implementation of Cross-lingual Language Model Pretraining.

ChatOps Tools

  • errbot errbot 3,116
    star
    updated 4 months ago GNU General Publi...

    Errbot is a chatbot, a daemon that connects to your favorite chat service and bring your tools and some fun into the conversation.

CMS

  • updated 4 months ago BSD 3-Clause "New...

    A Django-based CMS with a focus on extensibility and concise code

  • indico indico 1,764
    star
    updated 13 days ago MIT License

    Indico - A feature-rich event management system, made @ CERN, the place where the Web was born.

  • Kotti Kotti 410
    star
    updated about 1 year ago Other

    Kotti is a high-level, Pythonic web application framework based on Pyramid and SQLAlchemy. It includes an extensible Content Management System called the Kotti CMS.

  • mezzanine mezzanine 4,757
    star
    updated 5 months ago BSD 2-Clause "Sim...

    CMS framework for Django

Code Analysis

  • updated 7 months ago Other

    :octocat: A curated awesome list of flake8 extensions. Feel free to contribute! ๐ŸŽ“

  • updated about 2 months ago Creative Commons ...

    Collection of awesome Python types, stubs, plugins, and tools to work with them.

  • coala coala 3,552
    star
    updated 9 months ago GNU Affero Genera...

    coala provides a unified command-line interface for linting and fixing all your code, regardless of the programming languages you use.

  • code2flow code2flow 3,972
    star
    updated 4 months ago MIT License

    Pretty good call graphs for dynamic languages

  • updated over 1 year ago Other

    A Python library that generates static type annotations by collecting runtime types

  • updated 4 months ago GNU General Publi...

    Inspects Python source files and provides information about type and location of classes, methods etc

  • updated 7 months ago GNU General Publi...

    pycallgraph is a Python module that creates call graphs for Python programs.

  • pylama pylama 1,048
    star
    updated 4 months ago MIT License

    Code audit tool for python.

  • updated 6 months ago MIT License

    Performant type-checking for python.

  • pytype pytype 4,731
    star
    updated about 1 month ago Other

    A static type analyzer for Python code

  • typeshed typeshed 4,316
    star
    updated 28 days ago Other

    Collection of library stubs for Python, with static types

  • vulture vulture 3,356
    star
    updated 3 months ago MIT License

    Find dead Python code

  • updated 28 days ago MIT License

    The strictest and most opinionated python linter ever!

  • yapf yapf 13,755
    star
    updated 17 days ago Apache License 2.0

    A formatter for Python files

Code Coverage

  • updated over 1 year ago Apache License 2.0

    Python report uploader for Codecov

  • updated about 2 months ago Apache License 2.0

    The code coverage tool for Python

  • updated 3 months ago Other

    A Django plugin for pytest.

  • updated 9 months ago MIT License

    extracted --boxed from pytest-xdist to ensure backward compat

  • updated 3 months ago MIT License

    Thin-wrapper around the mock package for easier use with pytest

  • updated 3 months ago MIT License

    ๐ŸŽฒ Pytest plugin to randomly order tests and control random.seed

  • updated 8 months ago Other

    a pytest plugin that re-runs failed tests up to -n times to eliminate flakey failures

  • updated 4 months ago MIT License
  • updated 3 months ago MIT License

    pytest plugin for distributed testing and loop-on-failures testing modes.

  • updated 3 months ago MIT License

    pytest external process plugin

Command-line Interface Development

  • updated over 1 year ago MIT License

    A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations!

  • updated 4 months ago Apache License 2.0

    A cross platform package to do curses-like operations, plus higher level APIs and widgets to create text UIs and ASCII art animations

  • updated 4 months ago MIT License

    plotting in the terminal

  • python-fire python-fire 26,842
    star
    updated about 2 months ago Other

    Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.

  • rich rich 48,734
    star
    updated 2 months ago MIT License

    Rich is a Python library for rich text and beautiful formatting in the terminal.

  • tqdm tqdm 26,439
    star
    updated 11 months ago Other

    โšก A Fast, Extensible Progress Bar for Python and CLI

Command-line Tools

  • aws-cli aws-cli 14,304
    star
    updated about 1 year ago Other

    Universal Command Line Interface for Amazon Web Services

  • doitlive doitlive 3,434
    star
    updated 2 months ago MIT License

    Because sometimes you need to do it live

  • howdoi howdoi 10,202
    star
    updated over 1 year ago MIT License

    instant coding answers via the command line

  • iredis iredis 2,541
    star
    updated 7 months ago BSD 3-Clause "New...

    Interactive Redis: A Terminal Client for Redis with AutoCompletion and Syntax Highlighting.

  • updated almost 2 years ago Apache License 2.0

    Kubernetes shell: An integrated shell for working with the Kubernetes

  • litecli litecli 2,516
    star
    updated about 1 month ago BSD 3-Clause "New...

    CLI for SQLite Databases with auto-completion and syntax highlighting

  • mycli mycli 11,345
    star
    updated 4 months ago Other

    A Terminal Client for MySQL with AutoCompletion and Syntax Highlighting.

  • updated 7 months ago MIT License

    PathPicker accepts a wide range of input -- output from git commands, grep results, searches -- pretty much anything. After parsing the input, PathPicker presents you with a nice UI to select which files you're interested in. After that you can open them in your favorite editor or execute arbitrary commands.

  • percol percol 3,227
    star
    updated over 2 years ago

    adds flavor of interactive filtering to the traditional pipe concept of UNIX shell

  • pgcli pgcli 11,870
    star
    updated 4 months ago BSD 3-Clause "New...

    Postgres CLI with autocompletion and syntax highlighting

  • saws saws 5,241
    star
    updated 7 months ago Other

    A supercharged AWS command line interface (CLI).

  • thefuck thefuck 80,068
    star
    updated 12 months ago MIT License

    Magnificent app which corrects your previous console command.

  • tmux tmux 34,289
    star
    updated 2 months ago Other

    tmux source code

  • try try 721
    star
    updated over 2 years ago MIT License

    Dead simple CLI tool to try Python packages - It's never been easier! ๐Ÿ“ฆ

Compatibility

  • updated about 1 year ago Other

    Modernizes Python code for eventual Python 3 migration. Built on top of fissix (a fork of lib2to3)

Computer Vision

  • updated 4 months ago MIT License

    The world's simplest facial recognition api for Python and the command line

  • kornia kornia 9,898
    star
    updated 9 days ago Apache License 2.0

    Geometric Computer Vision Library for Spatial AI

  • updated 15 days ago Apache License 2.0

    A Python wrapper for Google Tesseract

  • SimpleCV SimpleCV 2,691
    star
    updated almost 2 years ago BSD 3-Clause "New...

    The Open Source Framework for Machine Vision

  • tesserocr tesserocr 2,004
    star
    updated 2 months ago MIT License

    A Python wrapper for the tesseract-ocr API

Concurrency and Parallelism

  • greenlet greenlet 1,594
    star
    updated 7 months ago Other

    Lightweight in-process concurrent programming

  • scoop scoop 636
    star
    updated over 1 year ago GNU Lesser Genera...

    SCOOP (Scalable COncurrent Operations in Python)

  • uvloop uvloop 10,243
    star
    updated 2 months ago Apache License 2.0

    Ultra fast asyncio event loop.

Configuration

  • updated about 1 month ago Other

    Python 3+ compatible port of the configobj library

  • hydra hydra 8,550
    star
    updated 3 months ago MIT License

    Hydra is a framework for elegantly configuring complex applications

  • updated 9 months ago MIT License

    Strict separation of config from code.

Crypto

  • updated about 1 year ago MIT License

    Ethereum ABI utilities for python

  • updated about 1 month ago MIT License

    Account abstraction library for web3.py

  • updated 3 months ago MIT License

    The Ethereum hashing function, keccak256, sometimes (erroneously) called sha256 or sha3

  • updated 9 days ago MIT License

    A common API for Ethereum key operations.

  • updated 4 months ago MIT License

    Tool suite for testing ethereum applications.

  • updated 5 months ago MIT License

    Python types for type hinting commonly used ethereum types

  • updated 3 months ago MIT License

    Utility functions for working with ethereum related codebases.

  • py-evm py-evm 2,262
    star
    updated 2 months ago MIT License

    A Python implementation of the Ethereum Virtual Machine

  • web3.py web3.py 4,941
    star
    updated 2 months ago MIT License

    A python interface for interacting with the Ethereum blockchain and ecosystem.

Cryptography

  • pynacl pynacl 1,026
    star
    updated 8 months ago Apache License 2.0

    Python binding to the Networking and Cryptography (NaCl) library

Data

  • plotly.py plotly.py 15,980
    star
    updated about 2 months ago MIT License

    The interactive graphing library for Python โœจ This project now includes Plotly Express!

  • updated about 1 year ago Apache License 2.0

    StreamAlert is a serverless, realtime data analysis framework which empowers you to ingest, analyze, and alert on data from any environment, using datasources and alerting logic you define.

Data Analysis

  • blaze blaze 3,180
    star
    updated about 1 year ago BSD 3-Clause "New...

    NumPy and Pandas interface to Big Data

  • mining mining 1,277
    star
    updated almost 8 years ago MIT License

    Business Intelligence (BI) in Python, OLAP

Database

  • chroma chroma 14,795
    star
    updated 26 days ago Apache License 2.0

    the AI-native open-source embedding database

  • updated 5 months ago BSD 3-Clause "New...

    pickleDB is an open source key-value store using Python's json module.

  • tinydb tinydb 6,656
    star
    updated 5 months ago MIT License

    TinyDB is a lightweight document oriented database optimized for your happiness :)

  • ZODB ZODB 624
    star
    updated about 1 year ago Other

    Python object-oriented database

Database Drivers

  • apsw apsw 724
    star
    updated 2 months ago Other

    Another Python SQLite wrapper

  • updated about 2 months ago Creative Commons ...

    A curated list of awesome PostgreSQL software, libraries, tools and resources, inspired by awesome-mysql

  • updated 3 months ago Creative Commons ...

    A collection of awesome sqlite tools, scripts, books, etc

  • updated 8 months ago Other

    ClickHouse Python Driver with native interface support

  • updated 27 days ago Apache License 2.0

    PyMongo - the Official MongoDB Python driver

  • motor motor 2,410
    star
    updated about 1 month ago Apache License 2.0

    Motor - the async Python driver for MongoDB and Tornado or asyncio

  • PyMySQL PyMySQL 7,616
    star
    updated 4 months ago MIT License

    MySQL client library for Python

  • updated about 1 month ago Apache License 2.0

    DataStax Python Driver for Apache Cassandra

  • updated almost 3 years ago BSD 3-Clause "New...

    PostgreSQL database access simplified

  • updated over 4 years ago MIT License

    A supercharged SQLite library for Python

Data Validation

  • cerberus cerberus 3,058
    star
    updated about 1 year ago ISC License

    Lightweight, extensible data validation library for Python

  • updated about 1 year ago MIT License

    An implementation of the JSON Schema specification for Python

  • schema schema 2,874
    star
    updated 5 months ago MIT License

    Schema validation just got Pythonic

  • updated over 1 year ago Other

    Python Data Structures for Humansโ„ข.

  • updated almost 2 years ago MIT License

    Lightweight data validation and adaptation Python library.

  • updated 9 months ago BSD 3-Clause "New...

    CONTRIBUTIONS ONLY: Voluptuous, despite the name, is a Python data validation library.

Data Visualization

  • altair altair 9,105
    star
    updated 3 months ago BSD 3-Clause "New...

    Declarative statistical visualization library for Python

  • bokeh bokeh 19,117
    star
    updated 3 months ago BSD 3-Clause "New...

    Interactive Data Visualization in the browser, from Python

  • cartopy cartopy 1,385
    star
    updated 4 months ago BSD 3-Clause "New...

    Cartopy - a cartographic python library with matplotlib support

  • chartify chartify 3,510
    star
    updated 7 months ago Apache License 2.0

    Python library that makes it easy for data scientists to create charts.

  • diagrams diagrams 37,551
    star
    updated 5 days ago MIT License

    ๐ŸŽจ Diagram as Code for prototyping cloud system architectures

  • plotnine plotnine 3,957
    star
    updated 3 months ago MIT License

    A Grammar of Graphics for Python

  • seaborn seaborn 12,229
    star
    updated 4 months ago BSD 3-Clause "New...

    Statistical data visualization in Python

  • vispy vispy 3,282
    star
    updated 3 months ago Other

    Main repository for Vispy

Date and Time

  • updated almost 5 years ago MIT License

    A Python 3 library for parsing human-written times and dates

  • dateutil dateutil 2,326
    star
    updated 3 months ago Other

    Useful extensions to the standard Python datetime features

  • delorean delorean 1,841
    star
    updated over 1 year ago MIT License

    Delorean: Time Travel Made Easy

  • maya maya 3,409
    star
    updated 3 months ago MIT License

    Datetimes for Humansโ„ข

  • moment moment 724
    star
    updated over 1 year ago

    A Python library for dealing with dates

  • pendulum pendulum 6,137
    star
    updated 5 months ago MIT License

    Python datetimes made easy

  • PyTime PyTime 159
    star
    updated almost 2 years ago MIT License

    PyTime is an easy-use Python module which aims to operate date/time/datetime by string.

  • updated almost 5 years ago BSD 3-Clause "New...

    Friendly Python Dates

Debugging

  • updated 12 months ago Apache License 2.0

    Python Cloud Debugger

  • ptvsd ptvsd 536
    star
    updated about 4 years ago Other

    Python debugger package for use with Visual Studio and Visual Studio Code.

  • pytest pytest 11,750
    star
    updated 3 months ago MIT License

    The pytest framework makes it easy to write small tests, yet scales to support complex functional testing

  • updated 5 months ago MIT License

    Coverage plugin for pytest.

  • updated about 2 months ago Other

    Plugin for generating HTML reports for pytest results

  • updated 3 months ago MIT License

    Thin-wrapper around the mock package for easier use with pytest

  • updated 3 months ago MIT License

    pytest plugin for distributed testing and loop-on-failures testing modes.

  • werkzeug werkzeug 6,616
    star
    updated 2 months ago BSD 3-Clause "New...

    The comprehensive WSGI web application library.

Debugging Tools

  • updated 2 months ago BSD 3-Clause "New...

    A configurable set of panels that display various debug information about the current request/response.

  • updated over 4 years ago BSD 3-Clause "New...

    A drop-in replacement for Django's runserver.

  • icecream icecream 8,799
    star
    updated 4 months ago MIT License

    ๐Ÿฆ Never use print() to debug again.

  • ipdb ipdb 1,846
    star
    updated 3 months ago BSD 3-Clause "New...

    Integration of IPython pdb

  • updated about 3 years ago Other

    (OLD REPO) Line-by-line profiling for Python - Current repo ->

  • updated about 6 years ago GNU General Publi...

    Trace any Python program, anywhere!

  • pudb pudb 2,778
    star
    updated about 1 year ago Other

    Full-screen console debugger for Python

  • py-spy py-spy 10,621
    star
    updated over 1 year ago MIT License

    Sampling profiler for Python programs

  • updated 23 days ago Other

    Parsing ELF and DWARF in Python

  • pyringe pyringe 1,638
    star
    updated almost 5 years ago Other

    Debugger capable of attaching to and injecting code into python processes.

  • updated over 1 year ago BSD 2-Clause "Sim...

    Hunter is a flexible code tracing toolkit.

  • updated almost 2 years ago BSD 2-Clause "Sim...

    Debugging manhole for python applications.

  • vprof vprof 3,951
    star
    updated over 2 years ago BSD 2-Clause "Sim...

    Visual profiler for Python

  • wdb wdb 1,574
    star
    updated almost 2 years ago GNU General Publi...

    An improbable web debugger through WebSockets

Deep Learning

  • caffe caffe 34,002
    star
    updated 3 months ago Other

    Caffe: a fast open framework for deep learning.

  • keras keras 61,706
    star
    updated 26 days ago Apache License 2.0

    Deep Learning for humans

  • pytorch pytorch 83,391
    star
    updated 2 days ago Other

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

  • pytorch3d pytorch3d 7,322
    star
    updated over 1 year ago Other

    PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

  • SerpentAI SerpentAI 6,742
    star
    updated almost 2 years ago MIT License

    Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!

  • tensorflow tensorflow 186,123
    star
    updated 10 days ago Apache License 2.0

    An Open Source Machine Learning Framework for Everyone

  • Theano Theano 9,897
    star
    updated 10 months ago Other

    Theano was a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It is being continued as PyTensor: www.github.com/pymc-devs/pytensor

  • TTS TTS 33,215
    star
    updated 3 months ago Mozilla Public Li...

    ๐Ÿธ๐Ÿ’ฌ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

DevOps Tools

  • ansible ansible 58,550
    star
    updated about 1 year ago GNU General Publi...

    Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to cloud management, in a language that approaches plain English, using SSH, with no agents to install on remote systems. https://docs.ansible.com.

  • cuisine cuisine 1,257
    star
    updated over 1 year ago Other

    Chef-like functionality for Fabric

  • fabric fabric 14,743
    star
    updated 7 months ago BSD 2-Clause "Sim...

    Simple, Pythonic remote execution and deployment.

  • fabtools fabtools 1,253
    star
    updated almost 5 years ago BSD 2-Clause "Sim...

    Tools for writing awesome Fabric files

  • foreman foreman 5,890
    star
    updated over 1 year ago MIT License

    Manage Procfile-based applications

  • honcho honcho 1,586
    star
    updated 10 months ago MIT License

    Honcho: a python clone of Foreman. For managing Procfile-based applications.

  • psutil psutil 10,247
    star
    updated 13 days ago BSD 3-Clause "New...

    Cross-platform lib for process and system monitoring in Python

  • pyinfra pyinfra 2,400
    star
    updated about 1 year ago MIT License

    pyinfra automates infrastructure super fast at massive scale. It can be used for ad-hoc command execution, service deployment, configuration management and more.

  • salt salt 14,092
    star
    updated about 2 months ago Apache License 2.0

    Software to automate the management and configuration of any infrastructure or application at scale. Get access to the Salt software package repository here:

  • updated 6 months ago Other

    Supervisor process control system for Unix (supervisord)

Diffusion Text to Image

  • updated over 6 years ago MIT License

    Text to image synthesis using thought vectors

Distributed Computing

  • dask dask 12,531
    star
    updated 5 days ago BSD 3-Clause "New...

    Parallel computing with task scheduling

  • faust faust 6,683
    star
    updated 11 months ago Other

    Python Stream Processing

  • luigi luigi 17,796
    star
    updated about 1 month ago Apache License 2.0

    Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

  • mrjob mrjob 2,615
    star
    updated over 1 year ago Other

    Run MapReduce jobs on Hadoop or Amazon Web Services

  • ray ray 33,272
    star
    updated 29 days ago Apache License 2.0

    Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

  • updated about 1 year ago Apache License 2.0

    Run Python in Apache Storm topologies. Pythonic API, CLI tooling, and a topology DSL.

Distribution

  • updated 6 months ago GNU General Publi...

    Python virtualenvs in Debian packages

  • pyarmor pyarmor 3,148
    star
    updated 3 months ago Other

    A tool used to obfuscate python scripts, bind obfuscated scripts to fixed machine or expire obfuscated scripts.

  • pyinstaller pyinstaller 11,763
    star
    updated about 1 month ago Other

    Freeze (package) Python programs into stand-alone executables

  • shiv shiv 1,729
    star
    updated 6 months ago BSD 2-Clause "Sim...

    shiv is a command line utility for building fully self contained Python zipapps as outlined in PEP 441, but with all their dependencies included.

Documentation

  • updated 3 months ago Other

    A curated list of awesome tools for Sphinx Python Documentation Generator

  • pdoc pdoc 1,624
    star
    updated about 1 year ago The Unlicense

    API Documentation for Python Projects

  • pycco pycco 842
    star
    updated over 1 year ago Other

    Literate-style documentation generator.

  • sphinx sphinx 5,754
    star
    updated 12 months ago Other

    The Sphinx documentation generator

Downloader

  • s3cmd s3cmd 4,546
    star
    updated about 2 months ago GNU General Publi...

    Official s3cmd repo -- Command line tool for managing S3 compatible storage services (including Amazon S3 and CloudFront).

  • s4cmd s4cmd 1,322
    star
    updated over 1 year ago Apache License 2.0

    Super S3 command line tool

E-commerce

  • alipay alipay 326
    star
    updated almost 7 years ago

    An Unofficial Alipay API for Python

  • updated 8 months ago BSD 2-Clause "Sim...

    Ecommerce for Mezzanine

  • updated about 1 year ago BSD 3-Clause "New...

    A Django based shop system

  • updated about 1 year ago MIT License

    Foreign exchange rates, Bitcoin price index and currency conversion using ratesapi.io

  • merchant merchant 1,012
    star
    updated over 2 years ago BSD 3-Clause "New...

    A Django app to accept payments from various payment processors via Pluggable backends.

  • money money 231
    star
    updated over 1 year ago MIT License

    Python money class with optional CLDR-backed locale-aware formatting and an extensible currency exchange solution.

  • updated about 3 years ago GNU General Publi...

    Display money format and its filthy currencies, for all money lovers out there.

Email

  • flanker flanker 1,633
    star
    updated 6 months ago Apache License 2.0

    Python email address and Mime parsing library

  • imbox imbox 1,185
    star
    updated 7 months ago MIT License

    Python IMAP for Human beings

  • mailer mailer 277
    star
    updated 5 months ago MIT License

    A light-weight, modular, message representation and mail delivery framework for Python.

  • modoboa modoboa 3,039
    star
    updated 28 days ago ISC License

    Mail hosting made simple

  • salmon salmon 644
    star
    updated 3 months ago Other

    A Python Mail Server

  • yagmail yagmail 2,639
    star
    updated about 2 years ago MIT License

    Send email in Python conveniently for gmail using yagmail

Enterprise Application Integrations

  • airflow airflow 36,241
    star
    updated about 2 months ago Apache License 2.0

    Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

  • ansible ansible 58,550
    star
    updated about 1 year ago GNU General Publi...

    Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to cloud management, in a language that approaches plain English, using SSH, with no agents to install on remote systems. https://docs.ansible.com.

  • updated 2 months ago Apache License 2.0

    A collection of useful CloudFormation templates

  • updated over 1 year ago Other

    AWS Glue Libraries are additions and enhancements to Spark for ETL operations.

  • updated over 1 year ago MIT No Attribution

    A developer toolkit to implement Serverless best practices and increase developer velocity.

  • boto3 boto3 8,877
    star
    updated 3 months ago Apache License 2.0

    AWS SDK for Python

  • updated about 1 year ago Apache License 2.0

    Official Python client for Elasticsearch

  • flask flask 67,407
    star
    updated 2 months ago BSD 3-Clause "New...

    The Python micro framework for building web applications.

  • updated about 1 year ago Apache License 2.0

    ๐Ÿ The official Python client library for Google's discovery based APIs.

  • invoke invoke 4,363
    star
    updated 5 months ago BSD 2-Clause "Sim...

    Pythonic task management & command execution.

  • luigi luigi 17,796
    star
    updated about 1 month ago Apache License 2.0

    Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

  • pika pika 3,606
    star
    updated 2 months ago BSD 3-Clause "New...

    Pure Python RabbitMQ/AMQP 0-9-1 client library

  • requests requests 51,920
    star
    updated 3 months ago Apache License 2.0

    A simple, yet elegant, HTTP library.

Environment Management

  • pyenv pyenv 39,166
    star
    updated 3 days ago MIT License

    Simple Python version management

  • updated 3 months ago MIT License

    Virtual Python Environment builder

Fake Data

  • fake2db fake2db 2,256
    star
    updated almost 5 years ago GNU General Publi...

    create custom test databases that are populated with fake data

  • faker faker 17,554
    star
    updated about 2 months ago MIT License

    Faker is a Python package that generates fake data for you.

  • mimesis mimesis 4,380
    star
    updated 4 months ago MIT License

    Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.

Files

  • updated 3 months ago Other

    A python wrapper for libmagic

  • updated almost 3 years ago

    An object-oriented approach to Python file/directory operations.

  • watchdog watchdog 6,585
    star
    updated 5 days ago Apache License 2.0

    Python library and shell utilities to monitor filesystem events.

Finance

  • arch arch 1,294
    star
    updated 3 months ago Other

    ARCH models in Python

  • backtrader backtrader 13,820
    star
    updated 3 months ago GNU General Publi...

    Python Backtesting library for trading strategies

  • empyrical empyrical 1,268
    star
    updated 3 months ago Apache License 2.0

    Common financial risk and performance metrics. Used by zipline and pyfolio.

  • qgrid qgrid 3,034
    star
    updated 10 months ago Apache License 2.0

    An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks

  • yfinance yfinance 13,238
    star
    updated 18 days ago Apache License 2.0

    Download market data from Yahoo! Finance's API

  • zipline zipline 17,443
    star
    updated 9 months ago Apache License 2.0

    Zipline, a Pythonic Algorithmic Trading Library

Foreign Function Interface

  • llvmlite llvmlite 1,840
    star
    updated 6 months ago BSD 2-Clause "Sim...

    A lightweight LLVM python binding for writing JIT compilers

Forms

  • deform deform 400
    star
    updated over 1 year ago Other

    A Python HTML form library.

  • updated 3 months ago BSD 3-Clause "New...

    Bootstrap 4 integration with Django.

  • updated 3 months ago MIT License

    The best way to have DRY Django forms. The app provides a tag and filter that lets you quickly render forms in a div format while providing an enormous amount of capability to configure and control the rendered HTML.

  • updated over 1 year ago MIT License

    A platform independent django form serializer

  • wtforms wtforms 1,500
    star
    updated 4 months ago BSD 3-Clause "New...

    A flexible forms validation and rendering library for Python.

Functional Programming

  • coconut coconut 4,067
    star
    updated about 2 months ago Apache License 2.0

    Simple, elegant, Pythonic functional programming.

  • fn.py fn.py 3,302
    star
    updated about 2 years ago Other

    Functional programming in Python: implementation of missing features to enjoy FP

  • funcy funcy 3,248
    star
    updated 7 months ago BSD 3-Clause "New...

    A fancy and practical functional tools

  • returns returns 3,495
    star
    updated about 1 month ago BSD 2-Clause "Sim...

    Make your functions return something meaningful, typed, and safe!

  • toolz toolz 4,619
    star
    updated 5 months ago Other

    A functional standard library for Python.

Game Development

  • updated over 1 year ago

    The first level of Super Mario Bros made with Python and Pygame.

  • updated about 1 year ago

    This is the code base for the pygame tutorials posted on my YouTube channel.

  • updated 5 months ago
  • updated over 1 year ago

    Creating the game Space Invaders using Pygame Module in Python

Geolocation

  • updated 4 months ago MIT License

    A Django application that provides country choices for use with forms, flag icons static files, and a country field for models.

  • updated over 3 years ago GNU Lesser Genera...

    DEPRECATED GeoIP Legacy Python API

GIS

  • cartopy cartopy 1,385
    star
    updated 4 months ago BSD 3-Clause "New...

    Cartopy - a cartographic python library with matplotlib support

  • geopandas geopandas 4,390
    star
    updated 3 months ago BSD 3-Clause "New...

    Python tools for geographic data

  • updated 2 months ago Other

    Core components of Python Spatial Analysis Library

  • pyshp pyshp 1,090
    star
    updated 4 months ago MIT License

    This library reads and writes ESRI Shapefiles in pure Python.

Graph

  • cugraph cugraph 1,668
    star
    updated about 2 months ago Apache License 2.0

    cuGraph - RAPIDS Graph Analytics Library

  • dask dask 12,531
    star
    updated 5 days ago BSD 3-Clause "New...

    Parallel computing with task scheduling

  • detectron2 detectron2 25,567
    star
    updated over 1 year ago Apache License 2.0

    Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

  • hydra hydra 8,550
    star
    updated 3 months ago MIT License

    Hydra is a framework for elegantly configuring complex applications

  • metaflow metaflow 8,012
    star
    updated about 2 months ago Apache License 2.0

    ๐Ÿš€ Build and manage real-life ML, AI, and data science projects with ease!

  • updated 14 days ago Other

    Neo4j Bolt driver for Python

  • networkx networkx 13,543
    star
    updated 11 months ago Other

    Network Analysis in Python

  • updated about 1 month ago GNU General Publi...

    Python interface for igraph

  • SlowFast SlowFast 5,678
    star
    updated over 1 year ago Apache License 2.0

    PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

  • yt yt 457
    star
    updated 3 months ago Other

    Main yt repository

GraphQL

  • graphene graphene 7,978
    star
    updated 7 months ago MIT License

    GraphQL framework for Python

  • updated 12 months ago MIT License

    tartiflette-aiohttp is a wrapper of aiohttp which includes the Tartiflette GraphQL Engine, do not hesitate to take a look of the Tartiflette project.

  • updated over 1 year ago MIT License

    ASGI support for the Tartiflette GraphQL engine

GUI

  • updated about 2 years ago

    A common base representation of python source code for pylint and other projects

  • bandit bandit 5,900
    star
    updated 8 months ago Apache License 2.0

    Bandit is a tool designed to find common security issues in Python code.

  • dash dash 19,422
    star
    updated about 1 year ago MIT License

    Data Apps & Dashboards for Python. No JavaScript Required.

  • flake8 flake8 3,068
    star
    updated about 1 year ago Other

    flake8 is a python tool that glues together pycodestyle, pyflakes, mccabe, and third-party plugins to check the style and quality of some python code.

  • isort isort 6,471
    star
    updated 3 months ago MIT License

    A Python utility / library to sort imports.

  • mccabe mccabe 602
    star
    updated about 1 year ago Other

    McCabe complexity checker for Python

  • napari napari 2,169
    star
    updated about 1 month ago BSD 3-Clause "New...

    napari: a fast, interactive, multi-dimensional image viewer for python

  • updated about 1 year ago Other

    Simple Python style checker in one Python file

  • updated 12 months ago MIT License

    docstring style checker

  • pylint pylint 4,246
    star
    updated about 2 years ago

    It's not just a linter that annoys you!

  • PyQt PyQt 6,645
    star
    updated 5 months ago GNU Lesser Genera...

    PyQt Examples๏ผˆPyQtๅ„็งๆต‹่ฏ•ๅ’Œไพ‹ๅญ๏ผ‰ PyQt4 PyQt5

  • PySide PySide 281
    star
    updated almost 8 years ago GNU Lesser Genera...

    ATTENTION: This project is deprecated, please refer to PySide2

  • PySimpleGUI PySimpleGUI 13,367
    star
    updated about 2 months ago Other

    Python GUIs for Humans! PySimpleGUI is the top-rated Python application development environment. Launched in 2018 and actively developed, maintained, and supported in 2024. Transforms tkinter, Qt, WxPython, and Remi into a simple, intuitive, and fun experience for both hobbyists and expert users.

  • updated 5 months ago Other

    TraitsUI: Traits-capable windowing framework

GUI Development

  • Eel Eel 4,878
    star
    updated over 2 years ago MIT License

    A little Python library for making simple Electron-like HTML/JS GUI apps

  • enaml enaml 1,487
    star
    updated almost 1 year ago Other

    Declarative User Interfaces for Python

  • Gooey Gooey 20,574
    star
    updated 8 months ago MIT License

    Turn (almost) any Python command line program into a full GUI application with one line

  • pyglet pyglet 1,865
    star
    updated 24 days ago BSD 3-Clause "New...

    pyglet is a cross-platform windowing and multimedia library for Python, for developing games and other visually rich applications.

  • PySimpleGUI PySimpleGUI 13,367
    star
    updated about 2 months ago Other

    Python GUIs for Humans! PySimpleGUI is the top-rated Python application development environment. Launched in 2018 and actively developed, maintained, and supported in 2024. Transforms tkinter, Qt, WxPython, and Remi into a simple, intuitive, and fun experience for both hobbyists and expert users.

  • pywebview pywebview 4,529
    star
    updated 3 months ago BSD 3-Clause "New...

    Build GUI for your Python program with JavaScript, HTML, and CSS

GUI / Web Testing

  • locust locust 24,758
    star
    updated 19 days ago MIT License

    Write scalable load tests in plain Python ๐Ÿš—๐Ÿ’จ

  • splinter splinter 2,714
    star
    updated about 1 month ago BSD 3-Clause "New...

    splinter - python test framework for web applications

Hardware

  • keyboard keyboard 3,791
    star
    updated 4 months ago MIT License

    Hook and simulate global keyboard events on Windows and Linux.

  • mouse mouse 902
    star
    updated 4 months ago MIT License

    Hook and simulate global mouse events in pure Python

  • updated about 3 years ago GNU General Publi...

    A module for cross-platform control of the mouse and keyboard in python that is simple to install and use.

  • scapy scapy 10,691
    star
    updated 9 days ago GNU General Publi...

    Scapy: the Python-based interactive packet manipulation program & library.

HTML Manipulation

  • bleach bleach 2,651
    star
    updated 2 days ago Other

    Bleach is an allowed-list-based HTML sanitizing library that escapes or strips markup and attributes

  • updated 8 months ago MIT License

    Standards-compliant library for parsing and serializing HTML documents and fragments in Python

  • updated 2 months ago BSD 3-Clause "New...

    Safely add untrusted strings to HTML/XML markup.

  • pyquery pyquery 2,230
    star
    updated about 1 year ago Other

    A jquery-like library for python

  • updated over 1 year ago MIT License

    Converts XML to Python objects

  • xmltodict xmltodict 5,480
    star
    updated 15 days ago MIT License

    Python module that makes working with XML feel like you are working with JSON

HTTP Clients

  • grequests grequests 4,400
    star
    updated over 1 year ago BSD 2-Clause "Sim...

    Requests + Gevent = <3

  • updated over 1 year ago Other

    Small, fast HTTP client library for Python. Features persistent connections, cache, and Google App Engine support. Originally written by Joe Gregorio, now supported by community.

  • httpx httpx 11,445
    star
    updated 12 months ago BSD 3-Clause "New...

    A next generation HTTP client for Python. ๐Ÿฆ‹

  • requests requests 51,920
    star
    updated 3 months ago Apache License 2.0

    A simple, yet elegant, HTTP library.

  • treq treq 585
    star
    updated 4 months ago Other

    Python requests like API built on top of Twisted's HTTP client.

Image Processing

  • hmap hmap 212
    star
    updated almost 5 years ago Other

    Image histogram remapping

  • updated almost 4 years ago MIT License

    Nudity detection with Python

  • pagan pagan 299
    star
    updated almost 2 years ago GNU General Publi...

    Python avatar generator for absolute nerds

  • Pillow Pillow 11,308
    star
    updated 11 months ago Other

    Python Imaging Library (Fork)

  • pymatting pymatting 1,773
    star
    updated 24 days ago MIT License

    A Python library for alpha matting

  • updated 3 months ago MIT License

    ใŠ™๏ธ Create standard barcodes with Python. No external dependencies. 100% Organic Python.

  • updated 5 months ago Other

    Python QR Code image generator

  • pyvips pyvips 638
    star
    updated 18 days ago MIT License

    python binding for libvips using cffi

  • pywal pywal 8,250
    star
    updated 9 months ago MIT License

    ๐ŸŽจ Generate and change color-schemes on the fly.

  • Quads Quads 1,176
    star
    updated almost 2 years ago

    Computer art based on quadtrees.

  • thumbor thumbor 10,042
    star
    updated about 1 month ago MIT License

    thumbor is an open-source photo thumbnail service by globo.com

Implementations

  • updated 5 months ago Other

    An implementation of Python in Common Lisp

  • cpython cpython 55,867
    star
    updated about 1 year ago Other

    The Python programming language

  • grumpy grumpy 10,464
    star
    updated over 2 years ago Apache License 2.0

    Grumpy is a Python to Go source code transcompiler and runtime.

  • updated 3 months ago Apache License 2.0

    Implementation of Python 3.x for .NET Framework that is built on top of the Dynamic Language Runtime.

  • micropython micropython 17,190
    star
    updated about 1 year ago Other

    MicroPython - a lean and efficient Python implementation for microcontrollers and constrained systems

  • PeachPy PeachPy 1,967
    star
    updated about 1 year ago Other

    x86-64 assembler embedded in Python

Interactive Interpreter

  • updated about 1 month ago Creative Commons ...

    A curated list of awesome Jupyter projects, libraries and resources

  • bpython bpython 2,598
    star
    updated 4 months ago Other

    bpython - A fancy curses interface to the Python interactive interpreter

Internationalization

  • pyicu pyicu 133
    star
    updated over 3 years ago Other

    The PyICU project repository has moved to https://pyicu.org.

Job Scheduler

  • updated almost 5 years ago

    A calendaring app for Django. It is now stable, Please feel free to use it now. Active development has been taken over by bartekgorny.

  • updated over 2 years ago Apache License 2.0

    Remote task execution tool

  • plan plan 1,171
    star
    updated over 2 years ago Other

    Crontab jobs management in Python

  • prefect prefect 16,121
    star
    updated 13 days ago Apache License 2.0

    Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

  • schedule schedule 11,697
    star
    updated 5 months ago MIT License

    Python job scheduling for humans.

Jupyter

  • binderhub binderhub 2,557
    star
    updated 30 days ago BSD 3-Clause "New...

    Run your code in the cloud, with technology so advanced, it feels like magic!

  • updated 4 months ago BSD 3-Clause "New...

    Spawns JupyterHub single user servers in Docker containers

  • updated 9 days ago BSD 3-Clause "New...

    Jupyter extensions for running an RStudio rsession proxy

  • updated 3 months ago BSD 3-Clause "New...

    Jupyter notebook server extension to proxy web services.

  • updated 3 months ago Other

    Multi-user server for Jupyter notebooks

  • updated 3 months ago BSD 3-Clause "New...

    Reference deployment of JupyterHub with docker

  • updated 5 months ago BSD 3-Clause "New...

    Reference deployment of JupyterHub and nbgrader on a single server

  • updated 3 months ago BSD 3-Clause "New...

    Kubernetes spawner for JupyterHub

  • updated 4 months ago BSD 3-Clause "New...

    LDAP Authenticator Plugin for Jupyter

  • updated 3 months ago BSD 3-Clause "New...

    OAuth + JupyterHub Authenticator = OAuthenticator

  • updated about 1 month ago BSD 3-Clause "New...

    Turn repositories into Jupyter-enabled Docker images

  • updated 11 months ago BSD 3-Clause "New...

    Spawn JupyterHub single-user servers with sudo

  • updated 10 days ago BSD 3-Clause "New...

    Simple JupyterHub distribution for 1-100 users on a single server

  • updated 7 months ago BSD 3-Clause "New...

    Mechanism for runtime configuration of spawners for JupyterHub

  • updated 3 months ago Other

    Helm Chart & Documentation for deploying JupyterHub on Kubernetes

Logging

  • loguru loguru 17,762
    star
    updated 8 months ago MIT License

    Python logging made (stupidly) simple

  • updated over 1 year ago MIT License

    The official Python SDK for Sentry.io

Machine Learning

  • Detectron Detectron 25,771
    star
    updated over 1 year ago Apache License 2.0

    FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

  • gym gym 34,442
    star
    updated 6 months ago Other

    A toolkit for developing and comparing reinforcement learning algorithms.

  • h2o-3 h2o-3 6,862
    star
    updated about 1 month ago Apache License 2.0

    H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

  • llama llama 44,989
    star
    updated 12 months ago Other

    Inference code for LLaMA models

  • m2cgen m2cgen 2,477
    star
    updated over 1 year ago MIT License

    Transform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies

  • Metrics Metrics 1,606
    star
    updated almost 2 years ago Other

    Machine learning evaluation metrics, implemented in Python, R, Haskell, and MATLAB / Octave

  • mindsdb mindsdb 26,202
    star
    updated about 2 months ago Other

    The platform for building AI from enterprise data

  • updated about 1 year ago GNU Affero Genera...

    Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.

  • updated almost 5 years ago Other

    lightweight python wrapper for vowpal wabbit

  • whisper whisper 62,693
    star
    updated 5 months ago MIT License

    Robust Speech Recognition via Large-Scale Weak Supervision

  • xgboost xgboost 26,028
    star
    updated 2 months ago Apache License 2.0

    Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Math

  • ipython ipython 16,302
    star
    updated 6 days ago BSD 3-Clause "New...

    Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.

  • matplotlib matplotlib 20,025
    star
    updated about 1 month ago

    matplotlib: plotting with Python

  • numba numba 9,480
    star
    updated 6 months ago BSD 2-Clause "Sim...

    NumPy aware dynamic Python compiler using LLVM

  • numpy numpy 26,893
    star
    updated 4 months ago Other

    The fundamental package for scientific computing with Python.

  • pandas pandas 43,074
    star
    updated 2 months ago BSD 3-Clause "New...

    Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

  • pyro pyro 8,530
    star
    updated about 1 month ago Apache License 2.0

    Deep universal probabilistic programming with Python and PyTorch

  • updated 27 days ago BSD 3-Clause "New...

    scikit-learn: machine learning in Python

  • scipy scipy 12,872
    star
    updated 2 months ago BSD 3-Clause "New...

    SciPy library main repository

  • sympy sympy 12,875
    star
    updated 16 days ago Other

    A computer algebra system written in pure Python

  • xarray xarray 3,551
    star
    updated about 2 months ago Apache License 2.0

    N-D labeled arrays and datasets in Python

Microsoft Windows

  • pythonnet pythonnet 4,369
    star
    updated 6 months ago MIT License

    Python for .NET is a package that gives Python programmers nearly seamless integration with the .NET Common Language Runtime (CLR) and provides a powerful application scripting tool for .NET developers.

  • pywin32 pywin32 4,936
    star
    updated 3 months ago

    Python for Windows (pywin32) Extensions

Miscellaneous

  • boltons boltons 6,473
    star
    updated 4 months ago Other

    ๐Ÿ”ฉ Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.

  • updated 2 months ago BSD 3-Clause "New...

    Safely pass trusted data to untrusted environments and back.

  • magenta magenta 19,151
    star
    updated 3 months ago Apache License 2.0

    Magenta: Music and Art Generation with Machine Intelligence

  • updated over 3 years ago Other

    A simple but flexible plugin system for Python.

ML - Deep Learning

  • bert bert 37,769
    star
    updated 3 months ago Apache License 2.0

    TensorFlow code and pre-trained models for BERT

  • datasets datasets 4,298
    star
    updated 6 days ago Apache License 2.0

    TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...

  • detectron2 detectron2 25,567
    star
    updated over 1 year ago Apache License 2.0

    Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

  • gpt-2 gpt-2 21,450
    star
    updated 5 months ago Other

    Code for the paper "Language Models are Unsupervised Multitask Learners"

  • keras keras 61,706
    star
    updated 26 days ago Apache License 2.0

    Deep Learning for humans

  • updated over 1 year ago MIT License

    Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

  • NeMo NeMo 11,967
    star
    updated 1 day ago Apache License 2.0

    A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

  • updated over 1 year ago Apache License 2.0

    An open source framework for seq2seq models in PyTorch.

  • updated 27 days ago BSD 3-Clause "New...

    scikit-learn: machine learning in Python

  • transformers transformers 133,705
    star
    updated 9 days ago Apache License 2.0

    ๐Ÿค— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

  • yolov5 yolov5 45,908
    star
    updated 8 months ago GNU Affero Genera...

    YOLOv5 ๐Ÿš€ in PyTorch > ONNX > CoreML > TFLite

ML - General

  • updated over 1 year ago Apache License 2.0

    An open source framework for seq2seq models in PyTorch.

  • sonnet sonnet 9,741
    star
    updated 3 months ago Apache License 2.0

    TensorFlow-based neural network library

  • updated 3 months ago Apache License 2.0
  • Theano Theano 9,897
    star
    updated 10 months ago Other

    Theano was a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It is being continued as PyTensor: www.github.com/pymc-devs/pytensor

ML - Interpretability

  • AIF360 AIF360 2,427
    star
    updated 4 months ago Apache License 2.0

    A comprehensive set of fairness metrics for datasets and machine learning models, explanations for these metrics, and algorithms to mitigate bias in datasets and models.

  • lale lale 333
    star
    updated about 1 month ago Apache License 2.0

    Library for Semi-Automated Data Science

ML - Ops

  • updated almost 3 years ago Apache License 2.0

    Cloud ML Engine repo. Please visit the new Vertex AI samples repo at https://github.com/GoogleCloudPlatform/vertex-ai-samples

  • dvc dvc 13,294
    star
    updated 5 months ago Apache License 2.0

    ๐Ÿฆ‰ ML Experiments and Data Management with Git

  • mlflow mlflow 18,500
    star
    updated 16 days ago Apache License 2.0

    Open source platform for the machine learning lifecycle

  • tfx tfx 2,099
    star
    updated 3 months ago Apache License 2.0

    TFX is an end-to-end platform for deploying production ML pipelines

ML - Reinforcement

  • garage garage 1,858
    star
    updated over 1 year ago MIT License

    A toolkit for reproducible reinforcement learning research.

  • gym gym 34,442
    star
    updated 6 months ago Other

    A toolkit for developing and comparing reinforcement learning algorithms.

  • homework homework 1,520
    star
    updated over 1 year ago MIT License

    Assignments for CS294-112.

  • Metaworld Metaworld 1,178
    star
    updated 3 months ago MIT License

    Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning

  • updated over 2 years ago MIT License

    PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

  • updated over 3 years ago MIT License

    Implementation of algorithms for continuous control (DDPG and NAF).

  • updated over 1 year ago MIT License

    Minimal and Clean Reinforcement Learning Examples

  • rllab rllab 2,892
    star
    updated over 1 year ago Other

    rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

  • trfl trfl 3,136
    star
    updated almost 2 years ago Apache License 2.0

    TensorFlow Reinforcement Learning

ML - Time Series

  • keras-tcn keras-tcn 1,863
    star
    updated 7 months ago MIT License

    Keras Temporal Convolutional Network.

  • orbit orbit 1,872
    star
    updated 4 months ago Other

    A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.

  • prophet prophet 17,943
    star
    updated 5 months ago MIT License

    Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.

  • updated about 1 month ago BSD 3-Clause "New...

    Statsmodels: statistical modeling and econometrics in Python

Mocking

  • freezegun freezegun 4,169
    star
    updated 3 months ago Apache License 2.0

    Let your Python tests travel through time

  • updated about 1 year ago Other

    A mocking library for requests

  • HTTPretty HTTPretty 2,107
    star
    updated 5 months ago MIT License

    Intercept HTTP requests at the Python socket level. Fakes the whole socket module

  • updated about 1 month ago BSD 3-Clause "New...

    a socket mock framework - for all kinds of socket animals, web-clients included

  • responses responses 4,127
    star
    updated about 2 months ago Apache License 2.0

    A utility for mocking out the Python Requests library.

  • vcrpy vcrpy 2,694
    star
    updated about 1 month ago MIT License

    Automatically mock your HTTP interactions to simplify and speed up testing

Natural Language Processing

  • funNLP funNLP 52,971
    star
    updated over 1 year ago

    ไธญ่‹ฑๆ–‡ๆ•ๆ„Ÿ่ฏใ€่ฏญ่จ€ๆฃ€ๆต‹ใ€ไธญๅค–ๆ‰‹ๆœบ/็”ต่ฏๅฝ’ๅฑžๅœฐ/่ฟ่ฅๅ•†ๆŸฅ่ฏขใ€ๅๅญ—ๆŽจๆ–ญๆ€งๅˆซใ€ๆ‰‹ๆœบๅทๆŠฝๅ–ใ€่บซไปฝ่ฏๆŠฝๅ–ใ€้‚ฎ็ฎฑๆŠฝๅ–ใ€ไธญๆ—ฅๆ–‡ไบบๅๅบ“ใ€ไธญๆ–‡็ผฉๅ†™ๅบ“ใ€ๆ‹†ๅญ—่ฏๅ…ธใ€่ฏๆฑ‡ๆƒ…ๆ„Ÿๅ€ผใ€ๅœ็”จ่ฏใ€ๅๅŠจ่ฏ่กจใ€ๆšดๆ่ฏ่กจใ€็น็ฎ€ไฝ“่ฝฌๆขใ€่‹ฑๆ–‡ๆจกๆ‹Ÿไธญๆ–‡ๅ‘้Ÿณใ€ๆฑชๅณฐๆญŒ่ฏ็”Ÿๆˆๅ™จใ€่Œไธšๅ็งฐ่ฏๅบ“ใ€ๅŒไน‰่ฏๅบ“ใ€ๅไน‰่ฏๅบ“ใ€ๅฆๅฎš่ฏๅบ“ใ€ๆฑฝ่ฝฆๅ“็‰Œ่ฏๅบ“ใ€ๆฑฝ่ฝฆ้›ถไปถ่ฏๅบ“ใ€่ฟž็ปญ่‹ฑๆ–‡ๅˆ‡ๅ‰ฒใ€ๅ„็งไธญๆ–‡่ฏๅ‘้‡ใ€ๅ…ฌๅธๅๅญ—ๅคงๅ…จใ€ๅค่ฏ—่ฏๅบ“ใ€IT่ฏๅบ“ใ€่ดข็ป่ฏๅบ“ใ€ๆˆ่ฏญ่ฏๅบ“ใ€ๅœฐๅ่ฏๅบ“ใ€ๅŽ†ๅฒๅไบบ่ฏๅบ“ใ€่ฏ—่ฏ่ฏๅบ“ใ€ๅŒปๅญฆ่ฏๅบ“ใ€้ฅฎ้ฃŸ่ฏๅบ“ใ€ๆณ•ๅพ‹่ฏๅบ“ใ€ๆฑฝ่ฝฆ่ฏๅบ“ใ€ๅŠจ็‰ฉ่ฏๅบ“ใ€ไธญๆ–‡่Šๅคฉ่ฏญๆ–™ใ€ไธญๆ–‡่ฐฃ่จ€ๆ•ฐๆฎใ€็™พๅบฆไธญๆ–‡้—ฎ็ญ”ๆ•ฐๆฎ้›†ใ€ๅฅๅญ็›ธไผผๅบฆๅŒน้…็ฎ—ๆณ•้›†ๅˆใ€bert่ต„ๆบใ€ๆ–‡ๆœฌ็”Ÿๆˆ&ๆ‘˜่ฆ็›ธๅ…ณๅทฅๅ…ทใ€cocoNLPไฟกๆฏๆŠฝๅ–ๅทฅๅ…ทใ€ๅ›ฝๅ†…็”ต่ฏๅท็ ๆญฃๅˆ™ๅŒน้…ใ€ๆธ…ๅŽๅคงๅญฆXLORE:ไธญ่‹ฑๆ–‡่ทจ่ฏญ่จ€็™พ็ง‘็Ÿฅ่ฏ†ๅ›พ่ฐฑใ€ๆธ…ๅŽๅคงๅญฆไบบๅทฅๆ™บ่ƒฝๆŠ€ๆœฏ็ณปๅˆ—ๆŠฅๅ‘Šใ€่‡ช็„ถ่ฏญ่จ€็”Ÿๆˆใ€NLUๅคช้šพไบ†็ณปๅˆ—ใ€่‡ชๅŠจๅฏน่”ๆ•ฐๆฎๅŠๆœบๅ™จไบบใ€็”จๆˆทๅ้ป‘ๅๅ•ๅˆ—่กจใ€็ฝชๅๆณ•ๅŠกๅ่ฏๅŠๅˆ†็ฑปๆจกๅž‹ใ€ๅพฎไฟกๅ…ฌไผ—ๅท่ฏญๆ–™ใ€cs224nๆทฑๅบฆๅญฆไน ่‡ช็„ถ่ฏญ่จ€ๅค„็†่ฏพ็จ‹ใ€ไธญๆ–‡ๆ‰‹ๅ†™ๆฑ‰ๅญ—่ฏ†ๅˆซใ€ไธญๆ–‡่‡ช็„ถ่ฏญ่จ€ๅค„็† ่ฏญๆ–™/ๆ•ฐๆฎ้›†ใ€ๅ˜้‡ๅ‘ฝๅ็ฅžๅ™จใ€ๅˆ†่ฏ่ฏญๆ–™ๅบ“+ไปฃ็ ใ€ไปปๅŠกๅž‹ๅฏน่ฏ่‹ฑๆ–‡ๆ•ฐๆฎ้›†ใ€ASR ่ฏญ้Ÿณๆ•ฐๆฎ้›† + ๅŸบไบŽๆทฑๅบฆๅญฆไน ็š„ไธญๆ–‡่ฏญ้Ÿณ่ฏ†ๅˆซ็ณป็ปŸใ€็ฌ‘ๅฃฐๆฃ€ๆต‹ๅ™จใ€Microsoftๅคš่ฏญ่จ€ๆ•ฐๅญ—/ๅ•ไฝ/ๅฆ‚ๆ—ฅๆœŸๆ—ถ้—ด่ฏ†ๅˆซๅŒ…ใ€ไธญๅŽๆ–ฐๅŽๅญ—ๅ…ธๆ•ฐๆฎๅบ“ๅŠapi(ๅŒ…ๆ‹ฌๅธธ็”จๆญ‡ๅŽ่ฏญใ€ๆˆ่ฏญใ€่ฏ่ฏญๅ’Œๆฑ‰ๅญ—)ใ€ๆ–‡ๆกฃๅ›พ่ฐฑ่‡ชๅŠจ็”Ÿๆˆใ€SpaCy ไธญๆ–‡ๆจกๅž‹ใ€Common Voice่ฏญ้Ÿณ่ฏ†ๅˆซๆ•ฐๆฎ้›†ๆ–ฐ็‰ˆใ€็ฅž็ป็ฝ‘็ปœๅ…ณ็ณปๆŠฝๅ–ใ€ๅŸบไบŽbert็š„ๅ‘ฝๅๅฎžไฝ“่ฏ†ๅˆซใ€ๅ…ณ้”ฎ่ฏ(Keyphrase)ๆŠฝๅ–ๅŒ…pkeใ€ๅŸบไบŽๅŒป็–—้ข†ๅŸŸ็Ÿฅ่ฏ†ๅ›พ่ฐฑ็š„้—ฎ็ญ”็ณป็ปŸใ€ๅŸบไบŽไพๅญ˜ๅฅๆณ•ไธŽ่ฏญไน‰่ง’่‰ฒๆ ‡ๆณจ็š„ไบ‹ไปถไธ‰ๅ…ƒ็ป„ๆŠฝๅ–ใ€ไพๅญ˜ๅฅๆณ•ๅˆ†ๆž4ไธ‡ๅฅ้ซ˜่ดจ้‡ๆ ‡ๆณจๆ•ฐๆฎใ€cnocr๏ผš็”จๆฅๅšไธญๆ–‡OCR็š„Python3ๅŒ…ใ€ไธญๆ–‡ไบบ็‰ฉๅ…ณ็ณป็Ÿฅ่ฏ†ๅ›พ่ฐฑ้กน็›ฎใ€ไธญๆ–‡nlp็ซž่ต›้กน็›ฎๅŠไปฃ็ ๆฑ‡ๆ€ปใ€ไธญๆ–‡ๅญ—็ฌฆๆ•ฐๆฎใ€speech-aligner: ไปŽโ€œไบบๅฃฐ่ฏญ้Ÿณโ€ๅŠๅ…ถโ€œ่ฏญ่จ€ๆ–‡ๆœฌโ€ไบง็”Ÿ้Ÿณ็ด ็บงๅˆซๆ—ถ้—ดๅฏน้ฝๆ ‡ๆณจ็š„ๅทฅๅ…ทใ€AmpliGraph: ็Ÿฅ่ฏ†ๅ›พ่ฐฑ่กจ็คบๅญฆไน (Python)ๅบ“๏ผš็Ÿฅ่ฏ†ๅ›พ่ฐฑๆฆ‚ๅฟต้“พๆŽฅ้ข„ๆต‹ใ€Scattertext ๆ–‡ๆœฌๅฏ่ง†ๅŒ–(python)ใ€่ฏญ่จ€/็Ÿฅ่ฏ†่กจ็คบๅทฅๅ…ท๏ผšBERT & ERNIEใ€ไธญๆ–‡ๅฏนๆฏ”่‹ฑๆ–‡่‡ช็„ถ่ฏญ่จ€ๅค„็†NLP็š„ๅŒบๅˆซ็ปผ่ฟฐใ€Synonymsไธญๆ–‡่ฟ‘ไน‰่ฏๅทฅๅ…ทๅŒ…ใ€HarvestText้ข†ๅŸŸ่‡ช้€‚ๅบ”ๆ–‡ๆœฌๆŒ–ๆŽ˜ๅทฅๅ…ท๏ผˆๆ–ฐ่ฏๅ‘็Žฐ-ๆƒ…ๆ„Ÿๅˆ†ๆž-ๅฎžไฝ“้“พๆŽฅ็ญ‰๏ผ‰ใ€word2word๏ผš(Python)ๆ–นไพฟๆ˜“็”จ็š„ๅคš่ฏญ่จ€่ฏ-่ฏๅฏน้›†๏ผš62็ง่ฏญ่จ€/3,564ไธชๅคš่ฏญ่จ€ๅฏนใ€่ฏญ้Ÿณ่ฏ†ๅˆซ่ฏญๆ–™็”Ÿๆˆๅทฅๅ…ท๏ผšไปŽๅ…ทๆœ‰้Ÿณ้ข‘/ๅญ—ๅน•็š„ๅœจ็บฟ่ง†้ข‘ๅˆ›ๅปบ่‡ชๅŠจ่ฏญ้Ÿณ่ฏ†ๅˆซ(ASR)่ฏญๆ–™ๅบ“ใ€ๆž„ๅปบๅŒป็–—ๅฎžไฝ“่ฏ†ๅˆซ็š„ๆจกๅž‹๏ผˆๅŒ…ๅซ่ฏๅ…ธๅ’Œ่ฏญๆ–™ๆ ‡ๆณจ๏ผ‰ใ€ๅ•ๆ–‡ๆกฃ้ž็›‘็ฃ็š„ๅ…ณ้”ฎ่ฏๆŠฝๅ–ใ€Kashgariไธญไฝฟ็”จgpt-2่ฏญ่จ€ๆจกๅž‹ใ€ๅผ€ๆบ็š„้‡‘่žๆŠ•่ต„ๆ•ฐๆฎๆๅ–ๅทฅๅ…ทใ€ๆ–‡ๆœฌ่‡ชๅŠจๆ‘˜่ฆๅบ“TextTeaser: ไป…ๆ”ฏๆŒ่‹ฑๆ–‡ใ€ไบบๆฐ‘ๆ—ฅๆŠฅ่ฏญๆ–™ๅค„็†ๅทฅๅ…ท้›†ใ€ไธ€ไบ›ๅ…ณไบŽ่‡ช็„ถ่ฏญ่จ€็š„ๅŸบๆœฌๆจกๅž‹ใ€ๅŸบไบŽ14WๆญŒๆ›ฒ็Ÿฅ่ฏ†ๅบ“็š„้—ฎ็ญ”ๅฐ่ฏ•--ๅŠŸ่ƒฝๅŒ…ๆ‹ฌๆญŒ่ฏๆŽฅ้พ™andๅทฒ็ŸฅๆญŒ่ฏๆ‰พๆญŒๆ›ฒไปฅๅŠๆญŒๆ›ฒๆญŒๆ‰‹ๆญŒ่ฏไธ‰่ง’ๅ…ณ็ณป็š„้—ฎ็ญ”ใ€ๅŸบไบŽSiamese bilstmๆจกๅž‹็š„็›ธไผผๅฅๅญๅˆคๅฎšๆจกๅž‹ๅนถๆไพ›่ฎญ็ปƒๆ•ฐๆฎ้›†ๅ’Œๆต‹่ฏ•ๆ•ฐๆฎ้›†ใ€็”จTransformer็ผ–่งฃ็ ๆจกๅž‹ๅฎž็Žฐ็š„ๆ นๆฎHacker Newsๆ–‡็ซ ๆ ‡้ข˜่‡ชๅŠจ็”Ÿๆˆ่ฏ„่ฎบใ€็”จBERT่ฟ›่กŒๅบๅˆ—ๆ ‡่ฎฐๅ’Œๆ–‡ๆœฌๅˆ†็ฑป็š„ๆจกๆฟไปฃ็ ใ€LitBank๏ผšNLPๆ•ฐๆฎ้›†โ€”โ€”ๆ”ฏๆŒ่‡ช็„ถ่ฏญ่จ€ๅค„็†ๅ’Œ่ฎก็ฎ—ไบบๆ–‡ๅญฆ็ง‘ไปปๅŠก็š„100้ƒจๅธฆๆ ‡่ฎฐ่‹ฑๆ–‡ๅฐ่ฏด่ฏญๆ–™ใ€็™พๅบฆๅผ€ๆบ็š„ๅŸบๅ‡†ไฟกๆฏๆŠฝๅ–็ณป็ปŸใ€่™šๅ‡ๆ–ฐ้—ปๆ•ฐๆฎ้›†ใ€Facebook: LAMA่ฏญ่จ€ๆจกๅž‹ๅˆ†ๆž๏ผŒๆไพ›Transformer-XL/BERT/ELMo/GPT้ข„่ฎญ็ปƒ่ฏญ่จ€ๆจกๅž‹็š„็ปŸไธ€่ฎฟ้—ฎๆŽฅๅฃใ€CommonsenseQA๏ผš้ขๅ‘ๅธธ่ฏ†็š„่‹ฑๆ–‡QAๆŒ‘ๆˆ˜ใ€ไธญๆ–‡็Ÿฅ่ฏ†ๅ›พ่ฐฑ่ต„ๆ–™ใ€ๆ•ฐๆฎๅŠๅทฅๅ…ทใ€ๅ„ๅคงๅ…ฌๅธๅ†…้ƒจ้‡Œๅคง็‰›ๅˆ†ไบซ็š„ๆŠ€ๆœฏๆ–‡ๆกฃ PDF ๆˆ–่€… PPTใ€่‡ช็„ถ่ฏญ่จ€็”ŸๆˆSQL่ฏญๅฅ๏ผˆ่‹ฑๆ–‡๏ผ‰ใ€ไธญๆ–‡NLPๆ•ฐๆฎๅขžๅผบ๏ผˆEDA๏ผ‰ๅทฅๅ…ทใ€่‹ฑๆ–‡NLPๆ•ฐๆฎๅขžๅผบๅทฅๅ…ท ใ€ๅŸบไบŽๅŒป่ฏ็Ÿฅ่ฏ†ๅ›พ่ฐฑ็š„ๆ™บ่ƒฝ้—ฎ็ญ”็ณป็ปŸใ€ไบฌไธœๅ•†ๅ“็Ÿฅ่ฏ†ๅ›พ่ฐฑใ€ๅŸบไบŽmongodbๅญ˜ๅ‚จ็š„ๅ†›ไบ‹้ข†ๅŸŸ็Ÿฅ่ฏ†ๅ›พ่ฐฑ้—ฎ็ญ”้กน็›ฎใ€ๅŸบไบŽ่ฟœ็›‘็ฃ็š„ไธญๆ–‡ๅ…ณ็ณปๆŠฝๅ–ใ€่ฏญ้Ÿณๆƒ…ๆ„Ÿๅˆ†ๆžใ€ไธญๆ–‡ULMFiT-ๆƒ…ๆ„Ÿๅˆ†ๆž-ๆ–‡ๆœฌๅˆ†็ฑป-่ฏญๆ–™ๅŠๆจกๅž‹ใ€ไธ€ไธชๆ‹็…งๅš้ข˜็จ‹ๅบใ€ไธ–็•Œๅ„ๅ›ฝๅคง่ง„ๆจกไบบๅๅบ“ใ€ไธ€ไธชๅˆฉ็”จๆœ‰่ถฃไธญๆ–‡่ฏญๆ–™ๅบ“ qingyun ่ฎญ็ปƒๅ‡บๆฅ็š„ไธญๆ–‡่Šๅคฉๆœบๅ™จไบบใ€ไธญๆ–‡่Šๅคฉๆœบๅ™จไบบseqGANใ€็œๅธ‚ๅŒบ้•‡่กŒๆ”ฟๅŒบๅˆ’ๆ•ฐๆฎๅธฆๆ‹ผ้Ÿณๆ ‡ๆณจใ€ๆ•™่‚ฒ่กŒไธšๆ–ฐ้—ป่ฏญๆ–™ๅบ“ๅŒ…ๅซ่‡ชๅŠจๆ–‡ๆ‘˜ๅŠŸ่ƒฝใ€ๅผ€ๆ”พไบ†ๅฏน่ฏๆœบๅ™จไบบ-็Ÿฅ่ฏ†ๅ›พ่ฐฑ-่ฏญไน‰็†่งฃ-่‡ช็„ถ่ฏญ่จ€ๅค„็†ๅทฅๅ…ทๅŠๆ•ฐๆฎใ€ไธญๆ–‡็Ÿฅ่ฏ†ๅ›พ่ฐฑ๏ผšๅŸบไบŽ็™พๅบฆ็™พ็ง‘ไธญๆ–‡้กต้ข-ๆŠฝๅ–ไธ‰ๅ…ƒ็ป„ไฟกๆฏ-ๆž„ๅปบไธญๆ–‡็Ÿฅ่ฏ†ๅ›พ่ฐฑใ€masr: ไธญๆ–‡่ฏญ้Ÿณ่ฏ†ๅˆซ-ๆไพ›้ข„่ฎญ็ปƒๆจกๅž‹-้ซ˜่ฏ†ๅˆซ็Ž‡ใ€Python้Ÿณ้ข‘ๆ•ฐๆฎๅขžๅนฟๅบ“ใ€ไธญๆ–‡ๅ…จ่ฏ่ฆ†็›–BERTๅŠไธคไปฝ้˜…่ฏป็†่งฃๆ•ฐๆฎใ€ConvLab๏ผšๅผ€ๆบๅคšๅŸŸ็ซฏๅˆฐ็ซฏๅฏน่ฏ็ณป็ปŸๅนณๅฐใ€ไธญๆ–‡่‡ช็„ถ่ฏญ่จ€ๅค„็†ๆ•ฐๆฎ้›†ใ€ๅŸบไบŽๆœ€ๆ–ฐ็‰ˆๆœฌrasaๆญๅปบ็š„ๅฏน่ฏ็ณป็ปŸใ€ๅŸบไบŽTensorFlowๅ’ŒBERT็š„็ฎก้“ๅผๅฎžไฝ“ๅŠๅ…ณ็ณปๆŠฝๅ–ใ€ไธ€ไธชๅฐๅž‹็š„่ฏๅˆธ็Ÿฅ่ฏ†ๅ›พ่ฐฑ/็Ÿฅ่ฏ†ๅบ“ใ€ๅค็›˜ๆ‰€ๆœ‰NLPๆฏ”่ต›็š„TOPๆ–นๆกˆใ€OpenCLaP๏ผšๅคš้ข†ๅŸŸๅผ€ๆบไธญๆ–‡้ข„่ฎญ็ปƒ่ฏญ่จ€ๆจกๅž‹ไป“ๅบ“ใ€UER๏ผšๅŸบไบŽไธๅŒ่ฏญๆ–™+็ผ–็ ๅ™จ+็›ฎๆ ‡ไปปๅŠก็š„ไธญๆ–‡้ข„่ฎญ็ปƒๆจกๅž‹ไป“ๅบ“ใ€ไธญๆ–‡่‡ช็„ถ่ฏญ่จ€ๅค„็†ๅ‘้‡ๅˆ้›†ใ€ๅŸบไบŽ้‡‘่ž-ๅธๆณ•้ข†ๅŸŸ(ๅ…ผๆœ‰้—ฒ่Šๆ€ง่ดจ)็š„่Šๅคฉๆœบๅ™จไบบใ€g2pC๏ผšๅŸบไบŽไธŠไธ‹ๆ–‡็š„ๆฑ‰่ฏญ่ฏป้Ÿณ่‡ชๅŠจๆ ‡่ฎฐๆจกๅ—ใ€Zincbase ็Ÿฅ่ฏ†ๅ›พ่ฐฑๆž„ๅปบๅทฅๅ…ทๅŒ…ใ€่ฏ—ๆญŒ่ดจ้‡่ฏ„ไปท/็ป†็ฒ’ๅบฆๆƒ…ๆ„Ÿ่ฏ—ๆญŒ่ฏญๆ–™ๅบ“ใ€ๅฟซ้€Ÿ่ฝฌๅŒ–ใ€Œไธญๆ–‡ๆ•ฐๅญ—ใ€ๅ’Œใ€Œ้˜ฟๆ‹‰ไผฏๆ•ฐๅญ—ใ€ใ€็™พๅบฆ็Ÿฅ้“้—ฎ็ญ”่ฏญๆ–™ๅบ“ใ€ๅŸบไบŽ็Ÿฅ่ฏ†ๅ›พ่ฐฑ็š„้—ฎ็ญ”็ณป็ปŸใ€jieba_fast ๅŠ ้€Ÿ็‰ˆ็š„jiebaใ€ๆญฃๅˆ™่กจ่พพๅผๆ•™็จ‹ใ€ไธญๆ–‡้˜…่ฏป็†่งฃๆ•ฐๆฎ้›†ใ€ๅŸบไบŽBERT็ญ‰ๆœ€ๆ–ฐ่ฏญ่จ€ๆจกๅž‹็š„ๆŠฝๅ–ๅผๆ‘˜่ฆๆๅ–ใ€Pythonๅˆฉ็”จๆทฑๅบฆๅญฆไน ่ฟ›่กŒๆ–‡ๆœฌๆ‘˜่ฆ็š„็ปผๅˆๆŒ‡ๅ—ใ€็Ÿฅ่ฏ†ๅ›พ่ฐฑๆทฑๅบฆๅญฆไน ็›ธๅ…ณ่ต„ๆ–™ๆ•ด็†ใ€็ปดๅŸบๅคง่ง„ๆจกๅนณ่กŒๆ–‡ๆœฌ่ฏญๆ–™ใ€StanfordNLP 0.2.0๏ผš็บฏPython็‰ˆ่‡ช็„ถ่ฏญ่จ€ๅค„็†ๅŒ…ใ€NeuralNLP-NeuralClassifier๏ผš่…พ่ฎฏๅผ€ๆบๆทฑๅบฆๅญฆไน ๆ–‡ๆœฌๅˆ†็ฑปๅทฅๅ…ทใ€็ซฏๅˆฐ็ซฏ็š„ๅฐ้—ญๅŸŸๅฏน่ฏ็ณป็ปŸใ€ไธญๆ–‡ๅ‘ฝๅๅฎžไฝ“่ฏ†ๅˆซ๏ผšNeuroNER vs. BertNERใ€ๆ–ฐ้—ปไบ‹ไปถ็บฟ็ดขๆŠฝๅ–ใ€2019ๅนด็™พๅบฆ็š„ไธ‰ๅ…ƒ็ป„ๆŠฝๅ–ๆฏ”่ต›๏ผšโ€œ็ง‘ๅญฆ็ฉบ้—ด้˜Ÿโ€ๆบ็ ใ€ๅŸบไบŽไพๅญ˜ๅฅๆณ•็š„ๅผ€ๆ”พๅŸŸๆ–‡ๆœฌ็Ÿฅ่ฏ†ไธ‰ๅ…ƒ็ป„ๆŠฝๅ–ๅ’Œ็Ÿฅ่ฏ†ๅบ“ๆž„ๅปบใ€ไธญๆ–‡็š„GPT2่ฎญ็ปƒไปฃ็ ใ€ML-NLP - ๆœบๅ™จๅญฆไน (Machine Learning)NLP้ข่ฏ•ไธญๅธธ่€ƒๅˆฐ็š„็Ÿฅ่ฏ†็‚นๅ’Œไปฃ็ ๅฎž็Žฐใ€nlp4han:ไธญๆ–‡่‡ช็„ถ่ฏญ่จ€ๅค„็†ๅทฅๅ…ท้›†(ๆ–ญๅฅ/ๅˆ†่ฏ/่ฏๆ€งๆ ‡ๆณจ/็ป„ๅ—/ๅฅๆณ•ๅˆ†ๆž/่ฏญไน‰ๅˆ†ๆž/NER/Nๅ…ƒ่ฏญๆณ•/HMM/ไปฃ่ฏๆถˆ่งฃ/ๆƒ…ๆ„Ÿๅˆ†ๆž/ๆ‹ผๅ†™ๆฃ€ๆŸฅใ€XLM๏ผšFacebook็š„่ทจ่ฏญ่จ€้ข„่ฎญ็ปƒ่ฏญ่จ€ๆจกๅž‹ใ€็”จๅŸบไบŽBERT็š„ๅพฎ่ฐƒๅ’Œ็‰นๅพๆๅ–ๆ–นๆณ•ๆฅ่ฟ›่กŒ็Ÿฅ่ฏ†ๅ›พ่ฐฑ็™พๅบฆ็™พ็ง‘ไบบ็‰ฉ่ฏๆกๅฑžๆ€งๆŠฝๅ–ใ€ไธญๆ–‡่‡ช็„ถ่ฏญ่จ€ๅค„็†็›ธๅ…ณ็š„ๅผ€ๆ”พไปปๅŠก-ๆ•ฐๆฎ้›†-ๅฝ“ๅ‰ๆœ€ไฝณ็ป“ๆžœใ€CoupletAI - ๅŸบไบŽCNN+Bi-LSTM+Attention ็š„่‡ชๅŠจๅฏนๅฏน่”็ณป็ปŸใ€ๆŠฝ่ฑก็Ÿฅ่ฏ†ๅ›พ่ฐฑใ€MiningZhiDaoQACorpus - 580ไธ‡็™พๅบฆ็Ÿฅ้“้—ฎ็ญ”ๆ•ฐๆฎๆŒ–ๆŽ˜้กน็›ฎใ€brat rapid annotation tool: ๅบๅˆ—ๆ ‡ๆณจๅทฅๅ…ทใ€ๅคง่ง„ๆจกไธญๆ–‡็Ÿฅ่ฏ†ๅ›พ่ฐฑๆ•ฐๆฎ๏ผš1.4ไบฟๅฎžไฝ“ใ€ๆ•ฐๆฎๅขžๅผบๅœจๆœบๅ™จ็ฟป่ฏ‘ๅŠๅ…ถไป–nlpไปปๅŠกไธญ็š„ๅบ”็”จๅŠๆ•ˆๆžœใ€allennlp้˜…่ฏป็†่งฃ:ๆ”ฏๆŒๅคš็งๆ•ฐๆฎๅ’Œๆจกๅž‹ใ€PDF่กจๆ ผๆ•ฐๆฎๆๅ–ๅทฅๅ…ท ใ€ Graphbrain๏ผšAIๅผ€ๆบ่ฝฏไปถๅบ“ๅ’Œ็ง‘็ ”ๅทฅๅ…ท๏ผŒ็›ฎ็š„ๆ˜ฏไฟƒ่ฟ›่‡ชๅŠจๆ„ไน‰ๆๅ–ๅ’Œๆ–‡ๆœฌ็†่งฃไปฅๅŠ็Ÿฅ่ฏ†็š„ๆŽข็ดขๅ’ŒๆŽจๆ–ญใ€็ฎ€ๅŽ†่‡ชๅŠจ็ญ›้€‰็ณป็ปŸใ€ๅŸบไบŽๅ‘ฝๅๅฎžไฝ“่ฏ†ๅˆซ็š„็ฎ€ๅŽ†่‡ชๅŠจๆ‘˜่ฆใ€ไธญๆ–‡่ฏญ่จ€็†่งฃๆต‹่ฏ„ๅŸบๅ‡†๏ผŒๅŒ…ๆ‹ฌไปฃ่กจๆ€ง็š„ๆ•ฐๆฎ้›†&ๅŸบๅ‡†ๆจกๅž‹&่ฏญๆ–™ๅบ“&ๆŽ’่กŒๆฆœใ€ๆ ‘ๆดž OCR ๆ–‡ๅญ—่ฏ†ๅˆซ ใ€ไปŽๅŒ…ๅซ่กจๆ ผ็š„ๆ‰ซๆๅ›พ็‰‡ไธญ่ฏ†ๅˆซ่กจๆ ผๅ’Œๆ–‡ๅญ—ใ€่ฏญๅฃฐ่ฟ็งปใ€Pythonๅฃ่ฏญ่‡ช็„ถ่ฏญ่จ€ๅค„็†ๅทฅๅ…ท้›†(่‹ฑๆ–‡)ใ€ similarity๏ผš็›ธไผผๅบฆ่ฎก็ฎ—ๅทฅๅ…ทๅŒ…๏ผŒjava็ผ–ๅ†™ใ€ๆตท้‡ไธญๆ–‡้ข„่ฎญ็ปƒALBERTๆจกๅž‹ ใ€Transformers 2.0 ใ€ๅŸบไบŽๅคง่ง„ๆจก้Ÿณ้ข‘ๆ•ฐๆฎ้›†Audioset็š„้Ÿณ้ข‘ๅขžๅผบ ใ€Poplar๏ผš็ฝ‘้กต็‰ˆ่‡ช็„ถ่ฏญ่จ€ๆ ‡ๆณจๅทฅๅ…ทใ€ๅ›พ็‰‡ๆ–‡ๅญ—ๅŽป้™ค๏ผŒๅฏ็”จไบŽๆผซ็”ป็ฟป่ฏ‘ ใ€186็ง่ฏญ่จ€็š„ๆ•ฐๅญ—ๅซๆณ•ๅบ“ใ€Amazonๅ‘ๅธƒๅŸบไบŽ็Ÿฅ่ฏ†็š„ไบบ-ไบบๅผ€ๆ”พ้ข†ๅŸŸๅฏน่ฏๆ•ฐๆฎ้›† ใ€ไธญๆ–‡ๆ–‡ๆœฌ็บ ้”™ๆจกๅ—ไปฃ็ ใ€็น็ฎ€ไฝ“่ฝฌๆข ใ€ Pythonๅฎž็Žฐ็š„ๅคš็งๆ–‡ๆœฌๅฏ่ฏปๆ€ง่ฏ„ไปทๆŒ‡ๆ ‡ใ€็ฑปไผผไบŽไบบๅ/ๅœฐๅ/็ป„็ป‡ๆœบๆž„ๅ็š„ๅ‘ฝๅไฝ“่ฏ†ๅˆซๆ•ฐๆฎ้›† ใ€ไธœๅ—ๅคงๅญฆใ€Š็Ÿฅ่ฏ†ๅ›พ่ฐฑใ€‹็ ”็ฉถ็”Ÿ่ฏพ็จ‹(่ต„ๆ–™)ใ€. ่‹ฑๆ–‡ๆ‹ผๅ†™ๆฃ€ๆŸฅๅบ“ ใ€ wwsearchๆ˜ฏไผไธšๅพฎไฟกๅŽๅฐ่‡ช็ ”็š„ๅ…จๆ–‡ๆฃ€็ดขๅผ•ๆ“Žใ€CHAMELEON๏ผšๆทฑๅบฆๅญฆไน ๆ–ฐ้—ปๆŽจ่็ณป็ปŸๅ…ƒๆžถๆž„ ใ€ 8็ฏ‡่ฎบๆ–‡ๆขณ็†BERT็›ธๅ…ณๆจกๅž‹่ฟ›ๅฑ•ไธŽๅๆ€ใ€DocSearch๏ผšๅ…่ดนๆ–‡ๆกฃๆœ็ดขๅผ•ๆ“Žใ€ LIDA๏ผš่ฝป้‡ไบคไบ’ๅผๅฏน่ฏๆ ‡ๆณจๅทฅๅ…ท ใ€aili - the fastest in-memory index in the East ไธœๅŠ็ƒๆœ€ๅฟซๅนถๅ‘็ดขๅผ• ใ€็Ÿฅ่ฏ†ๅ›พ่ฐฑ่ฝฆ้Ÿณๅทฅไฝœ้กน็›ฎใ€่‡ช็„ถ่ฏญ่จ€็”Ÿๆˆ่ต„ๆบๅคงๅ…จ ใ€ไธญๆ—ฅ้Ÿฉๅˆ†่ฏๅบ“mecab็š„PythonๆŽฅๅฃๅบ“ใ€ไธญๆ–‡ๆ–‡ๆœฌๆ‘˜่ฆ/ๅ…ณ้”ฎ่ฏๆๅ–ใ€ๆฑ‰ๅญ—ๅญ—็ฌฆ็‰นๅพๆๅ–ๅ™จ (featurizer)๏ผŒๆๅ–ๆฑ‰ๅญ—็š„็‰นๅพ๏ผˆๅ‘้Ÿณ็‰นๅพใ€ๅญ—ๅฝข็‰นๅพ๏ผ‰็”จๅšๆทฑๅบฆๅญฆไน ็š„็‰นๅพใ€ไธญๆ–‡็”ŸๆˆไปปๅŠกๅŸบๅ‡†ๆต‹่ฏ„ ใ€ไธญๆ–‡็ผฉๅ†™ๆ•ฐๆฎ้›†ใ€ไธญๆ–‡ไปปๅŠกๅŸบๅ‡†ๆต‹่ฏ„ - ไปฃ่กจๆ€ง็š„ๆ•ฐๆฎ้›†-ๅŸบๅ‡†(้ข„่ฎญ็ปƒ)ๆจกๅž‹-่ฏญๆ–™ๅบ“-baseline-ๅทฅๅ…ทๅŒ…-ๆŽ’่กŒๆฆœใ€PySS3๏ผš้ขๅ‘ๅฏ่งฃ้‡ŠAI็š„SS3ๆ–‡ๆœฌๅˆ†็ฑปๅ™จๆœบๅ™จๅฏ่ง†ๅŒ–ๅทฅๅ…ท ใ€ไธญๆ–‡NLPๆ•ฐๆฎ้›†ๅˆ—่กจใ€COPE - ๆ ผๅพ‹่ฏ—็ผ–่พ‘็จ‹ๅบใ€doccano๏ผšๅŸบไบŽ็ฝ‘้กต็š„ๅผ€ๆบๅๅŒๅคš่ฏญ่จ€ๆ–‡ๆœฌๆ ‡ๆณจๅทฅๅ…ท ใ€PreNLP๏ผš่‡ช็„ถ่ฏญ่จ€้ข„ๅค„็†ๅบ“ใ€็ฎ€ๅ•็š„็ฎ€ๅŽ†่งฃๆžๅ™จ๏ผŒ็”จๆฅไปŽ็ฎ€ๅŽ†ไธญๆๅ–ๅ…ณ้”ฎไฟกๆฏใ€็”จไบŽไธญๆ–‡้—ฒ่Š็š„GPT2ๆจกๅž‹๏ผšGPT2-chitchatใ€ๅŸบไบŽๆฃ€็ดข่Šๅคฉๆœบๅ™จไบบๅคš่ฝฎๅ“ๅบ”้€‰ๆ‹ฉ็›ธๅ…ณ่ต„ๆบๅˆ—่กจ(Leaderboardsใ€Datasetsใ€Papers)ใ€(Colab)ๆŠฝ่ฑกๆ–‡ๆœฌๆ‘˜่ฆๅฎž็Žฐ้›†้”ฆ(ๆ•™็จ‹ ใ€่ฏ่ฏญๆ‹ผ้Ÿณๆ•ฐๆฎใ€้ซ˜ๆ•ˆๆจก็ณŠๆœ็ดขๅทฅๅ…ทใ€NLPๆ•ฐๆฎๅขžๅนฟ่ต„ๆบ้›†ใ€ๅพฎ่ฝฏๅฏน่ฏๆœบๅ™จไบบๆก†ๆžถ ใ€ GitHub Typo Corpus๏ผšๅคง่ง„ๆจกGitHubๅคš่ฏญ่จ€ๆ‹ผๅ†™้”™่ฏฏ/่ฏญๆณ•้”™่ฏฏๆ•ฐๆฎ้›†ใ€TextCluster๏ผš็Ÿญๆ–‡ๆœฌ่š็ฑป้ข„ๅค„็†ๆจกๅ— Short text clusterใ€้ขๅ‘่ฏญ้Ÿณ่ฏ†ๅˆซ็š„ไธญๆ–‡ๆ–‡ๆœฌ่ง„่ŒƒๅŒ–ใ€BLINK๏ผšๆœ€ๅ…ˆ่ฟ›็š„ๅฎžไฝ“้“พๆŽฅๅบ“ใ€BertPunc๏ผšๅŸบไบŽBERT็š„ๆœ€ๅ…ˆ่ฟ›ๆ ‡็‚นไฟฎๅคๆจกๅž‹ใ€Tokenizer๏ผšๅฟซ้€Ÿใ€ๅฏๅฎšๅˆถ็š„ๆ–‡ๆœฌ่ฏๆกๅŒ–ๅบ“ใ€ไธญๆ–‡่ฏญ่จ€็†่งฃๆต‹่ฏ„ๅŸบๅ‡†๏ผŒๅŒ…ๆ‹ฌไปฃ่กจๆ€ง็š„ๆ•ฐๆฎ้›†ใ€ๅŸบๅ‡†(้ข„่ฎญ็ปƒ)ๆจกๅž‹ใ€่ฏญๆ–™ๅบ“ใ€ๆŽ’่กŒๆฆœใ€spaCy ๅŒปๅญฆๆ–‡ๆœฌๆŒ–ๆŽ˜ไธŽไฟกๆฏๆๅ– ใ€ NLPไปปๅŠก็คบไพ‹้กน็›ฎไปฃ็ ้›†ใ€ pythonๆ‹ผๅ†™ๆฃ€ๆŸฅๅบ“ใ€chatbot-list - ่กŒไธšๅ†…ๅ…ณไบŽๆ™บ่ƒฝๅฎขๆœใ€่Šๅคฉๆœบๅ™จไบบ็š„ๅบ”็”จๅ’Œๆžถๆž„ใ€็ฎ—ๆณ•ๅˆ†ไบซๅ’Œไป‹็ปใ€่ฏญ้Ÿณ่ดจ้‡่ฏ„ไปทๆŒ‡ๆ ‡(MOSNet, BSSEval, STOI, PESQ, SRMR)ใ€ ็”จ138GB่ฏญๆ–™่ฎญ็ปƒ็š„ๆณ•ๆ–‡RoBERTa้ข„่ฎญ็ปƒ่ฏญ่จ€ๆจกๅž‹ ใ€BERT-NER-Pytorch๏ผšไธ‰็งไธๅŒๆจกๅผ็š„BERTไธญๆ–‡NERๅฎž้ชŒใ€ๆ— ้“่ฏๅ…ธ - ๆœ‰้“่ฏๅ…ธ็š„ๅ‘ฝไปค่กŒ็‰ˆๆœฌ๏ผŒๆ”ฏๆŒ่‹ฑๆฑ‰ไบ’ๆŸฅๅ’Œๅœจ็บฟๆŸฅ่ฏขใ€2019ๅนดNLPไบฎ็‚นๅ›ž้กพใ€ Chinese medical dialogue data ไธญๆ–‡ๅŒป็–—ๅฏน่ฏๆ•ฐๆฎ้›† ใ€ๆœ€ๅฅฝ็š„ๆฑ‰ๅญ—ๆ•ฐๅญ—(ไธญๆ–‡ๆ•ฐๅญ—)-้˜ฟๆ‹‰ไผฏๆ•ฐๅญ—่ฝฌๆขๅทฅๅ…ทใ€ ๅŸบไบŽ็™พ็ง‘็Ÿฅ่ฏ†ๅบ“็š„ไธญๆ–‡่ฏ่ฏญๅคš่ฏไน‰/ไน‰้กน่Žทๅ–ไธŽ็‰นๅฎšๅฅๅญ่ฏ่ฏญ่ฏญไน‰ๆถˆๆญงใ€awesome-nlp-sentiment-analysis - ๆƒ…ๆ„Ÿๅˆ†ๆžใ€ๆƒ…็ปชๅŽŸๅ› ่ฏ†ๅˆซใ€่ฏ„ไปทๅฏน่ฑกๅ’Œ่ฏ„ไปท่ฏๆŠฝๅ–ใ€LineFlow๏ผš้ขๅ‘ๆ‰€ๆœ‰ๆทฑๅบฆๅญฆไน ๆก†ๆžถ็š„NLPๆ•ฐๆฎ้ซ˜ๆ•ˆๅŠ ่ฝฝๅ™จใ€ไธญๆ–‡ๅŒปๅญฆNLPๅ…ฌๅผ€่ต„ๆบๆ•ด็† ใ€MedQuAD๏ผš(่‹ฑๆ–‡)ๅŒปๅญฆ้—ฎ็ญ”ๆ•ฐๆฎ้›†ใ€ๅฐ†่‡ช็„ถ่ฏญ่จ€ๆ•ฐๅญ—ไธฒ่งฃๆž่ฝฌๆขไธบๆ•ดๆ•ฐๅ’Œๆตฎ็‚นๆ•ฐใ€Transfer Learning in Natural Language Processing (NLP) ใ€้ขๅ‘่ฏญ้Ÿณ่ฏ†ๅˆซ็š„ไธญๆ–‡/่‹ฑๆ–‡ๅ‘้Ÿณ่พžๅ…ธใ€Tokenizers๏ผšๆณจ้‡ๆ€ง่ƒฝไธŽๅคšๅŠŸ่ƒฝๆ€ง็š„ๆœ€ๅ…ˆ่ฟ›ๅˆ†่ฏๅ™จใ€CLUENER ็ป†็ฒ’ๅบฆๅ‘ฝๅๅฎžไฝ“่ฏ†ๅˆซ Fine Grained Named Entity Recognitionใ€ ๅŸบไบŽBERT็š„ไธญๆ–‡ๅ‘ฝๅๅฎžไฝ“่ฏ†ๅˆซใ€ไธญๆ–‡่ฐฃ่จ€ๆ•ฐๆฎๅบ“ใ€NLPๆ•ฐๆฎ้›†/ๅŸบๅ‡†ไปปๅŠกๅคงๅˆ—่กจใ€nlp็›ธๅ…ณ็š„ไธ€ไบ›่ฎบๆ–‡ๅŠไปฃ็ , ๅŒ…ๆ‹ฌไธป้ข˜ๆจกๅž‹ใ€่ฏๅ‘้‡(Word Embedding)ใ€ๅ‘ฝๅๅฎžไฝ“่ฏ†ๅˆซ(NER)ใ€ๆ–‡ๆœฌๅˆ†็ฑป(Text Classificatin)ใ€ๆ–‡ๆœฌ็”Ÿๆˆ(Text Generation)ใ€ๆ–‡ๆœฌ็›ธไผผๆ€ง(Text Similarity)่ฎก็ฎ—็ญ‰๏ผŒๆถ‰ๅŠๅˆฐๅ„็งไธŽnlp็›ธๅ…ณ็š„็ฎ—ๆณ•๏ผŒๅŸบไบŽkerasๅ’Œtensorflow ใ€Pythonๆ–‡ๆœฌๆŒ–ๆŽ˜/NLPๅฎžๆˆ˜็คบไพ‹ใ€ Blackstone๏ผš้ขๅ‘้ž็ป“ๆž„ๅŒ–ๆณ•ๅพ‹ๆ–‡ๆœฌ็š„spaCy pipelineๅ’ŒNLPๆจกๅž‹้€š่ฟ‡ๅŒไน‰่ฏๆ›ฟๆขๅฎž็Žฐๆ–‡ๆœฌโ€œๅ˜่„ธโ€ ใ€ไธญๆ–‡ ้ข„่ฎญ็ปƒ ELECTREA ๆจกๅž‹: ๅŸบไบŽๅฏนๆŠ—ๅญฆไน  pretrain Chinese Model ใ€albert-chinese-ner - ็”จ้ข„่ฎญ็ปƒ่ฏญ่จ€ๆจกๅž‹ALBERTๅšไธญๆ–‡NER ใ€ๅŸบไบŽGPT2็š„็‰นๅฎšไธป้ข˜ๆ–‡ๆœฌ็”Ÿๆˆ/ๆ–‡ๆœฌๅขžๅนฟใ€ๅผ€ๆบ้ข„่ฎญ็ปƒ่ฏญ่จ€ๆจกๅž‹ๅˆ้›†ใ€ๅคš่ฏญ่จ€ๅฅๅ‘้‡ๅŒ…ใ€็ผ–็ ใ€ๆ ‡่ฎฐๅ’Œๅฎž็Žฐ๏ผšไธ€็งๅฏๆŽง้ซ˜ๆ•ˆ็š„ๆ–‡ๆœฌ็”Ÿๆˆๆ–นๆณ•ใ€ ่‹ฑๆ–‡่„่ฏๅคงๅˆ—่กจ ใ€attnvis๏ผšGPT2ใ€BERT็ญ‰transformer่ฏญ่จ€ๆจกๅž‹ๆณจๆ„ๅŠ›ไบคไบ’ๅฏ่ง†ๅŒ–ใ€CoVoST๏ผšFacebookๅ‘ๅธƒ็š„ๅคš่ฏญ็ง่ฏญ้Ÿณ-ๆ–‡ๆœฌ็ฟป่ฏ‘่ฏญๆ–™ๅบ“๏ผŒๅŒ…ๆ‹ฌ11็ง่ฏญ่จ€(ๆณ•่ฏญใ€ๅพท่ฏญใ€่ทๅ…ฐ่ฏญใ€ไฟ„่ฏญใ€่ฅฟ็ญ็‰™่ฏญใ€ๆ„ๅคงๅˆฉ่ฏญใ€ๅœŸ่€ณๅ…ถ่ฏญใ€ๆณขๆ–ฏ่ฏญใ€็‘žๅ…ธ่ฏญใ€่’™ๅค่ฏญๅ’Œไธญๆ–‡)็š„่ฏญ้Ÿณใ€ๆ–‡ๅญ—่ฝฌๅฝ•ๅŠ่‹ฑๆ–‡่ฏ‘ๆ–‡ใ€Jiagu่‡ช็„ถ่ฏญ่จ€ๅค„็†ๅทฅๅ…ท - ไปฅBiLSTM็ญ‰ๆจกๅž‹ไธบๅŸบ็ก€๏ผŒๆไพ›็Ÿฅ่ฏ†ๅ›พ่ฐฑๅ…ณ็ณปๆŠฝๅ– ไธญๆ–‡ๅˆ†่ฏ ่ฏๆ€งๆ ‡ๆณจ ๅ‘ฝๅๅฎžไฝ“่ฏ†ๅˆซ ๆƒ…ๆ„Ÿๅˆ†ๆž ๆ–ฐ่ฏๅ‘็Žฐ ๅ…ณ้”ฎ่ฏ ๆ–‡ๆœฌๆ‘˜่ฆ ๆ–‡ๆœฌ่š็ฑป็ญ‰ๅŠŸ่ƒฝใ€็”จunetๅฎž็Žฐๅฏนๆ–‡ๆกฃ่กจๆ ผ็š„่‡ชๅŠจๆฃ€ๆต‹๏ผŒ่กจๆ ผ้‡ๅปบใ€NLPไบ‹ไปถๆๅ–ๆ–‡็Œฎ่ต„ๆบๅˆ—่กจ ใ€ ้‡‘่ž้ข†ๅŸŸ่‡ช็„ถ่ฏญ่จ€ๅค„็†็ ”็ฉถ่ต„ๆบๅคงๅˆ—่กจใ€CLUEDatasetSearch - ไธญ่‹ฑๆ–‡NLPๆ•ฐๆฎ้›†๏ผšๆœ็ดขๆ‰€ๆœ‰ไธญๆ–‡NLPๆ•ฐๆฎ้›†๏ผŒ้™„ๅธธ็”จ่‹ฑๆ–‡NLPๆ•ฐๆฎ้›† ใ€medical_NER - ไธญๆ–‡ๅŒปๅญฆ็Ÿฅ่ฏ†ๅ›พ่ฐฑๅ‘ฝๅๅฎžไฝ“่ฏ†ๅˆซ ใ€(ๅ“ˆไฝ›)่ฎฒๅ› ๆžœๆŽจ็†็š„ๅ…่ดนไนฆใ€็Ÿฅ่ฏ†ๅ›พ่ฐฑ็›ธๅ…ณๅญฆไน ่ต„ๆ–™/ๆ•ฐๆฎ้›†/ๅทฅๅ…ท่ต„ๆบๅคงๅˆ—่กจใ€Forte๏ผš็ตๆดปๅผบๅคง็š„่‡ช็„ถ่ฏญ่จ€ๅค„็†pipelineๅทฅๅ…ท้›† ใ€Pythonๅญ—็ฌฆไธฒ็›ธไผผๆ€ง็ฎ—ๆณ•ๅบ“ใ€PyLaia๏ผš้ขๅ‘ๆ‰‹ๅ†™ๆ–‡ๆกฃๅˆ†ๆž็š„ๆทฑๅบฆๅญฆไน ๅทฅๅ…ทๅŒ…ใ€TextFooler๏ผš้’ˆๅฏนๆ–‡ๆœฌๅˆ†็ฑป/ๆŽจ็†็š„ๅฏนๆŠ—ๆ–‡ๆœฌ็”Ÿๆˆๆจกๅ—ใ€Haystack๏ผš็ตๆดปใ€ๅผบๅคง็š„ๅฏๆ‰ฉๅฑ•้—ฎ็ญ”(QA)ๆก†ๆžถใ€ไธญๆ–‡ๅ…ณ้”ฎ็Ÿญ่ฏญๆŠฝๅ–ๅทฅๅ…ท

  • gensim gensim 15,622
    star
    updated about 2 months ago GNU Lesser Genera...

    Topic Modelling for Humans

  • jieba jieba 33,208
    star
    updated 2 months ago MIT License

    ็ป“ๅทดไธญๆ–‡ๅˆ†่ฏ

  • langid.py langid.py 2,292
    star
    updated almost 5 years ago Other

    Stand-alone language identification system

  • pattern pattern 8,736
    star
    updated 5 months ago BSD 3-Clause "New...

    Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

  • updated almost 2 years ago MIT License

    pkusegๅคš้ข†ๅŸŸไธญๆ–‡ๅˆ†่ฏๅทฅๅ…ท; The pkuseg toolkit for multi-domain Chinese word segmentation

  • polyglot polyglot 2,312
    star
    updated 12 months ago Other

    Multilingual text (NLP) processing toolkit

  • pytext pytext 6,357
    star
    updated over 1 year ago Other

    A natural language modeling framework based on PyTorch

  • updated over 1 year ago BSD 3-Clause "New...

    Basic Utilities for PyTorch Natural Language Processing (NLP)

  • snownlp snownlp 6,396
    star
    updated almost 5 years ago MIT License

    Python library for processing Chinese text

  • stanza stanza 7,278
    star
    updated 2 days ago Other

    Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Network Virtualization

  • mininet mininet 5,355
    star
    updated 4 months ago BSD 3-Clause "New...

    Emulator for rapid prototyping of Software Defined Networks

  • napalm napalm 2,221
    star
    updated 2 months ago Apache License 2.0

    Network Automation and Programmability Abstraction Layer with Multivendor support

  • pox pox 618
    star
    updated over 1 year ago Apache License 2.0

    The POX network software platform

News Feed

  • updated about 2 months ago BSD 3-Clause "New...

    Generate generic activity streams from the actions on your site. Users can follow any actors' activities for personalized streams.

  • updated over 1 year ago Other

    Stream Framework is a Python library, which allows you to build news feed, activity streams and notification systems using Cassandra and/or Redis. The authors of Stream-Framework also provide a cloud service for feed technology:

NLP

  • allennlp allennlp 11,751
    star
    updated almost 2 years ago Apache License 2.0

    An open-source NLP research library, built on PyTorch.

  • bert bert 37,769
    star
    updated 3 months ago Apache License 2.0

    TensorFlow code and pre-trained models for BERT

  • updated almost 2 years ago MIT License

    Fixes contractions such as `you're` to `you are`

  • updated over 1 year ago GNU General Publi...

    Fuzzy String Matching in Python

  • gensim gensim 15,622
    star
    updated about 2 months ago GNU Lesser Genera...

    Topic Modelling for Humans

  • haystack haystack 14,953
    star
    updated 3 months ago Apache License 2.0

    ๐Ÿ” LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

  • nltk nltk 13,395
    star
    updated 2 months ago Apache License 2.0

    NLTK Source

  • polyglot polyglot 2,312
    star
    updated 12 months ago Other

    Multilingual text (NLP) processing toolkit

  • updated 9 days ago Apache License 2.0

    State-of-the-Art Text Embeddings

  • spaCy spaCy 29,546
    star
    updated 2 months ago MIT License

    ๐Ÿ’ซ Industrial-strength Natural Language Processing (NLP) in Python

  • stanza stanza 7,278
    star
    updated 2 days ago Other

    Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

  • transformers transformers 133,705
    star
    updated 9 days ago Apache License 2.0

    ๐Ÿค— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

  • updated over 5 years ago Apache License 2.0

    Lightning Fast Language Prediction ๐Ÿš€

  • updated 5 months ago GNU Affero Genera...

    A tool for extracting plain text from Wikipedia dumps

  • word_cloud word_cloud 10,101
    star
    updated about 2 months ago MIT License

    A little word cloud generator in Python

Object Factories

  • updated 4 months ago MIT License

    A test fixtures replacement for Python

  • mixer mixer 942
    star
    updated 8 months ago Other

    Mixer -- Is a fixtures replacement. Supported Django, Flask, SqlAlchemy and custom python objects.

ORM

  • updated about 1 year ago

    A curated list of awesome tools for SQLAlchemy

  • dataset dataset 4,766
    star
    updated about 1 year ago MIT License

    Easy-to-use data handling for SQL data stores with support for implicit table creation, bulk loading, and transactions.

  • updated 7 months ago BSD 2-Clause "Sim...

    Rich Python data types for Redis

  • updated 2 months ago MIT License

    A Python Object-Document-Mapper for working with MongoDB

  • orator orator 1,429
    star
    updated almost 2 years ago MIT License

    The Orator ORM provides a simple yet beautiful ActiveRecord implementation.

  • orm orm 1,752
    star
    updated about 2 years ago BSD 3-Clause "New...

    An async ORM. ๐Ÿ—ƒ

  • peewee peewee 11,103
    star
    updated about 1 month ago MIT License

    a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb

  • pony pony 3,368
    star
    updated about 1 year ago Apache License 2.0

    Pony Object Relational Mapper

  • pydal pydal 488
    star
    updated 4 months ago BSD 3-Clause "New...

    A pure Python Database Abstraction Layer

Package Management

  • conda conda 6,285
    star
    updated 3 months ago Other

    A system-level, binary package and environment manager running on all major operating systems and platforms.

  • pip-tools pip-tools 7,668
    star
    updated 2 months ago BSD 3-Clause "New...

    A set of tools to keep your pinned Python dependencies fresh.

Package Repositories

  • updated 3 months ago Academic Free Lic...

    A PyPI mirror client according to PEP 381 http://www.python.org/dev/peps/pep-0381/

  • devpi devpi 880
    star
    updated 2 months ago

    Python PyPi staging server and packaging, testing, release tool

Pandas

  • missingno missingno 3,634
    star
    updated over 1 year ago MIT License

    Missing data visualization module for Python.

  • modin modin 9,857
    star
    updated about 1 month ago Apache License 2.0

    Modin: Scale your Pandas workflows by changing a single line of code

  • pandas pandas 43,074
    star
    updated 2 months ago BSD 3-Clause "New...

    Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

  • updated about 1 year ago Other

    Extract data from a wide range of Internet sources into a pandas DataFrame.

  • qgrid qgrid 3,034
    star
    updated 10 months ago Apache License 2.0

    An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks

Penetration testing

  • fsociety fsociety 10,412
    star
    updated 3 months ago MIT License

    fsociety Hacking Tools Pack โ€“ A Penetration Testing Framework

  • updated 3 months ago

    The Social-Engineer Toolkit (SET) repository from TrustedSec - All new versions of SET will be deployed here.

  • sqlmap sqlmap 28,686
    star
    updated almost 1 year ago Other

    Automatic SQL injection and database takeover tool

Performance

  • black black 38,653
    star
    updated about 1 month ago MIT License

    The uncompromising Python code formatter

  • cython cython 8,388
    star
    updated about 1 year ago Apache License 2.0

    The most widely used Python to C compiler

  • numba numba 9,480
    star
    updated 6 months ago BSD 2-Clause "Sim...

    NumPy aware dynamic Python compiler using LLVM

  • numpy numpy 26,893
    star
    updated 4 months ago Other

    The fundamental package for scientific computing with Python.

  • pytest pytest 11,750
    star
    updated 3 months ago MIT License

    The pytest framework makes it easy to write small tests, yet scales to support complex functional testing

  • scipy scipy 12,872
    star
    updated 2 months ago BSD 3-Clause "New...

    SciPy library main repository

Permissions

  • updated about 2 months ago MIT License

    Awesome Django authorization, without the database

Processes

  • updated almost 2 years ago MIT License

    Subprocesses for Humans 2.0.

  • sh sh 6,871
    star
    updated 5 months ago MIT License

    Python process launching

Profiling

  • profiling profiling 2,965
    star
    updated about 4 years ago BSD 3-Clause "New...

    Was an interactive continuous Python profiler.

  • updated about 1 year ago BSD 3-Clause "New...

    ๐Ÿšดย Call stack profiler for Python. Shows you why your code is slow!

  • yappi yappi 1,451
    star
    updated 4 months ago MIT License

    Yet Another Python Profiler, but this time multithreading, asyncio and gevent aware.

Recommender Systems

  • annoy annoy 13,197
    star
    updated 3 months ago Apache License 2.0

    Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

  • fastFM fastFM 1,075
    star
    updated over 2 years ago Other

    fastFM: A Library for Factorization Machines

  • implicit implicit 3,508
    star
    updated 4 months ago MIT License

    Fast Python Collaborative Filtering for Implicit Feedback Datasets

  • lightfm lightfm 4,738
    star
    updated 3 months ago Apache License 2.0

    A Python implementation of LightFM, a hybrid recommendation algorithm.

  • spotlight spotlight 2,857
    star
    updated over 1 year ago MIT License

    Deep recommender models using PyTorch.

  • Surprise Surprise 6,296
    star
    updated 5 months ago BSD 3-Clause "New...

    A Python scikit for building and analyzing recommender systems

  • tensorrec tensorrec 1,277
    star
    updated over 1 year ago Apache License 2.0

    A TensorFlow recommendation algorithm and framework in Python.

Refactoring

  • rope rope 1,898
    star
    updated 6 months ago GNU Lesser Genera...

    a python refactoring library

RESTful API

  • apistar apistar 5,578
    star
    updated over 2 years ago BSD 3-Clause "New...

    The Web API toolkit. ๐Ÿ› 

  • updated 4 months ago Other

    Build Web Services with Pyramid.

  • eve eve 6,682
    star
    updated about 2 months ago Other

    REST API framework designed for human beings

  • falcon falcon 9,506
    star
    updated 19 days ago Apache License 2.0

    The no-magic web data plane API and microservices framework for Python developers, with a focus on reliability, correctness, and performance at scale.

  • fastapi fastapi 73,897
    star
    updated 3 months ago MIT License

    FastAPI framework, high performance, easy to learn, fast to code, ready for production

  • updated about 1 year ago BSD 3-Clause "New...

    Simple framework for creating REST APIs

  • hug hug 6,856
    star
    updated 4 months ago MIT License

    Embrace the APIs of the future. Hug aims to make developing APIs as simple as possible, but no simpler.

  • sandman2 sandman2 1,997
    star
    updated 2 months ago Apache License 2.0

    Automatically generate a RESTful API service for your legacy database. No code required!

Robotics

  • updated about 1 year ago Other

    Python sample codes for robotics algorithms.

RPC Servers

Science

  • bcbb bcbb 582
    star
    updated over 1 year ago

    Incubator for useful bioinformatics code, primarily in Python and R

  • updated 8 months ago GNU General Publi...

    Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

  • updated about 1 month ago BSD 3-Clause "New...

    Statsmodels: statistical modeling and econometrics in Python

  • sympy sympy 12,875
    star
    updated 16 days ago Other

    A computer algebra system written in pure Python

  • zipline zipline 17,443
    star
    updated 9 months ago Apache License 2.0

    Zipline, a Pythonic Algorithmic Trading Library

Security

  • impacket impacket 9,423
    star

    Impacket is a collection of Python classes for working with network protocols.

  • lemur lemur 1,651
    star
    updated about 1 year ago Apache License 2.0

    Repository for the Lemur Certificate Manager

  • metaflow metaflow 8,012
    star
    updated about 2 months ago Apache License 2.0

    ๐Ÿš€ Build and manage real-life ML, AI, and data science projects with ease!

  • repokid repokid 1,104
    star
    updated over 1 year ago Apache License 2.0

    AWS Least Privilege for Distributed, High-Velocity Deployment

  • updated over 3 years ago Apache License 2.0

    Security Monkey monitors AWS, GCP, OpenStack, and GitHub orgs for assets and their changes over time.

  • w3af w3af 4,309
    star
    updated over 1 year ago

    w3af: web application attack and audit framework, the open source web vulnerability scanner.

Serialization

  • updated about 2 months ago MIT License

    A lightweight library for converting complex objects to and from simple Python datatypes.

  • updated 3 months ago Other

    Python bindings for the simdjson project.

  • updated about 2 months ago Other

    Python wrapper around rapidjson

  • rapidjson rapidjson 14,163
    star
    updated 2 months ago Other

    A fast JSON parser/generator for C++ with both SAX/DOM style API

Serverless Frameworks

  • updated over 1 year ago ISC License

    A toolkit for developing and deploying serverless Python code in AWS Lambda.

  • Zappa Zappa 3,284
    star
    updated about 1 month ago MIT License

    Serverless Python

Shell

  • xonsh xonsh 8,337
    star
    updated about 1 month ago Other

    ๐Ÿš Python-powered shell. Full-featured and cross-platform.

Simulation

  • PyPSA PyPSA 1,190
    star
    updated 3 months ago MIT License

    PyPSA: Python for Power System Analysis

  • updated 2 months ago MIT License

    A community based Python library for quantitative economics

  • RLBot RLBot 591
    star
    updated 5 months ago MIT License

    A framework that allows people to write their own Rocket League bots.

Specific Formats Processing

  • csvkit csvkit 6,007
    star
    updated 2 months ago MIT License

    A suite of utilities for converting to and working with CSV, the king of tabular file formats.

  • mistune mistune 2,323
    star
    updated about 1 year ago BSD 3-Clause "New...

    A fast yet powerful Python Markdown parser with renderers and plugins.

  • pdfminer pdfminer 5,237
    star
    updated almost 2 years ago MIT License

    Python PDF Parser (Not actively maintained). Check out pdfminer.six.

  • pyexcel pyexcel 1,209
    star
    updated 12 days ago Other

    Single API for reading, manipulating and writing data in csv, ods, xls, xlsx and xlsm files

  • updated 2 months ago MIT License

    Create and modify Word documents with Python

  • updated 6 months ago GNU Lesser Genera...

    Use a docx as a jinja2 template

  • updated 3 months ago MIT License

    Create Open XML PowerPoint documents in Python

  • tablib tablib 4,586
    star
    updated 4 months ago MIT License

    Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.

  • unoconv unoconv 2,590
    star
    updated over 1 year ago GNU General Publi...

    Universal Office Converter - Convert between any document format supported by LibreOffice/OpenOffice.

  • unp unp 414
    star
    updated over 7 years ago Other

    Unpacks things.

  • xlrd xlrd 2,147
    star
    updated 4 months ago Other

    Please use openpyxl where you can...

  • updated 4 months ago BSD 2-Clause "Sim...

    A Python module for creating Excel XLSX files.

  • xlwt xlwt 1,041
    star
    updated over 4 years ago Other

    Library to create spreadsheet files compatible with MS Excel 97/2000/XP/2003 XLS files, on any platform.

Static Site Generator

  • lektor lektor 3,729
    star
    updated 12 months ago BSD 3-Clause "New...

    The lektor static file content management system

  • makesite makesite 1,691
    star
    updated over 1 year ago MIT License

    Simple, lightweight, and magic-free static site/blog generator for Python coders

  • mkdocs mkdocs 18,095
    star
    updated 8 months ago BSD 2-Clause "Sim...

    Project documentation with Markdown.

  • nikola nikola 2,486
    star
    updated about 1 year ago MIT License

    A static website and blog generator

  • pelican pelican 12,470
    star
    updated 3 months ago GNU Affero Genera...

    Static site generator that supports Markdown and reST syntax. Powered by Python.

Study

  • updated over 2 years ago

    Matplotlib tutorial for beginner

  • numpy-100 numpy-100 10,706
    star
    updated over 1 year ago MIT License

    100 numpy exercises (with solutions)

  • updated 9 months ago Other

    An open access book on scientific visualization using python and matplotlib

Tagging

  • updated 3 months ago BSD 3-Clause "New...

    Simple tagging for django

Task Queues

  • dramatiq dramatiq 3,640
    star
    updated over 1 year ago GNU Lesser Genera...

    A fast and reliable background task processing library for Python 3.

  • huey huey 5,154
    star
    updated about 1 month ago MIT License

    a little task queue for python

  • mrq mrq 877
    star
    updated over 1 year ago MIT License

    Mr. Queue - A distributed worker task queue in Python using Redis & gevent

  • rq rq 9,799
    star
    updated about 1 month ago Other

    Simple job queues for Python

Template

  • updated 9 days ago BSD 3-Clause "New...

    A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.

  • PyGithub PyGithub 6,903
    star
    updated about 2 months ago GNU Lesser Genera...

    Typed interactions with the GitHub API v3

  • updated 3 months ago MIT License

    BDD library for the py.test runner

Template Engine

  • jinja jinja 10,210
    star
    updated 2 months ago BSD 3-Clause "New...

    A very fast and expressive template engine.

Terminal

  • bless bless 2,722
    star
    updated about 1 year ago Apache License 2.0

    Repository for BLESS, an SSH Certificate Authority that runs as a AWS Lambda function

  • bpython bpython 2,598
    star
    updated 4 months ago Other

    bpython - A fancy curses interface to the Python interactive interpreter

  • Gooey Gooey 20,574
    star
    updated 8 months ago MIT License

    Turn (almost) any Python command line program into a full GUI application with one line

  • icdiff icdiff 4,177
    star
    updated 6 months ago Other

    improved colored diff

  • ipython ipython 16,302
    star
    updated 6 days ago BSD 3-Clause "New...

    Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.

  • lemur lemur 1,651
    star
    updated about 1 year ago Apache License 2.0

    Repository for the Lemur Certificate Manager

  • pexpect pexpect 2,425
    star
    updated about 1 year ago Other

    A Python module for controlling interactive programs in a pseudo-terminal

  • repokid repokid 1,104
    star
    updated over 1 year ago Apache License 2.0

    AWS Least Privilege for Distributed, High-Velocity Deployment

  • updated over 3 years ago Apache License 2.0

    Security Monkey monitors AWS, GCP, OpenStack, and GitHub orgs for assets and their changes over time.

  • tmuxp tmuxp 3,847
    star
    updated 10 months ago MIT License

    ๐Ÿ–ฅ๏ธ tmux session manager. built on libtmux

Testing

  • updated 4 months ago MIT License

    A test fixtures replacement for Python

  • fake2db fake2db 2,256
    star
    updated almost 5 years ago GNU General Publi...

    create custom test databases that are populated with fake data

  • faker faker 17,554
    star
    updated about 2 months ago MIT License

    Faker is a Python package that generates fake data for you.

  • freezegun freezegun 4,169
    star
    updated 3 months ago Apache License 2.0

    Let your Python tests travel through time

  • green green 786
    star
    updated 6 months ago MIT License

    Green is a clean, colorful, fast python test runner.

  • updated about 1 year ago Other

    A mocking library for requests

  • HTTPretty HTTPretty 2,107
    star
    updated 5 months ago MIT License

    Intercept HTTP requests at the Python socket level. Fakes the whole socket module

  • updated about 2 months ago Other

    Hypothesis is a powerful, flexible, and easy to use library for property-based testing.

  • locust locust 24,758
    star
    updated 19 days ago MIT License

    Write scalable load tests in plain Python ๐Ÿš—๐Ÿ’จ

  • mimesis mimesis 4,380
    star
    updated 4 months ago MIT License

    Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.

  • mixer mixer 942
    star
    updated 8 months ago Other

    Mixer -- Is a fixtures replacement. Supported Django, Flask, SqlAlchemy and custom python objects.

  • nose2 nose2 794
    star
    updated about 1 month ago Other

    The successor to nose, based on unittest2

  • updated about 1 month ago BSD 3-Clause "New...

    a socket mock framework - for all kinds of socket animals, web-clients included

  • responses responses 4,127
    star
    updated about 2 months ago Apache License 2.0

    A utility for mocking out the Python Requests library.

  • updated about 1 month ago Apache License 2.0

    Generic automation framework for acceptance testing and RPA

  • splinter splinter 2,714
    star
    updated about 1 month ago BSD 3-Clause "New...

    splinter - python test framework for web applications

  • vcrpy vcrpy 2,694
    star
    updated about 1 month ago MIT License

    Automatically mock your HTTP interactions to simplify and speed up testing

Testing Frameworks

  • updated about 2 months ago Other

    Hypothesis is a powerful, flexible, and easy to use library for property-based testing.

  • nose2 nose2 794
    star
    updated about 1 month ago Other

    The successor to nose, based on unittest2

  • updated about 1 month ago Apache License 2.0

    Generic automation framework for acceptance testing and RPA

Test Runners

  • green green 786
    star
    updated 6 months ago MIT License

    Green is a clean, colorful, fast python test runner.

Text Processing

  • chardet chardet 2,152
    star
    updated 3 months ago GNU Lesser Genera...

    Python character encoding detector

  • updated over 1 year ago GNU General Publi...

    Fuzzy String Matching in Python

  • updated over 1 year ago MIT License

    Implementation of hashids (http://hashids.org) in Python. Compatible with Pythonโ€‰2 and Pythonโ€‰3

  • updated over 1 year ago MIT License

    Paranoid text spacing in Python

  • ply ply 2,717
    star
    updated about 1 year ago

    Python Lex-Yacc

  • pyfiglet pyfiglet 1,348
    star
    updated 3 months ago Other

    An implementation of figlet written in Python

  • pyparsing pyparsing 2,004
    star
    updated 12 months ago MIT License

    Python library for creating PEG parsers

  • updated 11 months ago Other

    A simple Python module for parsing human names into their individual components

  • updated 3 months ago Apache License 2.0

    Python port of Google's libphonenumber

  • updated about 1 year ago MIT License

    ๆฑ‰ๅญ—่ฝฌๆ‹ผ้Ÿณ(pypinyin)

  • updated 8 months ago MIT License

    Returns unicode slugs

  • updated over 1 year ago MIT License

    A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.

  • shortuuid shortuuid 1,905
    star
    updated over 1 year ago BSD 3-Clause "New...

    A generator library for concise, unambiguous and URL-safe UUIDs.

  • sqlparse sqlparse 3,682
    star
    updated 2 months ago BSD 3-Clause "New...

    A non-validating SQL parser module for Python

  • updated over 1 year ago BSD 3-Clause "New...

    A slugifier that works in unicode

Third-party APIs

  • boto3 boto3 8,877
    star
    updated 3 months ago Apache License 2.0

    AWS SDK for Python

  • updated 3 months ago Apache License 2.0

    Python SDK for Facebook's Graph API

  • gspread gspread 7,058
    star
    updated 27 days ago MIT License

    Google Sheets Python API

  • twython twython 1,849
    star
    updated over 2 years ago MIT License

    Actively maintained, pure Python wrapper for the Twitter API. Supports both normal and streaming Twitter APIs.

Typing

  • mypy mypy 18,263
    star
    updated about 1 month ago Other

    Optional static typing for Python

  • typeguard typeguard 1,494
    star
    updated 3 months ago Other

    Run-time type checker for Python

  • updated about 1 year ago MIT License

    Runtime inspection utilities for Python typing module

URL Manipulation

  • furl furl 2,615
    star
    updated 3 months ago Other

    ๐ŸŒ URL parsing and manipulation made easy.

  • purl purl 293
    star
    updated about 1 year ago MIT License

    A simple, immutable URL class with a clean API for interrogation and manipulation.

  • updated over 2 years ago GNU General Publi...

    ๐Ÿ”Œ Generating short urls with python has never been easier

  • webargs webargs 1,372
    star
    updated 2 months ago MIT License

    A friendly library for parsing HTTP request arguments, with built-in support for popular web frameworks, including Flask, Django, Bottle, Tornado, Pyramid, webapp2, Falcon, and aiohttp.

Utility

  • updated about 2 years ago

    A common base representation of python source code for pylint and other projects

  • bandit bandit 5,900
    star
    updated 8 months ago Apache License 2.0

    Bandit is a tool designed to find common security issues in Python code.

  • black black 38,653
    star
    updated about 1 month ago MIT License

    The uncompromising Python code formatter

  • click click 15,469
    star
    updated 2 months ago BSD 3-Clause "New...

    Python composable command line interface toolkit

  • updated about 1 month ago Other

    Web APIs for Django. ๐ŸŽธ

  • flake8 flake8 3,068
    star
    updated about 1 year ago Other

    flake8 is a python tool that glues together pycodestyle, pyflakes, mccabe, and third-party plugins to check the style and quality of some python code.

  • flask flask 67,407
    star
    updated 2 months ago BSD 3-Clause "New...

    The Python micro framework for building web applications.

  • httpx httpx 11,445
    star
    updated 12 months ago BSD 3-Clause "New...

    A next generation HTTP client for Python. ๐Ÿฆ‹

  • isort isort 6,471
    star
    updated 3 months ago MIT License

    A Python utility / library to sort imports.

  • mccabe mccabe 602
    star
    updated about 1 year ago Other

    McCabe complexity checker for Python

  • PyGithub PyGithub 6,903
    star
    updated about 2 months ago GNU Lesser Genera...

    Typed interactions with the GitHub API v3

  • pylint pylint 4,246
    star
    updated about 2 years ago

    It's not just a linter that annoys you!

  • python-fire python-fire 26,842
    star
    updated about 2 months ago Other

    Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.

  • scrapy scrapy 52,666
    star
    updated 22 days ago BSD 3-Clause "New...

    Scrapy, a fast high-level web crawling & scraping framework for Python.

  • updated over 1 year ago MIT License

    The official Python SDK for Sentry.io

  • updated 15 days ago MIT License

    The Database Toolkit for Python

Video

  • updated over 4 years ago Other

    Video processing routines for SciPy

  • vidgear vidgear 3,329
    star
    updated 3 months ago Apache License 2.0

    A High-performance cross-platform Video Processing Python framework powerpacked with unique trailblazing features ๐Ÿ”ฅ

Visualization

  • altair altair 9,105
    star
    updated 3 months ago BSD 3-Clause "New...

    Declarative statistical visualization library for Python

  • bokeh bokeh 19,117
    star
    updated 3 months ago BSD 3-Clause "New...

    Interactive Data Visualization in the browser, from Python

  • updated about 1 year ago Other

    Python binding for the freetype library

  • matplotlib matplotlib 20,025
    star
    updated about 1 month ago

    matplotlib: plotting with Python

  • updated about 4 years ago BSD 2-Clause "Sim...

    Matplotlib 3.1 cheat sheet.

  • updated over 2 years ago

    Matplotlib tutorial for beginner

  • numpy-100 numpy-100 10,706
    star
    updated over 1 year ago MIT License

    100 numpy exercises (with solutions)

  • plotly.py plotly.py 15,980
    star
    updated about 2 months ago MIT License

    The interactive graphing library for Python โœจ This project now includes Plotly Express!

  • updated 9 months ago Other

    An open access book on scientific visualization using python and matplotlib

  • updated over 1 year ago

    Ten simple rules for better figures

  • vispy vispy 3,282
    star
    updated 3 months ago Other

    Main repository for Vispy

Web

  • bottle bottle 8,217
    star
    updated 10 months ago MIT License

    bottle.py is a fast and simple micro-framework for python web-applications.

  • django django 78,641
    star
    updated 2 months ago BSD 3-Clause "New...

    The Web framework for perfectionists with deadlines.

  • updated about 1 month ago Other

    Web APIs for Django. ๐ŸŽธ

  • peewee peewee 11,103
    star
    updated about 1 month ago MIT License

    a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb

  • PyMySQL PyMySQL 7,616
    star
    updated 4 months ago MIT License

    MySQL client library for Python

  • updated 5 months ago MIT License

    Coverage plugin for pytest.

  • updated 3 months ago Other

    A Django plugin for pytest.

  • updated 3 months ago MIT License

    A set of pytest fixtures to test Flask applications

  • updated about 2 months ago Other

    Plugin for generating HTML reports for pytest results

  • updated 3 months ago Other

    Plugin for running Selenium with pytest

  • updated over 2 years ago BSD 3-Clause "New...

    Social auth made simple

  • updated about 2 months ago MIT License

    THIS IS NOT THE OFFICIAL REPO - PLEASE SUBMIT PRs ETC AT: http://github.com/sqlalchemy/sqlalchemy

  • updated 15 days ago MIT License

    The Database Toolkit for Python

  • starlette starlette 8,881
    star
    updated 12 months ago BSD 3-Clause "New...

    The little ASGI framework that shines. ๐ŸŒŸ

  • uvicorn uvicorn 7,747
    star
    updated 7 months ago BSD 3-Clause "New...

    An ASGI web server, for Python. ๐Ÿฆ„

Web Asset Management

  • updated 3 months ago Other

    Compresses linked and inline javascript or CSS into a single cached file.

  • updated 2 months ago MIT License

    Pipeline is an asset packaging library for Django.

  • updated 13 days ago BSD 3-Clause "New...

    https://django-storages.readthedocs.io/

  • updated about 1 year ago BSD 2-Clause "Sim...

    Flask webassets integration.

  • updated over 2 years ago BSD 2-Clause "Sim...

    Asset management for Python web development.

Web Content Extracting

  • lassie lassie 588
    star
    updated over 2 years ago MIT License

    Web Content Retrieval for Humansโ„ข

  • updated 4 months ago MIT License

    a small library for extracting rich content from urls

  • newspaper newspaper 14,004
    star
    updated 3 months ago MIT License

    newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:

  • updated 7 months ago MIT License

    Pythonic HTML Parsing for Humansโ„ข

  • sumy sumy 3,491
    star
    updated 6 months ago Apache License 2.0

    Module for automatic summarization of text documents and HTML pages.

  • textract textract 3,852
    star
    updated 3 months ago MIT License

    extract text from any document. no muss. no fuss.

Web Crawling

  • grab grab 2,330
    star
    updated over 1 year ago MIT License

    Web Scraping Framework

  • updated 10 months ago MIT License

    A Python library for automating interaction with websites.

  • portia portia 8,991
    star
    updated about 1 year ago BSD 3-Clause "New...

    Visual scraping for Scrapy

  • pyspider pyspider 16,434
    star
    updated 6 months ago Apache License 2.0

    A Powerful Spider(Web Crawler) System in Python.

  • updated about 4 years ago BSD 3-Clause "New...

Web Frameworks

  • updated 3 months ago Other

    The Best Django Resource, Awesome Django for mature packages.

  • updated 3 months ago Creative Commons ...

    A curated list of awesome things related to Django

  • updated 12 months ago

    A curated list of awesome Flask resources and plugins

  • updated over 3 years ago

    A curated list of awesome Pyramid apps, projects and resources.

  • masonite masonite 2,177
    star
    updated 3 months ago MIT License

    The Modern And Developer Centric Python Web Framework. Be sure to read the documentation and join the Discord channel for questions: https://discord.gg/TwKeFahmPZ

WebSocket

  • updated 6 months ago MIT License

    WebSocket and WAMP in Python for Twisted and asyncio

  • channels channels 6,086
    star
    updated about 2 months ago BSD 3-Clause "New...

    Developer-friendly asynchrony for Django

  • Library for building WebSocket servers and clients in Python

WSGI Servers

  • bjoern bjoern 2,993
    star
    updated 3 months ago Other

    A screamingly fast Python 2/3 WSGI server written in C.

  • gunicorn gunicorn 9,757
    star
    updated 2 months ago Other

    gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.

  • waitress waitress 1,310
    star
    updated about 1 year ago Other

    Waitress - A WSGI server for Python 3

  • werkzeug werkzeug 6,616
    star
    updated 2 months ago BSD 3-Clause "New...

    The comprehensive WSGI web application library.