Paul O'Leary McCann (@polm)

Top repositories

1

fugashi

A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
C++
384
star
2

cutlet

Japanese to romaji converter in Python
Python
283
star
3

posuto

๐Ÿฃ๐Ÿ“ฎใ€  Japanese postal code data.
Python
201
star
4

unidic-py

Unidic packaged for installation via pip.
Python
74
star
5

ndl-crop

Script for cropping photos from the NDL.
Python
37
star
6

unidic-lite

A small version of UniDic for easy pip installs.
Python
36
star
7

showmemore

SHOW ME MORE OF [-----]
Python
28
star
8

ipadic-py

IPAdic packaged for easy use from Python.
Python
25
star
9

awesome-digital-collections

Publicly accessible digital collections.
19
star
10

palladian-facades

๐Ÿ›๏ธ Palladian Facade Generator for ProcJam2015
LiveScript
19
star
11

multilang-filter

Script for preprocessing multilingual Markdown.
Python
14
star
12

deltos

A magic notepad. ฮด
LiveScript
13
star
13

gamefaces

Public domain headshots
12
star
14

dupdupdraw

Forthish drawing system with random program generation.
JavaScript
11
star
15

chargen

Random generator taking literature as input.
Python
7
star
16

node-migemo

Japanese search regex generator
LiveScript
7
star
17

ja-tokenizer-benchmark

Compare the speed of various Japanese tokenizers in Python.
Python
7
star
18

philtre

Search objects with a familiar syntax.
LiveScript
6
star
19

jp-ner

[abandoned] Work on generating an NER dataset for Japanese
Python
5
star
20

jumandic-py

JumanDic packaged for use with PyPI.
Python
3
star
21

awesome-gamedev-jp

ใ‚ฒใƒผใƒ ้–‹็™บใซๅฝน็ซ‹ใคใƒชใƒณใ‚ฏ้›†
3
star
22

shesha

Random generator toolkit
JavaScript
3
star
23

bontan.ls

Bontan is a simple scraper primarily intended for articles.
LiveScript
2
star
24

lua-mecab

Lua wrapper for Mecab Japanese morphological analyzer.
C++
2
star
25

fugashi-streamlit-demo

Streamlit demo for fugashi
Python
2
star
26

gutenjuice

Top books from Project Gutenberg, in raw form and extracted.
2
star
27

bookoff-redirect

Deal with BookOff query parameter nonsense.
HTML
2
star
28

fugashi-sagemaker-demo

A basic introduction to using fugashi for Japanese tokenization.
Jupyter Notebook
2
star
29

github-tasks.vim

Github task plugin for vim
Vim Script
2
star
30

spaCy

๐Ÿ’ซ Industrial-strength Natural Language Processing (NLP) with Python and Cython
Python
1
star
31

mecab-packed

[broken/wip] Bundled mecab & unidic for installing via pip.
Shell
1
star
32

language-disruptor

Randomly replace words in Japanese sentences.
Python
1
star
33

poine-tool

POINE้–ข้€ฃใฎใƒ„ใƒผใƒซ
Python
1
star
34

mecab-manylinux1-wheel-builder

Build manylinux1 wheels with MeCab installed.
Shell
1
star
35

bontan

Get embed code for a link, using OEmbed as appropriate.
Nim
1
star
36

yuzulabo.works

Yuzu Labo web site
CSS
1
star
37

deltos.vim

A vim plugin for use with Deltos.
Vim Script
1
star
38

kanji

Kanji data package for Python
Python
1
star
39

visidata-conll

CoNLL-U data loader for Visidata.
Python
1
star
40

everybayes

Document classification for everyone.
Python
1
star
41

jfmt.lua

Tool for wrapping Japanese text to natural width
Lua
1
star
42

searchy

[discontinued] Simple interactive search for Node
LiveScript
1
star