• Stars
    star
    7
  • Rank 2,294,772 (Top 46 %)
  • Language
    Python
  • Created about 5 years ago
  • Updated almost 3 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Compare the speed of various Japanese tokenizers in Python.

More Repositories

1

fugashi

A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
C++
384
star
2

cutlet

Japanese to romaji converter in Python
Python
283
star
3

posuto

๐Ÿฃ๐Ÿ“ฎใ€  Japanese postal code data.
Python
201
star
4

unidic-py

Unidic packaged for installation via pip.
Python
74
star
5

ndl-crop

Script for cropping photos from the NDL.
Python
37
star
6

unidic-lite

A small version of UniDic for easy pip installs.
Python
36
star
7

showmemore

SHOW ME MORE OF [-----]
Python
28
star
8

ipadic-py

IPAdic packaged for easy use from Python.
Python
25
star
9

awesome-digital-collections

Publicly accessible digital collections.
19
star
10

palladian-facades

๐Ÿ›๏ธ Palladian Facade Generator for ProcJam2015
LiveScript
19
star
11

multilang-filter

Script for preprocessing multilingual Markdown.
Python
14
star
12

deltos

A magic notepad. ฮด
LiveScript
13
star
13

gamefaces

Public domain headshots
12
star
14

dupdupdraw

Forthish drawing system with random program generation.
JavaScript
11
star
15

node-migemo

Japanese search regex generator
LiveScript
7
star
16

chargen

Random generator taking literature as input.
Python
7
star
17

philtre

Search objects with a familiar syntax.
LiveScript
6
star
18

jp-ner

[abandoned] Work on generating an NER dataset for Japanese
Python
5
star
19

jumandic-py

JumanDic packaged for use with PyPI.
Python
3
star
20

shesha

Random generator toolkit
JavaScript
3
star
21

awesome-gamedev-jp

ใ‚ฒใƒผใƒ ้–‹็™บใซๅฝน็ซ‹ใคใƒชใƒณใ‚ฏ้›†
3
star
22

bontan.ls

Bontan is a simple scraper primarily intended for articles.
LiveScript
2
star
23

lua-mecab

Lua wrapper for Mecab Japanese morphological analyzer.
C++
2
star
24

fugashi-streamlit-demo

Streamlit demo for fugashi
Python
2
star
25

gutenjuice

Top books from Project Gutenberg, in raw form and extracted.
2
star
26

bookoff-redirect

Deal with BookOff query parameter nonsense.
HTML
2
star
27

fugashi-sagemaker-demo

A basic introduction to using fugashi for Japanese tokenization.
Jupyter Notebook
2
star
28

github-tasks.vim

Github task plugin for vim
Vim Script
2
star
29

mecab-packed

[broken/wip] Bundled mecab & unidic for installing via pip.
Shell
1
star
30

language-disruptor

Randomly replace words in Japanese sentences.
Python
1
star
31

poine-tool

POINE้–ข้€ฃใฎใƒ„ใƒผใƒซ
Python
1
star
32

bontan

Get embed code for a link, using OEmbed as appropriate.
Nim
1
star
33

yuzulabo.works

Yuzu Labo web site
CSS
1
star
34

mecab-manylinux1-wheel-builder

Build manylinux1 wheels with MeCab installed.
Shell
1
star
35

deltos.vim

A vim plugin for use with Deltos.
Vim Script
1
star
36

kanji

Kanji data package for Python
Python
1
star
37

visidata-conll

CoNLL-U data loader for Visidata.
Python
1
star
38

everybayes

Document classification for everyone.
Python
1
star
39

jfmt.lua

Tool for wrapping Japanese text to natural width
Lua
1
star
40

searchy

[discontinued] Simple interactive search for Node
LiveScript
1
star