• Stars
    star
    1
  • Language
    Python
  • Created over 12 years ago
  • Updated over 12 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Document classification for everyone.

More Repositories

1

fugashi

A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
C++
384
star
2

cutlet

Japanese to romaji converter in Python
Python
283
star
3

posuto

๐Ÿฃ๐Ÿ“ฎใ€  Japanese postal code data.
Python
201
star
4

unidic-py

Unidic packaged for installation via pip.
Python
74
star
5

ndl-crop

Script for cropping photos from the NDL.
Python
37
star
6

unidic-lite

A small version of UniDic for easy pip installs.
Python
36
star
7

showmemore

SHOW ME MORE OF [-----]
Python
28
star
8

ipadic-py

IPAdic packaged for easy use from Python.
Python
25
star
9

awesome-digital-collections

Publicly accessible digital collections.
19
star
10

palladian-facades

๐Ÿ›๏ธ Palladian Facade Generator for ProcJam2015
LiveScript
19
star
11

multilang-filter

Script for preprocessing multilingual Markdown.
Python
14
star
12

deltos

A magic notepad. ฮด
LiveScript
13
star
13

gamefaces

Public domain headshots
12
star
14

dupdupdraw

Forthish drawing system with random program generation.
JavaScript
11
star
15

node-migemo

Japanese search regex generator
LiveScript
7
star
16

chargen

Random generator taking literature as input.
Python
7
star
17

ja-tokenizer-benchmark

Compare the speed of various Japanese tokenizers in Python.
Python
7
star
18

philtre

Search objects with a familiar syntax.
LiveScript
6
star
19

jp-ner

[abandoned] Work on generating an NER dataset for Japanese
Python
5
star
20

jumandic-py

JumanDic packaged for use with PyPI.
Python
3
star
21

shesha

Random generator toolkit
JavaScript
3
star
22

awesome-gamedev-jp

ใ‚ฒใƒผใƒ ้–‹็™บใซๅฝน็ซ‹ใคใƒชใƒณใ‚ฏ้›†
3
star
23

bontan.ls

Bontan is a simple scraper primarily intended for articles.
LiveScript
2
star
24

lua-mecab

Lua wrapper for Mecab Japanese morphological analyzer.
C++
2
star
25

fugashi-streamlit-demo

Streamlit demo for fugashi
Python
2
star
26

gutenjuice

Top books from Project Gutenberg, in raw form and extracted.
2
star
27

bookoff-redirect

Deal with BookOff query parameter nonsense.
HTML
2
star
28

fugashi-sagemaker-demo

A basic introduction to using fugashi for Japanese tokenization.
Jupyter Notebook
2
star
29

github-tasks.vim

Github task plugin for vim
Vim Script
2
star
30

mecab-packed

[broken/wip] Bundled mecab & unidic for installing via pip.
Shell
1
star
31

language-disruptor

Randomly replace words in Japanese sentences.
Python
1
star
32

poine-tool

POINE้–ข้€ฃใฎใƒ„ใƒผใƒซ
Python
1
star
33

bontan

Get embed code for a link, using OEmbed as appropriate.
Nim
1
star
34

yuzulabo.works

Yuzu Labo web site
CSS
1
star
35

mecab-manylinux1-wheel-builder

Build manylinux1 wheels with MeCab installed.
Shell
1
star
36

deltos.vim

A vim plugin for use with Deltos.
Vim Script
1
star
37

kanji

Kanji data package for Python
Python
1
star
38

visidata-conll

CoNLL-U data loader for Visidata.
Python
1
star
39

jfmt.lua

Tool for wrapping Japanese text to natural width
Lua
1
star
40

searchy

[discontinued] Simple interactive search for Node
LiveScript
1
star