• This repository has been archived on 04/May/2020
  • Stars
    star
    5
  • Rank 2,861,937 (Top 57 %)
  • Language
    Python
  • Created over 5 years ago
  • Updated over 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

[abandoned] Work on generating an NER dataset for Japanese

More Repositories

1

fugashi

A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
C++
384
star
2

cutlet

Japanese to romaji converter in Python
Python
283
star
3

posuto

๐Ÿฃ๐Ÿ“ฎใ€  Japanese postal code data.
Python
201
star
4

unidic-py

Unidic packaged for installation via pip.
Python
74
star
5

ndl-crop

Script for cropping photos from the NDL.
Python
37
star
6

unidic-lite

A small version of UniDic for easy pip installs.
Python
36
star
7

showmemore

SHOW ME MORE OF [-----]
Python
28
star
8

ipadic-py

IPAdic packaged for easy use from Python.
Python
25
star
9

awesome-digital-collections

Publicly accessible digital collections.
19
star
10

palladian-facades

๐Ÿ›๏ธ Palladian Facade Generator for ProcJam2015
LiveScript
19
star
11

multilang-filter

Script for preprocessing multilingual Markdown.
Python
14
star
12

deltos

A magic notepad. ฮด
LiveScript
13
star
13

gamefaces

Public domain headshots
12
star
14

dupdupdraw

Forthish drawing system with random program generation.
JavaScript
11
star
15

node-migemo

Japanese search regex generator
LiveScript
7
star
16

chargen

Random generator taking literature as input.
Python
7
star
17

ja-tokenizer-benchmark

Compare the speed of various Japanese tokenizers in Python.
Python
7
star
18

philtre

Search objects with a familiar syntax.
LiveScript
6
star
19

jumandic-py

JumanDic packaged for use with PyPI.
Python
3
star
20

shesha

Random generator toolkit
JavaScript
3
star
21

awesome-gamedev-jp

ใ‚ฒใƒผใƒ ้–‹็™บใซๅฝน็ซ‹ใคใƒชใƒณใ‚ฏ้›†
3
star
22

bontan.ls

Bontan is a simple scraper primarily intended for articles.
LiveScript
2
star
23

lua-mecab

Lua wrapper for Mecab Japanese morphological analyzer.
C++
2
star
24

fugashi-streamlit-demo

Streamlit demo for fugashi
Python
2
star
25

gutenjuice

Top books from Project Gutenberg, in raw form and extracted.
2
star
26

bookoff-redirect

Deal with BookOff query parameter nonsense.
HTML
2
star
27

fugashi-sagemaker-demo

A basic introduction to using fugashi for Japanese tokenization.
Jupyter Notebook
2
star
28

github-tasks.vim

Github task plugin for vim
Vim Script
2
star
29

mecab-packed

[broken/wip] Bundled mecab & unidic for installing via pip.
Shell
1
star
30

language-disruptor

Randomly replace words in Japanese sentences.
Python
1
star
31

poine-tool

POINE้–ข้€ฃใฎใƒ„ใƒผใƒซ
Python
1
star
32

bontan

Get embed code for a link, using OEmbed as appropriate.
Nim
1
star
33

yuzulabo.works

Yuzu Labo web site
CSS
1
star
34

mecab-manylinux1-wheel-builder

Build manylinux1 wheels with MeCab installed.
Shell
1
star
35

deltos.vim

A vim plugin for use with Deltos.
Vim Script
1
star
36

kanji

Kanji data package for Python
Python
1
star
37

visidata-conll

CoNLL-U data loader for Visidata.
Python
1
star
38

everybayes

Document classification for everyone.
Python
1
star
39

jfmt.lua

Tool for wrapping Japanese text to natural width
Lua
1
star
40

searchy

[discontinued] Simple interactive search for Node
LiveScript
1
star