• Stars
    star
    4
  • Rank 3,289,661 (Top 66 %)
  • Language
    Java
  • License
    Apache License 2.0
  • Created over 7 years ago
  • Updated almost 3 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Japanese text normalizer for mecab-neologd

More Repositories

1

jaconv

Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, and Zenkaku
Python
289
star
2

neologdn

Japanese text normalizer for mecab-neologd
Cython
265
star
3

dataset-list

lists of text corpus and more (mainly Japanese)
116
star
4

pymlask

Emotion analyzer for Japanese text
Python
111
star
5

oseti

Dictionary based Sentiment Analysis for Japanese
Python
90
star
6

misc

Machine Learning / Randomized Algorithm and more
Jupyter Notebook
35
star
7

mozcpy

Mozc for Python: Kana-Kanji converter
Python
34
star
8

flati

Flatten nested iterable object for Python (Pure-Python implementation)
Python
28
star
9

madoka-python

Memory-efficient Count-Min Sketch Counter (based on Madoka C++ library)
C++
25
star
10

oll-python

Online machine learning algorithms (based on OLL C++ library)
C++
22
star
11

shellinford-python

Wavelet Matrix/Tree succinct data structure for full text search (based on shellinford C++ library)
C++
22
star
12

rakutenma-python

Rakuten MA (Python version)
Python
21
star
13

sengiri

Yet another sentence-level tokenizer for the Japanese text
Python
21
star
14

python-tr

A Pure-Python implementation of the tr algorithm
Python
14
star
15

asa-python

Japanese Argument Structure Analyzer (ASA) client for Python
Python
11
star
16

mecab-as-kkc

Converting Mozc dictionary to MeCab dictionary for Kana-Kanji conversion (KKC)
Python
10
star
17

coding-tips

ใฉๅฟ˜ใ‚Œใ—ใŸใจใใฎใŸใ‚ใฎใƒกใƒข
10
star
18

zunda-python

Zunda: Japanese Enhanced Modality Analyzer client for Python.
Python
10
star
19

jctconv

Rename jctconv -> jaconv. Please use the jaconv
Python
8
star
20

pytypo

English spelling correction
Python
7
star
21

morris_counter

Memory-efficient probabilistic counter namely Morris Counter
Python
5
star
22

udon

Rename udon -> pytypo. Please use the pytypo
Python
4
star
23

dotfiles

Shell
3
star
24

csj-eval

For evaluating speech recognition system using the Corpus of Spontaneous Japanese (CSJ)
Python
3
star
25

kpy

Keitai (Japanese mobile phone) model name extractor on Python
Python
2
star
26

neologd-diff

Write diff (added/removed entries) of mecab-ipadic-neologd between 2 versions
Python
2
star
27

ikegami-yukino.github.io

Profile de Yukino Ikegami
HTML
1
star
28

yascikit-learn

Yet another scikit-learn
Python
1
star
29

mecab-python-windows

C++
1
star
30

notebooks

Jupyter notebook
Jupyter Notebook
1
star