Language Media Processing Lab, Kyoto University (@ku-nlp)

Top repositories

1

jumanpp

Juman++ (a Morphological Analyzer Toolkit)
C++
355
star
2

kwja

An integrated Japanese analyzer based on foundation models
Python
112
star
3

pyknp

A Python Module for JUMAN++/KNP
Python
86
star
4

KWDLC

Kyoto University Web Document Leads Corpus
Python
72
star
5

KyotoCorpus

Kyoto University Text Corpus
Perl
53
star
6

bert-based-faqir

Python
47
star
7

ja-vicuna-qa-benchmark

Python
28
star
8

rhoknp

Yet another Python binding for Juman++/KNP/KWJA
Python
26
star
9

knp

A Japanese Parser
C
26
star
10

JMRD

Japanese Movie Recommendation Dialogue dataset
25
star
11

steganography-with-masked-lm

Implementation of "Frustratingly Easy Edit-based Linguistic Steganography with a Masked Language Model"
Python
24
star
12

bertknp

A Japanese dependency parser based on BERT
Python
20
star
13

AnnotatedFKCCorpus

Annotated Fuman Kaitori Center Corpus
Python
17
star
14

text-cleaning

A powerful text cleaner for Japanese web texts
Python
12
star
15

WikipediaAnnotatedCorpus

Python
11
star
16

kyoto-reader

A processor for KyotoCorpus, KWDLC, and AnnotatedFKCCorpus
Python
10
star
17

pyknp-eventgraph

Python
9
star
18

VISA

An ambiguous subtitles dataset for visual scene-aware machine translation
9
star
19

JKUSea

Utilitary tool aligning sentences of texts written in 2 different languages.
Perl
7
star
20

Winograd-Schema-Challenge-Ja

Japanese Translation of Winograd Schema Challenge
Python
6
star
21

juman

C
6
star
22

python-textformatting

Python
6
star
23

KyotoCorpusAnnotationTool

An annotation tool for the Kyoto University Corpus
JavaScript
5
star
24

TSUBAKI

Perl
5
star
25

jumanpp-jumandic

Scripts for training Jumandic Juman++ model
Makefile
5
star
26

WWW2sf

Perl
4
star
27

covost2NativeJa

Corpus for speech-to-text translation in Japanese-English based on CoVoST 2
3
star
28

ChatCollectionFramework

Python
3
star
29

speechBSD

An extension of the BSD corpus with audio and speaker attribute information
3
star
30

dockerfile-jumanpp-knp

Dockerfile for Juman++, KNP, and KWJA
Dockerfile
3
star
31

ishi

Ishi: A volition classifier for Japanese
Python
2
star
32

video-helpful-MMT

2
star
33

jumandic-grammar

grammar files and related scripts
Python
1
star
34

normtime

Python
1
star
35

JumanDIC

Perl
1
star