• Stars
    star
    3
  • Rank 3,863,294 (Top 78 %)
  • Language
  • License
    Other
  • Created about 2 years ago
  • Updated 4 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

An extension of the BSD corpus with audio and speaker attribute information

More Repositories

1

jumanpp

Juman++ (a Morphological Analyzer Toolkit)
C++
355
star
2

kwja

An integrated Japanese analyzer based on foundation models
Python
112
star
3

pyknp

A Python Module for JUMAN++/KNP
Python
86
star
4

KWDLC

Kyoto University Web Document Leads Corpus
Python
72
star
5

KyotoCorpus

Kyoto University Text Corpus
Perl
53
star
6

bert-based-faqir

Python
47
star
7

ja-vicuna-qa-benchmark

Python
28
star
8

rhoknp

Yet another Python binding for Juman++/KNP/KWJA
Python
26
star
9

knp

A Japanese Parser
C
26
star
10

JMRD

Japanese Movie Recommendation Dialogue dataset
25
star
11

steganography-with-masked-lm

Implementation of "Frustratingly Easy Edit-based Linguistic Steganography with a Masked Language Model"
Python
24
star
12

bertknp

A Japanese dependency parser based on BERT
Python
20
star
13

AnnotatedFKCCorpus

Annotated Fuman Kaitori Center Corpus
Python
17
star
14

text-cleaning

A powerful text cleaner for Japanese web texts
Python
12
star
15

WikipediaAnnotatedCorpus

Python
11
star
16

kyoto-reader

A processor for KyotoCorpus, KWDLC, and AnnotatedFKCCorpus
Python
10
star
17

pyknp-eventgraph

Python
9
star
18

VISA

An ambiguous subtitles dataset for visual scene-aware machine translation
9
star
19

JKUSea

Utilitary tool aligning sentences of texts written in 2 different languages.
Perl
7
star
20

Winograd-Schema-Challenge-Ja

Japanese Translation of Winograd Schema Challenge
Python
6
star
21

juman

C
6
star
22

python-textformatting

Python
6
star
23

KyotoCorpusAnnotationTool

An annotation tool for the Kyoto University Corpus
JavaScript
5
star
24

TSUBAKI

Perl
5
star
25

jumanpp-jumandic

Scripts for training Jumandic Juman++ model
Makefile
5
star
26

WWW2sf

Perl
4
star
27

covost2NativeJa

Corpus for speech-to-text translation in Japanese-English based on CoVoST 2
3
star
28

ChatCollectionFramework

Python
3
star
29

dockerfile-jumanpp-knp

Dockerfile for Juman++, KNP, and KWJA
Dockerfile
3
star
30

ishi

Ishi: A volition classifier for Japanese
Python
2
star
31

video-helpful-MMT

2
star
32

jumandic-grammar

grammar files and related scripts
Python
1
star
33

normtime

Python
1
star
34

JumanDIC

Perl
1
star