There are no reviews yet. Be the first to send feedback to the community and the maintainers!
ginza
A Japanese NLP Library using spaCy as framework based on Universal DependenciesHappyDB
A corpus of 100,000 happy momentsditto
Code for the paper "Deep Entity Matching with Pre-trained Language Models"bunkai
Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)sato
Code and data for Sato https://arxiv.org/abs/1911.06311.jrte-corpus
Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)opiniondigest
OpinionDigest: A Simple Framework for Opinion Summarization (ACL 2020)vecscan
SubjQA
A question-answering dataset with a focus on subjective informationt5-japanese
Codes to pre-train Japanese T5 modelsruler
Data Programming by Demonstration (DPBD) for Document Classificationtagruler
Data programming by demonstration for information extraction and span annotationcoop
☘️ Code for Convex Aggregation for Opinion Summarization (Iso et al; Findings of EMNLP 2021)doduo
Annotating Columns with Pre-trained Language Modelsasdc
Accommodation Search Dialog Corpus (宿泊施設探索対話コーパス)instruction_ja
Japanese instruction data (日本語指示データ)rotom
Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond"cocosum
🥥 Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)ebe-dataset
Evidence-based Explanation Dataset (AACL-IJCNLP 2020)ginza-transformers
Use custom tokenizers in spacy-transformersteddy
Code and data for Teddy https://arxiv.org/abs/2001.05171.zett
🙈 Code for Zero-shot Triplet Extraction by Template Infilling (Kim et al; IJCNLP-AACL 2023)machamp
The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021starmie
Resources for PVLDB 2023 submissionsudowoodo
The source code of the Sudowoodo paper in ICDE 2023explainit
desuwa
Feature annotator to morphemes and phrases based on KNP rule files (pure-Python)react-jupyter-cookiecutter
xatu
🕊️ Code and Data for XATU: A Fine-grained Instruction-based Benchmark for Explainable Text Updates (Zhang et al; LREC-COLING 2024)magneton
Repository of the Magneton framework for authoring interaction-aware and customizable widgets.emu
Enhancing Multilingual Sentence Embeddings with Semantic Specialization (AAAI '20)learnit
A Tool for Machine Learning Beginnersleam
Source code and demo for Leamminun
Evaluating Counterfactual Explanations for Entity Matchingllm-longeval
💵 Code for Less is More for Long Document Summary Evaluation by LLMs (Wu, Iso et al; EACL 2024)jrte-corpus_example
Example codes for Japanese Realistic Textual Entailment CorpusTyrogue
qa-summarization
Ting-Yao's intern projectpilota
✈ SCUD generator (解釈文生成器)quasi_japanese_reviews
Quasi Japanese Reviews (擬似レビューデータ)MCR
witqa
Love Open Source and this site? Check out how you can help us