Explore @THU-KEG Open Source projects

@THU-KEG

THU-KEG

Stars
3,396
Global Org. Rank 6,332 (Top 3 %)
Registered almost 8 years ago
Most used languages

Python
84.1 %

Jupyter Notebook
9.1 %

Java
4.5 %

Perl
2.3 %
Location 🇨🇳 China
Country Total Rank 1,866
Country Ranking

Perl
75

Jupyter Notebook
308

Python
459

Entity_Alignment_Papers

Must-read papers on entity alignment published in recent years

EvaluationPapers4ChatGPT

Resource, Evaluation and Detection Papers for ChatGPT

Knowledge_Graph_Reasoning_Papers

Must-read papers on knowledge graph reasoning

OmniEvent

A comprehensive, unified and modular event extraction toolkit.

EAkit

Entity Alignment toolkit (EAkit), a lightweight, easy-to-use and highly extensible PyTorch implementation of many entity alignment algorithms.

KEPLER

Source code for TACL paper "KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation".

MAVEN-dataset

Source code and dataset for EMNLP 2020 paper "MAVEN: A Massive General Domain Event Detection Dataset".

MetaKGR

Source codes and datasets for EMNLP 2019 paper "Adapting Meta Knowledge Graph Information for Multi-Hop Reasoning over Few-Shot Relations"

ChatLog

⏳ ChatLog: Recording and Analysing ChatGPT Across Time

Jupyter Notebook

MOOCCubeX

A large-scale knowledge repository for adaptive learning, learning analytics, and knowledge discovery in MOOCs, hosted by THU KEG.

CLEVE

Source code for ACL 2021 paper "CLEVE: Contrastive Pre-training for Event Extraction"

KoPL

Knowledge Oriented Programming Language

MAVEN-ERE

Source code and dataset for EMNLP 2022 paper "MAVEN-ERE: A Unified Large-scale Dataset for Event Coreference, Temporal, Causal, and Subevent Relation Extraction".

KoLA

[ICLR24] The open-source repo of THU-KEG's KoLA benchmark.

Jupyter Notebook

DacKGR

Source codes and datasets for EMNLP 2020 paper "Dynamic Anticipation and Completion for Multi-Hop Reasoning over Sparse Knowledge Graph"

Jupyter Notebook

PKGC

Do Pre-trained Models Benefit Knowledge Graph Completion? A Reliable Evaluation and a Reasonable Approach

EDUKG

EDUKG: a Heterogeneous Sustainable K-12 Educational Knowledge Graph

KECG

Source code and datasets for EMNLP 2019 paper "Semi-supervised Entity Alignment via Joint Knowledge Embedding Model and Cross-graph Model".

ADELIE

Aligning Large Language Models on Information Extraction

MOOC-Radar

The data and source code for the paper "MoocRadar: A Fine-grained and Multi-aspect Knowledge Repository for Improving Cognitive Student Modeling in MOOCs"

CCL2022_Storyline_Relationship_Classification

CCL2022 新闻脉络关系识别

TWAG

Code and dataset for the ACL 2021 paper "TWAG: A Topic-guided Wikipedia Abstract Generator"

COPEN

The official code and dataset for EMNLP 2022 paper "COPEN: Probing Conceptual Knowledge in Pre-trained Language Models".

BIMR

Datasets and source codes for paper "Is Multi-Hop Reasoning Really Explainable? Towards Benchmarking Reasoning Interpretability"

WaterBench

[ACL2024-Main] Data and Code for WaterBench: Towards Holistic Evaluation of LLM Watermarks

Skill-Neuron

Source code for EMNLP2022 paper "Finding Skill Neurons in Pre-trained Transformers via Prompt Tuning".

SeaKR

Awesome_MOOCs

This is a repo listing some must-read papers on *AI-driven MOOCs* or *Intelligent Education* published in recent years, mainly contributed by the MOOC team members at Knowledge Engineering Group ([KEG](http://keg.cs.tsinghua.edu.cn/)) of Tsinghua University.

ProgramTransfer

Official code and data of the ACL 2022 paper "Program Transfer for Complex Question Answering over Knowledge Bases"

Entity-Linking-Trends-and-History

Papers about the trend of Entity Linking in recent years.

ProbTree

Source code for EMNLP 2023 paper "Probabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex Questions".

Xlore2.0

Xlore2.0 Code[BaiduExtractor, HudongExtractor, WikiExtractor, XloreData, XloreWeb]

MAVEN-Argument

Completing the Puzzle of All-in-One Event Understanding Benchmark with Event Arguments

goal

UPER

Code for the COLING22 paper "UPER: Boosting Multi-Document Summarization with an Unsupervised Prompt-based Extractor"

Event-Level-Knowledge-Editing

KoRC

Baseline for KoRC

MOOC-NER

The code and dataset of ACL'23 paper "Distantly Supervised Course Concept Extraction in MOOCs with Academic Discipline"

KB-Plugin

This is the accompanying code & data for the paper "KB-Plugin: A Plug-and-play Framework for Large Language Models to Induce Programs over Low-resourced Knowledge Bases".

HIF-KAT

DICE

DICE: Detecting In-distribution Data Contamination with LLM's Internal State

Awesome-KBQA

ICLEA

Code and datasets for ICLEA: Interactive Contrastive Learning for Self-supervised Entity Alignment

ijcai13data

ijcai13-dataset-content-alignment

WikiExtrator

extractor for wikipedia dump files

CStory

Data resource of CStory

ARTE

MAVEN-FACT

ClinicNER

ClinicNER experiments

R-Eval

[KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models

VTA

Code, APIs and data for the CIKM23 paper "LittleMu: Deploying an Online Virtual Teaching Assistant via Heterogeneous Sources Integration and Chain-of-Teach Prompts"

ConstGCN

XAlias

XAlias: An Unsupervised Bilingual Entity Alias Discovery System with Multiple Sources

IR4KGC

NGS

Source code for AACL-IJCNLP 2020 paper "Neural Gibbs Sampling for Joint Event Argument Extraction".

SQC-Score

LLMAEL

LLM-Augmented Entity Linking

LLM_Reasoning_Papers

Papers on LLM Reasoning and Retrieval-Augmented LLM Reasoning

SafetyNeuron

Data and code for the paper: Finding Safety Neurons in Large Language Models

Jupyter Notebook

KNOT