• Stars
    star
    120
  • Rank 295,983 (Top 6 %)
  • Language
    Python
  • Created almost 4 years ago
  • Updated over 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Discovering New Intents with Deep Aligned Clustering (AAAI 2021)

Discovering New Intents with Deep Aligned Clustering

A deep aligned clustering method to discover new intents.

The proposed method together with baselines are also integrated into the open intent discovery module in our another scalable framework TEXTOIR, enjoy it!

Introduction

This repository provides the official PyTorch implementation of the research paper Discovering New Intents with Deep Aligned Clustering (Accepted by AAAI2021).

Related works can refer to the reading list.

Dependencies

We use anaconda to create python environment:

conda create --name python=3.6

Install all required libraries:

pip install -r requirements.txt

Model Preparation

Get the pre-trained BERT model and convert it into Pytorch.

Set the path of the uncased-bert model (parameter "bert_model" in init_parameter.py).

Usage

Run the experiments by:

sh scripts/run.sh

You can change the parameters in the script. The selected parameters are as follows:

dataset: clinc | banking
factor_of_clusters: 1 (default) | 2 | 3 | 4 
known_class_ratio: 0.25 | 0.5 | 0.75 (default)

Model

The model architecture of DeepAligned: Model

Results

The detailed results can be seen in known_intent_ratio_results.csv and k_results.csv.

Main experiments

CLINC BANKING
Method NMI ARI ACC NMI ARI ACC
KM 70.89 26.86 45.06 54.57 12.18 29.55
AG 73.07 27.70 44.03 57.07 13.31 31.58
SAE-KM 73.13 29.95 46.75 63.79 22.85 38.92
DEC 74.83 27.46 46.89 67.78 27.21 41.29
DCN 75.66 31.15 49.29 67.54 26.81 41.99
DAC 78.40 40.49 55.94 47.35 14.24 27.41
DeepCluster 65.58 19.11 35.70 41.77 8.95 20.69
PCK-means 68.70 35.40 54.61 48.22 16.24 32.66
BERT-KCL 86.82 58.79 68.86 75.21 46.72 60.15
BERT-MCL 87.72 59.92 69.66 75.68 47.43 61.14
CDAC+ 86.65 54.33 69.89 72.25 40.97 53.83
BERT-DTC 90.54 65.02 74.15 76.55 44.70 56.51
DeepAligned 93.89 79.75 86.49 79.56 53.64 64.90

Ablation study

Method CLINC BANKING
Method NMI ARI ACC NMI ARI ACC
w/o Pre + Reinit 57.80 9.63 23.02 34.34 4.49 13.67
w/o Pre + Align 62.53 14.10 28.63 36.91 5.23 15.42
Pre + Reinit 82.90 45.67 55.80 68.12 31.56 41.32
Pre + Align 93.89 79.75 86.49 79.56 53.64 64.90

If you are insterested in this work, and want to use the codes or results in this repository, please star this repository and cite by:

@article{Zhang_Xu_Lin_Lyu_2021, 
    title={Discovering New Intents with Deep Aligned Clustering}, 
    volume={35}, 
    number={16}, 
    journal={Proceedings of the AAAI Conference on Artificial Intelligence}, 
    author={Zhang, Hanlei and Xu, Hua and Lin, Ting-En and Lyu, Rui}, 
    year={2021}, 
    month={May}, 
    pages={14365-14373}
}

Acknowledgments

This paper is founded by seed fund of Tsinghua University (Department of Computer Science and Technology)- Siemens Ltd., China Joint Research Center for Industrial Intelligence and Internet of Things.

More Repositories

1

MMSA

MMSA is a unified framework for Multimodal Sentiment Analysis.
Python
654
star
2

GNN-GBDT-Guided-Fast-Optimizing-Framework

GNN&GBDT-Guided Fast Optimizing Framework for Large-scale Integer Programming(Ye et al., ICML 2023): https://openreview.net/pdf?id=tX7ajV69wt
Python
312
star
3

TEXTOIR

TEXTOIR is the first opensource toolkit for text open intent recognition. (ACL 2021)
Python
195
star
4

Self-MM

Codes for paper "Learning Modality-Specific Representations with Self-Supervised Multi-Task Learning for Multimodal Sentiment Analysis"
Python
184
star
5

MMSA-FET

A Tool for extracting multimodal features from videos.
Python
133
star
6

OKD-Reading-List

Papers for Open Knowledge Discovery
TeX
117
star
7

Cross-Modal-BERT

CM-BERT: Cross-Modal BERT for Text-Audio Sentiment Analysis๏ผˆMM2020๏ผ‰
Python
103
star
8

AWESOME-MSA

Paper List for Multimodal Sentiment Analysis
94
star
9

M-SENA

M-SENA: All-in-One Platform for Multimodal Sentiment Analysis
79
star
10

MIntRec

MIntRec: A New Dataset for Multimodal Intent Recognition (ACM MM 2022)
Python
74
star
11

Adaptive-Decision-Boundary

Deep Open Intent Classification with Adaptive Decision Boundary (AAAI 2021)
Python
70
star
12

ch-sims-v2

Make Acoustic and Visual Cues Matter: CH-SIMS v2.0 Dataset and AV-Mixup Consistent Module
Python
54
star
13

TEXTOIR-DEMO

TEXTOIR: An Integrated and Visualized Platform for Text Open Intent Recognition (ACL 2021)
JavaScript
46
star
14

CDAC-plus

Discovering New Intents via Constrained Deep Adaptive Clustering with Cluster Refinement (AAAI2020)
Jupyter Notebook
44
star
15

DeepUnkID

Deep Unknown Intent Detection with Margin Loss (ACL2019)
Jupyter Notebook
34
star
16

TCL-MAP

TCL-MAP is a powerful method for multimodal intent recognition (AAAI 2024)
Python
31
star
17

CRL

Implementation of the research paper Consistent Representation Learning for Continual Relation Extraction (Findings of ACL 2022)
Python
25
star
18

TFR-Net

This repository contains the official implementation code of the paper Transformer-based Feature Reconstruction Network for Robust Multimodal Sentiment Analysis, accepted at ACMMM 2021.
Python
25
star
19

MIntRec2.0

MIntRec 2.0 is the first large-scale dataset for multimodal intent recognition and out-of-scope detection in multi-party conversations (ICLR 2024)
Python
20
star
20

OpenVNA

[ACL 2024 SDT] OpenVNA is an open-source framework designed for analyzing the behavior of multimodal language understanding systems under noisy conditions.
Python
15
star
21

AWESOME-Dialogue

Paper List for Dialogue and Interactive Systems
15
star
22

UMC

Unsupervised Multimodal Clustering for Semantics Discovery in Multimodal Utterances (ACL 2024)
Python
14
star
23

thuiar.github.io

The official website of THUIAR.
HTML
10
star
24

Books

JavaScript
8
star
25

Robust-MSA

JavaScript
7
star
26

CTMWA

Crossmodal Translation based Meta Weight Adaption for Robust Image-Text Sentiment Analysis
Python
7
star
27

Meta-NA

Pytorch implementation for codes in Meta Noise Adaption Framework for Multimodal Sentiment Analysis with Feature Noise (Accepted by IEEE Transactions on Multimedia).
Python
6
star
28

TCM-CAS

Traditional Chinese Medicine Constitution Assessment System
4
star
29

Adaptive-Batch-ParEGO

This repository contains Matlab implementation of the algorithm framework for adaptive batch-ParEGO
MATLAB
3
star
30

MILPGen

Python
2
star
31

AWESOME-MTL

Paper List for Multi-task Learning
2
star
32

cmcnn

code for paper "Co-attentive multi-task convolutional neural network for facial expression recognition"
Python
2
star
33

Expensive-Multi-objective-Optimization

2
star
34

Light-MILPopt

2
star
35

AudioProcess

Related methods and tools for processing audio data
C++
1
star
36

Block-MOBO

This repository contains Matlab implementation of the algorithm framework for Block-MOBO.
MATLAB
1
star
37

ML4MILP

ML4MILP: the first benchmark dataset specifically designed to test ML-based algorithms for solving MILP problems
Python
1
star
38

GAR-Net

GAR-Net: A Graph Attention Reasoning Network for Conversation Understanding
Python
1
star