• Stars
    star
    105
  • Rank 328,196 (Top 7 %)
  • Language
    Python
  • Created about 4 years ago
  • Updated almost 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

PyTorch Implementation of "Towards K-Means-Friendly Spaces: Simultaneous Deep Learning and Clustering," Bo Yang et al., ICML'2017.

DCN: Deep Clustering Network

I found the official implementation of deep clustering network (DCN) is outdated (https://github.com/boyangumn/DCN-New). This repo is a re-implementation of DCN using PyTorch.

Introduction

An interesting work that jointly performs unsupervised dimension reduction and clustering using a neural network autoencoder.

How to run

Here I offer a demo on training DCN on the MNIST dataset (corresponding to Section 5.2.5 in the raw paper). To run this demo, simply type the following command:

python mnist.py

Acknowledgement

For anyone with interests, you can also refer to the implementation of Gรผnther Eder: https://github.com/guenthereder/Deep-Clustering-Network, which has more details on the reproducibility.

Experiment

I trained the DCN model on MNIST dataset, hyper-parameters like network structure were set as values reported in the paper. The left figure presents the reconstruction error of the autoencoder during the pre-training stage, and the right figure presents changes on NMI and ARI (two metrics employed in the paper) during the training stage. The best NMI result I have got is around 0.65.

MNIST Experiment Result

Package dependency

  • scikit-lean==0.23.1
  • pytorch==1.6.0
  • torchvision==0.7.0
  • joblib==0.16.0

In my practice, this implementation also works fine on PyTorch 0.4.1. Feel free to open an issue if there were incompatibility problems.

Reference