Controllable Multi-Interest Framework for Recommendation
Original implementation for paper Controllable Multi-Interest Framework for Recommendation.
Yukuo Cen, Jianwei Zhang, Xu Zou, Chang Zhou, Hongxia Yang, Jie Tang
Accepted to KDD 2020 ADS Track!
Prerequisites
- Python 3
- TensorFlow-GPU >= 1.8 (< 2.0)
- Faiss-GPU
Getting Started
Installation
-
Install TensorFlow-GPU 1.x
-
Install Faiss-GPU based on the instructions here: https://github.com/facebookresearch/faiss/blob/master/INSTALL.md
-
Clone this repo
git clone https://github.com/THUDM/ComiRec
.
Dataset
-
Original links of datasets are:
-
Two preprocessed datasets can be downloaded through:
-
You can also download the original datasets and preprocess them by yourself. You can run
python preprocess/data.py {dataset_name}
andpython preprocess/category.py {dataset_name}
to preprocess the datasets.
Training
Training on the existing datasets
You can use python src/train.py --dataset {dataset_name} --model_type {model_name}
to train a specific model on a dataset. Other hyperparameters can be found in the code. (If you share the server with others or you want to use the specific GPU(s), you may need to set CUDA_VISIBLE_DEVICES
.)
For example, you can use python src/train.py --dataset book --model_type ComiRec-SA
to train ComiRec-SA model on Book dataset.
When training a ComiRec-DR model, you should set --learning_rate 0.005
.
Training on your own datasets
If you want to train models on your own dataset, you should prepare the following three(or four) files:
- train/valid/test file: Each line represents an interaction, which contains three numbers
<user_id>,<item_id>,<time_stamp>
. - category file (optional): Each line contains two numbers
<item_id>,<cate_id>
used for computing diversity..
Common Issues
The computation of NDCG score.
I'm so sorry that the computation of NDCG score in the original version (now in `paper` branch) is not consistent with the definition in the paper, as mentioned in the issue #6. I have updated the computation of NDCG score in the `master` branch according to the correct definition. For reproducing the NDCG scores reported in the paper, please use the `paper` branch. By the way, I personally recommend to use the reported results of recall and hit rate only.
If you have ANY difficulties to get things working in the above steps, feel free to open an issue. You can expect a reply within 24 hours.
Acknowledgement
The structure of our code is based on MIMN.
Cite
Please cite our paper if you find this code useful for your research:
@inproceedings{cen2020controllable,
title = {Controllable Multi-Interest Framework for Recommendation},
author = {Cen, Yukuo and Zhang, Jianwei and Zou, Xu and Zhou, Chang and Yang, Hongxia and Tang, Jie},
booktitle = {Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining},
year = {2020},
pages = {2942โ2951},
publisher = {ACM},
}