Paper of the source codes released:
Jing Ma, Wei Gao, Kam-Fai Wong. Rumor Detection on Twitter with Tree-structured Recursive Neural Networks. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018.
Datasets:
The datasets used in the experiments were based on the two publicly available Twitter datasets released by Ma et al. (2017):
Jing Ma, Wei Gao, Kam-Fai Wong. Detect Rumors in Microblog Posts Using Propagation Structure via Kernel Learning. ACL 2017.
In the 'resource' folder we provide the pre-processed data files used for our experiments. The raw datasets can be downloaded from https://www.dropbox.com/s/7ewzdrbelpmrnxu/rumdetect2017.zip?dl=0. For details about the datasets please contact Jing at: majing at se dot cuhk dot edu dot hk.
The datafile is in a tab-sepreted column format, where each row corresponds to a tweet. Consecutive columns correspond to the following pieces of information:
1: root-id -- an unique identifier describing the tree (tweetid of the root);
2: index-of-parent-tweet -- an index number of the parent tweet for the current tweet;
3: index-of-the-current-tweet -- an index number of the current tweet;
4: parent-number -- the total number of the parent node in the tree that the current tweet is belong to;
5: text-length -- the maximum length of all the texts from the tree that the current tweet is belong to;
6: list-of-index-and-counts -- the rest of the line contains space separated index-count pairs, where a index-count pair is in format of "index:count", E.g., "index1:count1 index2:count2" (extracted from the "text" field in the json format from Twitter)
Dependencies:
Please install the following python libraries:
numpy version 1.11.2
theano version 0.8.2
Reproduce the experimental results
Run script "model/Main_BU_RvNN.py" for bottom-up recursive model or "model/Main_TD_RvNN.py" for up-down recursive model.
Alternatively, you can change the "obj" parameter and "fold" parameter to set the dataset and each fold.
#If you find this code useful, please let us know and cite our paper.