Dataset Statistics
ACM-1
(Source: https://github.com/zyz282994112/GraphInception/tree/master/data)
Entity |
#Entity |
Paper |
12.5k |
Author |
/ |
Conf |
/ |
Term (paper feature) |
300 |
Index(paper label) |
11 |
ACM-2
(Source: https://github.com/Jhy1993/HAN)
Entity |
#Entity |
Paper |
3025 |
Author |
5835 |
Subject |
56 |
Term (paper feature) |
1830 |
Research area(paper label) |
3 |
ACM-3
Entity |
#Entity |
Paper |
12k |
Author |
17k |
Af๏ฌliations |
1.8k |
Term |
1.5k |
Subjects |
73 |
MovieLens
(Containing rating and timestamp information)
(Note: We utilize the Pearson's coefficient to measure the similiarities in the KNN algorithm)
(Source : https://grouplens.org/datasets/movielens/)
Entity |
#Entity |
User |
943 |
Age |
8 |
Occupation |
21 |
Movie |
1,682 |
Genre |
18 |
Relation Statistics
Relation |
#Relation |
User - Movie |
100,000 |
User - User (KNN) |
47,150 |
User - Age |
943 |
User - Occupation |
943 |
Movie - Movie (KNN) |
82,798 |
Movie - Genre |
2,861 |
Douban Movie
(Containing rating information)
Entity Statistics
Entity |
#Entity |
User |
13,367 |
Movie |
12,677 |
Group |
2,753 |
Actor |
6,311 |
Director |
2,449 |
Type |
38 |
Relation Statistics
Relation |
#Relation |
User - Movie |
1,068,278 |
User - Group |
570,047 |
User - User |
4,085 |
Movie - Actor |
33,587 |
Movie - Director |
11,276 |
Movie - Type |
27,668 |
Douban Book
(Containing rating information)
Entity Statistics
Entity |
#Entity |
User |
13,024 |
Book |
22,347 |
Group |
2,936 |
Location |
38 |
Author |
10,805 |
Publisher |
1,815 |
Year |
64 |
Relation Statistics
Relation |
#Relation |
User - Book |
792,062 |
User - Group |
1,189,271 |
User - User |
169,150 |
User - Location |
10,592 |
Book - Author |
21,907 |
Book - Publisher |
21,773 |
Book - Year |
21,192 |
Amazon
(Containing rating and timestamp information)
(Source : http://jmcauley.ucsd.edu/data/amazon/)
Entity Statistics
Entity |
#Entity |
User |
6,170 |
Item |
2,753 |
View |
3,857 |
Category |
22 |
Brand |
334 |
Relation Statistics
Relation |
#Relation |
User - Item |
195,791 |
Item - View |
5,694 |
Item - Category |
5,508 |
Item - Brand |
2,753 |
LastFM
(Note: We utilize the Pearson's coefficient to measure the similiarities in the KNN algorithm)
(Source : https://grouplens.org/datasets/hetrec-2011/)
Entity Statistics
Entity |
#Entity |
User |
1,892 |
Artist |
17,632 |
Tag |
11,945 |
Relation Statistics
Relation |
#Relation |
User - Artist |
92834 |
User - User (Original) |
25,434 |
User - User (KNN) |
18,802 |
Artist - Artist (KNN) |
153,399 |
Artist - Tag |
184,941 |
Yelp
(Containing rating information)
Entity Statistics
Entity |
#Entity |
User |
16,239 |
Business |
14,284 |
Compliment |
11 |
Category |
47 |
City |
511 |
Relation Statistics
Relation |
#Relation |
User - Business |
198,397 |
User - User |
158,590 |
User - Compliment |
76,875 |
Business - City |
14,267 |
Business - Category |
40,009 |
Yelp-2
(Containing rating information)
Entity Statistics
Entity |
#Entity |
User |
1,286 |
Business |
2,614 |
Service |
2 |
Star level |
9 |
Reservation |
2 |
Category |
3 |
Relation Statistics
Relation |
#Relation |
User - Business |
30,838 |
Bussiness - Service |
2,614 |
Bussiness - Star level |
2,614 |
Business - Revervation |
2,614 |
Business - Category |
2,614 |
DBLP-1
(Note: author_map_id.dat map the author id to the unique id)
Entity Statistics
Entity |
#Entity |
Author |
14,475 |
Paper |
14,376 |
Author_label |
4 |
Conference |
20 |
Type |
8,920 |
Relation Statistics
Relation |
#Relation |
Author - Label |
4,057 |
Paper - Author |
41,794 |
Paper - Conference |
14,376 |
Paper - Type |
114,624 |
DBLP-2
(Source: https://github.com/Jhy1993/HAN)
Entity |
#Entity |
Paper |
14328 |
Author |
4057 |
Conf |
20 |
Term |
8789 |
Profile(author feature) |
334 |
Research area(author label) |
4 |
Aminer
(Note: author_map_id.dat map the author id to the unique id)
Entity Statistics
Entity |
#Entity |
Author |
164,472 |
Paper |
127,623 |
Papel_label |
10 |
Conference |
101 |
Reference |
147,251 |
Relation Statistics
Relation |
#Relation |
Paper - Label |
127,623 |
Paper - Author |
355,072 |
Paper - Conference |
127,632 |
Paper - Reference |
392,519 |
IMDB
(Source: https://github.com/zyz282994112/GraphInception/tree/master/data)
้พๆฅ:https://pan.baidu.com/s/1pRGfoGrOsOKs-x6o5KgHmg ๅฏ็ :o0ap
Entity |
#Entity |
Movie |
14475 |
Actress |
/ |
Actor |
/ |
Director |
/ |
Plot(movie feature) |
1000 |
Genre(movie label) |
9 |
SLAP
(Source: https://github.com/zyz282994112/GraphInception/tree/master/data)
้พๆฅ:https://pan.baidu.com/s/1Vv6823BaAd2wRPpQHDEWUg ๅฏ็ :dt5p
Entity |
#Entity |
Gene |
20419 |
Ontology(gene feature) |
3000 |
Tissue |
/ |
Pathway |
/ |
Diease |
/ |
Chemical Compound |
/ |
Family(gene label) |
15 |
This repository is based on https://github.com/librahu/HIN-Datasets-for-Recommendation-and-Network-Embedding. Thanks to librahu.