منبع (استناد)
- Yang, J. Leskovec. Temporal Variation in Online Media. ACM International Conference on Web Search and Data Mining (WSDM '11), 2011.
داده ها به درخواست توئیتر حذف شد
آمارهای دیتاست
Number of users |
17,069,982 |
Number of tweets |
476,553,560 |
Number of URLs |
181,611,080 |
Number of Hashtags |
49,293,684 |
Number of re-tweets |
71,835,017 |
96 میلیون الگوی رفتاری از Memetracker
فایل ها
عنوان | حجم |
quotes_2008-08.txt.gz | 1 GB |
quotes_2008-09.txt.gz | 1.3 GB |
quotes_2008-10.txt.gz | 1.3 GB |
quotes_2008-11.txt.gz | 1.1 GB |
quotes_2008-12.txt.gz | 1.1 GB |
quotes_2009-01.txt.gz | 1.2 GB |
quotes_2009-02.txt.gz | 1.7 GB |
quotes_2009-03.txt.gz | 2 GB |
quotes_2009-04.txt.gz | 2.5 GB |
منبع(استناد)
- Leskovec, K. Lang, A. Dasgupta, M. Mahoney. Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters. Internet Mathematics 6(1) 29--123, 2009.
- Google programming contest, 2002
آمارهای دیتاست
Nodes |
875713 |
Edges |
5105039 |
Nodes in largest WCC |
855802 (0.977) |
Edges in largest WCC |
5066842 (0.993) |
Nodes in largest SCC |
434818 (0.497) |
Edges in largest SCC |
3419124 (0.670) |
Average clustering coefficient |
0.5143 |
Number of triangles |
13391903 |
Fraction of closed triangles |
0.01911 |
Diameter (longest shortest path) |
21 |
90-percentile effective diameter |
8.1 |
Volume Time Series of Memetracker Phrases and Twitter Hashtags
فایل ها
عنوان | حجم |
MemePhr.txt | 1.2 MB |
TwtHtag.txt | 1.2 MB |
منبع(استناد)
- Yang, J. Leskovec. Temporal Variation in Online Media. ACM International Conference on Web Search and Data Mining (WSDM '11), 2011.
آمارهای دیتاست
Number of time series |
1,000 |
Length of time series |
128 |
Time unit |
1 hour |
مجموعه داده Higgs Twitter
Reply Network statistics |
|
Nodes |
38918 |
Edges |
32523 |
Nodes in largest WCC |
12839 (0.330) |
Edges in largest WCC |
14944 (0.459) |
Nodes in largest SCC |
322 (0.008) |
Edges in largest SCC |
708 (0.022) |
Average clustering coefficient |
0.0058 |
Number of triangles |
244 |
Fraction of closed triangles |
0.0001561 |
Diameter (longest shortest path) |
29 |
90-percentile effective diameter |
10 |
Mention Network statistics |
|
Nodes |
116408 |
Edges |
150818 |
Nodes in largest WCC |
91606 (0.787) |
Edges in largest WCC |
132068 (0.876) |
Nodes in largest SCC |
1801 (0.015) |
Edges in largest SCC |
7069 (0.047) |
Average clustering coefficient |
0.0825 |
Number of triangles |
23068 |
Fraction of closed triangles |
0.0002417 |
Diameter (longest shortest path) |
18 |
90-percentile effective diameter |
6.5 |
منبع(استناد)
- Leskovec, K. Lang, A. Dasgupta, M. Mahoney. Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters. Internet Mathematics 6(1) 29--123, 2009.
آمارهای دیتاست
آمارهای شبکه اجتماعی |
|
Nodes |
456626 |
Edges |
14855842 |
Nodes in largest WCC |
456290 (0.999) |
Edges in largest WCC |
14855466 (1.000) |
Nodes in largest SCC |
360210 (0.789) |
Edges in largest SCC |
14102605 (0.949) |
Average clustering coefficient |
0.1887 |
Number of triangles |
83023401 |
Fraction of closed triangles |
0.002901 |
Diameter (longest shortest path) |
9 |
90-percentile effective diameter |
3.7 |
آمارهای شبکه Retweet |
|
Nodes |
256491 |
Edges |
328132 |
Nodes in largest WCC |
223833 (0.873) |
Edges in largest WCC |
308596 (0.940) |
Nodes in largest SCC |
984 (0.004) |
Edges in largest SCC |
3850 (0.012) |
Average clustering coefficient |
0.0156 |
Number of triangles |
21172 |
Fraction of closed triangles |
0.0001085 |
Diameter (longest shortest path) |
19 |
90-percentile effective diameter |
6.8 |
فایل ها
عنوان | حجم |
higgs-activity_time.txt.gz | 4 MB |
higgs-mention_network.edgelist.gz | 865 KB |
higgs-reply_network.edgelist.gz | 198 KB |
higgs-retweet_network.edgelist.gz | 1.9 MB |
higgs-social_network.edgelist.gz | 52 MB |