داده های وب : Reddit Pizza Requests
فایل ها
عنوان | حجم |
pizza_request_dataset.tar.gz | 2.6 MB |
منبع (استناد)
- Tim Althoff, Cristian Danescu-Niculescu-Mizil, Dan Jurafsky. How to Ask for a Favor: A Case Study on the Success of Altruistic Requests. ICWSM, 2014.
آمارهای دیتاست
Number of requests |
5,671 |
Timespan |
December 8, 2010 - September 29, 2013 |
Average success rate |
24.6% |
داده های وب:Reddit submissions
فایل ها
عنوان | حجم |
redditHtmlData.tar.gz | 1.8 GB |
redditSubmissions.csv.gz | 7.3 MB |
منبع(استناد)
- Lakkaraju, J. J. McAuley, J. LeskovecWhat's in a name? Understanding the interplay between titles, content, and communities in social media. ICWSM, 2013.
آمارهای دیتاست
Number of submissions |
132,308 |
Number of unique images |
16,736 |
Average number of times an image is resubmitted |
7.9 |
Timespan |
July 2008 - Jan 2013 |
روابط تصویر flickr
فایل ها
عنوان | حجم |
edgeFeaturesFlickr.tar.gz | 215 MB |
flickrEdges.txt.gz | 21 MB |
flickrXml.tar.gz | 1.2 GB |
nodeFeaturesFlickr.tar.gz | 35 MB |
منبع(استناد)
- McAuley and J. Leskovec. Image Labeling on a Network: Using Social-Network Metadata for Image Classification. ECCV, 2012.
آمارهای دیتاست
Nodes |
105938 |
Edges |
2316948 |
Nodes in largest WCC |
105722 (0.998) |
Edges in largest WCC |
2316668 (1.000) |
Nodes in largest SCC |
105722 (0.998) |
Edges in largest SCC |
2316668 (1.000) |
Average clustering coefficient |
0.0891 |
Number of triangles |
107987357 |
Fraction of closed triangles |
0.1828 |
Diameter (longest shortest path) |
9 |
90-percentile effective diameter |
4.8 |