Skip to content

Graphalytics

Gabor Szarnyas edited this page May 30, 2021 · 1 revision

Graphalytics

Changes

The following table contains the DATAGEN releases and their changes related to Graphalytics.

release date changes
0.2.1 2015-04-20 scale factors for Graphalytics
0.2.2 2015-10-27 graphalytics.CSVPersonSerializerWithWeights serializer
0.2.3 2015-10-27 new scale factors for Graphalytics and readjusted existing ones
0.2.4 2015-11-02 tunable clustering coefficients
0.2.5 2015-11-03
0.2.6 2016-06-20 graphalytics.CSVPersonSerializerExtended serializer
0.2.7b 2017-07-11 gscale-based generation (initial implementation)
0.2.7 2017-10-17 gscale-based generation (stable implementation)
0.2.8 2018-01-18

Sizes of datasets with "Facebook distribution"

In the following table, we list the number of vertices/edges for Graphalytics graphs of scale factors 30, 100, 300, adhering to a "Facebook distribution". These graphs can be automatically generated with the graphalytics-generate-old.sh script.

release D30/V D30/E D100/V D100/E D300/V D300/E
0.2.1 650,000 30,845,741 1,800,000 99,689,585 4,800,000 299,646,264
0.2.2 592,500 29,970,158 1,670,000 100,291,564 4,350,000 299,446,981
0.2.3 592,500 29,970,158 1,670,000 100,291,564 4,350,000 299,446,981
0.2.4 592,500 30,361,660 1,670,000 101,716,179 4,350,000 304,036,710
0.2.5 592,500 30,361,660 1,670,000 101,716,179 4,350,000 304,036,710
0.2.6 592,500 30,338,404 1,670,000 101,747,778 4,350,000 304,035,394
0.2.7b 593,750 30,374,544 1,562,500 94,415,527 4,281,250 299,248,729
0.2.7 593,750 30,402,030 1,562,500 94,488,768 4,281,250 299,298,877
0.2.8 593,750 30,402,030 1,562,500 94,488,768 4,281,250 299,298,877

Additionally, between versions 0.2.6 and 0.2.7, at commits e0a9f63 and later 02c322d the code contained a bug that caused it to generate an incorrect distribution (the change was introduced with the "gscale" option at commit c3da811, but is executable at the aforementioned commits). An example for this bug is that the Graphalytics D30 dataset had 593,750 vertices (the same value as all versions from 0.2.7), but with 30,374,544 edges instead of 30,402,030. This issue was fixed in e54c8b6.

Converter scripts

Scripts to convert the output files to Graphalytics-compatible vertex and edge files are available in the graph-format-conversion directory.

Clone this wiki locally