Increment of available per-dataset information
#7848 opened on Aug 4, 2023
Description
🚀 The feature, motivation and pitch
This is a request that is related to the amount of information available per dataset in PyG.
I have noticed that some dataset papers have started to increase the amount of information available, and include several graph properties which might be interesting for researchers that seek to benchmark algorithms.
One example of such paper is "A Critical Look at the Evaluation of GNNs under Heterophily: Are We Really Making Progress?", where they include properties such as graph diameter, average degree...
I believe that adding this or similar information can be helpful and adding it bit by bit to the dataset cheatsheet could be good.
Alternatives
Another version of this, could be the the properties from Network Repository, which slightly differ from the ones used in the previous paper.
Additional context
I found myself developing a layer that worked better under specific graph properties, and I believe people can benefit from this information being more readily accesible