pyg-team/pytorch_geometric

Increment of available per-dataset information

Open

#7848 opened on Aug 4, 2023

View on GitHub
 (3 comments) (0 reactions) (0 assignees)Python (19,985 stars) (3,514 forks)batch import
1 - Priority P1datasetdocumentationfeaturehelp wanted

Description

🚀 The feature, motivation and pitch

This is a request that is related to the amount of information available per dataset in PyG. I have noticed that some dataset papers have started to increase the amount of information available, and include several graph properties which might be interesting for researchers that seek to benchmark algorithms. One example of such paper is "A Critical Look at the Evaluation of GNNs under Heterophily: Are We Really Making Progress?", where they include properties such as graph diameter, average degree... image I believe that adding this or similar information can be helpful and adding it bit by bit to the dataset cheatsheet could be good.

Alternatives

Another version of this, could be the the properties from Network Repository, which slightly differ from the ones used in the previous paper. image

Additional context

I found myself developing a layer that worked better under specific graph properties, and I believe people can benefit from this information being more readily accesible

Contributor guide