pyg-team/pytorch_geometric

Increment of available per-dataset information

Open

#7 848 ouverte le 4 août 2023

Voir sur GitHub
 (3 commentaires) (0 réactions) (0 assignés)Python (3 514 forks)batch import
1 - Priority P1datasetdocumentationfeaturehelp wanted

Métriques du dépôt

Stars
 (19 985 stars)
Métriques de merge PR
 (Merge moyen 16j 3h) (13 PRs mergées en 30 j)

Description

🚀 The feature, motivation and pitch

This is a request that is related to the amount of information available per dataset in PyG. I have noticed that some dataset papers have started to increase the amount of information available, and include several graph properties which might be interesting for researchers that seek to benchmark algorithms. One example of such paper is "A Critical Look at the Evaluation of GNNs under Heterophily: Are We Really Making Progress?", where they include properties such as graph diameter, average degree... image I believe that adding this or similar information can be helpful and adding it bit by bit to the dataset cheatsheet could be good.

Alternatives

Another version of this, could be the the properties from Network Repository, which slightly differ from the ones used in the previous paper. image

Additional context

I found myself developing a layer that worked better under specific graph properties, and I believe people can benefit from this information being more readily accesible

Guide contributeur