pyg-team/pytorch_geometric

Increment of available per-dataset information

Open

#7.848 aberto em 4 de ago. de 2023

Ver no GitHub
 (3 comments) (0 reactions) (0 assignees)Python (3.514 forks)batch import
1 - Priority P1datasetdocumentationfeaturehelp wanted

Métricas do repositório

Stars
 (19.985 stars)
Métricas de merge de PR
 (Mesclagem média 16d 3h) (13 fundiu PRs em 30d)

Description

🚀 The feature, motivation and pitch

This is a request that is related to the amount of information available per dataset in PyG. I have noticed that some dataset papers have started to increase the amount of information available, and include several graph properties which might be interesting for researchers that seek to benchmark algorithms. One example of such paper is "A Critical Look at the Evaluation of GNNs under Heterophily: Are We Really Making Progress?", where they include properties such as graph diameter, average degree... image I believe that adding this or similar information can be helpful and adding it bit by bit to the dataset cheatsheet could be good.

Alternatives

Another version of this, could be the the properties from Network Repository, which slightly differ from the ones used in the previous paper. image

Additional context

I found myself developing a layer that worked better under specific graph properties, and I believe people can benefit from this information being more readily accesible

Guia do colaborador