Local multi-headed self-attention · pyg-team/pytorch_geometric#8972 | Good First Issue

(3 comments) (1 reaction) (0 assignees)Python (3,514 forks)batch import

featurehelp wanted

Repository metrics

Stars: (19,985 stars)
PR merge metrics: (平均マージ 35d 1h) (30d で 14 merged PRs)

説明

🚀 The feature, motivation and pitch

I am unable to find the clean implementation of local multi-headed self-attention in pytorch geometric. I found three types of multi-head attention, one TransformerConv (https://pytorch-geometric.readthedocs.io/en/latest/generated/torch_geometric.nn.conv.TransformerConv.html#torch_geometric.nn.conv.TransformerConv). But this one calculates a linear combination of all features with different attention weights as opposed to dividing features into multiple heads and taking their linear combination: another RGATConv in the similar direction (https://pytorch-geometric.readthedocs.io/en/latest/generated/torch_geometric.nn.conv.RGATConv.html). And finally GPSConv (https://pytorch-geometric.readthedocs.io/en/latest/generated/torch_geometric.nn.conv.GPSConv.html) that does multi-head attention but is global.

Alternatives

I think it is nice to have the implementation of local self-attention with multiple heads where each head looks into a part of the feature dimension.

Additional context

No response

コントリビューターガイド

調査方針: 特徴次元を複数のヘッドに分割するローカルマルチヘッド自己注意機構を実装します。TransformerConvに似ていますが、ヘッドごとの特徴部分空間を持ちます。PyTorch Geometricの既存の実装（TransformerConv、RGATConv、GPSConvなど）を研究し、パターンを理解して拡張します。
技術スタック: pythonpytorch
領域: machine learning
Issue 種別: 機能
難度: 3
推定時間: 1-2日
活動状況: アクティブ
明確さ: おおむね明確
前提条件: PythonPyTorchGraph Neural Networks basics
初心者向け度: 30