Local multi-headed self-attention · pyg-team/pytorch_geometric#8972 | Good First Issue

(3 留言) (1 反應) (0 負責人)Python (3,514 fork)batch import

featurehelp wanted

倉庫指標

Star: (19,985 star)
PR 合併指標: (平均合併 35天 1小時) (30 天內合併 14 個 PR)

描述

🚀 The feature, motivation and pitch

I am unable to find the clean implementation of local multi-headed self-attention in pytorch geometric. I found three types of multi-head attention, one TransformerConv (https://pytorch-geometric.readthedocs.io/en/latest/generated/torch_geometric.nn.conv.TransformerConv.html#torch_geometric.nn.conv.TransformerConv). But this one calculates a linear combination of all features with different attention weights as opposed to dividing features into multiple heads and taking their linear combination: another RGATConv in the similar direction (https://pytorch-geometric.readthedocs.io/en/latest/generated/torch_geometric.nn.conv.RGATConv.html). And finally GPSConv (https://pytorch-geometric.readthedocs.io/en/latest/generated/torch_geometric.nn.conv.GPSConv.html) that does multi-head attention but is global.

Alternatives

I think it is nice to have the implementation of local self-attention with multiple heads where each head looks into a part of the feature dimension.

Additional context

No response

貢獻者指南

研究方向: 實現局部多頭自注意力機制，將特徵維度分割到多個頭上，類似於TransformerConv但具有每個頭的特徵子空間。研究PyTorch Geometric中現有的實現，如TransformerConv、RGATConv和GPSConv，以理解模式並擴展它們。
技術棧: pythonpytorch
領域: machine learning
議題類型: 功能
難度: 3
預計時間: 1-2 天
活動狀態: 活躍
清晰度: 大致清晰
前置要求: PythonPyTorchGraph Neural Networks basics
新手友善度: 30