alibaba/GraphScope

[BUG] Loading from large dataframe/large numpy requires holding all chunks in coordinator

Open

Aperta il 23 dic 2022

Vedi su GitHub
 (0 commenti) (1 reazione) (0 assegnatari)HTML (2401 star) (301 fork)batch import
bugcomponent:coordinatorgood first issue

Descrizione

Describe the bug

It looks strange that we need to accumulate all chunks in the request stream into a list in coordinator before sending to analytical engine, that would requires large available memory for the coordinator pod.

https://github.com/alibaba/GraphScope/blob/b80a35599424580325a750e734f8a3b2dead2a5b/coordinator/gscoordinator/dag_manager.py#L77-L107

Guida contributor