alibaba/GraphScope

[BUG] Loading from large dataframe/large numpy requires holding all chunks in coordinator

Open

#2 342 ouverte le 23 déc. 2022

Voir sur GitHub
 (0 commentaires) (1 réaction) (0 assignés)HTML (2 401 stars) (301 forks)batch import
bugcomponent:coordinatorgood first issue

Description

Describe the bug

It looks strange that we need to accumulate all chunks in the request stream into a list in coordinator before sending to analytical engine, that would requires large available memory for the coordinator pod.

https://github.com/alibaba/GraphScope/blob/b80a35599424580325a750e734f8a3b2dead2a5b/coordinator/gscoordinator/dag_manager.py#L77-L107

Guide contributeur