how does `default_response_timeout` work? · pytorch/serve#2452

(6 Kommentare) (0 Reaktionen) (0 zugewiesene Personen)Java (790 Forks)batch import

documentationgood first issuetriaged

Repository-Metriken

Stars: (3.844 Stars)
PR-Merge-Metriken: (Keine gemergten PRs in 30 T)

Beschreibung

📚 The doc issue

I set the value of default_response_timeout to 4 i.e. 4 seconds. At the start of the model load, this happens after 4 (ish) seconds:

org.pytorch.serve.wlm.WorkerInitializationException: Backend worker did not respond in given time

My guess is because the model takes a while to load (more than 4 seconds), the worker gets killed. Is there a way to set a larger initial delay i.e. differentiate these two scenarios:

account for the initial model load with a number different from default_response_timeout
if model doesn't response in default_response_timeout after the initial load, then kill the worker

Suggest a potential alternative/fix

No response

Contributor Guide

Research-Richtung: Untersuchen Sie die Konfiguration von default response timeout und deren Interaktion mit dem Modellladen. Überprüfen Sie den Quellcode von WorkerInitializationException, um zu verstehen, wann sie ausgelöst wird.
Tech Stack: java
Domain: backend
Issue Type: Dokumentation
Schwierigkeit: 2
Geschätzte Zeit: 1-3 Stunden
Aktivitätsstatus: Aktiv
Klarheit: Klar
Voraussetzungen: TorchServe
Einsteigerfreundlichkeit: 75

Repository-Metriken

Beschreibung

📚 The doc issue

Suggest a potential alternative/fix

Contributor Guide

Erhalte frische Easy Issues per E-Mail.