how does `default_response_timeout` work? · pytorch/serve#2452

(6 commentaires) (0 réactions) (0 assignés)Java (790 forks)batch import

documentationgood first issuetriaged

Métriques du dépôt

Stars: (3 844 stars)
Métriques de merge PR: (Aucune PR mergée en 30 j)

Description

📚 The doc issue

I set the value of default_response_timeout to 4 i.e. 4 seconds. At the start of the model load, this happens after 4 (ish) seconds:

org.pytorch.serve.wlm.WorkerInitializationException: Backend worker did not respond in given time

My guess is because the model takes a while to load (more than 4 seconds), the worker gets killed. Is there a way to set a larger initial delay i.e. differentiate these two scenarios:

account for the initial model load with a number different from default_response_timeout
if model doesn't response in default_response_timeout after the initial load, then kill the worker

Suggest a potential alternative/fix

No response

Guide contributeur

Direction de recherche: Examinez la configuration de default response timeout et son interaction avec le chargement du modele. Verifiez le code source de WorkerInitializationException pour comprendre quand elle est lancee.
Stack technique: java
Domaine: backend
Type d'issue: Documentation
Difficulté: 2
Temps estimé: 1-3 heures
Statut d'activité: Active
Clarté: Claire
Prérequis: TorchServe
Accessibilité débutant: 75

Métriques du dépôt

Description

📚 The doc issue

Suggest a potential alternative/fix

Guide contributeur

Recevez de nouvelles issues Easy par e-mail.