unslothai/unsloth

[Feature] Unsloth/ Whisper/Large-v3 - S3 Bucket connection

Open

#4 539 ouverte le 23 mars 2026

Voir sur GitHub
 (3 commentaires) (0 réactions) (0 assignés)Python (5 658 forks)batch import
feature requestgood first issuehelp wanted

Métriques du dépôt

Stars
 (64 271 stars)
Métriques de merge PR
 (Merge moyen 3j 15h) (525 PRs mergées en 30 j)

Description

Hi,

I’m currently using Unsloth Studio to fine-tune unsloth/ Whisper Large v3, and I have a question regarding dataset ingestion. My dataset consists of audio files stored privately on AWS S3 along with their corresponding transcriptions. I’d like to know whether Unsloth Studio supports direct integration with S3 (e.g., via bucket paths, IAM roles, or signed URLs) during the dataset upload step. At the moment, I’m unsure if there is a built-in way to connect an S3 bucket or if all data must first be downloaded locally to the instance. Could you clarify if this feature exists, is planned, or if there is a recommended workaround for securely using private S3-hosted audio datasets within Unsloth Studio?

Thank you for helping..

Guide contributeur