[Feature] Unsloth/ Whisper/Large-v3 - S3 Bucket connection
#4539 opened on Mar 23, 2026
Description
Hi,
I’m currently using Unsloth Studio to fine-tune unsloth/ Whisper Large v3, and I have a question regarding dataset ingestion. My dataset consists of audio files stored privately on AWS S3 along with their corresponding transcriptions. I’d like to know whether Unsloth Studio supports direct integration with S3 (e.g., via bucket paths, IAM roles, or signed URLs) during the dataset upload step. At the moment, I’m unsure if there is a built-in way to connect an S3 bucket or if all data must first be downloaded locally to the instance. Could you clarify if this feature exists, is planned, or if there is a recommended workaround for securely using private S3-hosted audio datasets within Unsloth Studio?
Thank you for helping..