Add video feature · huggingface/datasets#5225

仓库指标

Star: (18,313 star)
PR 合并指标: (平均合并 2天 3小时) (30 天内合并 16 个 PR)

描述

Feature request

Add a Video feature to the library so folks can include videos in their datasets.

Motivation

Being able to load Video data would be quite helpful. However, there are some challenges when it comes to videos:

Videos, unlike images, can end up being extremely large files
Often times when training video models, you need to do some very specific sampling. Videos might end up needing to be broken down into X number of clips used for training/inference
Videos have an additional audio stream, which must be accounted for
The feature needs to be able to encode/decode videos (with right video settings) from bytes.

Your contribution

I did work on this a while back in this (now closed) PR. It used a library I made called encoded_video, which is basically the utils from pytorchvideo, but without the torch dep. It included the ability to read/write from bytes, as we need to do here. We don't want to be using a sketchy library that I made as a dependency in this repo, though.

Would love to use this issue as a place to:

brainstorm ideas on how to do this right
list ways/examples to work around it for now

CC @sayakpaul @mariosasko @fcakyon

贡献者指南

研究方向: 回顾已关闭的 PR #4532 和 encoded video 库，以了解之前的尝试。探索 pytorchvideo 中不依赖 torch 的视频解码工具。考虑如何将视频支持集成到现有的 datasets Feature API 中，可能通过实现一个新的 Video 特征类来处理编码/解码为字节。与维护者沟通以明确设计决策，并确保与当前代码库的兼容性。
技术栈: python
领域: apimachine learning
议题类型: 功能
难度: 3
预计时间: 1-2 天
活动状态: 活跃
清晰度: 基本清晰
前置要求: Python
新手友好度: 40

仓库指标

描述

Feature request

Motivation

Your contribution

贡献者指南

每天在邮箱收到新鲜 Easy issues。