Dépôts
Dépôts de open-mmlab
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
[IJCV] FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.
Multimodal-GPT
OpenPCDet Toolbox for LiDAR-based 3D Object Detection.
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一个高质量多功能的图像修补模型,可以同时支持插入物体、移除物体、图像扩展、形状可控的物体生成,只需要一个模型
MIM Installs OpenMMLab Packages
An open-source toolbox for action understanding based on PyTorch
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
OpenMMLab Computer Vision Foundation
OpenMMLab Model Deployment Framework
OpenMMLab Detection Toolbox and Benchmark
OpenMMLab's next-generation platform for general 3D object detection.
OpenMMLab Foundational Library for Training Deep Learning Models