Optimize byte swapping routines in memory_generic.cc · xenia-project/xenia#308

(0 评论) (0 反应) (0 负责人)C++ (7,418 star) (1,077 fork)batch import

cpugood first issue

描述

AVX intrinsics and unrolled loops could help swap large chunks of memory much faster.

技术栈: cpp
领域: performance
议题类型: performance
难度: 3
预计时间: 1-3 hours
活动状态: stale
清晰度: clear
前置要求: C++ programmingSIMD intrinsics (AVX)Understanding of byte swapping
新手友好度: 30
研究方向: Investigate the current byte swapping implementation in memory generic.cc. Research AVX intrinsics for 128/256 bit byte swaps. Look into unrolled loop patterns to improve throughput. Test the optimized version for correctness and performance.