0 comments (0 comments)0 reactions (0 reactions)1 assignee (1 assignee)Python276 stars (276 stars)101 forks (101 forks)auto 404
help wantedperformance
Description
- model: Qwen/Qwen3-235B-A22B
- chip: v6e
- input: [4, 64, 256, 512, 1024, 2048, 4096]
- output: [1, 4, 1024, 4096, 8192, 16384, 24576]
- batch_size: [8,16,32,64,128,256,512]
- mode: McJAX
- hardware: tpu-v6e-32
- Issue type
- test
- Research direction
- Set up the environment to run the model Qwen/Qwen3-235B A22B on TPU v6e with JAX, execute the given input shapes and batch sizes, measure latency and throughput, and compare against expected performance baselines.