仓库
ApsarasX 的仓库
A music player based on Vue.js(基于Vue.js的聚合音乐播放器WebAPP)
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
A sniffer based on libpcap and electron(基于libpcap和electron的网络嗅探器)
A TypeScript-like language for WebAssembly.
Compiler infrastructure and toolchain library for WebAssembly
个人博客 & 学习笔记
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
LLVM bindings for Node.js/JavaScript/TypeScript
Build LLVM for Windows
《小宠爱》高校小程序开发应用赛西北赛区一等奖
Classification of sinks and sources in node.js API.
A high-throughput and memory-efficient inference and serving engine for LLMs
Community maintained hardware plugin for vLLM on Ascend
A framework for efficient model inference with omni-modality models
:sparkles: A WebAssembly interpreter written in C for demonstration