ray Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads. 开源项目 2周前 0 点赞 0 评论 125 浏览
vllm A high-throughput and memory-efficient inference and serving engine for LLMs 开源项目 2周前 0 点赞 0 评论 96 浏览