Industry Collaboration | MoE Inference Deployment
Applied research with Huawei and HIT Shenzhen on real-world MoE LLM inference deployment. Details withheld under NDA.
English
Detailed project information is withheld under a non-disclosure agreement.
This collaboration builds on the core techniques of ExpertFlow (DAC 2026) and explores their application in real-world industrial deployment of sparse MoE LLMs.
Partners:
- Huawei Technologies
- Harbin Institute of Technology, Shenzhen (HIT Shenzhen)
Scope: Algorithm-to-production validation of MoE inference optimization under real-world cluster conditions and business workloads.
中文版本
因保密协议,项目详细内容不便公开。
本合作基于 ExpertFlow(DAC 2026) 的核心技术,探索稀疏 MoE 大语言模型推理优化在工业级真实环境中的落地路径。
合作单位:
- 华为技术有限公司
- 哈尔滨工业大学(深圳)
研究方向: 在真实集群环境和业务负载下,验证 MoE 推理优化算法从学术成果到生产部署的可行性。
如有合作意向,欢迎联系 / For collaboration inquiries: he_xin@a-star.edu.sg