Industry Collaboration | MoE Inference Deployment

Applied research with Huawei and HIT Shenzhen on real-world MoE LLM inference deployment. Details withheld under NDA.

English

Detailed project information is withheld under a non-disclosure agreement.

This collaboration builds on the core techniques of ExpertFlow (DAC 2026) and explores their application in real-world industrial deployment of sparse MoE LLMs.

Partners:

  • Huawei Technologies
  • Harbin Institute of Technology, Shenzhen (HIT Shenzhen)

Scope: Algorithm-to-production validation of MoE inference optimization under real-world cluster conditions and business workloads.


中文版本

因保密协议,项目详细内容不便公开。

本合作基于 ExpertFlow(DAC 2026) 的核心技术,探索稀疏 MoE 大语言模型推理优化在工业级真实环境中的落地路径。

合作单位:

  • 华为技术有限公司
  • 哈尔滨工业大学(深圳)

研究方向: 在真实集群环境和业务负载下,验证 MoE 推理优化算法从学术成果到生产部署的可行性。


如有合作意向,欢迎联系 / For collaboration inquiries: he_xin@a-star.edu.sg