Jiayi Weng 翁家翌

trinkle23897 [at] gmail [dot] com

I'm currently a first-year master student at MCDS@CMU. Previously, I earned a bachelor's degree from the Department of Computer Science and Technology, Tsinghua University. I spent a wonderful time at TSAIL, working with Professor Hang Su and Jun Zhu in the field of Reinforcement Learning.

My interest lies in constructing and optimizing machine learning systems. Open-source projects are my favorite. I created a Reinforcement Learning platform Tianshou and got over GitHub stars. My Github has more than GitHub followers. I always aim to use what I have learned to gain more influence and benefit others. Currently, I am looking for Software Engineering internships / Machine Learning Engineering internships / Research Engineering internships in 2021 summer.

Curriculum Vitae  /  GitHub  /  LinkedIn  /  Facebook  /  Zhihu

Research Experience

I'm broadly interested in the problem of creating machines that exhibit intelligence, the hallmarks of which I consider to be adaptability, flexibility, and generality. In my exploration of this interest, I have studied and done research in reinforcement learning, computer vision, and natural language processing. I've had the fortune of participating in a range of interesting research projects with talented and patient collaborators.

For the summer of 2019, I was a visiting student researcher at the Montreal Institute for Learning Algorithms (MILA), where I worked with Professor Yoshua Bengio on the Consciousness Prior based on Transformer architecture. [Photo]

Prior to that, I worked on the reinforcement learning algorithm based on the VizDoom platform with Professor Hang Su and Jun Zhu in TSAIL. As the team leader, we proposed an environment-aware hierarchical reinforcement learning architecture and achieved first place in VizDoom AI Competition 2018 Single Player Track(1). I am also the main contributor of Tianshou, an elegant, flexible, and superfast PyTorch deep Reinforcement Learning platform.

Previously, I had experience in image denoising during my internship at Sensetime Inc., mentored by Hongwei Qin. My first research project was a rule-based escape routing problem working with Professor Hailong Yao.

Publications and/or Submitted Manuscripts

Playing FPS Game with Environment-aware Hierarchical Reinforcement Learning
Shihong Song*, Jiayi Weng*, Hang Su, Dong Yan, Haosheng Zou, Jun Zhu
The 28th International Joint Conferences on Artificial Intelligence (IJCAI 2019). Oral Presentation.

URBER: Ultrafast Rule-Based Escape Routing Method for Large-Scale Sample Delivery Biochips
Jiayi Weng, Tsung-Yi Ho, Weiqing Ji, Peng Liu, Mengdi Bao, Hailong Yao
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2018.

Tianshou: A Fast Lightweight Deep Reinforcement Learning Platform
Jiayi Weng, Minghao Zhang, Alexis Duburcq, Kaichao You, Dong Yan, Hang Su, Jun Zhu
Submitted to Journal of Machine Learning Research (JMLR).

Model-based Credit Assignment for Model-free Deep Reinforcement Learning
Dong Yan, Jiayi Weng, Shiyu Huang, Chongxuan Li, Yichi Zhou, Hang Su, Jun Zhu
Submitted to IEEE Transactions on Neural Networks and Learning Systems (TNNLS).

I'm also engaged (but an amateur) in Photography / Computer Graphics / Web Security.

, Credits