I am a Research Fellow in the Artificial Intelligence Institute, Te Ipu o te Mahara and a member of the Machine Learning Group at the University of Waikato. Before coming to New Zealand, I worked as a Research Assitant Professor in the Department of Computer Science and Engineering, Shanghai Jiao Tong University (SJTU). I obtained a PhD in Compputer Science from Nanyang Technological University (NTU) in 2020 and a Bachelor degree from Shanghai Jiao Tong University, in 2015. I’m currently working on developing efficient reinforcement learning algorithms for continual learning. Here is my CV.
I am particularly interested in developing statistically and computationally efficient machine learning algorithms which are applicable for real-world systems. To improve the sample efficiency in reinforcement learning, one idea I’ve explored is to bootstrap policy gradient with better/worse actions (AAMAS 2019). This leads to fast and unbiased convergence in challenging environments with large action space and short horizon (e.g. intelligent tutoring system)(User Model User-Adap Inter (2021)). To reduce the computational cost in cluster analysis, I proposed to exploit curvature information of the evaluation graph. This results in a simple yet powerful method for estimating the number of clusters in a dataset (Information Sciences 2017). I’m also interested in applying machine learning to real-world problems. On this note, I’ve employed an interdisciplinary research method, to design and implement practical gaming systems (a multiplayer game, an online game) and conduct online and offline user studies (Computers in Human Behavior 2018).
Research Interests: Reinforcement Learning; Machine Learning; Human-Computer Interaction.
Email: yaqian_zhang at hotmail.com
CS7327-033-M01 Neural Network Theory and Applications (Graduate course), 2021 spring, SJTU
Yaqian Zhang, Wooi-Boon Goh, “Personalized task difficulty adaptation based on reinforcement learning” User Modeling and User-Adapted Interaction (2021).(project page)
Yaqian Zhang, Wooi-Boon Goh, “Bootstrapped Policy Gradient for Difficulty Adaptation in Intelligent Tutoring Systems” Proceedings of the 18th International Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS’19). (paper) (code), (project page)
Enmei Tu, Yaqian Zhang, Lin Zhu, Jie Yang, Nikola Kasabov, “A graph-based semi-supervised k nearest-neighbor method for nonlinear manifold distributed data classification”. Information Sciences 367-368 (2016): 673-688.
Enmei Tu, Jie Yang, Nikola Kasabov, Yaqian Zhang, “Posterior Distribution Learning (PDL): A novel supervised learning framework using unlabeled samples to improve classification performance”. Neurocomputing 157(2015): 173-186.
|Reinforcement Learning||Cluster Analysis|
|Enhance sample efficiency with bootstrapped policy gradient with better/worse actions [more]||Curvatue-based method for cluster number determination [more]|
|Difficulty Adaptation||Cooperative Play|
|Robust Dynamic Difficulty Adaptation in intelligent tutoring systems [more]||The influence of peer accountability on attention [more]|