About Me
I’m an NLP Engineer at Aveni. I previously worked at Shanghai AI Lab and Huawei Research Center. I obtained an MSc in Data Science at the University of Edinburgh supervised by Jeff Z.Pan.
I’m broadly interested in the applications of large language models (LLMs) and their foundation capabilities, especially the agentic capabilities and their ability to be vertical domain learners (e.g financial domains, etc).
I currently develop/train agents for financial services and do research. I’m working to become a full-stack LLM Engineer experienced with (continual) pretraining -> post-training -> deployment.
I have worked on the Factual Knowledge Extraction, LLM Swarm Agents and Long-Context Language Models. Feel free to reach out to me if you are interested in the same topic!
Selected Publications
A Controllable Examination for Long-Context Language Models
Yijun Yang*, Zeyu Huang*, Wenhao Zhu, Zihan Qiu, Fei Yuan, Jeff Z. Pan, Ivan Titov.
NeurIPS 2025 Spotlight | code
UniArk: Improving Generalisation and Consistency for Factual Knowledge Extraction through Debiasing
Yijun Yang, Jie He, Pinzhen Chen, Víctor Gutiérrez-Basulto, Jeff Z. Pan
NAACL 2024 Main | code
Acknowledgement
Since starting my NLP journey, I’ve been lucky to learn so much from those dedicated and inspiring peers: Chenmien Tan@Alibaba, Hanxu Hu@University of Zurich, Pinzhen Chen@UoE, Simon Yu@Northeastern University, Wenhao Zhu@Bytedance Seed, Yifu Qiu@UoE and Zeyu Huang@UoE. Most importantly, I thank Yanran Ni for her company and being my forever home.
Last Update (Sep 2025)
