About Me

I’m an NLP Engineer at Aveni. I previously worked at Shanghai AI Lab and Huawei Research Center. I obtained an MSc in Data Science at the University of Edinburgh supervised by Jeff Z.Pan.

I’m broadly interested in the applications of large language models (LLMs) and their foundation capabilities, especially the agentic capabilities and their ability to be vertical domain learners (e.g financial domains, etc).

I currently develop/train agents for financial services and do research. I’m working to become a full-stack LLM Engineer experienced with (continual) pretraining -> post-training -> deployment.

I have worked on the Factual Knowledge Extraction, LLM Swarm Agents and Long-Context Language Models. Feel free to reach out to me if you are interested in the same topic!

Selected Publications

Acknowledgement

Since starting my NLP journey, I’ve been lucky to learn so much from those dedicated and inspiring peers: Chenmien Tan@Alibaba, Hanxu Hu@University of Zurich, Pinzhen Chen@UoE, Simon Yu@Northeastern University, Wenhao Zhu@Bytedance Seed, Yifu Qiu@UoE and Zeyu Huang@UoE. Most importantly, I thank Yanran Ni for her company and being my forever home.


Last Update (Sep 2025)