I am currently working as a Research Scientist in Skywork AI 2050, which is leaded by Prof Shuicheng Yan focuses on accelerating the pace of realization of Artificial General Intelligence (AGI). Before that I obtained my Ph.D dergee from Xidian University in 2021, supervised by Prof Bo Chen and Prof Mingyuan Zhou, and worked as Research Fellow in Nanyang Technological University until 2023, supervised by Prof Bo An.
My research interests mainly lie in generative models (GMs), large language models (LLMs), and reinforcement learning from human feedback (RLHF) technologies.
I am looking for Research Intern who is willing to work on LLMs for math reasoning and code generation. Please send me your CV if you have interests.
Research Highlights
- [Jun, 2024] Our paper "Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning" has been released on arxiv.
- [May, 2021] Our paper "EnsLM: Ensemble Language Model for Data Diversity by Semantic Clustering" with Zhibin Duan, Chaojie Wang, Zhengjue Wang, Bo Chen, and Mingyuan Zhou will be published in ACL 2021. This work is about by semantic clustering, building an ensemble language model to alleviate the heterogeneous charateristics of data unsupervisedly. Our idea can be used for language generation (decoder) and understanding (encoder and autoencoder).
- [May, 2021] Our paper "Multimodal Weibull Variational Autoencoder for Jointly Modeling Image-Text Data" with Chaojie Wang, Bo Chen, Sucheng Xiao, Zhengjue Wang, Penghui Wang, Ning Han, and Mingyuan Zhou will be published in IEEE Trans. on Cybernetics. This work is about building an interpretable image-text modalities probability autoencoder.
- [September, 2020] Our paper "Deep Relational Topic Modeling via Graph Poisson Gamma Belief Network" with Chaojie Wang, Zhengjue Wang, Dongsheng Wang, Bo Chen, and Mingyuan Zhou will be published in NeurIPS2020. This work is about using hierarical topic model to explore the graph data for node clustering, node classification and node-relation prediction.
- [September, 2020] Our paper "Bidirectional Convolutional Poisson Gamma Dynamical Systems" with Wenchao Chen, Chaojie Wang, Yicheng Liu, Bo Chen, and Mingyuan Zhou will be published in NeurIPS2020. This work is about building a bidirectional convolutional topic modeling for document classifcation and exploring the relations among sentences in one document.
- [September, 2020] Our paper "Friendly Topic Assistant for Transformer Based Abstractive Summarization" with Zhengjue Wang, Zhibin Duan, Chaojie Wang, Long Tian, Bo Chen, and Mingyuan Zhou will be published in the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP2020). This work is about using topic model to help Transformer based language model for document abstractive summarization.
- [Janurary, 2020] Our paper "Learning Dynamic Hierarchical Topic Graph with Graph Convolutional Network for Document Classication" with Zhengjue Wang, Hao Zhang, Zhibin Duan, Bo Chen, and Mingyuan Zhou will be presented in AISTATS2020. This work is about building a dynamic document graph with the help of a hierarhcical topic model for document classification. Hope to see you in Palermo, Sicily, Italy, in June 2020.
- [2016] The ACM-ICPC China Shaanxi Provincial Programming Contest. individual / team
Honors and Awards
© Chaojie Wang