- I obtained both my Bachelor’s and Master’s degrees in Software Engineering from Harbin Engineering University. During my master’s studies, I conducted research under the guidance of Researcher Shenjun Zhong at Monash University. My research interests include medical multi-modality, medical multi-modal LLMs, medical VQA, and remote sensing VQA.
- I enjoy fitness, playing PC games, and collecting a plethora of papers that I might not thoroughly read. If you are interested in my research, feel free to contact me. I am happy to collaborate on research projects, and I also welcome companions for fitness and gaming.
Recent News
- May 23, 2025, I was admitted to the School of Mechanical and Electrical Engineering at the University of Electronic Science and Technology of China and officially joined the HiLab as a doctoral student under the supervision of Professor Guotai Wang.
- Apr 8, 2025, I got the 2025 Outstanding Graduate of Harbin Engineering University.
- Jan 3, 2025, our paper Parameter-Efficient Fine-Tuning Medical Multimodal Large Language Models for Medical Visual Grounding was accepted by 2025 IEEE 22th International Symposium on Biomedical Imaging(ISBI 2025).[paper]
- Nov 7, 2024, our paper Cross-Modal Self-Supervised Vision Language Pre-training with Multiple Objectives for Medical Visual Question Answering was accepted by Journal of Biomedical Informatics(JBI).[paper]
- Oct 8, 2024, our paper DSS-DocRE: A DenseNet-Based Semantic Segmentation Model for Document-Level Relation Extraction was accepted by 20th International Conference on Mobility, Sensing and Networking(MSN 2024). Congratulations to DongZe.
- Sep 21, 2024, I got the Xiaomi Scholarship (ranked 4/224) - ¥5000 ($690).
- Aug 14, 2024, our paper PERS: Parameter-Efficient Multi-modal Transfer Learning for Remote Sensing Visual Question Answering was accepted by IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing(JStars).[paper]
- Jan 8, 2024, our paper Pefomed: Parameter Efficient Fine-tuning on Multimodal large Language Models for Medical Visual Question Answering was published in arxiv. Although the paper is not officially published, it is still a good work.[paper]
- Dec 14, 2023, our paper PECR: Parameter-Efficient Transfer Learning with Cross-Modal Representation Learning for Remote Sensing Visual Question Answering was accepted by 2024 IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP 2024).[paper]
- Oct 6, 2023, I got the National Scholarship for Postgraduate Students (ranked 7/224) - ¥20000 ($2760).