Posts by Collection

portfolio

publications

SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant

Published in Proceedings of the 18th European Conference on Computer Vision (ECCV), 2024

This work has introduced a new training method that enhances general-purpose vision-language understanding and image-oriented question answering through visual self-questioning.

Recommended citation: Sun, G., Qin, C., Wang, J., Chen, Z., Xu, R., & Tao, Z. (2024). SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant. ECCV
Download Paper

Latent Chain-of-Thought for Visual Reasoning

Published in NeurIPS, 2025

This work has introduced a new deep-RL method that enhances VLM reasoning ability.

Recommended citation: Sun, G., Hua, H., Wang, J., Luo, J., Dianat, S., Rabbani, M., & Tao, Z. (2025). Latent chain-of-thought for visual reasoning. NeurIPS
Download Paper

talks

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.