Now, I am a 2nd year master at Tsinghua University, supervised by Prof. Yansong Tang. I obtained my bachelor's degree from the School of Software Engineering at Tongji University in 2023. I am interning at Tencent ARC Lab.
- 🔭 I’m currently working on large vision language models!
- 💬 How to reach me: Email.
- 📫 Recent work:
- VoCo-LLaMA. [Preprint] The first approach to compress vision information utilizing the LLMs' understanding paradigm, which can compress hundreds of vision tokens into a single VoCo token with minimal visual information loss.
- LAVT-RS. [TPAMI2024] Pixel-level language-aware early-fusion vision transformer structure for both referring image and video segmentation.