Search Results for author: Junwen He

Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception

Multimodal Large Language Model (MLLMs) leverages Large Language Models as a cognitive framework for diverse visual-language tasks.

Paper
Add Code

Our method sets the new state of the art for depth-aware panoptic segmentation on both Cityscapes-DVPS and SemKITTI-DVPS datasets.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.