了解最新的模型研究、产品动态与行业洞察。
DINO-XSeek is a referring object detection model based on a multimodal large language model, designed to precisely locate objects based on user-input natural language descriptions.