Abstract: Zero-Shot Composed Image Retrieval (ZS-CIR) involves diverse tasks with a broad range of visual content manipulation intent across domain, scene, object, and attribute. The key challenge for ...
Abstract: With the continuous improvement of high-resolution remote-sensing image-acquisition technologies, image quality and resolution are constantly improved, which greatly promotes the development ...
dots.ocr is a powerful, multilingual document parser that unifies layout detection and content recognition within a single vision-language model while maintaining good reading order. Despite its ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results