News
When prompted with images and text inputs, Florence-2 handles a variety of tasks, including object detection, captioning, visual grounding and visual question answering.
Results that may be inaccessible to you are currently showing.
Hide inaccessible results