News
When prompted with images and text inputs, Florence-2 handles a variety of tasks, including object detection, captioning, visual grounding and visual question answering.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results