Florence-2-large news

arxivMay 13

Fashion Florence: Fine-Tuning Florence-2 for Structured Fashion Attribute Extraction

arXiv:2605.09827v1 Announce Type: cross Abstract: We present Fashion Florence, a Florence-2 vision-language model fine-tuned with LoRA to extract structured fashion attributes from clothing images. Given a single photograph, the model generates a JSON object containing category, color, material, sty

arxivApr 3

A ROS 2 Wrapper for Florence-2: Multi-Mode Local Vision-Language Inference for Robotic Systems

arXiv:2604.01179v1 Announce Type: cross Abstract: Foundation vision-language models are becoming increasingly relevant to robotics because they can provide richer semantic perception than narrow task-specific pipelines. However, their practical adoption in robot software stacks still depends on repr

huggingfaceJun 24

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Florence-2-large news

3 articles mentioning Florence-2-large

arxivMay 13