Model Detail
Florence-2-large
—Florence-2-large is a code generation model released by Microsoft. The model is registered under the image-text-to-text pipeline tag on Hugging Face.
Florence-2-large is published on Hugging Face but our pipeline has not yet captured architecture, license, or parameter-count metadata for this entry. The data is refreshed daily, so these fields typically populate within 24–48 hours of release.
Florence-2-large is best fit for code completion, repository-scale Q&A, and pair-programming integrations. It is a less obvious choice for one-shot generation of security-critical code without review. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.
Fashion Florence: Fine-Tuning Florence-2 for Structured Fashion Attribute Extraction
arXiv:2605.09827v1 Announce Type: cross Abstract: We present Fashion Florence, a Florence-2 vision-language model fine-tuned with LoRA to extract structured fashion attributes from clothing images. Given a single photograph, the model generates a JSON object containing category, color, material, sty
A ROS 2 Wrapper for Florence-2: Multi-Mode Local Vision-Language Inference for Robotic Systems
arXiv:2604.01179v1 Announce Type: cross Abstract: Foundation vision-language models are becoming increasingly relevant to robotics because they can provide richer semantic perception than narrow task-specific pipelines. However, their practical adoption in robot software stacks still depends on repr