florence-2

Here are 22 public repositories matching this topic...

roboflow / maestro

streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL

transformers vqa objectdetection captioning fine-tuning multimodal vision-and-language phi-3-vision paligemma florence-2

Updated Oct 1, 2024
Python

jhc13 / taggui

Star

Tag manager and captioner for image datasets

image-captioning image-tagging tag-manager pyside6 stable-diffusion llava cogvlm florence-2

Updated Aug 4, 2024
Python

autodistill / autodistill-grounded-sam-2

Star

Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.

grounded-sam autodistill florence-2 segment-anything-2

Updated Aug 7, 2024
Python

Ravi-Teja-konda / Surveillance_Video_Summarizer

Star

VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for querying and analyzing video footage.

video ai summarization gradio vlm vision-and-language huggingface surviellance gpt-4 chatgpt gradio-python-llm florence-2

Updated Sep 17, 2024
Python

autodistill / autodistill-florence-2

Star

Use Florence 2 to auto-label data for use in training fine-tuned object detection models.

object-detection zero-shot-object-detection autodistill florence-2

Updated Aug 15, 2024
Python

retkowsky / florence-2

Star

Florence-2

azure florence-2

Updated Jun 21, 2024
Jupyter Notebook

Damarcreative / rem-wm

Sponsor

Star

Rem-WM, a powerful watermark remover tool that leverages the capabilities of Microsoft Florence and Lama Cleaner models.

watermark lama-cleaner florence-2

Updated Jul 19, 2024
Python

ANYANTUDRE / Florence-2-Vision-Language-Model

Star

Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-language tasks.

computer-vision deep-learning huggingface vision-language vision-transformer vision-transformer-models vision-language-model florence-2

Updated Jul 3, 2024
Jupyter Notebook

jacobmarks / fiftyone_florence2_plugin

Star

Run SOTA Vision-Language Model Florence-2 on your data!

computer-vision ml transformer datacentric fiftyone-datasets vision-language-model florence-2

Updated Jun 29, 2024
Python

Mithunprb / text2segment_video

Star

Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes advanced AI models, specifically Florence2 and SAM2, to detect and segment specific objects or activities in a video based on textual descriptions.

video-summarization florence-2 sam2

Updated Aug 5, 2024
Python

D-Ogi / WatermarkRemover-AI

Star

AI-Powered Watermark Remover using Florence-2 and LaMA Models: A Python application leveraging state-of-the-art deep learning models to effectively remove watermarks from images with a user-friendly PyQt6 interface.

dataset-creation inpainting watermark-remover lama-cleaner florence-2

Updated Aug 26, 2024
Python

sayedmohamedscu / Vision-language-models-VLM

Star

vision language models finetuning notebooks & use cases (paligemma - florence .....)

computer-vision vlm florence finetuning multimodal colab-notebook finetune-llms paligemma florence-2 visionlanguage florence-finetuning

Updated Sep 26, 2024
Jupyter Notebook

Ambruk-chan / DiscordBot

Star

The Ultimate Local LLM Discord Bot!!!

ai discord-bot roleplay llm koboldcpp gbnf florence-2

Updated Jul 2, 2024
Python

Kazuhito00 / Florence-2-Colaboratory-Sample

Star

Microsoft の軽量VLMのFlorence-2のColaboratory上でのサンプル

python vlm colaboratory florence-2

Updated Aug 30, 2024
Jupyter Notebook

sitamgithub-MSIT / TextSnap

Star

TextSnap: Demo for Florence 2 model used in OCR tasks to extract and visualize text from images.

artificial-intelligence optical-character-recognition gradio ocr-text-reader huggingface-transformers gradio-interface huggingface-spaces vision-language-model florence-2

Updated Aug 22, 2024
Python

Zuellni / Qt-Caption

Star

Image captioning GUI using Florence-2.

image-captioning qt6 pyside6 florence-2

Updated Sep 23, 2024
Python

Zuellni / Image-Tools

Star

Various image processing scripts.

image-processing image-captioning exllamav2 florence-2

Updated Aug 21, 2024
Python

regiellis / ecko-cli

Star

ecko-cli is a simple CLI tool that streamlines the process of processing images in a directory, generating captions, and saving them as text files. Additionally, it provides functionalities to create a JSONL file from images in the directory you specify. Images will be captioned using the Microsoft Florence-2-large model and ONNX

cli ai image-processing image-classification onnxruntime huggingface-transformers generative-ai ecko florence-2 ecko-cli

Updated Sep 30, 2024
Python

antonio-f / Florence-2-test

Star

Florence-2 quick test

python tutorial jupyter-notebook image-captioning image-to-text colab-notebook visual-grounding referring-expression-comprehension huggingface-transformers multimodal-large-language-models vision-foundation-model florence-2

Updated Aug 15, 2024
Jupyter Notebook

Abdeen-A-AI / Image-Feature-Extraction-Using-GenAI

Star

This project implements an advanced generative AI pipeline for extracting and rating features from images. It combines the power of Florence-2, a state-of-the-art vision-language model, with a fine-tuned version of Mistral-v3, a cutting-edge large language model.

airbnb pyhton genai mistral-7b florence-2

Updated Aug 22, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the florence-2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the florence-2 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

florence-2

Here are 22 public repositories matching this topic...

roboflow / maestro

jhc13 / taggui

autodistill / autodistill-grounded-sam-2

Ravi-Teja-konda / Surveillance_Video_Summarizer

autodistill / autodistill-florence-2

retkowsky / florence-2

Damarcreative / rem-wm

ANYANTUDRE / Florence-2-Vision-Language-Model

jacobmarks / fiftyone_florence2_plugin

Mithunprb / text2segment_video

D-Ogi / WatermarkRemover-AI

sayedmohamedscu / Vision-language-models-VLM

Ambruk-chan / DiscordBot

Kazuhito00 / Florence-2-Colaboratory-Sample

sitamgithub-MSIT / TextSnap

Zuellni / Qt-Caption

Zuellni / Image-Tools

regiellis / ecko-cli

antonio-f / Florence-2-test

Abdeen-A-AI / Image-Feature-Extraction-Using-GenAI

Improve this page

Add this topic to your repo