Ultralytics YOLO ๐
-
Updated
Oct 20, 2025 - Python
Ultralytics YOLO ๐
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Segment Anything in High Quality [NeurIPS 2023]
Segment Anything for Stable Diffusion WebUI
A Python package for segmenting geospatial data with the Segment Anything Model (SAM)
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (ๆฏๆDragGANใChatGPTใImageBindใSAM็ๅจ็บฟDemo็ณป็ป)
Efficient vision foundation models for high-resolution generation and perception.
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!
Images to inference with no labeling (use foundation models to train supervised models).
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX
Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.ไบคไบๅผๅ่ชๅจๅพๅๆ ๆณจๅทฅๅ ท
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API ๐ฅ
Tracking and collecting papers/projects/others related to Segment Anything.
Segment-Anything + 3D. Let's lift anything to 3D.
Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
ๆถ้ CVPR ๆๆฐ็ๆๆ๏ผๅ ๆฌ่ฎบๆใไปฃ็ ๅdemo่ง้ข็ญ๏ผๆฌข่ฟๅคงๅฎถๆจ่๏ผCollect the latest CVPR (Conference on Computer Vision and Pattern Recognition) results, including papers, code, and demo videos, etc., and welcome recommendations from everyone!
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
Adapting Meta AI's Segment Anything to Downstream Tasks with Adapters and Prompts
Add a description, image, and links to the segment-anything topic page so that developers can more easily learn about it.
To associate your repository with the segment-anything topic, visit your repo's landing page and select "manage topics."