MacPaw
/

yolov11l-ui-groups-detection

Object Detection

🇪🇺 Region: EU

Model card Files Files and versions Community

hellcaster commited on 27 days ago

Commit

d267531

·

verified ·

1 Parent(s): 74ed2d2

Update README.md

Files changed (1) hide show

README.md +102 -3

README.md CHANGED Viewed

@@ -1,3 +1,102 @@
----
-license: agpl-3.0
----

+---
+base_model:
+- Ultralytics/YOLO11
+pipeline_tag: object-detection
+library_name: ultralytics
+tags:
+- yolov11
+- ultralytics
+- yolo
+- vision
+- object-detection
+- pytorch
+- ui
+datasets:
+- MacPaw/Screen2AX-Group
+license: agpl-3.0
+---
+# 🔍 YOLOv11l — UI Groups Detection
+This model is a fine-tuned version of [`Ultralytics/YOLO11`](https://huggingface.co/Ultralytics/YOLO11), trained to detect **UI groups** (e.g., toolbars, tab groups) in macOS application screenshots.
+It is part of the **Screen2AX** project, a research-driven effort to generate accessibility metadata for macOS applications using vision-based techniques.
+---
+## 🧠 Task Overview
+- **Task:** Object Detection
+- **Target:** macOS UI groups
+- **Supported Label(s):**
+  ```
+  ['AXGroup']
+  ```
+This model detects higher-level UI groupings that are commonly used to structure accessible interfaces (e.g., `AXGroup`, `AXTabGroup`, `AXToolbar`, etc.).
+---
+## 🗂 Dataset
+- **Training data:** [`MacPaw/Screen2AX-Group`](https://huggingface.co/datasets/MacPaw/Screen2AX-Group)
+---
+## 🚀 How to Use
+### 🔧 Install Dependencies
+```bash
+pip install huggingface_hub ultralytics
+```
+### 🧪 Load the Model and Run Predictions
+```python
+from huggingface_hub import hf_hub_download
+from ultralytics import YOLO
+# Download the model from the Hugging Face Hub
+model_path = hf_hub_download(
+    repo_id="MacPaw/yolov11l-ui-groups-detection",
+    filename="ui-groups-detection.pt"
+)
+# Load and run prediction
+model = YOLO(model_path)
+results = model.predict("/path/to/your/image")
+# Visualize or process results
+results[0].show()
+```
+---
+## 📜 License
+This model is licensed under the **GNU Affero General Public License v3.0 (AGPL-3.0)**, inherited from the original YOLOv11 base model.
+---
+## 🔗 Related Projects
+- [Screen2AX Project](https://github.com/MacPaw/Screen2AX)
+- [Screen2AX HuggingFace Collection](https://huggingface.co/collections/MacPaw/screen2ax-687dfe564d50f163020378b8)
+- [YOLOv11l — UI Elements Detection](https://huggingface.co/MacPaw/yolov11l-ui-elements-detection)
+---
+## ✍️ Citation
+If you use this model, please cite the Screen2AX paper:
+```bibtex
+...
+```
+---
+## 🌐 MacPaw Research
+Learn more at [https://research.macpaw.com](https://research.macpaw.com)