add dataset link.
Browse files
README.md
CHANGED
|
@@ -9,7 +9,7 @@ pipeline_tag: image-text-to-text
|
|
| 9 |
# GUI-Actor-7B with Qwen2-VL-7B as backbone VLM
|
| 10 |
|
| 11 |
This model was introduced in the paper [**GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents**](https://huggingface.co/papers/2506.03143).
|
| 12 |
-
It is developed based on [Qwen2-VL-7B-Instruct ](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct), augmented by an attention-based action head and finetuned to perform GUI grounding using the dataset [here
|
| 13 |
|
| 14 |
For more details on model design and evaluation, please check: [π Project Page](https://microsoft.github.io/GUI-Actor/) | [π» Github Repo](https://github.com/microsoft/GUI-Actor) | [π Paper](https://www.arxiv.org/pdf/2506.03143).
|
| 15 |
|
|
|
|
| 9 |
# GUI-Actor-7B with Qwen2-VL-7B as backbone VLM
|
| 10 |
|
| 11 |
This model was introduced in the paper [**GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents**](https://huggingface.co/papers/2506.03143).
|
| 12 |
+
It is developed based on [Qwen2-VL-7B-Instruct ](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct), augmented by an attention-based action head and finetuned to perform GUI grounding using the dataset [here](https://huggingface.co/datasets/cckevinn/GUI-Actor-Data).
|
| 13 |
|
| 14 |
For more details on model design and evaluation, please check: [π Project Page](https://microsoft.github.io/GUI-Actor/) | [π» Github Repo](https://github.com/microsoft/GUI-Actor) | [π Paper](https://www.arxiv.org/pdf/2506.03143).
|
| 15 |
|