AskUI
/

pta-text-0.1

Model card Files Files and versions Community

gitlost-murali commited on Feb 15, 2024

Commit

4077882

·

verified ·

1 Parent(s): d17e301

Add coordinates method

Files changed (1) hide show

README.md +20 -1

README.md CHANGED Viewed

@@ -37,7 +37,7 @@ Download the checkpoint ".pt" model from files in this model card.
 ## Running the model
-### In full precision, on CPU:
 You can run the model in full precision on CPU:
 ```python
@@ -59,6 +59,25 @@ render_image.show()
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/5f993a63777efc07d7f1e2ce/ZNwjdENJqn-1VpXDcm_Wg.png)
 # Contribution
 An AskUI's open source initiative. This model is contributed and added to the Hugging Face ecosystem by [Murali Manohar @ AskUI](https://huggingface.co/gitlost-murali).

 ## Running the model
+### Get the annotated image
 You can run the model in full precision on CPU:
 ```python
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/5f993a63777efc07d7f1e2ce/ZNwjdENJqn-1VpXDcm_Wg.png)
+### Get the coordinates
+```python
+import requests
+from PIL import Image
+from askui_ml_helper.utils.pta_text import PtaTextInference
+pta_text_inference = PtaTextInference("pta-text-v0.1.pt")
+url = "https://docs.askui.com/assets/images/how_askui_works_architecture-363bc8be35bd228e884c83d15acd19f7.png"
+image = Image.open(requests.get(url, stream=True).raw).convert("RGB")
+prompt = 'click on the text "Operating System"'
+coordinates = pta_text_inference.process_image(image, prompt)
+coordinates
+>>> [0.3981265723705292, 0.13768285512924194]
+```
 # Contribution
 An AskUI's open source initiative. This model is contributed and added to the Hugging Face ecosystem by [Murali Manohar @ AskUI](https://huggingface.co/gitlost-murali).