pankti0919 commited on
Commit
c41fe9c
·
verified ·
1 Parent(s): 981337a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -0
README.md CHANGED
@@ -12,3 +12,19 @@ short_description: Built a vision-language application
12
  ---
13
 
14
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  ---
13
 
14
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
15
+
16
+
17
+ # UI Screen Describer with Pix2Struct
18
+
19
+ This demo uses Google's `pix2struct-screen2words-large` model to turn UI screenshots into natural language descriptions.
20
+
21
+ ### Use Cases
22
+ - Accessibility
23
+ - UI testing
24
+ - Auto documentation
25
+
26
+ ### How it works
27
+ Upload any screenshot (e.g., app, webpage, dashboard) and the model will describe it in text.
28
+
29
+ Built using Hugging Face Transformers + Gradio.
30
+