Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
prasadt2 's Collections
RAG
Memory
Generative UI
Voice agents
Screen agents
Reasoning
LAMs
Agents
Trained models
Datasets

Screen agents

updated Jan 22
Upvote
-

  • MobA: A Two-Level Agent System for Efficient Mobile Task Automation

    Paper • 2410.13757 • Published Oct 17, 2024 • 33

  • Agent S: An Open Agentic Framework that Uses Computers Like a Human

    Paper • 2410.08164 • Published Oct 10, 2024 • 25

  • WebPilot: A Versatile and Autonomous Multi-Agent System for Web Task Execution with Strategic Exploration

    Paper • 2408.15978 • Published Aug 28, 2024

  • Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents

    Paper • 2409.17140 • Published Sep 25, 2024

  • AssistantX: An LLM-Powered Proactive Assistant in Collaborative Human-Populated Environment

    Paper • 2409.17655 • Published Sep 26, 2024

  • Harnessing Webpage UIs for Text-Rich Visual Understanding

    Paper • 2410.13824 • Published Oct 17, 2024 • 32

  • Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation

    Paper • 2410.13232 • Published Oct 17, 2024 • 45

  • OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

    Paper • 2410.23218 • Published Oct 30, 2024 • 51

  • UI-TARS: Pioneering Automated GUI Interaction with Native Agents

    Paper • 2501.12326 • Published Jan 21 • 62
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs