A team at Hugging Face has released a freely available, cloud-hosted computer-using AI “agent.” But be forewarned: it’s quite sluggish and occasionally makes mistakes.
Hugging Face’s agent, called Open Computer Agent, is accessible via the web and can use a Linux virtual machine preloaded with several applications, including Firefox. Similar to OpenAI’s Operator, you can prompt Open Computer Agent to complete a task — say, “Use Google Maps to find the Hugging Face HQ in Paris” — and sit back as the agent opens the necessary programs and figures out the required steps.
Open Computer Agent can handle simple requests well enough. But more complicated ones, like searching for flights, tripped it up in TechCrunch’s testing. Open Computer Agent also often runs into CAPTCHA tests that it’s unable to solve.
You’ll also have to wait in a virtual queue to use Open Computer Agent — a queue seconds to minutes long, depending on demand.
We’re launching Computer Use in smolagents! 🥳
-> As vision models become more capable, they become able to power complex agentic workflows. Especially Qwen-VL models, that support built-in grounding, i.e. ability to locate any element in an image by its coordinates, thus to… pic.twitter.com/mI8MuWZkIS
— m_ric (@AymericRoucher) May 6, 2025
Of course, the Hugging Face team’s goal wasn’t to build a state-of-the-art computer-using agent. Rather, they wanted to demonstrate that open AI models are becoming more capable — and cheaper to run on cloud infrastructure.
“As vision models become more capable, they become able to power complex agentic workflows,” Ameryic Roucher, a member of the agents team at Hugging Face, wrote in a post on X. “[Some of these models] support built-in grounding, i.e. [the] ability to locate any element in an image by its coordinates, [and] thus [can] click any item [in a virtual machine].”
While it’s far from perfect, agentic technology is attracting increasing investment as enterprises look to adopt it to boost productivity. According to a recent KPMG survey, 65% of companies are experimenting with AI agents. Markets and Markets projects that the AI agent segment will grow from $7.84 billion in 2025 to $52.62 billion by 2030.
Techcrunch event
Berkeley, CA
|
June 5
A team at Hugging Face has released a freely available, cloud-hosted computer-using AI “agent.” But be forewarned: it’s quite sluggish and occasionally makes mistakes.
Hugging Face’s agent, called Open Computer Agent, is accessible via the web and can use a Linux virtual machine preloaded with several applications, including Firefox. Similar to OpenAI’s Operator, you can prompt Open Computer Agent to complete a task — say, “Use Google Maps to find the Hugging Face HQ in Paris” — and sit back as the agent opens the necessary programs and figures out the required steps.
Open Computer Agent can handle simple requests well enough. But more complicated ones, like searching for flights, tripped it up in TechCrunch’s testing. Open Computer Agent also often runs into CAPTCHA tests that it’s unable to solve.
You’ll also have to wait in a virtual queue to use Open Computer Agent — a queue seconds to minutes long, depending on demand.
We’re launching Computer Use in smolagents! 🥳
-> As vision models become more capable, they become able to power complex agentic workflows. Especially Qwen-VL models, that support built-in grounding, i.e. ability to locate any element in an image by its coordinates, thus to… pic.twitter.com/mI8MuWZkIS
— m_ric (@AymericRoucher) May 6, 2025
Of course, the Hugging Face team’s goal wasn’t to build a state-of-the-art computer-using agent. Rather, they wanted to demonstrate that open AI models are becoming more capable — and cheaper to run on cloud infrastructure.
“As vision models become more capable, they become able to power complex agentic workflows,” Ameryic Roucher, a member of the agents team at Hugging Face, wrote in a post on X. “[Some of these models] support built-in grounding, i.e. [the] ability to locate any element in an image by its coordinates, [and] thus [can] click any item [in a virtual machine].”
While it’s far from perfect, agentic technology is attracting increasing investment as enterprises look to adopt it to boost productivity. According to a recent KPMG survey, 65% of companies are experimenting with AI agents. Markets and Markets projects that the AI agent segment will grow from $7.84 billion in 2025 to $52.62 billion by 2030.
Techcrunch event
Berkeley, CA
|
June 5
It is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. The point of using Lorem Ipsum is that it has a more-or-less normal distribution of letters, as opposed to using ‘Content here, content here’, making it look like readable English. Many desktop publishing packages and web page editors now use Lorem Ipsum as their default model text, and a search for ‘lorem ipsum’ will uncover many web sites still in their infancy.
It is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. The point of using Lorem Ipsum is that it has a more-or-less normal distribution of letters, as opposed to using ‘Content here, content here’, making it look like readable English. Many desktop publishing packages and web page editors now use Lorem Ipsum as their default model text, and a search for ‘lorem ipsum’ will uncover many web sites still in their infancy.
The point of using Lorem Ipsum is that it has a more-or-less normal distribution of letters, as opposed to using ‘Content here, content here’, making
The point of using Lorem Ipsum is that it has a more-or-less normal distribution of letters, as opposed to using ‘Content here, content here’, making it look like readable English. Many desktop publishing packages and web page editors now use Lorem Ipsum as their default model text, and a search for ‘lorem ipsum’ will uncover many web sites still in their infancy.
It is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. The point of using Lorem Ipsum is that it has a more-or-less normal distribution
Copyright BlazeThemes. 2023