They made using PowerShell effortless.
A benchmark for evaluating LLM agents on real, self-hosted SaaS applications. Each task asks the agent to drive a browser through a multi-step business workflow (project management, accounting, HR, ...
There was an error while loading. Please reload this page.