๐ Amazon has officially introduced Nova Act, a groundbreaking general-purpose AI agent designed to take control of web browsers and execute simple tasks autonomously. This innovative technology is accompanied by the Nova Act SDK, a comprehensive toolkit enabling developers to craft their own agent prototypes using Nova Act’s capabilities.
Developed by Amazon’s AGI lab in San Francisco, Nova Act is set to enhance the upcoming Alexa+ upgrade, integrating generative AI to revolutionize Amazon’s voice assistant. Currently available as a research preview, developers can explore the Nova Act toolkit at nova.amazon.com, a platform also highlighting Amazon’s Nova foundation models.
๐ก How to Get Started with Nova Act SDK:
- Visit nova.amazon.com to access the Nova Act toolkit.
- Utilize the SDK to automate basic user actions, such as ordering food or booking reservations.
- Develop tools for AI agents to navigate web pages, complete forms, and select calendar dates.
Amazon asserts that Nova Act surpasses competitors like OpenAI’s Operator and Anthropic’s Computer Use in internal evaluations, showcasing superior interaction with on-screen text. Despite not being benchmarked against common agent evaluations like WebVoyager, Nova Act’s performance signals Amazon’s ambitious entry into the AI agent space.
๐ Emerging from Amazon’s AGI lab under the guidance of ex-OpenAI researchers, Nova Act represents a pivotal step towards superintelligent AI systems. With the potential to redefine reliability and autonomy in AI agents, Amazon’s foray into this technology could set new standards for the industry.