Operating a Windows program. Automating invoice reconciliation. Booking a flight and hotel.
These are just a few tasks that a new class of large language models (LLMs) could enable for AI agents. Researchers are calling this next phase of LLMs “large action models,” or LAMs.
To date, LLMs have been stateless — unable to act, adapt or interact with tools on their own. But now, LAMs are set to let agents perform increasingly sophisticated actions and even navigate graphical user interfaces (GUIs).
“LAMs are a critical inflection point…