dl4ir-webnav
-
WebNav is a benchmark task for evaluating an agent with abilities to understand natural language and plan on partially observed environments.
In this challenging task, an agent navigates through a web site consisting of web pages and hyperlinks to find a web page in which a query appears.