Lei Wang
demolei
AI & ML interests
LLMs
Recent Activity
upvoted a paper about 7 hours ago
EvoBrowseComp: Benchmarking Search Agents on Evolving Knowledge upvoted a paper about 7 hours ago
WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces upvoted a paper about 7 hours ago
FORT-Searcher: Synthesizing Shortcut-Resistant Search Tasks for Training Deep Search Agents