SOCIA-Nabla: Textual Gradient Meets Multi-Agent Orchestration for Automated Simulator Generation
Anzeige
Ähnliche Artikel
arXiv – cs.AI
•
Aligning LLM agents with human learning and adjustment behavior: a dual agent approach
arXiv – cs.AI
•
CATArena: Neues Benchmark-Tool für lernende LLM-Agenten
arXiv – cs.AI
•
APTBench: Benchmarking Agentic Potential of Base LLMs During Pre-Training
arXiv – cs.AI
•
KI simuliert Rechtssysteme: LLM-Agenten replizieren Kriminalitätsmuster
arXiv – cs.AI
•
OutboundEval: A Dual-Dimensional Benchmark for Expert-Level Intelligent Outbound Evaluation of Xbench's Professional-Aligned Series
arXiv – cs.AI
•
EU-Agent-Bench: Measuring Illegal Behavior of LLM Agents Under EU Law