neutrino12
's Collections
Agent
updated
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper
•
2508.03680
•
Published
•
122
Training Long-Context, Multi-Turn Software Engineering Agents with
Reinforcement Learning
Paper
•
2508.03501
•
Published
•
59
SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from
Experience
Paper
•
2508.04700
•
Published
•
52
RoboMemory: A Brain-inspired Multi-memory Agentic Framework for Lifelong
Learning in Physical Embodied Systems
Paper
•
2508.01415
•
Published
•
7
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Paper
•
2508.06471
•
Published
•
195
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm
Bridging Foundation Models and Lifelong Agentic Systems
Paper
•
2508.07407
•
Published
•
98
Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with
Long-Term Memory
Paper
•
2508.09736
•
Published
•
57
Memp: Exploring Agent Procedural Memory
Paper
•
2508.06433
•
Published
•
35
OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks
Paper
•
2508.05614
•
Published
•
20
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent
Foundation Models Training
Paper
•
2508.00414
•
Published
•
93
Tool-integrated Reinforcement Learning for Repo Deep Search
Paper
•
2508.03012
•
Published
•
20
SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution
Paper
•
2507.23348
•
Published
•
11
Think in Games: Learning to Reason in Games via Reinforcement Learning
with Large Language Models
Paper
•
2508.21365
•
Published
•
29
How Can Input Reformulation Improve Tool Usage Accuracy in a Complex
Dynamic Environment? A Study on τ-bench
Paper
•
2508.20931
•
Published
•
15
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Paper
•
2508.16153
•
Published
•
160
Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance
for Text-to-Image Generation
Paper
•
2508.18032
•
Published
•
42
AWorld: Orchestrating the Training Recipe for Agentic AI
Paper
•
2508.20404
•
Published
•
38
Understanding Tool-Integrated Reasoning
Paper
•
2508.19201
•
Published
•
32
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
Paper
•
2509.01055
•
Published
•
76
EPO: Entropy-regularized Policy Optimization for LLM Agents
Reinforcement Learning
Paper
•
2509.22576
•
Published
•
134