With OpenAI's latest updates to its Responses API — the application programming interface that allows developers on OpenAI's platform to access multiple agentic tools like web search and file search ...
We propose TraceRL, a trajectory-aware reinforcement learning method for diffusion language models, which demonstrates the best performance among RL approaches for DLMs. We also introduce a ...
. ├── TS-Bench/ # Benchmark datasets for guardrail model evaluation ├── benchmark/ # Evaluation benchmark of agent safety&security ├── scripts/ # Shell scripts for training/inference ├── src/ # Source ...