Learning to Code Mods for Java Minecraft

Pioneering Perception Policy with Reinforcement Learning

We present Perception-R1, a scalable RL framework using Group Relative Policy Optimization (GRPO) during MLLM post-training. Key innovations: 🎯 Perceptual Perplexity Analysis: We introduce a novel ...

IEEE

A Source Code Vulnerability Detection Method Based on Positive-Unlabeled Learning

Abstract: As the scale of modern software continues to expand, the risk of software being attacked also increases. Software vulnerabilities are the primary cause of these risks. Traditional detection ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Pioneering Perception Policy with Reinforcement Learning

A Source Code Vulnerability Detection Method Based on Positive-Unlabeled Learning

Trending now