Alex Knowledge Base
Search
Search
Dark mode
Light mode
Explorer
Tag: weight-updates
1 item with this tag.
May 15, 2026
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification
llm-post-training
supervised-fine-tuning
reinforcement-learning
reward-rectification
weight-updates