Alex Knowledge Base

Tag: weight-updates

1 item with this tag.

  • May 15, 2026

    On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

    • llm-post-training
    • supervised-fine-tuning
    • reinforcement-learning
    • reward-rectification
    • weight-updates

Created with Quartz v4.5.2 © 2026