Alex Knowledge Base

Tag: llm-post-training

2 items with this tag.

  • May 15, 2026

    On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

    • llm-post-training
    • supervised-fine-tuning
    • reinforcement-learning
    • reward-rectification
    • weight-updates
  • May 15, 2026

    LLM Post-Training

    • llm-post-training
    • supervised-fine-tuning
    • reinforcement-learning
    • evolution-strategies

Created with Quartz v4.5.2 © 2026