Universal Reasoning Model

Source

Core Claim

URM analyzes Universal Transformer variants for ARC-AGI and Sudoku, then adds short convolution and truncated backpropagation to improve small-model recursive reasoning.

Relevance To This Wiki

It is a direct modern descendant of UT for puzzle-like recursive reasoning, useful for separating recurrent inductive bias from elaborate architecture choices.

Limitations

The tasks are puzzle/reasoning tasks, not time-series state tracking. Reported pass@1 comparisons have a restricted setting that should be preserved when cited.

Foundation TSFM Relevance

Adjacent to latent-state refinement and dynamic compute; not direct TSFM evidence.

Open Questions

  • What matched-budget baseline should this source be compared against: unique-depth Transformer layers, recurrent state, explicit memory, or extra inference steps?
  • Which claims transfer from token-sequence reasoning to multivariate time-series state tracking, event streams, or action-conditioned world models?
  • Which gains come from UT-style recurrence, ConvSwiGLU nonlinearity, truncated backpropagation, or the ARC/Sudoku data protocol?