Tuna-2

Summary

Tuna-2 is a pixel-space unified multimodal model that discards pretrained vision encoder modules.

Role In The Wiki

Tuna-2 is the strongest current counterpoint to semantic-encoder-first multimodal design in the corpus.

Evidence

Tuna-2

Relation To Foundation TSFM Agenda

Use the source-level agenda mapping in tuna-2-2026 rather than duplicating verdict rows here.

At the entity level, Tuna-2 is the strongest current counterpoint to semantic-encoder-first multimodal design in the corpus. This page should stay as the object card; source pages carry slot-level verdicts, evidence, and missing pieces.

Foundation Time-Series Model Research Agenda
Unified Multimodal Models
Vision Foundation Models

Alex Open Research Wiki

Explorer

Tuna-2

Tuna-2

Summary

Role In The Wiki

Evidence

Relation To Foundation TSFM Agenda

Graph View

Table of Contents

Backlinks

Alex Open Research Wiki

Explorer

Tuna-2

Tuna-2

Summary

Role In The Wiki

Evidence

Relation To Foundation TSFM Agenda

Related Pages

Graph View

Table of Contents

Backlinks