Holo2-235B-A22B Preview achieves state-of-the-art GUI grounding for UI localization (Screenspot-Pro, OSWorld G)
AI Impact Summary
H Company's Holo2-235B-A22B Preview is a 235B-parameter UI localization model that achieves state-of-the-art scores on GUI grounding benchmarks: 78.5% on Screenspot-Pro (3 steps) and 79.0% on OSWorld G. It uses agentic localization to iteratively refine predictions, delivering 10-20% relative gains across model sizes. Trained via SkyPilot across multiple cloud providers and deployed with Kubernetes, it demonstrates scalable experimentation for researchers. The model is available as a research release on Hugging Face, signaling accessible evaluation but requiring diligence on production readiness and licensing before operational use.
Affected Systems
- Date
- Date not specified
- Change type
- capability
- Severity
- info