Project Weasel

Project Weasel is a research initiative aimed at making Vision-Language-Action (VLA) models genuinely useful in the real world — focused on agents that take action. The main project, the W series, identifies and tackles the core problems VLAs face today, working toward agents with real autonomy. Sub-projects explore important VLA-adjacent topics.

Github / Maintainer: Kang Minkyu

Weasel logo

Main project

W0: An Open VLA for Long-Horizon Tasks W0: An Open VLA for Long-Horizon Tasks

First entry in the W series, confronting the core problems VLAs face today and building toward agents with real autonomy.

Sub-projects

Mean Collapse is a Serious Bottleneck of VLA Mean Collapse is a Serious Bottleneck of VLA

We test the mean-collapse case on Ant-Maze. For a VLM-based VLA acting on the latest observation only, mean collapse turns out to be a major problem on POMDP inputs — and a small memory-based steering module on the frozen VLA solves it, raising the success rate above the MDP baseline.

MineMate: Multi-Agent Emergence with Humans MineMate: Multi-Agent Emergence with Humans

Studying how cooperative behaviour emerges when humans are part of a multi-agent system.