Project Weasel

Project Weasel is a research initiative aimed at making Vision-Language-Action (VLA) models genuinely useful in the real world — focused on agents that take action. The main project, the W series, identifies and tackles the core problems VLAs face today, working toward agents with real autonomy. Sub-projects explore important VLA-adjacent topics.

Github / Maintainer: Kang Minkyu

Main project

W0: An Open VLA for Long-Horizon Tasks
VLA · Long-Horizon · Autonomy

First entry in the W series, confronting the core problems VLAs face today and building toward agents with real autonomy.

Sub-projects

	Mean Collapse is a Serious Bottleneck of VLA VLA · POMDP · Memory · Steering We test the mean-collapse case on Ant-Maze. For a VLM-based VLA acting on the latest observation only, mean collapse turns out to be a major problem on POMDP inputs — and a small memory-based steering module on the frozen VLA solves it, raising the success rate above the MDP baseline.
	MineMate: Multi-Agent Emergence with Humans Multi-Agent · Human-in-the-loop Studying how cooperative behaviour emerges when humans are part of a multi-agent system.