News

To build the Lego dataset, the team fed images rendered from 24 different viewpoints into GPT-4o and let that model write captions for each Lego structure, asking it to focus on geometric features ...