object recognition; monocular depth estimation; GPT-V and related approaches; incorporating non-linguistic sensing in GPT; chain-of-thought reasoning
object recognition; monocular depth estimation; GPT-V and related approaches; incorporating non-linguistic sensing in GPT; chain-of-thought reasoning