Boyang Yan

Home

❯

posts

❯

Multimodal AI

Multimodal AI

Apr 15, 20251 min read

object recognition; monocular depth estimation; GPT-V and related approaches; incorporating non-linguistic sensing in GPT; chain-of-thought reasoning

video

Reference List

  1. https://www.aimesoft.com/multimodalai.html

Graph View

Backlinks

  • ollama

Created with Quartz v4.5.0 © 2025