Visual Object Search

May 2, 2025

A CLIP-aided dynamic database to store live information about the world and retrieve it using a Vision Language Model via question answering.

Keeps track of objects in dynamic environments. A swarm of robots (without GPS) patrol around in the environment and regularly send their observations to a central server.

diagram

diagram1

RSS
https://aakashks.github.io/posts/feed.xml