Visual Object Search
May 2, 2025
A CLIP-aided dynamic database to store live information about the world and retrieve it using a Vision Language Model via question answering.
Keeps track of objects in dynamic environments. A swarm of robots (without GPS) patrol around in the environment and regularly send their observations to a central server.