4,629 2 weeks ago

A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone

vision 1b

13 models