We tried out Google’s new family of multi-modal models with variants compact enough to work on local devices. They work well.
Abstract: Recently, 3D scene representation based on Gaussian primitives are gaining a lot of research interest due to its usability for real-time photorealistic novel view synthesis via 3D Gaussian ...