BVQA is a python command line tool that lets you ask a series of predefined questions to a vision language model (VLM) about a collection of images. It saves the answers in json files (one file per ...
Please read the first article, then proceed to the second and this repository to understand the reasoning behind the current structure.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results