· 4 min read
A Comprehensive Analysis of Image Captioning Models - Evaluating ViT-GPT2, BLIP, and GIT
Benchmarking Vision-Language Models for Automated Image Description Using Quantitative and Qualitative Metrics
computer vision
dataset creation
notebooks