Comparative Analysis · Biraj Koirala

A Comprehensive Analysis of Image Captioning Models - Evaluating ViT-GPT2, BLIP, and GIT

2024 · 12 · 10 · 4 min read

A Comprehensive Analysis of Image Captioning Models - Evaluating ViT-GPT2, BLIP, and GIT

Benchmarking Vision-Language Models for Automated Image Description Using Quantitative and Qualitative Metrics

computer vision dataset creation notebooks