Technical University of Munich

Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining (bibtex)

by U Sahin, H Li, Q Khan, D Cremers and T Volker

Reference:

Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining (U Sahin, H Li, Q Khan, D Cremers and T Volker), In IEEE Winter Conference on Applications of Computer Vision (WACV, 2024. ([arXiv][project page][code])

Bibtex Entry:

@inproceedings{compreason2024,
 title = {Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining},
 booktitle = {IEEE Winter Conference on Applications of Computer Vision (WACV},
 author = {U Sahin and H Li and Q Khan and D Cremers and T Volker},
 year = {2024},
 keywords = {neural networks, deep learning, Large Language Models},
}

Powered by bibtexbrowser

Go Back

Computer Vision & Artificial Intelligence
TUM School of Computation, Information and Technology
Technical University of Munich

Technical University of Munich

Links

Informatik IX
Chair of Computer Vision & Artificial Intelligence

Navigation

Rechte Seite

Informatik IX
Chair of Computer Vision & Artificial Intelligence

Table of Contents

Computer Vision & Artificial IntelligenceTUM School of Computation, Information and TechnologyTechnical University of Munich

Technical University of Munich

Links

Informatik IX Chair of Computer Vision & Artificial Intelligence

Navigation

Rechte Seite

Informatik IX Chair of Computer Vision & Artificial Intelligence

Table of Contents

Computer Vision & Artificial Intelligence
TUM School of Computation, Information and Technology
Technical University of Munich

Informatik IX
Chair of Computer Vision & Artificial Intelligence

Informatik IX
Chair of Computer Vision & Artificial Intelligence