Direkt zum Inhalt springen
Computer Vision & Artificial Intelligence
TUM School of Computation, Information and Technology
Technical University of Munich

Technical University of Munich

Menu

Links

Informatik IX
Chair of Computer Vision & Artificial Intelligence

Boltzmannstrasse 3
85748 Garching info@vision.in.tum.de

Follow us on:
CVG Group DVL Group SRL Group


Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining (bibtex)
Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining (bibtex)
by U Sahin, H Li, Q Khan, D Cremers and T Volker
Reference:
Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining (U Sahin, H Li, Q Khan, D Cremers and T Volker), In IEEE Winter Conference on Applications of Computer Vision (WACV, 2024. ([arXiv][project page][code])
Bibtex Entry:
@inproceedings{compreason2024,
 title = {Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining},
 booktitle = {IEEE Winter Conference on Applications of Computer Vision (WACV},
 author = {U Sahin and H Li and Q Khan and D Cremers and T Volker},
 year = {2024},
 keywords = {neural networks, deep learning, Large Language Models},
}
Powered by bibtexbrowser
Go Back

Rechte Seite

Informatik IX
Chair of Computer Vision & Artificial Intelligence

Boltzmannstrasse 3
85748 Garching info@vision.in.tum.de

Follow us on:
CVG Group DVL Group SRL Group