Posts by Collection
publications
Consistency-Diversity-Realism Pareto Fronts of Conditional Image Generative Models Pietro Astolfi, Marlene Careil, Melissa Hall, Oscar Mañas, Matthew Muckley, Jakob Verbeek, Adriana Romero-Soriano, Michal Drozdzal arXiv 2024 paper An Introduction to Vision-Language Modeling Florian Bordes, Richard Yuanzhe Pang, Anurag Ajay, Alexander C. Li, Adrien Bardes, Suzanne Petryk, Oscar Mañas, Zhiqiu Lin, Anas Mahmoud, Bargav Jayaraman, Mark Ibrahim, Melissa Hall, Yunyang Xiong, Jonathan Lebensold, Candace Ross, et al. arXiv 2024 paper EvalGIM: A Library for Evaluating Generative Image Models Melissa Hall, Oscar Mañas, Reyhane Askari Hemmat, Mark Ibrahim, Candace Ross, Pietro Astolfi, Tariq Berrada Ifriqi, Marton Havasi, Yohann Benchetrit, Karen Ullrich, Carolina Braga, Abhishek Charnalia, Maeve Ryan, Mike Rabbat, Michal Drozdzal, Jakob Verbeek, Adriana Romero-Soriano arXiv 2024 paper code SneakPeek: Future-Guided Instructional Streaming Video Generation Cheeun Hong, German Barquero, Fadime Sener, Markos Georgopoulos, Edgar Schönfeld, Stefan Popov, Yuming Du, Oscar Mañas, Albert Pumarola arXiv 2025 paper LatentLens: Revealing Highly Interpretable Visual Tokens in LLMs Benno Krojer, Shravan Nayak, Oscar Mañas, Vaibhav Adlakha, Desmond Elliott, Siva Reddy, Marius Mosbach arXiv 2026 paper A Weakly Supervised Consistency-based Learning Method for COVID-19 Segmentation in CT Images Issam Laradji, Pau Rodríguez, Oscar Mañas, Keegan Lensink, Marco Law, Lironne Kurzman, William Parker, David Vázquez, Derek Nowrouzezahrai WACV 2021 paper code Seasonal Contrast: Unsupervised Pre-Training from Uncurated Remote Sensing Data Oscar Mañas, Alexandre Lacoste, Xavier Giró-i-Nieto, David Vázquez, Pau Rodríguez ICCV 2021 first author paper Improving Automatic VQA Evaluation Using Large Language Models Oscar Mañas, Benno Krojer, Aishwarya Agrawal AAAI 2024 first author paper Improving Text-to-Image Consistency via Automatic Prompt Optimization Oscar Mañas, Pietro Astolfi, Melissa Hall, Candace Ross, Jack Urbanek, Adina Williams, Aishwarya Agrawal, Adriana Romero-Soriano, Michal Drozdzal TMLR 2024 first author paper Controlling Multimodal LLMs via Reward-guided Decoding Oscar Mañas, Pierluca D'Oro, Koustuv Sinha, Adriana Romero-Soriano, Michal Drozdzal, Aishwarya Agrawal ICCV 2025 first author paper