About me

Hi, my name is Oscar Mañas.

I am a PhD candidate at Mila and Université de Montréal, advised by Prof. Aishwarya Agrawal. I am also a visiting researcher at Meta AI, advised by Dr. Michal Drozdzal and Prof. Adriana Romero.

My research interests lie at the intersection of computer vision and natural language processing. I believe that, like humans (and other animals), AI systems should have a holistic understanding of the world around them. This means working with multiple sensory modalities, among which vision and language arise as particularly interesting. On one hand, they are complementary: vision is a low-level perceptual modality, while language is an abstract human construct. On the other hand, they are believed to be two essential modalities for solving AI-complete problems. In particular, I focus on multimodal vision-language generative models, i.e. models capable of generating images and/or text conditioned on multimodal inputs.

Previously, I was a research intern at Element AI in Montreal, advised by Dr. Pau Rodríguez and Dr. David Vázquez. I obtained a M.Sc. in Computer Vision from Universitat Autònoma de Barcelona, and I carried out my master’s thesis at the Image Processing Group advised by Prof. Xavier Giró. Before, I obtained a B.Sc. in Computer Science from Universitat Politècnica de Catalunya, and I carried out my bachelor’s thesis at the Architectures and Compilers Group advised by Prof. Antonio Gonzalez and Dr. Jose-Maria Arnau.