I am a Research Scientist at Google Research in Tel-Aviv where I work on vision-and-language. I completed my PhD in The Hebrew University of Jerusalem, Israel. During my time there, I had the privilege of being advised by Dr. Roy Schwartz and Dr. Gabriel Stanovsky.
The goal of my research is to improve vision and language generalization. Specifically, I aim to develop models with better compositionality abilities, less biased and better perform on real-world examples. My recent works and interest areas include image-text alignment, improving text-to-image models, and visual instruction tuning. See my publications for more details. My PhD talk "Bridging Vision and Language with Data: From Perception to Understanding" ๐ฌ record is available here.
I did my MSc withย Prof. Michael Elhadad and Prof. Eitan Bachmat, at the Ben Gurion University.
Download my complete CV.
PhD in Computer Science (Vision-and-Language), 2020-2023
The Hebrew University of Jerusalem, Israel
MSc in Computer Science (Natural Language Processing), Magna cum laude, 2018-2019
Ben Gurion University of the Negev, Israel
BSc in Computer Science, 2015-2018
Ben Gurion University of the Negev, Israel
I've had the opportunity to collaborate with several MSc and new PhD students towards their publication goals:
Netta Madvil - Read, Look or Listen? Multimodal models analysis
Nitzan Bitton-Guetta - WHOOPS! - Commonsense-defying image with text-to-image models
Oren Sultan - Analogies research project (in-progress)
If you want to work together on vision-and-language research, feel free to shoot me an email.