Yonatan Bitton is Senior Research Scientist at Google Research, where he focuses on advancing multimodal consistency and improving large vision-and-language models. His work includes developing feedback mechanisms for text-to-image and text-to-video generation, with a strong emphasis on enhancing alignment and factual accuracy between textual and visual outputs. Yonatan completed his PhD in Computer Science at The Hebrew University of Jerusalem, where his research bridged vision and language through innovative datasets and models.