About me
Hi, I’m David! I’m a research scientist at Apple. I like training general-purpose multimodal models using simple yet scalable methods :)
Recent work
4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities
Roman Bachmann*, Oğuzhan Fatih Kar*, David Mizrahi*, Ali Garjani, Mingfei Gao, David Griffiths, Jiaming Hu, Afshin Dehghan, Amir Zamir
NeurIPS, 2024 • Project Page • Paper • Code4M: Massively Multimodal Masked Modeling
David Mizrahi*, Roman Bachmann*, Oğuzhan Fatih Kar, Teresa Yeo, Mingfei Gao, Afshin Dehghan, Amir Zamir
NeurIPS, 2023 [Spotlight] • Project Page • Paper • OpenReview • CodeMultiMAE: Multi-modal Multi-task Masked Autoencoders
Roman Bachmann*, David Mizrahi*, Andrei Atanov, Amir Zamir
ECCV, 2022 • Project Page • Paper • CodeComposite Relationship Fields with Transformers for Scene Graph Generation
George Adaimi, David Mizrahi, Alexandre Alahi
WACV, 2023 • Paper • Code[Re] Can gradient clipping mitigate label noise?
David Mizrahi, Oğuz Kaan Yüksel, Aiday Marlen Kyzy
ReScience C, 2021 • Paper • OpenReview • Code
* Equal Contribution