About the job
Your Impact at Lila
As an ML Research Scientist specializing in Multimodal Data Extraction, you will play a pivotal role in advancing Lila's mission of achieving scientific superintelligence. Your work will focus on the development of foundational models capable of autonomously reading, interpreting, and organizing scientific knowledge from diverse formats such as text, images, and experimental data in the physical sciences. Your research will contribute to the unification of global scientific data into a machine-readable format, enhancing reasoning, prediction, and autonomous discovery within materials science and chemistry.
What You Will Be Building
- Innovate and create AI systems that effectively extract and organize knowledge from a variety of scientific resources.
- Design and optimize large language models, multimodal models, and specialized architectures for accurate and interpretable data extraction.
- Develop scalable solutions for managing unstructured and heterogeneous scientific data, integrating various formats including text, tables, and visuals.
- Collaborate with subject matter experts to ensure that the extracted data aligns with real-world research workflows.
- Publish impactful research that propels the field of multimodal understanding and AI-driven knowledge extraction forward.

