Multimodal AI Engineer specializing in Document Understanding

LlamaIndexSan Francisco

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Responsibilities:Develop, train, and optimize machine learning models focused on document structure understanding, table extraction, layout analysis, and multimodal content processing. Create efficient data pipelines, evaluation frameworks, and experimentation infrastructures. Design and implement production-level ML systems capable of processing complex, real-world documents at scale. Stay updated on the latest advancements in vision-language models, document AI, and multimodal learning. Collaborate with engineering teams to integrate ML innovations into production APIs. Contribute to our open-source frameworks and enterprise solutions. Make informed technical decisions while balancing research exploration with product delivery. Required Qualifications:3-7 years of experience in machine learning engineering or applied research. Strong software engineering fundamentals with experience in production Python (familiarity with tools like uv, ruff, mypy, Pydantic). Hands-on experience in training, fine-tuning, or deploying ML models in production environments. In-depth understanding of modern ML techniques, particularly in computer vision, NLP, or multimodal learning. Experience with at least one ML framework such as TensorFlow or PyTorch.

About the job

Join our innovative team and help define the future of AI, focusing on the narrative of document understanding.

About the Role:

We are on the lookout for talented AI engineers to become a part of our dedicated document understanding team. In this role, you'll be at the crossroads of computer vision, natural language processing, and production machine learning systems, driving advancements in document parsing and comprehension.

Our team powers LlamaParse, LlamaExtract, and other advanced processing solutions, handling millions of intricate documents such as PDFs, PowerPoints, Word files, and spreadsheets. Your contributions will significantly influence numerous developers who are creating RAG applications and document agents, in addition to enhancing our open-source frameworks that revolutionize industry standards in document processing.

Depending on your expertise and interests, you may concentrate on data curation, model fine-tuning, or ML infrastructure. We are hiring multiple candidates and will collaborate with you to identify the perfect role for your skills.

About LlamaIndex

At LlamaIndex, we are committed to pushing the boundaries of artificial intelligence, particularly in the realm of document understanding. Join us in our mission to innovate and redefine how AI interacts with complex documents.

Similar jobs

1 - 20 of 5,158 Jobs

Search for Documentation Engineer

5,158 results

Select all on this page (20)

Apply

Documentation Engineer

Braintrust

Full-time|Remote|San Francisco

Join our innovative team at Braintrust as a Documentation Engineer. In this role, you will be responsible for creating, managing, and optimizing documentation for our cutting-edge technology solutions. You will work closely with cross-functional teams to ensure clarity and accuracy in all technical literature, enabling our users to leverage our products effectively.

Mar 4, 2026

Apply

Engineering Manager, Document Platform

Harvey

FullTime|On-site|San Francisco

Join the Revolution at HarveyAt Harvey, we are pioneering a transformation in the legal and professional services industries, moving beyond mere incremental changes to redefine the entire landscape. By leveraging cutting-edge agentic AI, a robust enterprise platform, and extensive domain expertise, we are fundamentally altering the execution of critical knowledge work for decades to come.This is a unique opportunity to contribute to the foundation of a generational company at a pivotal moment in its evolution. With over 1,000 clients across more than 58 countries, a strong alignment between our product and market needs, and exceptional backing from esteemed investors, we are rapidly scaling and creating a new category in real time. The challenges are ambitious, expectations are high, and the potential for personal, professional, and financial growth is unparalleled.Our team is composed of sharp, driven individuals who are deeply aligned with our mission. We operate with agility, intensity, and a sense of ownership over the challenges we face—from initial concepts to long-term results. We maintain close relationships with our clients, collaborating across all levels—from leadership to engineering—to address real issues with urgency and care. If you excel in ambiguous situations, strive for excellence, and wish to shape the future of work alongside high-achieving peers, we invite you to join us in this exciting journey.At Harvey, we are writing the future of professional services today—and this is just the beginning.Role OverviewHarvey is at the forefront of reimagining legal work, and we are seeking an Engineering Manager to spearhead our initiatives. The Document Platform team is central to our operations—empowering our AI systems to interpret vast quantities of legal documents. Our technology encompasses everything from OCR to vector storage, necessitating systems that are rapid, precise, and scalable to meet the demands of the world’s leading law firms.In this role, you will lead a team of extraordinary engineers tasked with expanding our capabilities, pushing the limits of document understanding, and ensuring our infrastructure scales to meet user demands.We emphasize urgency, simplicity, and expect our leaders to be actively involved as necessary. If you are passionate about ownership, decisive execution, and tackling real user challenges, we would love to connect with you.

Feb 10, 2026

Apply

Platform Engineer - Document Intelligence at Hebbia

Hebbia

Full-time|On-site|New York City; San Francisco, CA

About HebbiaHebbia is an innovative AI platform designed specifically for investors and bankers, aimed at generating alpha and maximizing returns. Founded in 2020 by George Sivulka, and backed by prominent investors including Peter Thiel and Andreessen Horowitz, we empower investment decisions for industry leaders like BlackRock, KKR, Carlyle, and Centerview, managing over $30 trillion in global assets.Our flagship product, Matrix, is renowned for providing exceptional accuracy, speed, and transparency in AI-driven analysis, helping finance professionals gain a competitive edge. Our technology uncovers unseen signals, reveals hidden opportunities, and facilitates rapid decision-making, thus transforming capital deployment, risk management, and value creation across markets.Hebbia isn't just a tool; it's the competitive advantage that enhances performance and market leadership.The TeamThe Document Intelligence team at Hebbia is dedicated to developing state-of-the-art AI solutions that revolutionize how users interact with vast collections of private and public documents. Through our innovative Browse application, we enable intelligent document exploration, advanced search functionality, and profound insights extraction. Our commitment to continuous improvement means we work closely with our customers to address real-world challenges and foster impactful, data-driven decisions.The RoleAs a Platform Engineer at Hebbia, you will be at the forefront of building scalable systems that support billions of tokens across significant assets under management. Your role will involve deploying optimized systems and ensuring high-performance capabilities in our infrastructure.

Feb 11, 2026

Apply

Multimodal AI Engineer specializing in Document Understanding

LlamaIndex

Full-time|On-site|San Francisco

Join our innovative team and help define the future of AI, focusing on the narrative of document understanding.About the Role:We are on the lookout for talented AI engineers to become a part of our dedicated document understanding team. In this role, you'll be at the crossroads of computer vision, natural language processing, and production machine learning systems, driving advancements in document parsing and comprehension.Our team powers LlamaParse, LlamaExtract, and other advanced processing solutions, handling millions of intricate documents such as PDFs, PowerPoints, Word files, and spreadsheets. Your contributions will significantly influence numerous developers who are creating RAG applications and document agents, in addition to enhancing our open-source frameworks that revolutionize industry standards in document processing.Depending on your expertise and interests, you may concentrate on data curation, model fine-tuning, or ML infrastructure. We are hiring multiple candidates and will collaborate with you to identify the perfect role for your skills.

Nov 21, 2025

Apply

Technical Writer for Product, API, and SDK Documentation

360itprofessionals1

Contract|On-site|San Francisco

We are seeking a talented and detail-oriented Technical Writer for an urgent 3-month contract role. The successful candidate will be responsible for creating high-quality documentation for our Product, API, and SDK, specifically for Android and iOS platforms. If you are passionate about technical communication and possess a strong ability to convey complex information clearly and effectively, we want to hear from you!

Sep 7, 2017

Apply

Document Control Specialist

Luster National

Full-time|On-site|San Francisco, CA

Role overview Luster National is forming a talent pool of Document Control Specialists in anticipation of future large-scale civil infrastructure projects in San Francisco, CA. These projects may involve highways, roads, bridges, transit systems (bus and rail), international airports, or water sector work. This is not an immediate vacancy. Instead, joining the pool ensures early notification when new project needs arise. What you will do Maintain and organize project documentation at every stage, focusing on accuracy, version control, and ensuring documents are accessible when needed. Application process Applications are reviewed on a rolling basis. Interviews may occur before specific roles open. Candidates selected for the pool will be contacted when suitable project opportunities become available.

Apr 20, 2026

Apply

Developer Relations Specialist (Documentation & YouTube Content)

Firecrawl

Full-time|Hybrid|San Francisco, CA (Hybrid) OR Remote (Americas, UTC-3 to UTC-10)

Join Firecrawl as a Developer Relations Specialist focusing on documentation and YouTube content creation. In this dynamic role, you'll engage with our developer community, providing resources and support through comprehensive documentation and engaging video content. Your expertise will help enhance the user experience and foster a vibrant developer ecosystem.

Apr 13, 2026

Apply

Solutions Engineer

Pulse

Full-time|On-site|San Francisco

OverviewAt Pulse, we are revolutionizing data infrastructure by addressing the critical challenge of extracting precise, structured data from intricate documents on a large scale. Our innovative approach to document understanding melds intelligent schema mapping with finely-tuned extraction models, effectively conquering the limitations faced by traditional OCR and parsing tools.We are a dynamic and rapidly expanding team of engineers located in San Francisco, serving Fortune 100 companies, Y Combinator startups, public investment firms, and fast-growing enterprises. Our growth is fueled by top-tier investors and a commitment to excellence.What sets our technology apart is our sophisticated multi-stage architecture:Expert layout comprehension with specialized component detection modelsLow-latency OCR models designed for targeted data extractionAdvanced algorithms for reading-order in complex document structuresProprietary recognition and parsing of table structuresFine-tuned vision-language models for interpreting charts, tables, and figuresIf you are enthusiastic about the convergence of computer vision, natural language processing, and data infrastructure, your contributions at Pulse will profoundly influence our clients and help shape the future of document intelligence.What We're SeekingAvailability for in-office work 5 days a week at our San Francisco locationA strong desire to learn and adapt swiftlyPrevious experience in a startup or founding role is advantageousRole OverviewAs a Solutions Engineer at Pulse, you will bridge the gap between customer deployments and our core engineering capabilities. Your role will involve designing, deploying, and managing Pulse within live production environments, frequently integrated within customer infrastructures, with a strong emphasis on reliability, performance, and precision.This hands-on and technical position requires collaboration with customer engineering teams and engagement with Pulse’s platform, ML, and product teams to guarantee successful deployments and continuous enhancements.Key ResponsibilitiesDeploy, manage, and troubleshoot Pulse services in Kubernetes-based environmentsCollaborate directly with customer engineering teams to ensure optimal implementation

Dec 15, 2025

Apply

Senior Manager, Quality Document Control and Compliance

Olema Oncology

Full-time|$150K/yr - $160K/yr|Hybrid|San Francisco, California

At Olema Oncology, we are committed to pioneering innovative treatments for breast cancer and beyond. Our flagship product, palazestrant (OP-1250), stands out as a complete estrogen receptor antagonist (CERAN) currently in clinical trials for metastatic breast cancer, boasting remarkable potential both as a standalone treatment and in combination therapies for ER+/HER2- metastatic breast cancer. Our subsequent candidate, OP-3136, is a leading KAT6 inhibitor with exceptional therapeutic promise.Our scientific advancements are propelled by a culture that empowers, inspires, and encourages one another. At Olema, we believe that prioritizing our people leads to unparalleled outcomes. If you are eager to be part of something transformative, join us in making a significant difference for our patients, your career, and the future of medicine.You can view our latest corporate deck and presentations here.About the Role: Senior Manager, Quality Document Control and ComplianceThe Senior Manager, Quality Document Control and Compliance will collaborate closely with the Director of Quality Systems, spearheading daily operations of Veeva Vault QDocs along with the entire document management process and compliance. Responsibilities include the creation and management of document change control (DCC) documentation within Veeva QDocs, overseeing standard operating procedures (SOPs) and other GXP documentation, processing and archiving GXP records, and maintaining document control compliance KPIs while compiling and reporting metrics for management review.This position is located at our San Francisco, CA or Cambridge, MA office, with approximately 15% travel required.Your primary responsibilities will include:Acting as the Veeva Vault QDocs Administrator and document control manager.Processing document change control (DCC) for GxP controlled documents in Veeva.Ensuring finalized documents adhere to compliance with effective templates.Proofreading and finalizing documents for processing in Veeva QDocs.Developing, updating, managing, and maintaining GxP controlled documents in close collaboration with functional area representatives.

Apr 9, 2026

Apply

Documentation Project Manager - Electric Vehicle Construction Management

Turner Townsend

Full-time|On-site|San Francisco

Role overview Turner Townsend seeks a Documentation Project Manager in San Francisco to support electric vehicle construction management. This position oversees documentation for EV construction projects, helping maintain compliance and uphold high standards throughout each project phase. What you will do Manage and organize project documentation for electric vehicle construction initiatives Coordinate documentation needs and updates with cross-functional teams Ensure accuracy, completeness, and timely delivery of reports Support compliance with both internal policies and external requirements Impact This role contributes directly to Turner Townsend’s construction management success in the growing electric vehicle sector. Effective documentation management helps keep projects on track and supports the company’s reputation for quality and compliance.

Apr 24, 2026

Apply

Machine Learning Engineer

Pulse

Full-time|On-site|San Francisco

OverviewPulse is revolutionizing data infrastructure by addressing the critical challenge of extracting accurate, structured information from complex documents on a large scale. Our innovative approach to document understanding integrates intelligent schema mapping with advanced extraction models, outperforming traditional OCR and parsing methods.As a dynamic and rapidly growing team of engineers based in San Francisco, we empower Fortune 100 companies, Y Combinator startups, public investment firms, and growth-oriented businesses. With the backing of top-tier investors, we are on an exciting growth trajectory.What sets our technology apart is our cutting-edge multi-stage architecture:Layout comprehension with specialized component detection modelsLow-latency OCR models designed for targeted data extractionAdvanced algorithms for determining reading order in complex formatsProprietary table structure recognition and parsing capabilitiesFine-tuned vision-language models for interpreting charts, tables, and figuresIf you are passionate about the convergence of computer vision, natural language processing, and data infrastructure, your contributions at Pulse will directly influence our customers and shape the future of document intelligence.

Jul 30, 2025

Apply

Software Engineer

Pulse

Full-time|On-site|San Francisco

OverviewAt Pulse, we are revolutionizing data infrastructure by addressing the critical challenge of extracting precise, structured information from intricate documents at scale. Our innovative approach to document understanding integrates cutting-edge schema mapping with advanced extraction models that outshine traditional OCR and parsing tools.Based in the heart of San Francisco, our dynamic and rapidly expanding team of engineers collaborates with Fortune 100 companies, YC startups, public investment firms, and growth-stage enterprises. With the backing of top-tier investors, we are poised for significant growth.What sets our technology apart is our sophisticated multi-stage architecture:Specialized component detection models for layout understandingLow-latency OCR models designed for targeted extractionAdvanced reading-order algorithms to navigate complex document structuresProprietary recognition and parsing capabilities for table structuresFine-tuned vision-language models for interpreting charts, tables, and figuresIf you are passionate about the convergence of computer vision, NLP, and data infrastructure, your contributions at Pulse will make a direct impact on our clients and the future of document intelligence.

Jul 30, 2025

Apply

Forward Deployed Engineer

Pulse

Full-time|On-site|San Francisco

OverviewPulse is at the forefront of addressing a significant challenge in data infrastructure: extracting precise, structured information from intricate documents at scale. Our innovative approach to document understanding merges intelligent schema mapping with advanced extraction models, where traditional OCR and parsing tools often fall short.Based in San Francisco, we are a dynamic, rapidly expanding team of engineers dedicated to empowering Fortune 100 companies, YC startups, public investment firms, and growth-stage businesses. With the backing of top-tier investors, we are on an exciting growth trajectory.Our technology stands out due to its multi-stage architecture, which includes:Specialized models for layout understanding and component detectionLow-latency OCR models tailored for precise extractionAdvanced algorithms for reading order in complex structuresProprietary recognition and parsing of table structuresFine-tuned vision-language models for interpreting charts, tables, and figuresIf you are passionate about the intersection of computer vision, NLP, and data infrastructure, your contributions at Pulse will have a direct impact on our customers and help shape the future of document intelligence.

Jul 30, 2025

Apply

Software Engineer, Inference

Pulse

Full-time|On-site|San Francisco

OverviewAt Pulse, we are revolutionizing the way data infrastructure operates by addressing the critical challenge of accurately extracting structured information from intricate documents on a large scale. Our innovative document understanding technique merges intelligent schema mapping with advanced extraction models, outperforming traditional OCR and parsing methods.Located in the heart of San Francisco, we are a dynamic team of engineers dedicated to empowering Fortune 100 enterprises, YC startups, public investment firms, and growth-stage companies. Backed by top-tier investors, we are rapidly expanding our footprint in the industry.What sets our technology apart is our sophisticated multi-stage architecture, which includes:Specialized models for layout understanding and component detectionLow-latency OCR models designed for precise extractionAdvanced algorithms for reading-order in complex document structuresProprietary methods for table structure recognition and parsingFine-tuned vision-language models for interpreting charts, tables, and figuresIf you possess a strong passion for the convergence of computer vision, natural language processing, and data infrastructure, your contributions at Pulse will significantly impact our clients and help shape the future of document intelligence.

Jul 30, 2025

Apply

Remote Document Review Attorney - Work From Home Opportunity

Attorneys

Part-time|$25/hr - $25/hr|Remote|Remote — San Francisco, California, United States

Join Our Team as a Remote Document Review Attorney!If you are a licensed attorney eager to provide remote document review services for a prestigious U.S. trial firm, we invite you to apply for this exciting opportunity.We seek candidates who demonstrate a strong work ethic and possess a passion for technology in legal review work.Enjoy the flexibility of working from home with a minimum weekly commitment required.

Dec 1, 2016

Apply

AI Content Engineer at LlamaIndex | San Francisco

LlamaIndex

Full-time|On-site|San Francisco

Join our innovative team and play a pivotal role in shaping the future of AI by crafting compelling narratives in the realm of document understanding.About the RoleWe are on the lookout for a technically proficient Machine Learning Engineer who excels at producing authentic and engaging technical content efficiently. Your extensive knowledge in document AI, coupled with strong writing skills, will be essential in creating benchmarks, publishing technical analyses, and solidifying our position as the foremost authority in document understanding.This role transcends traditional Developer Relations or Marketing. You will engage in coding, develop tangible benchmarks, conduct experiments, and swiftly translate your findings into high-quality published content, significantly accelerating awareness and adoption among developers creating the next generation of document-centric applications.ResponsibilitiesDesign, construct, and sustain extensive benchmarks for document parsing and understanding.Regularly publish high-quality technical content, including blog posts, benchmark reports, technical comparisons, and tutorials.Maintain an up-to-date understanding of the document AI landscape, including new models, research papers, competitors, and techniques.Execute experiments and translate findings into publishable content rapidly.Generate technical analyses showcasing our capabilities relative to alternatives.Contribute to open-source projects, examples, notebooks, and documentation.Collaborate closely with the core ML team to highlight significant improvements and capabilities.Engage genuinely with the developer community through technical content rather than conferences or events.Required QualificationsProven experience in software engineering, with a focus on ML engineering and research being an advantage.Solid foundation in software engineering principles and production-level experience with Python.Familiarity with modern ML techniques, particularly in computer vision, natural language processing, or multimodal learning.Ability to convey complex technical topics clearly, rapidly, and authentically through writing.A strong inclination towards delivering results quickly, comfortable publishing at a blog pace rather than an academic one.Adept at reading, comprehending, and synthesizing research papers efficiently.Proactive and self-motivated, capable of identifying significant topics to write about and executing projects end-to-end.

Jan 19, 2026

Apply

Senior Software Engineer - Platform Development

Reducto

Full-time|On-site|San Francisco Office

About ReductoAt Reducto, we empower AI teams to seamlessly integrate real-world enterprise data with unmatched precision. Our mission is to unlock the vast potential of enterprise data—whether it’s financial reports or health records—currently trapped in unstructured formats like PDFs and spreadsheets. We develop advanced vision models that interpret these documents as a human would, enabling the creation of innovative products, training of models, and automation of processes at scale.Having experienced remarkable growth, with a 7x increase in revenue year-over-year, we now collaborate with hundreds of organizations, from prominent AI teams such as Harvey, Vanta, and Scale, to major enterprises including FAANG and top trading firms.With over $100 million raised from esteemed investors like A16z, Benchmark, and First Round Capital, we are on the lookout for seasoned engineers to join our Platform team.The OpportunityAs a Senior Software Engineer on our Platform team, you will be pivotal in developing our core API, which facilitates document parsing for numerous companies. Your role will involve integrating cutting-edge large language models (LLMs), optimizing document processing pipelines, and building the foundational platform and infrastructure that democratizes state-of-the-art document understanding at scale. Collaboration with our machine learning engineers will be key, as we work together to deploy in-house models at the forefront of document understanding.

Dec 28, 2025

Apply

Conservation Documentation Internship

San Francisco Museum of Modern Art

Internship|Hybrid|San Francisco, CA

The San Francisco Museum of Modern Art (SFMOMA), a leading cultural institution in the United States, invites passionate individuals to apply for the role of Conservation Documentation Intern. Our mission is to inspire and connect people through art, fostering an inclusive environment where every voice is valued and celebrated.Our Values:Inclusive: We aim to be a platform for diverse voices in dialogue.Passionate: Working with art is not just a career; it’s a way of life.Brave: We approach our work with courage and a sense of adventure, exploring new perspectives.Empathic: We strive to represent the individual over the institution.SFMOMA provides a vibrant space for those who are endlessly curious, allowing them to explore and engage with contemporary art. We believe art shapes our understanding of the world, and we are committed to creating a joyful environment that prioritizes belonging and purpose.Schedule: Full Time, 35 hours/week, from June 15 to August 14, with at least three days onsite.Department OverviewThe Elise S. Haas Conservation Department is responsible for the preservation, conservation, and documentation of modern and contemporary artworks. Our team of eight collaborates in a shared studio, focusing on five conservation disciplines: electronic media, objects, paintings, paper, and photography. We embrace innovative models for the care and preservation of art, addressing challenges presented by unconventional materials and emerging artistic practices.

Dec 24, 2025

Apply

Account Executive

Pulse

Full-time|On-site|San Francisco

OverviewAt Pulse, we are revolutionizing the way organizations handle data infrastructure by delivering precise, structured insights from complex documents at scale. Our innovative approach to document understanding integrates intelligent schema mapping with advanced extraction models, effectively overcoming the limitations of traditional OCR and parsing technologies.Based in the vibrant tech hub of San Francisco, we are a dynamic and rapidly expanding team of engineers dedicated to empowering Fortune 100 companies, Y Combinator startups, public investment firms, and growth-stage enterprises. With backing from top-tier investors, our growth trajectory is on the rise.What sets Pulse's technology apart is our sophisticated multi-stage architecture:Comprehensive layout understanding through specialized component detection modelsLow-latency OCR models designed for targeted extractionAdvanced algorithms for determining reading order in complex document structuresProprietary recognition and parsing of table structuresHighly refined vision-language models for interpreting charts, tables, and figuresIf you are excited about the convergence of computer vision, natural language processing (NLP), and data infrastructure, joining Pulse means your contributions will directly influence customer success and advance the future of document intelligence.

Jul 30, 2025

Apply

Business Development Representative (BDR)

Pulse

Full-time|On-site|San Francisco

OverviewAt Pulse, we are revolutionizing data infrastructure by addressing one of its most challenging aspects: extracting precise, structured information from intricate documents on a large scale. Our innovative document understanding technology merges intelligent schema mapping with meticulously crafted extraction models, overcoming the limitations of traditional OCR and parsing tools.Located in the heart of San Francisco, we are a dynamic and rapidly expanding team dedicated to empowering Fortune 100 companies, Y Combinator startups, public investment firms, and high-growth organizations. With the backing of leading investors, we continue to scale our operations.What sets our technology apart is our multi-faceted architecture, which includes:Advanced layout comprehension using specialized component detection modelsLow-latency OCR models tailored for specific extraction tasksCutting-edge reading-order algorithms designed for complex document structuresProprietary recognition and parsing of table structuresRefined vision-language models for interpreting charts, tables, and figuresIf you are passionate about the intersection of computer vision, natural language processing, and data infrastructure, your contributions at Pulse will significantly impact our clients and the future of document intelligence.

Nov 30, 2025

Create account — see all 5,158 results