companyUnstructured Technologies logo

Staff and Principal Software Engineers at Unstructured Technologies

Remote Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

What You Will Own & Drive: Define and enhance the comprehensive architecture for Unstructured’s data transformation and retrieval platform. Develop and scale distributed systems capable of processing vast amounts of unstructured data from various formats and sources. Act as the organization-wide expert on Kubernetes orchestration, cluster design, performance optimization, and reliability. Lead the architecture of Python applications and best practices—ensuring performance, modularity, and maintainability across services. Design and improve Postgres schemas, queries, and indexing strategies to facilitate large-scale metadata and retrieval pipelines. Mentor senior engineers through design reviews and code guidance, elevating the standard of technical excellence across the organization. Collaborate with the infrastructure and product teams to transform research prototypes into production-ready systems.

About the job

At Unstructured Technologies, we are pioneering the future of enterprise data transformation in the era of large language models (LLMs) and generative AI. With over $65 million raised from top-tier investors such as Menlo Ventures, Bain Capital, Databricks, NVIDIA, Microsoft, and IBM within just two years, we are positioned for rapid growth.
 
Our open-source toolkit, downloaded over 52 million times, is utilized by more than 66,000 companies, including nearly half of the Fortune 500. We enable AI workflows across both commercial and federal sectors, converting PDFs, HTML, Word documents, images, emails, and more into scalable AI-ready data pipelines.
 
We are not merely creating tools; we are establishing the critical infrastructure that supports generative AI and drives intelligence across diverse industries.
 
We are seeking Staff and Principal Software Engineers to help shape the architectural framework that allows enterprises to efficiently process and transform unstructured data for LLM applications.
 
This role is perfect for individuals who excel in deep systems work—those who could author a definitive guide on Kubernetes, are fluent in Python, and possess an in-depth understanding of Postgres at the index and transaction-log level. You will take charge of the technical direction for our core platform, overseeing data movement through all stages—from ingestion and transformation to retrieval.
 
You will collaborate closely with the Head of Engineering, Head of Infrastructure, and product leaders to ensure that Unstructured’s systems are high-performing, resilient, and equipped to handle the next generation of AI-driven workloads.

About Unstructured Technologies

Unstructured Technologies is at the forefront of defining enterprise data transformation standards in the age of LLMs and generative AI. With significant investment and a robust toolset utilized by thousands of companies, we are driving innovation in AI workflows.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.