companyThinking Machines Lab logo

Software Engineer, Data Infrastructure

On-site Full-time $350K/yr - $475K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

What You’ll DoDesign, build, and manage scalable, fault-tolerant infrastructure for LLM Research, including distributed computing, data orchestration, and storage across multiple modalities. Develop high-throughput systems for data ingestion, processing, and transformation, which encompass training data catalogs, deduplication, quality checks, and search functionalities. Create systems for traceability, reproducibility, and stringent quality control throughout the data lifecycle. Implement and sustain monitoring and alerting systems to enhance platform reliability and performance. Collaborate with research teams to unlock new features, enrich data quality, and accelerate project timelines.

About the job

At Thinking Machines Lab, our vision is to enhance human potential by advancing collaborative general intelligence. We are dedicated to creating a future where individuals have the resources and knowledge to harness AI for their specific objectives and aspirations.

Our team comprises scientists, engineers, and innovators who have developed some of the most popular AI products, including ChatGPT and Character.ai, as well as influential open-weight models like Mistral, along with highly regarded open-source projects such as PyTorch, OpenAI Gym, Fairseq, and Segment Anything.

About the Role

We are seeking a talented engineer to enhance our data infrastructure. You will become part of a dynamic, high-impact team tasked with designing and scaling the foundational infrastructure for distributed training pipelines, multimodal data catalogs, and sophisticated processing systems that manage petabytes of data.

Our infrastructure is pivotal; it serves as the foundation for every groundbreaking achievement. You will collaborate directly with researchers to expedite experiments, develop novel datasets, optimize infrastructure efficiency, and derive essential insights from our data repositories.

If you are passionate about distributed systems, large-scale data mining, and open-source tools such as Spark, Kafka, Beam, Ray, and Delta Lake, and enjoy building innovative solutions from scratch, we encourage you to apply.

Note: This is an evergreen role that we keep open continuously for expressions of interest. We receive a high volume of applications, and while there may not always be an immediate position that aligns perfectly with your skills and experience, we encourage you to apply. We regularly review applications and reach out as new opportunities arise. You are welcome to reapply after gaining more experience, but please refrain from applying more than once every six months. We may also post for specific roles for particular projects or team needs, and in those cases, you are welcome to apply directly in addition to this evergreen role.

About Thinking Machines Lab

Thinking Machines Lab is at the forefront of AI innovation, driving advancements that empower individuals and organizations through accessible technology. We pride ourselves on our diverse team of experts who are committed to making AI a tool for everyone. Join us in shaping the future of intelligence and technology.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.