Research Engineer - CUDA Kernel Development

Voltai TechnologiesPalo Alto Office

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Qualifications:Proven experience in CUDA kernel programming and optimization. Strong understanding of GPU architecture and performance profiling tools. Familiarity with AI frameworks and their integration for enhanced performance. Hands-on experience with NVIDIA's latest technology. Ability to work collaboratively in a fast-paced research environment.

About the job

About Voltai
At Voltai, we are pioneering the development of world models and agents capable of learning, evaluating, planning, experimenting, and interacting with the physical realm. Our journey begins with a focus on hardware, specifically in electronics systems and semiconductors, where we harness AI to design and innovate beyond human cognitive capabilities.

About the Team

Our team boasts extraordinary talent, including esteemed former Stanford professors, SAIL researchers, and medalists from prestigious competitions like IPhO and IOI. We are supported by top-tier investors from Silicon Valley and industry leaders, including CEOs and Presidents from Google, AMD, Broadcom, and Marvell.

About the Role

As a Research Engineer specializing in CUDA Kernel engineering, you will design, integrate, and optimize cutting-edge CUDA kernels that drive AI models, facilitating rapid advancements in semiconductor design and verification. Your contributions will empower extensive model training, inference, and reinforcement learning systems capable of reasoning about circuit layouts, generating and validating RTL, and optimizing chip architectures, all while efficiently utilizing thousands of GPUs.
You will create tools, performance benchmarks, and integration layers that maximize GPU utilization for compute-intensive workloads in AI-driven hardware design. Collaborating closely with fellow researchers and engineers, you will help position Voltai as the foremost organization in AI and semiconductor research. Furthermore, your kernels and tools will be released as valuable contributions to the open-source AI and HPC ecosystems.

You might excel in this position if you possess experience in:

Writing and optimizing CUDA kernels for large-scale AI applications (e.g., attention mechanisms, routing, graph-based operations, and physics-inspired operators).
Profiling and enhancing GPU performance for specialized compute or memory-bound workloads.
Integrating custom kernels into state-of-the-art training and inference frameworks (including PyTorch, Megatron, vLLM, and TorchTitan).
Engaging with the latest NVIDIA hardware and software frameworks (Hopper, Blackwell, NVLink, NCCL, Triton).
Creating GPU-accelerated primitives for graph reasoning, symbolic computation, or hardware simulation tasks.

About Voltai Technologies

Voltai is at the forefront of AI and semiconductor innovation, developing advanced models and agents that redefine hardware capabilities. Our mission is to push the boundaries of technology, transforming how we understand and interact with the world through cutting-edge research and development.

Similar jobs

1 - 20 of 635 Jobs

Search for Research Engineer For Molecule Design Platform

635 results

Select all on this page (20)

Apply

Research Engineer for Molecule Design Platform

Genbio

Full-time|On-site|Palo Alto, CA

Situated in the heart of Silicon Valley, Genbio is an innovative start-up uniting a diverse team of forward-thinking scientists, engineers, and entrepreneurs. Our mission is to revolutionize biology and medicine through the transformative capabilities of Generative AI. Our collective expertise includes leading minds in AI and Biological Science, dedicated to redefining the boundaries of possibility. As pioneers in the realm of pan-modal Large Biological Models (LBM), we are setting a new standard in biomedicine and healthcare. With our groundbreaking foundation model training, we aim to unlock life-changing solutions and insights into biology. With our main office located in Silicon Valley and an additional office in Paris, we are on a path to making a significant impact globally. Join us as we work to reshape the future of biology and medicine with the power of Generative AI.

Oct 23, 2025

Apply

Mid-Level Research Engineer - Semiconductor Design

Voltai

Full-time|On-site|Palo Alto Office

About VoltaiAt Voltai, we are pioneering the development of advanced world models and intelligent agents that learn, evaluate, and interact with the physical environment. Our initial focus lies in understanding and enhancing hardware, particularly in electronics systems and semiconductors, where AI surpasses human cognitive capabilities in design and innovation.About the TeamOur team is comprised of elite professionals backed by top investors in Silicon Valley, Stanford University, and industry leaders including CEOs and Presidents from Google, AMD, Broadcom, and Marvell. We bring together former Stanford professors, SAIL researchers, Olympiad medalists, and high-ranking officials from the U.S. government, all working collaboratively towards groundbreaking advancements.Mid-Level Training OpportunityIn this role, you will play a crucial part in training cutting-edge models to become experts in semiconductor design and verification, laying the groundwork for reinforcement learning and automated chip development. You will innovate methods for generating and curating synthetic design data, executing model distillation, and facilitating scalable continual learning. Collaboration will be key, as you will partner with hardware engineers, reinforcement learning researchers, and verification specialists to optimize design data quality and enhance model performance. You will also work alongside compute engineers to efficiently scale training across thousands of GPUs and RL environments, developing high-performance tools to analyze how data and simulations influence model-driven design intelligence.Ideal Candidates Will Have Experience In:Training large language models or foundation models on semiconductor design and verification datasets (e.g., RTL, netlists, PDKs, simulation logs)Modeling design scaling laws and optimizing compute budgets for chip-design-specific tasksGenerating extensive synthetic design data (e.g., RTL variations, testbenches, verification traces)Developing evaluations that correlate with downstream design metrics (e.g., timing closure, power efficiency, area, verification coverage)

Nov 6, 2025

Apply

RTL and Design Verification Engineer

Ricursive Intelligence

Full-time|On-site|Palo Alto

Ricursive Intelligence is an innovative AI research laboratory dedicated to revolutionizing the field of self-improving systems, beginning with advanced chip design. Our mission is to transform chip development processes by creating a synergistic link between artificial intelligence and the hardware that supports it, thereby accelerating the journey towards artificial superintelligence.We are seeking top-tier engineers and researchers with substantial expertise in RTL (Register Transfer Level) and design verification for cutting-edge chip design. Your responsibilities will encompass various verification methodologies, simulation practices, and debug/formal/equivalence flows. As part of an extraordinary team, you will leverage your practical experience in specification, RTL development, testbench architecture, regression testing, waveform analysis, and design closure to influence the algorithms and systems that modern chip design teams depend upon.

Mar 18, 2026

Apply

Post-Training Research Engineer

Voltai

Full-time|On-site|Palo Alto Office

About VoltaiAt Voltai, we are pioneering advancements in artificial intelligence by developing sophisticated world models and agents that learn, evaluate, plan, and interact with the physical environment. Our primary focus lies in enhancing hardware capabilities, particularly in electronics systems and semiconductors, where AI can surpass traditional human cognitive limitations in design and creation.About the TeamOur team comprises exceptional talent, including former Stanford professors, acclaimed SAIL researchers, Olympiad medalists, and industry leaders from renowned companies such as Google, AMD, and Broadcom. We are supported by top investors from Silicon Valley and have a diverse group of experts, including former U.S. government officials, committed to driving innovation in AI and hardware design.Role OverviewAs a Post-Training Research Engineer, you will focus on post-training cutting-edge models to autonomously execute intricate tasks within the semiconductor design and verification pipeline. The models you help develop will optimize chip architectures, refine RTL code, conduct simulations, identify verification gaps, and iteratively enhance designs to expedite semiconductor innovation. You will work alongside leading experts in hardware design and verification, crafting comprehensive reinforcement learning environments that encapsulate the complexities of chip design workflows. Your contributions will involve developing structured reward functions, scaling strategies, and evaluation frameworks aimed at enhancing model reliability, efficiency, and creativity in semiconductor reasoning.Ideal Candidate ProfileYou may excel in this role if you possess experience in:Creating and scaling reinforcement learning environments for large language models or multimodal agents.Building high-quality evaluation datasets and benchmarks for complex reasoning or design challenges.Collaborating closely with domain experts in hardware and verification to establish evaluation metrics, constraints, and simulation conditions.Designing reward functions and feedback pipelines that ensure a balance between correctness, performance, and design efficiency.Conducting large-scale reinforcement learning fine-tuning or post-training experiments on frontier models.

Nov 6, 2025

Apply

Platform Software Engineer

Glean

Full-time|$140K/yr - $265K/yr|On-site|San Francisco Bay Area

About Glean:Founded in 2019, Glean is a pioneering AI-driven knowledge management platform that empowers organizations to efficiently locate, organize, and disseminate information throughout their teams. By seamlessly integrating with tools such as Google Drive, Slack, and Microsoft Teams, Glean enables employees to access the right knowledge at the right moment, enhancing productivity and collaboration. Our state-of-the-art AI technology streamlines knowledge discovery, allowing teams to leverage their collective intelligence faster and more effectively.Glean was established by Founder & CEO Arvind Jain, who recognized the challenges employees encounter in sourcing and comprehending information at work. Witnessing the fragmentation of knowledge and the challenges posed by numerous SaaS tools, he aimed to create a better solution—an AI-enabled enterprise search platform that facilitates swift and intuitive access to essential information. Since its inception, Glean has evolved into the leading Work AI platform, merging enterprise-grade search capabilities with an AI assistant and advanced application- and agent-building functionalities, fundamentally transforming how employees operate.About the Role:Glean is on the lookout for a skilled Infrastructure Engineer to design and advance the platform that maintains a highly available, efficient, secure, and cost-effective infrastructure across multiple cloud environments. You will be responsible for delivering intricate components featuring refined automation, robust uptime assurances, autoscaling, and dependable alerting/monitoring for infrastructure that operates across various cloud providers. As a member of the Platform team, you will oversee critical aspects of Glean’s infrastructure, collaborating closely with other teams to support their workloads.

Dec 8, 2025

Apply

Research and Development Engineer

PsiQuantum

Full-time|$125K/yr - $175K/yr|On-site|Palo Alto, California, United States

At PsiQuantum, we are dedicated to realizing the potential of quantum computing by developing the first practical quantum computers that can achieve the breakthroughs anticipated in this exciting field. Since our inception in 2016, we have been singularly focused on building and deploying million-qubit, fault-tolerant quantum systems.Our cutting-edge quantum computers leverage the principles of quantum mechanics to tackle problems that are beyond the reach of even the most sophisticated supercomputers and AI systems. The transformative power of quantum technology is expected to revolutionize industries such as energy, pharmaceuticals, finance, agriculture, transportation, and materials science.Our innovative architecture is grounded in silicon photonics, allowing us to utilize advanced semiconductor manufacturing processes in collaboration with leading partners like GlobalFoundries. This enables us to produce our systems at scale while benefiting from the inherent advantages of photonics, such as immunity to heat and electromagnetic interference, as well as seamless integration with existing cooling and fiber-optic technologies.In 2024, PsiQuantum announced government-supported initiatives to establish our first utility-scale quantum computers in Brisbane, Australia, and Chicago, Illinois, marking a significant step towards scaling quantum computing in a way that is both economically and strategically viable.Additionally, we are committed to developing the algorithms and software essential for unlocking the commercial value of quantum systems. Our teams collaborate with Fortune 500 leaders like Lockheed Martin, Mercedes-Benz, Boehringer Ingelheim, and Mitsubishi Chemical to create quantum solutions that will have a real-world impact.Quantum computing is not merely an extension of classical computing; it represents a paradigm shift that opens pathways to address challenges that are insurmountable through conventional means. The potential is immense, and we have a clear roadmap to transform this potential into reality.Join us on this revolutionary journey.

Mar 24, 2026

Apply

Design Engineer

Parallel

Full-time|On-site|San Francisco or Palo Alto

Parallel is seeking a Design Engineer in San Francisco or Palo Alto. This role centers on creating innovative solutions that align with both industry standards and customer requirements. Role overview As a Design Engineer, collaboration sits at the heart of the work. Expect to work closely with a skilled team to design and implement new products. The position calls for both creative thinking and strong technical skills to ensure each design meets high quality standards. What you will do Work with team members to develop product designs from concept through implementation Apply technical expertise to solve design challenges and refine solutions Ensure all products meet industry benchmarks and customer expectations Requirements Experience in product design and engineering Ability to collaborate effectively with others Strong creative and technical problem-solving skills

Apr 28, 2026

Apply

LLM Modeling and Scaling Researcher

Ricursive Intelligence

Full-time|On-site|Palo Alto

Ricursive Intelligence is at the forefront of AI innovation, dedicated to developing self-enhancing systems with a strong emphasis on chip design. Our mission is to transform chip development and create a seamless connection between artificial intelligence and the hardware that powers it, thereby accelerating the journey towards artificial superintelligence.We are on the lookout for top-tier researchers to engage in groundbreaking AI research, tackling a diverse array of challenges associated with LLM modeling, training, data management, evaluation, and beyond. As a dynamic startup, our team is highly collaborative and hands-on; researchers are empowered to design and execute large-scale experiments and to build and deploy models in a production environment.

Jan 19, 2026

Apply

Research Engineer Intern

genbio

Internship|On-site|Palo Alto, CA

At genbio, a cutting-edge start-up based in Silicon Valley, we unite visionary scientists, engineers, and entrepreneurs who are passionate about reshaping biology and medicine with the innovative potential of generative AI. Our team is comprised of leading experts and trailblazers in AI and biological sciences, continually striving to push the frontiers of what's achievable. We are the dreamers who are re-envisioning the future of biology and medicine.Our mission is to comprehensively decode biological processes, paving the way for transformative health solutions. As pioneers in pan-modal Large Biological Models (LBM), we are at the forefront of a new era in biomedicine, where our LBM training is catalyzing groundbreaking advancements and reshaping healthcare. With a robust R&D team and a leadership role in LLMs and generative AI, we are well-positioned to make a significant global impact. Join us on this exciting journey as we redefine the future of biology and medicine through the transformative power of Generative AI.

Nov 22, 2024

Apply

Research Engineer - Member of Technical Staff

Odyssey

Full-time|On-site|Palo Alto

About UsOdyssey is an innovative AI laboratory at the forefront of developing general-purpose world models. These advanced multimodal intelligence systems are set to revolutionize consumer, enterprise, and intelligence applications. With models like Odyssey-2 Pro, we are pioneering the next significant leap in AI technology.Position OverviewWe are in search of passionate engineers who excel in the art of building robust systems. You should possess the ability to write elegant, scalable machine learning code, with a strong emphasis on performance and an understanding of the underlying research. You are comfortable navigating the realms of modeling and systems, boldly tackling complex technical challenges while taking pride in constructing the infrastructure and tools that enable groundbreaking advancements.Your ResponsibilitiesDevelop and scale the training and inference systems that drive Odyssey’s general-purpose world models, encompassing large-scale distributed pipelines and real-time optimization.Collaborate closely with researchers to prototype novel architectures, enhance model performance, and transition concepts from research to production.Create high-performance data and computation systems for video generation and control, facilitating rapid iteration and effective resource utilization.Design tools, metrics, and visualizations that provide insights into model behavior and evolution.Work hand-in-hand with product engineers to incorporate Odyssey’s models into real-time, interactive user experiences that exemplify new general-purpose world models.Embrace a fast-paced iterative approach. As part of a tightly-knit team, your experiments will evolve into demos and ultimately into products.Contribute to shaping Odyssey’s engineering culture, which is pragmatic, research-oriented, and always focused on what is possible next.Your ProfileA staff-level or senior engineer experienced in large-scale machine learning systems, distributed training, performance optimization, or model deployment.Hands-on and technically adept: you thrive on writing code, optimizing processes, and enhancing system efficiency.Proven experience with data structures, algorithms, and coding practices that lead to high-performance outputs.

Mar 11, 2026

Apply

Hardware Systems Prototyping / Research Engineer

Full-time|On-site|Palo Alto, California, United States

About 1X Founded in 2015, 1X is a pioneering company dedicated to innovating advanced humanoid robots for home use. Our vision is to revolutionize labor availability with safe and intelligent humanoids that enhance everyday living.Position OverviewAs a Hardware Systems Prototyping / Research Engineer within the 1X Labs team, your primary responsibility is to redefine the limits of technology by rapidly creating and refining tangible systems that operate under real-world constraints.You will conceptualize and develop new mechanical systems and architectures, transforming ideas into functional hardware and rigorously testing them. Your prototypes will serve as tools to validate hypotheses, identify constraints, and maximize performance. The goal is not merely to achieve functionality but to approach human-level behavior or fully understand the limitations.Your expertise spans mechanics, electronics, and software. You will iterate swiftly, with each prototype being meticulously measured, evaluated, and benchmarked against biological or system-level standards. Success means pushing the limits of what works, and failure is an opportunity to learn and advance.This role is execution-driven; you will generate original ideas and develop concepts from others, favoring practical implementation over theoretical discussions. Prototyping is a means to let the system inform you, not a final destination.Importantly, you will ensure the transition of successful prototypes into robust designs that can be seamlessly integrated with existing products, accommodating real-world constraints and production processes.

Dec 5, 2025

Apply

Software Engineer, Context Platform

Glean

Full-time|$140K/yr - $265K/yr|On-site|Palo Alto

About Glean:Established in 2019, Glean is a groundbreaking AI-enhanced knowledge management platform that empowers organizations to swiftly locate, organize, and disseminate information across teams. By seamlessly integrating with popular tools such as Google Drive, Slack, and Microsoft Teams, Glean ensures that employees have immediate access to the right knowledge at the right time, thereby enhancing productivity and collaboration. The company's advanced AI technology streamlines knowledge discovery, making it quicker and more efficient for teams to harness their collective intelligence.Glean was inspired by Founder & CEO Arvind Jain’s deep comprehension of the hurdles employees encounter when trying to find and comprehend information at work. Witnessing the fragmentation of knowledge and the overwhelming array of SaaS tools that hindered productivity, he was motivated to create a better solution - an AI-driven enterprise search platform that facilitates quick and intuitive access to essential information. Since its inception, Glean has transformed into a leading Work AI platform, merging enterprise-grade search capabilities, an AI assistant, and robust application and agent-building features to fundamentally redefine employee workflows. About the Role:Glean is developing a horizontal AI platform that revolutionizes how employees leverage AI at work – not only within Glean but across diverse tools such as Copilot, Gemini, ChatGPT, IDEs, and more. We are in search of innovative engineers to construct this context platform and provide rich, reliable experiences across all these applications.This team is responsible for the complete platform experience: establishing standards and tools for REST APIs, SDKs, client libraries, agent SDKs, integrations, MCP servers, custom contexts like code and memory, as well as the infrastructure and documentation to facilitate building on Glean. In a fast-paced startup environment, this role will involve collaborating across multiple layers of the stack – from backend services to developer tools, SDKs, and documentation.You will:

Feb 4, 2026

Apply

Software Engineer - Data Platform

xAI

Full-time|$180K/yr - $440K/yr|On-site|Palo Alto, CA

Join xAI as a Software Engineer on our Data Platform team, where you'll design, build, and operate scalable distributed systems that handle vast data processing and transport. Be part of a dynamic environment focused on engineering excellence, working with cutting-edge technologies like Apache Kafka, Spark, and Flink to drive real-time machine learning and analytics at a petabyte scale.

Dec 29, 2025

Apply

Research Engineer, Machine Learning

Mistral AI

Full-time|On-site|Palo Alto

About Mistral AIAt Mistral AI, we harness the transformative power of artificial intelligence to streamline tasks, save valuable time, and foster enhanced creativity and learning. Our innovative technology is crafted to effortlessly integrate into everyday work environments.We are committed to democratizing AI by offering high-performance, optimized, open-source models, products, and solutions. Our extensive AI platform caters to both enterprise and individual needs, featuring products like Le Chat, La Plateforme, Mistral Code, and Mistral Compute—creating cutting-edge intelligence accessible to all users.As a vibrant and collaborative team, we are driven by our passion for AI and its potential to revolutionize society. Our diverse workforce excels in competitive settings and is dedicated to fostering innovation. With teams distributed across France, the USA, the UK, Germany, and Singapore, we pride ourselves on our creativity, humility, and team spirit.Join us in shaping the future of AI at a pioneering company. Together, we can create a lasting impact. Discover more about our culture at https://mistral.ai/careers.Role OverviewAbout the Research Engineering TeamThe Research Engineering team operates across Platform (shared infrastructure & clean coding practices) and Embedded (integrated within research squads). Our engineers have the flexibility to navigate the research↔production spectrum as their interests and needs evolve.As a Machine Learning Research Engineer, you will be responsible for building and optimizing large-scale learning systems that underpin our open-weight models. Collaborating closely with Research Scientists, you may join either:- Platform RE Team: Focus on enhancing our shared training frameworks, data pipelines, and tools utilized across all teams; or- Embedded RE Team: Become part of a research squad (Alignment, Pre-training, Multimodal, etc.) to turn innovative ideas into scalable, repeatable code.Key Responsibilities• Support researchers by managing the complex aspects of large-scale ML pipelines and developing robust tools.• Bridge cutting-edge research with production: integrate checkpoints, optimize evaluations, and create accessible APIs.• Conduct experiments utilizing the latest deep-learning techniques (sparsification on 70B+ models, distributed training across thousands of GPUs).• Design, implement, and benchmark ML algorithms; produce clear and efficient code in Python.• Deliver prototypes that evolve into production-grade components for Le Chat and our enterprise API.

Jan 27, 2026

Apply

Data Platform Software Engineer

PsiQuantum

Full-time|$140K/yr - $165K/yr|On-site|Palo Alto, California, United States

Join PsiQuantum, a pioneering force dedicated to creating the first practical quantum computers that will revolutionize various industries. Since our inception in 2016, we have been unwavering in our mission to construct and implement million-qubit, fault-tolerant quantum systems.Our innovative quantum computers leverage the principles of quantum mechanics to tackle problems that are beyond the capabilities of the most sophisticated supercomputers and AI technologies. The transformative potential of these machines will benefit fields such as energy, pharmaceuticals, finance, agriculture, transportation, and materials science.Utilizing silicon photonics as the foundation of our architecture, we capitalize on advanced semiconductor manufacturing techniques—partnering with leaders like GlobalFoundries. This approach allows us to employ high-volume processes that already produce billions of chips for telecommunications and consumer electronics. The benefits of photonics are clear: they are immune to heat, unaffected by electromagnetic interference, and seamlessly integrate with current cryogenic cooling systems and standard fiber-optic infrastructures.In 2024, we announced significant government-funded projects aimed at establishing our first utility-scale quantum computers in Brisbane, Australia, and Chicago, Illinois. These initiatives illustrate the growing acknowledgment of quantum computing's strategic and economic significance, emphasizing the urgency to scale our efforts.At PsiQuantum, we are also focused on developing the algorithms and software necessary for these systems to achieve commercial viability. Our application and software teams collaborate with prestigious Fortune 500 companies—including Lockheed Martin, Mercedes-Benz, Boehringer Ingelheim, and Mitsubishi Chemical—to optimize quantum solutions for real-world applications.Quantum computing signifies not merely an extension of classical computing but a radical transformation, offering pathways to tackle challenges that cannot be addressed through any other means. The opportunities are vast, and we are committed to making this vision a tangible reality.We invite you to be a part of this groundbreaking journey.

Mar 31, 2026

Apply

Early Career Research Engineer

Parallel

Full-time|On-site|Palo Alto

Join Our TeamAt Parallel, we are at the forefront of web infrastructure innovation, empowering businesses in various sectors—sales, marketing, insurance, and technology—to develop sophisticated AI agents equipped with robust programmatic access to the internet.Having secured $130 million in funding from prestigious investors such as Kleiner Perkins, Index Ventures, Spark Capital, Khosla Ventures, First Round, and Terrain, we are building a premier team of engineers, designers, marketers, sales professionals, researchers, and operational specialists to fulfill our ambitious vision.Your ProfileWe are looking for a researcher who embodies an engineering mindset, or an engineer who approaches problems with curiosity typical of researchers. You may have experience with information retrieval systems, embedding models, or neural ranking at scale, or possess a deep interest in the challenges of training models to comprehend and navigate billions of web pages. You will excel in the intersection of theory and practical application, devising elegant solutions that perform efficiently on real-world infrastructure. You'll be equally comfortable reading the latest papers from SIGIR and RecSys as you are troubleshooting distributed training pipelines.Position OverviewIn this role, you will design and train models that drive Parallel's APIs—the intelligent framework that enables AI agents to extract precise information from the open web. This involves addressing complex research challenges that most labs only encounter at scale: How can we create embedding models that accurately represent semantic intent across various query types? How do we achieve a balance between model expressiveness and sub-second retrieval times? How can we ensure our index remains up-to-date with the constantly evolving web, without the need for complete rebuilds?Unlike conventional search engines tailored for human queries, you will be developing solutions for AI agents that generate intricate, multi-hop queries, requiring structured, programmatic responses. This is information retrieval redefined for the era of large language models, merging traditional information retrieval methods with cutting-edge deep learning, applied at a scale that necessitates innovative solutions.Working EnvironmentOur team collaborates fully in-person at our headquarters in Palo Alto and our San Francisco office. We pride ourselves on being a flat, talent-rich organization committed to tackling both technical and creative challenges.We are eager to welcome individuals who share our enthusiasm for leveraging science, creativity, and consistency to address large, complex problems with significant impacts. Here are our core values:Customer Impact Ownership: We take responsibility for delivering tangible results for our clients.

Jan 24, 2026

Apply

Research Engineer - CUDA Kernel Development

Voltai Technologies

Full-time|On-site|Palo Alto Office

About VoltaiAt Voltai, we are pioneering the development of world models and agents capable of learning, evaluating, planning, experimenting, and interacting with the physical realm. Our journey begins with a focus on hardware, specifically in electronics systems and semiconductors, where we harness AI to design and innovate beyond human cognitive capabilities.About the TeamOur team boasts extraordinary talent, including esteemed former Stanford professors, SAIL researchers, and medalists from prestigious competitions like IPhO and IOI. We are supported by top-tier investors from Silicon Valley and industry leaders, including CEOs and Presidents from Google, AMD, Broadcom, and Marvell.About the RoleAs a Research Engineer specializing in CUDA Kernel engineering, you will design, integrate, and optimize cutting-edge CUDA kernels that drive AI models, facilitating rapid advancements in semiconductor design and verification. Your contributions will empower extensive model training, inference, and reinforcement learning systems capable of reasoning about circuit layouts, generating and validating RTL, and optimizing chip architectures, all while efficiently utilizing thousands of GPUs.You will create tools, performance benchmarks, and integration layers that maximize GPU utilization for compute-intensive workloads in AI-driven hardware design. Collaborating closely with fellow researchers and engineers, you will help position Voltai as the foremost organization in AI and semiconductor research. Furthermore, your kernels and tools will be released as valuable contributions to the open-source AI and HPC ecosystems.You might excel in this position if you possess experience in:Writing and optimizing CUDA kernels for large-scale AI applications (e.g., attention mechanisms, routing, graph-based operations, and physics-inspired operators).Profiling and enhancing GPU performance for specialized compute or memory-bound workloads.Integrating custom kernels into state-of-the-art training and inference frameworks (including PyTorch, Megatron, vLLM, and TorchTitan).Engaging with the latest NVIDIA hardware and software frameworks (Hopper, Blackwell, NVLink, NCCL, Triton).Creating GPU-accelerated primitives for graph reasoning, symbolic computation, or hardware simulation tasks.

Nov 6, 2025

Apply

Engineering Manager - Cloud Platform

Verdigris Technologies

Full-time|On-site|Palo Alto, CA 94306

As an Engineering Manager for our Cloud Platform team at Verdigris Technologies, you will lead a talented group of engineers in developing cutting-edge solutions that harness the power of cloud technology. Your leadership will drive innovation, enhance performance, and ensure the delivery of reliable, scalable, and secure cloud services.In this dynamic role, you will collaborate with cross-functional teams to define engineering best practices and drive the successful execution of projects. You will mentor and develop your team while fostering a culture of continuous improvement and technical excellence.

Apr 13, 2026

Apply

Staff Data Platform Engineer (Hybrid)

Fiddler AI

Full-time|$190K/yr - $300K/yr|Hybrid|Palo Alto

Our PurposeAt Fiddler AI, we recognize the profound implications of artificial intelligence and its impact on human lives. Our mission is to instill trust in AI technologies. With the emergence of Generative AI and intelligent agents, the potential for generalized intelligence has expanded, but so have the associated risks. Fiddler is dedicated to assisting organizations in navigating these challenges by providing reliable and transparent AI solutions.We collaborate with AI-centric organizations to establish a sustainable framework for responsible AI practices, fostering trust among their users. AI Engineers, Data Scientists, and business teams leverage Fiddler AI to monitor, evaluate, secure, analyze, and enhance their AI solutions, facilitating improved outcomes. Our platform empowers engineering teams and business stakeholders to comprehend the 'what', 'why', and 'how' behind AI results.Our FoundersFiddler AI was co-founded by Krishna Gade, a distinguished engineering leader from Facebook, Pinterest, Twitter, and Microsoft, and Amit Paka, a product visionary with a history at Microsoft, Samsung, PayPal, and as a two-time founder. Our venture is supported by prominent investors including Insight Partners, Lightspeed Venture Partners, and Lux Capital.Why Join UsJoining our innovative team means contributing to the mission of embedding trust into AI, thereby helping society harness its transformative power. You will play a crucial role in ensuring that AI applications deployed at scale across various industries maintain operational transparency and security. As an early-stage startup, we are rapidly expanding and proud of our dynamic team composed of intelligent and empathetic doers, thinkers, creators, and builders. The AI and ML sector is characterized by swift innovation, offering monumental learning opportunities. This is your chance to lead the way as a pioneer in this field.Fiddler is recognized as a trailblazer in AI Observability, having earned numerous accolades such as the 2022 a16z Data50 list, 2021 CB Insights AI 100 most promising startups, 2020 WEF Technology Pioneer, and a 2019 Gartner Cool Vendor in Enterprise AI Governance and Ethical Response. By joining our talented team, you will contribute to shaping the future of AI Observability.‍ The Mission:As a Staff Data Platform Engineer, you will significantly impact the safety and return on investment of large language models and agentic applications across diverse verticals and domains. You will be at the forefront of designing and developing innovative tools that enhance the performance and reliability of AI systems.

Oct 23, 2025

Apply

Senior Software Engineer, Data Platforms

Mudflap

Full-time|$185K/yr - $250K/yr|Hybrid|Palo Alto, CA

Join Mudflap, a pioneering force in the $800 billion trucking industry, where our innovative payment solutions empower truckers to save significantly on fuel costs—their largest operational expense. We connect fuel stop partners with new customers, creating a vibrant marketplace that is rapidly expanding. We are actively seeking a customer-focused Senior Software Engineer to help us shape our exciting future.As a Senior Software Engineer specializing in Data Platforms, you will be instrumental in constructing a robust data infrastructure that ensures reliable and scalable data flow throughout Mudflap's systems. Your contributions will be vital in designing and managing the frameworks and services that facilitate efficient data ingestion, processing, and accessibility on a large scale.In this role, you will architect and develop high-performance platform systems that drive data ingestion, orchestrate pipelines, and manage large-scale processing. Your efforts will lay the groundwork for high-availability data systems that empower Mudflap's teams to operate with speed and confidence.This position is based in the Bay Area and offers a hybrid work model, allowing for a blend of in-office collaboration and remote work.

Mar 17, 2026

Create account — see all 635 results