Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Senior
Qualifications
Proven experience in software engineering with a focus on video performance optimization. Strong proficiency in programming languages such as JavaScript, Python, or C++. Experience with video streaming technologies, codecs, and protocols. Ability to work in a fast-paced environment and deliver high-quality results. Excellent problem-solving skills and a proactive approach to challenges.
About the job
Join our talented team at Canva as a Senior Software Engineer specializing in Video Performance. We are looking for an innovative and solutions-oriented engineer who is passionate about optimizing video experiences for our users. In this role, you will collaborate with cross-functional teams to enhance performance, develop new features, and implement best practices in video engineering.
About Canva
Canva is a leading graphic design platform, empowering millions to create stunning visuals easily and efficiently. With a vibrant culture and a commitment to innovation, Canva is dedicated to making design accessible to everyone.
Join our talented team at Canva as a Senior Software Engineer specializing in Video Performance. We are looking for an innovative and solutions-oriented engineer who is passionate about optimizing video experiences for our users. In this role, you will collaborate with cross-functional teams to enhance performance, develop new features, and implement best practices in video engineering.
Join Canva as a Staff Software Engineer specializing in Video Performance. In this role, you will be instrumental in enhancing our video features, ensuring top-notch performance for our users. You will collaborate with cross-functional teams, leveraging your expertise to drive innovation and optimize our video products.
Join Canva as an Engineering Manager specializing in Video Performance, where you'll lead a talented team dedicated to enhancing our video features. You will play a pivotal role in driving innovation, optimizing video processing, and ensuring exceptional performance for our users.This is an exciting opportunity for a motivated leader who thrives in a fast-paced environment and is passionate about delivering high-quality video experiences.
Full-time|$166K/yr - $225K/yr|On-site|San Francisco, California
P-97 At Databricks, we are dedicated to empowering data teams to tackle some of the most challenging problems in the world. We achieve this by creating and managing a leading data and AI infrastructure platform that enables our clients to leverage deep data insights for business enhancement. Our commitment to pushing the limits of data and AI technology is matched by our focus on resilience, security, and scalability, which are essential for our customers' success on our platform. Databricks operates one of the largest-scale software platforms, comprising millions of virtual machines that generate terabytes of logs and process exabytes of data daily. Given our scale, we frequently encounter cloud hardware, network, and operating system faults, and our software must adeptly protect our customers from these issues. As a Senior Performance Engineer, you will collaborate with various teams throughout the organization to assess product and feature performance, pinpoint performance bottlenecks, and partner with engineers to address performance and scalability challenges. This includes setting performance goals for different software releases, guiding teams in developing performance benchmarks, conducting competitive benchmark analyses for various Databricks products, and performing in-depth analyses to identify and resolve performance issues.
Join our dynamic team at Canva as a Senior Software Engineer specializing in Video Templates. In this role, you'll leverage your expertise to design and develop innovative video solutions that enhance user engagement and creativity. Collaborate with cross-functional teams, contributing to an agile environment that fosters creativity and excellence in software development.
At ClickUp, we're not just developing software; we're shaping the future of work! In an era dominated by work sprawl, we identified a more efficient way. This led us to create the first truly integrated AI workspace, consolidating tasks, documents, chat, calendar, and enterprise search, all enhanced by context-driven AI. Our mission is to empower millions of teams to escape silos, reclaim their time, and reach unprecedented levels of productivity. At ClickUp, you'll have the chance to learn, innovate, and leverage AI in transformative ways that will not only influence our product but also the broader landscape of work itself. Join a daring, pioneering team that's challenging the limits of what's possible! We are on the lookout for a technical leader in SaaS client performance who is passionate about enhancing the customer experience through top-tier performance solutions. As a Senior Performance Engineer, you will spearhead comprehensive strategies to optimize application speed, memory utilization, and reliability across our entire platform. You will be empowered to analyze, diagnose, and address performance bottlenecks wherever they arise—be it front-end, back-end, or infrastructure—ensuring ClickUp remains the fastest and most reliable productivity platform available.The ideal candidate is a hands-on authority in browser and NodeJS performance, with a thorough understanding of how code influences rendering, memory management, and overall user experience. You excel in solving intricate challenges, collaborating across teams, and establishing new benchmarks for performance excellence. If you're driven to make a significant impact for millions of users, this is your chance to lead at scale.Your Responsibilities:Conduct root cause analysis on client performance issues and perform post-mortems.Profile application code to identify inefficient algorithms, memory leaks, and other issues; propose and implement effective solutions.Establish performance monitoring, alerting, and dashboards to proactively detect and resolve client performance challenges.Examine client traffic patterns, load testing outcomes, and other metrics to set benchmarks and drive enhancements.Champion performance best practices and set performance standards across the engineering organization.Identify infrastructure upgrades (caching, CDNs, database optimization) to elevate the client experience.Collaborate with development teams to incorporate performance as a core requirement in the development of new features.
Join Cloudflare as a Senior Software Engineer specializing in Network Performance & Reliability! In this role, you'll be at the forefront of enhancing the performance and stability of our global network, ensuring our customers benefit from unparalleled speed and reliability. You'll collaborate with experts across various teams to design and implement innovative solutions that optimize network operations.
Our VisionAquabyte is dedicated to transforming the sustainability and efficiency of aquaculture, a daring and immensely fulfilling endeavor. By enhancing the productivity of fish farming, we aim to support the responsible production of low carbon protein and address one of the most significant contributors to climate change. As the fastest-growing food production sector globally, aquaculture is at a pivotal moment where technology can redefine our approach to ocean harvesting, ensuring its preservation for future generations.Our diverse and mission-oriented team is eager to collaborate with individuals who share our passion. If our vision resonates with you, we invite you to connect with us.Our ProductCurrently, we empower salmon farmers to gain insights into their fish populations, enabling them to make environmentally responsible choices. Utilizing custom underwater cameras, computer vision, and machine learning, we quantify fish weights, assess health conditions, and formulate optimal feeding strategies in real-time. Our product comprises three integral components: on-site hardware for image capture, cloud-based data processing pipelines, and an interactive web application. This creates a dynamic ecosystem with countless intriguing challenges across all layers of technology.At Aquabyte, we prioritize customer satisfaction above all. Our product development is driven by the demands of fish farmers, ensuring we delight our customers with every endeavor. We are committed to fostering a global, collaborative team environment.The RoleWe seek a Senior Backend Engineer to design and maintain the systems facilitating real-time video streaming, AI analytics, and secure remote control of industrial machinery. Your role will involve working on cloud and edge-connected systems that interact with physical devices, where reliability, security, and low-latency communication are paramount.
About UsAt Twelve Labs, we are at the forefront of innovation, creating advanced multimodal foundation models that enable machines to understand videos in a human-like manner. Our groundbreaking approach to video-language modeling is setting new benchmarks, enhancing our ability to analyze and interact with diverse media formats.With an impressive $107 million in funding from leading venture capital firms including NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, as well as visionary AI leaders like Fei-Fei Li and Alexandr Wang, we are strategically positioned to drive global innovation. Our headquarters in San Francisco, along with a significant presence in Seoul, illustrates our commitment to reshaping the future of technology.We celebrate the unique journeys of every individual and believe that our diverse backgrounds fuel our creativity and innovation. We invite passionate individuals who share our mission to join us in transforming video understanding and multimodal AI.Role OverviewUnlike typical video engineering roles that focus primarily on enhancing human playback, at Twelve Labs, we engineer video for machine comprehension. This unique approach prioritizes AI model performance over mere perceptual quality. As the Principal Software Engineer for Video Engineering, you will architect and implement our video processing pipelines, overseeing the entire journey from byte ingestion through to playback. Your work will be pivotal in ensuring our systems are efficient, cost-effective, and tailored for AI-driven video intelligence at scale.Your ResponsibilitiesEnd-to-End Ownership: Lead the design and execution of video pipelines, covering ingestion, decoding, chunking, storage, and playback, applicable to both batch and streaming modes.Codec Expertise: Guide the development of decoding strategies, container format management, and codec implementations, ensuring optimal performance for various video types.
Join Crusoe as a Senior Systems Performance Engineer, where you will play a crucial role in optimizing and enhancing our systems for superior performance. You will be responsible for diagnosing performance bottlenecks, implementing solutions, and ensuring that our infrastructure can scale efficiently. Work in a dynamic environment that encourages innovation and professional growth.
About UsAt Lemurian Labs, we are dedicated to democratizing AI technology while prioritizing sustainability. Our mission is to create solutions that minimize environmental impact, ensuring that artificial intelligence serves humanity positively. We are committed to responsible innovation and the sustainable growth of AI.We are in the process of developing a state-of-the-art, portable compiler that empowers developers to 'build once, deploy anywhere.' This technology ensures seamless cross-platform integration, allowing for model training in the cloud and deployment at the edge, all while maximizing resource efficiency and scalability.If you are passionate about scaling AI sustainably and are eager to make AI development more powerful and accessible, we invite you to join our team at Lemurian Labs. Together, we can build a future that is innovative and responsible.The RoleWe are seeking a Senior ML Performance Engineer to take charge of designing and leading our Performance Testing Platform from inception. In this pivotal role, you will be recognized as the technical expert in measuring, validating, and enhancing the performance of large language models (including Llama 3.2 70B, DeepSeek, and others) prior to and following compiler optimization on cutting-edge GPU architectures.This is a critical position that will significantly impact our product quality and customer success. You will work at the intersection of Machine Learning systems, GPU architecture, and performance engineering, constructing the infrastructure that substantiates the value of our compiler.
At Genmo, we are at the forefront of advancing artificial intelligence through innovative research in video generation. Our mission is to construct open, cutting-edge models that will ultimately contribute to the realization of Artificial General Intelligence (AGI). As part of our dynamic team, you will play a pivotal role in redefining the future of AI and expanding the horizons of video creation.We are looking for a skilled GPU Performance Engineer who can extract maximum performance from our H100 infrastructure and fine-tune our model serving stack to achieve unparalleled efficiency. If you are passionate about optimizing performance, particularly at the microsecond level, and thrive on pushing hardware to its limits, this is the perfect opportunity for you.Key ResponsibilitiesUtilize advanced profiling tools such as Nsight Systems and nvprof to analyze and enhance GPU workloads.Develop high-performance CUDA and Triton kernels to optimize essential model functions.Reduce cold start latency from seconds to mere milliseconds in our serving infrastructure.Optimize memory access patterns, implement kernel fusion, and maximize GPU utilization.Collaborate closely with machine learning engineers to optimize model implementations.Diagnose and resolve performance issues throughout the application and hardware stack.Implement custom memory pooling and allocation strategies to enhance performance.Promote performance optimization techniques and foster a culture of excellence across teams.
ABOUT BASETENBaseten is at the forefront of AI technology, empowering leading-edge companies like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer to seamlessly integrate advanced AI models into their operations. Our unique blend of applied AI research, adaptable infrastructure, and intuitive developer tools enables innovators to bring their most ambitious AI products to life. With our recent $300M Series E funding from top-tier investors such as BOND, IVP, Spark Capital, Greylock, and Conviction, we are poised for rapid growth. Join us in shaping the platform that engineers rely on to deploy transformative AI solutions.THE ROLEAre you driven by a passion for enhancing artificial intelligence applications? We are seeking a proactive Software Engineer specializing in ML performance to join our energetic team. This position is perfect for backend engineers who thrive in a fast-paced startup environment and are eager to make substantial contributions to the realm of Large Language Model (LLM) Inference. If you're enthusiastic about optimizing open-source ML models, we can't wait to hear from you!EXAMPLE INITIATIVESAs a member of our Model Performance team, you will have the opportunity to work on exciting projects, including:Baseten Embeddings Inference: The quickest embeddings solution availableThe Baseten Inference StackDriving model performance optimizationRESPONSIBILITIESDevelop, refine, and implement advanced techniques (quantization, speculative decoding, kv cache reuse, chunked prefill, and LoRA) for ML model inference and infrastructure.Conduct thorough investigations into the codebases of TensorRT, PyTorch, TensorRT-LLM, vllm, sglang, CUDA, and other libraries to troubleshoot and resolve ML performance issues.Scale and apply optimization techniques across a diverse array of ML models, with a focus on large language models.
Role overview This Software Engineer position at OpenAI focuses on inference and performance optimization. Based in San Francisco, the role centers on increasing the speed and efficiency of advanced AI systems. Collaboration with experienced engineers is a key part of the work, with an emphasis on refining AI performance. What you will do Work on optimizing the performance of AI inference systems Collaborate with other engineers to improve efficiency and speed Contribute to solutions that enhance AI system capabilities Location This role is based in San Francisco.
OpenAI is seeking a Software Engineer in San Francisco to focus on improving productivity by optimizing model performance. This position centers on developing solutions that make machine learning models more efficient and effective. Role overview This role involves working closely with teams across different functions to identify and address areas where model performance can be improved. The aim is to deliver changes that have a measurable impact on both systems and workflows. What you will do Collaborate with engineers and other specialists to enhance model efficiency Develop and implement solutions that improve the effectiveness of machine learning systems Contribute to projects that streamline processes and drive productivity gains Impact Your work will help shape improvements in how models operate and how teams at OpenAI achieve their goals. The changes you help deliver will support more effective use of resources and better outcomes for the organization.
Full-time|$180K/yr - $286K/yr|Remote|San Francisco, CA | Remote
About the RoleJoin us in creating a cutting-edge AI-driven platform and web application designed for the seamless and rapid generation of audio and video content. Our mission is to revolutionize how users can record, transcribe, edit, and mix multimedia online, presenting us with unique technical challenges that require innovative solutions. We are seeking talented Software Engineers at both the senior and staff levels to fill several positions on our Agent team, dedicated to shaping the future of Agentic Video Editing!About the Team:Since launching in Open Beta in July 2025, the Agent team at Descript has been focused on enhancing user experiences for chat-based editing, expanding our agent's capabilities, and refining our infrastructure to support the ongoing advancement of AI models. Our team comprises AI Researchers and Software Engineers who are passionate about making video editing accessible to everyone. Collaborate with peers located in San Francisco, New York, and across the US and Canada.This role is perfect for you if you're excited about:Pushing the boundaries of quality in video editing agents through experimentation (e.g., tool design, token/context optimizations, reinforcement learning, multimodal capabilities).Creating an exceptional product experience for agentic video editing interactions that boosts user retention (e.g., refining prompt templates, addressing user feedback, implementing essential product features like chat history).Establishing a top-notch developer experience for building agents (e.g., logging systems, evaluation frameworks, online monitoring, and feedback loops).
Join Canva as a Staff Software Engineer specializing in Video Templates! In this pivotal role, you will be responsible for designing and implementing innovative video solutions that enhance user experience. Collaborate with cross-functional teams to create visually appealing and user-friendly video templates that cater to diverse needs. Your expertise in software engineering will contribute to our mission of empowering people to create stunning visual content effortlessly.
Join Canva as a Senior Engineering Manager specializing in Video Templates. In this pivotal role, you will lead a dynamic team of engineers to innovate and enhance our video template offerings, ensuring a seamless and engaging user experience. You will be responsible for setting the technical direction, driving projects from inception to delivery, and collaborating with cross-functional teams to align on product vision and execution.As a key leader, you will mentor and guide your team members, fostering a culture of creativity and excellence. Your expertise will help shape the future of video content creation at Canva, making it accessible and enjoyable for users around the globe.
Join Cloudflare as a Software Engineer dedicated to enhancing our network performance and reliability. In this dynamic role, you will collaborate with cross-functional teams to develop innovative software solutions that optimize our network infrastructure and ensure high availability and performance for our users. Your contributions will directly impact millions of users worldwide, making the internet a safer place for everyone.
Full-time|$180K/yr - $250K/yr|On-site|San Francisco
Join fal in our pursuit to maintain a leading edge in model performance for generative media models. You'll be instrumental in designing and implementing innovative solutions for model serving architecture, built on our proprietary inference engine. Your focus will be on maximizing throughput while minimizing latency and resource consumption. In addition, you will create performance monitoring and profiling tools to identify bottlenecks and optimization opportunities. Collaborate closely with our Applied ML team and clients in the media sector to ensure their workloads leverage our accelerator effectively.
Join our talented team at Canva as a Senior Software Engineer specializing in Video Performance. We are looking for an innovative and solutions-oriented engineer who is passionate about optimizing video experiences for our users. In this role, you will collaborate with cross-functional teams to enhance performance, develop new features, and implement best practices in video engineering.
Join Canva as a Staff Software Engineer specializing in Video Performance. In this role, you will be instrumental in enhancing our video features, ensuring top-notch performance for our users. You will collaborate with cross-functional teams, leveraging your expertise to drive innovation and optimize our video products.
Join Canva as an Engineering Manager specializing in Video Performance, where you'll lead a talented team dedicated to enhancing our video features. You will play a pivotal role in driving innovation, optimizing video processing, and ensuring exceptional performance for our users.This is an exciting opportunity for a motivated leader who thrives in a fast-paced environment and is passionate about delivering high-quality video experiences.
Full-time|$166K/yr - $225K/yr|On-site|San Francisco, California
P-97 At Databricks, we are dedicated to empowering data teams to tackle some of the most challenging problems in the world. We achieve this by creating and managing a leading data and AI infrastructure platform that enables our clients to leverage deep data insights for business enhancement. Our commitment to pushing the limits of data and AI technology is matched by our focus on resilience, security, and scalability, which are essential for our customers' success on our platform. Databricks operates one of the largest-scale software platforms, comprising millions of virtual machines that generate terabytes of logs and process exabytes of data daily. Given our scale, we frequently encounter cloud hardware, network, and operating system faults, and our software must adeptly protect our customers from these issues. As a Senior Performance Engineer, you will collaborate with various teams throughout the organization to assess product and feature performance, pinpoint performance bottlenecks, and partner with engineers to address performance and scalability challenges. This includes setting performance goals for different software releases, guiding teams in developing performance benchmarks, conducting competitive benchmark analyses for various Databricks products, and performing in-depth analyses to identify and resolve performance issues.
Join our dynamic team at Canva as a Senior Software Engineer specializing in Video Templates. In this role, you'll leverage your expertise to design and develop innovative video solutions that enhance user engagement and creativity. Collaborate with cross-functional teams, contributing to an agile environment that fosters creativity and excellence in software development.
At ClickUp, we're not just developing software; we're shaping the future of work! In an era dominated by work sprawl, we identified a more efficient way. This led us to create the first truly integrated AI workspace, consolidating tasks, documents, chat, calendar, and enterprise search, all enhanced by context-driven AI. Our mission is to empower millions of teams to escape silos, reclaim their time, and reach unprecedented levels of productivity. At ClickUp, you'll have the chance to learn, innovate, and leverage AI in transformative ways that will not only influence our product but also the broader landscape of work itself. Join a daring, pioneering team that's challenging the limits of what's possible! We are on the lookout for a technical leader in SaaS client performance who is passionate about enhancing the customer experience through top-tier performance solutions. As a Senior Performance Engineer, you will spearhead comprehensive strategies to optimize application speed, memory utilization, and reliability across our entire platform. You will be empowered to analyze, diagnose, and address performance bottlenecks wherever they arise—be it front-end, back-end, or infrastructure—ensuring ClickUp remains the fastest and most reliable productivity platform available.The ideal candidate is a hands-on authority in browser and NodeJS performance, with a thorough understanding of how code influences rendering, memory management, and overall user experience. You excel in solving intricate challenges, collaborating across teams, and establishing new benchmarks for performance excellence. If you're driven to make a significant impact for millions of users, this is your chance to lead at scale.Your Responsibilities:Conduct root cause analysis on client performance issues and perform post-mortems.Profile application code to identify inefficient algorithms, memory leaks, and other issues; propose and implement effective solutions.Establish performance monitoring, alerting, and dashboards to proactively detect and resolve client performance challenges.Examine client traffic patterns, load testing outcomes, and other metrics to set benchmarks and drive enhancements.Champion performance best practices and set performance standards across the engineering organization.Identify infrastructure upgrades (caching, CDNs, database optimization) to elevate the client experience.Collaborate with development teams to incorporate performance as a core requirement in the development of new features.
Join Cloudflare as a Senior Software Engineer specializing in Network Performance & Reliability! In this role, you'll be at the forefront of enhancing the performance and stability of our global network, ensuring our customers benefit from unparalleled speed and reliability. You'll collaborate with experts across various teams to design and implement innovative solutions that optimize network operations.
Our VisionAquabyte is dedicated to transforming the sustainability and efficiency of aquaculture, a daring and immensely fulfilling endeavor. By enhancing the productivity of fish farming, we aim to support the responsible production of low carbon protein and address one of the most significant contributors to climate change. As the fastest-growing food production sector globally, aquaculture is at a pivotal moment where technology can redefine our approach to ocean harvesting, ensuring its preservation for future generations.Our diverse and mission-oriented team is eager to collaborate with individuals who share our passion. If our vision resonates with you, we invite you to connect with us.Our ProductCurrently, we empower salmon farmers to gain insights into their fish populations, enabling them to make environmentally responsible choices. Utilizing custom underwater cameras, computer vision, and machine learning, we quantify fish weights, assess health conditions, and formulate optimal feeding strategies in real-time. Our product comprises three integral components: on-site hardware for image capture, cloud-based data processing pipelines, and an interactive web application. This creates a dynamic ecosystem with countless intriguing challenges across all layers of technology.At Aquabyte, we prioritize customer satisfaction above all. Our product development is driven by the demands of fish farmers, ensuring we delight our customers with every endeavor. We are committed to fostering a global, collaborative team environment.The RoleWe seek a Senior Backend Engineer to design and maintain the systems facilitating real-time video streaming, AI analytics, and secure remote control of industrial machinery. Your role will involve working on cloud and edge-connected systems that interact with physical devices, where reliability, security, and low-latency communication are paramount.
About UsAt Twelve Labs, we are at the forefront of innovation, creating advanced multimodal foundation models that enable machines to understand videos in a human-like manner. Our groundbreaking approach to video-language modeling is setting new benchmarks, enhancing our ability to analyze and interact with diverse media formats.With an impressive $107 million in funding from leading venture capital firms including NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, as well as visionary AI leaders like Fei-Fei Li and Alexandr Wang, we are strategically positioned to drive global innovation. Our headquarters in San Francisco, along with a significant presence in Seoul, illustrates our commitment to reshaping the future of technology.We celebrate the unique journeys of every individual and believe that our diverse backgrounds fuel our creativity and innovation. We invite passionate individuals who share our mission to join us in transforming video understanding and multimodal AI.Role OverviewUnlike typical video engineering roles that focus primarily on enhancing human playback, at Twelve Labs, we engineer video for machine comprehension. This unique approach prioritizes AI model performance over mere perceptual quality. As the Principal Software Engineer for Video Engineering, you will architect and implement our video processing pipelines, overseeing the entire journey from byte ingestion through to playback. Your work will be pivotal in ensuring our systems are efficient, cost-effective, and tailored for AI-driven video intelligence at scale.Your ResponsibilitiesEnd-to-End Ownership: Lead the design and execution of video pipelines, covering ingestion, decoding, chunking, storage, and playback, applicable to both batch and streaming modes.Codec Expertise: Guide the development of decoding strategies, container format management, and codec implementations, ensuring optimal performance for various video types.
Join Crusoe as a Senior Systems Performance Engineer, where you will play a crucial role in optimizing and enhancing our systems for superior performance. You will be responsible for diagnosing performance bottlenecks, implementing solutions, and ensuring that our infrastructure can scale efficiently. Work in a dynamic environment that encourages innovation and professional growth.
About UsAt Lemurian Labs, we are dedicated to democratizing AI technology while prioritizing sustainability. Our mission is to create solutions that minimize environmental impact, ensuring that artificial intelligence serves humanity positively. We are committed to responsible innovation and the sustainable growth of AI.We are in the process of developing a state-of-the-art, portable compiler that empowers developers to 'build once, deploy anywhere.' This technology ensures seamless cross-platform integration, allowing for model training in the cloud and deployment at the edge, all while maximizing resource efficiency and scalability.If you are passionate about scaling AI sustainably and are eager to make AI development more powerful and accessible, we invite you to join our team at Lemurian Labs. Together, we can build a future that is innovative and responsible.The RoleWe are seeking a Senior ML Performance Engineer to take charge of designing and leading our Performance Testing Platform from inception. In this pivotal role, you will be recognized as the technical expert in measuring, validating, and enhancing the performance of large language models (including Llama 3.2 70B, DeepSeek, and others) prior to and following compiler optimization on cutting-edge GPU architectures.This is a critical position that will significantly impact our product quality and customer success. You will work at the intersection of Machine Learning systems, GPU architecture, and performance engineering, constructing the infrastructure that substantiates the value of our compiler.
At Genmo, we are at the forefront of advancing artificial intelligence through innovative research in video generation. Our mission is to construct open, cutting-edge models that will ultimately contribute to the realization of Artificial General Intelligence (AGI). As part of our dynamic team, you will play a pivotal role in redefining the future of AI and expanding the horizons of video creation.We are looking for a skilled GPU Performance Engineer who can extract maximum performance from our H100 infrastructure and fine-tune our model serving stack to achieve unparalleled efficiency. If you are passionate about optimizing performance, particularly at the microsecond level, and thrive on pushing hardware to its limits, this is the perfect opportunity for you.Key ResponsibilitiesUtilize advanced profiling tools such as Nsight Systems and nvprof to analyze and enhance GPU workloads.Develop high-performance CUDA and Triton kernels to optimize essential model functions.Reduce cold start latency from seconds to mere milliseconds in our serving infrastructure.Optimize memory access patterns, implement kernel fusion, and maximize GPU utilization.Collaborate closely with machine learning engineers to optimize model implementations.Diagnose and resolve performance issues throughout the application and hardware stack.Implement custom memory pooling and allocation strategies to enhance performance.Promote performance optimization techniques and foster a culture of excellence across teams.
ABOUT BASETENBaseten is at the forefront of AI technology, empowering leading-edge companies like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer to seamlessly integrate advanced AI models into their operations. Our unique blend of applied AI research, adaptable infrastructure, and intuitive developer tools enables innovators to bring their most ambitious AI products to life. With our recent $300M Series E funding from top-tier investors such as BOND, IVP, Spark Capital, Greylock, and Conviction, we are poised for rapid growth. Join us in shaping the platform that engineers rely on to deploy transformative AI solutions.THE ROLEAre you driven by a passion for enhancing artificial intelligence applications? We are seeking a proactive Software Engineer specializing in ML performance to join our energetic team. This position is perfect for backend engineers who thrive in a fast-paced startup environment and are eager to make substantial contributions to the realm of Large Language Model (LLM) Inference. If you're enthusiastic about optimizing open-source ML models, we can't wait to hear from you!EXAMPLE INITIATIVESAs a member of our Model Performance team, you will have the opportunity to work on exciting projects, including:Baseten Embeddings Inference: The quickest embeddings solution availableThe Baseten Inference StackDriving model performance optimizationRESPONSIBILITIESDevelop, refine, and implement advanced techniques (quantization, speculative decoding, kv cache reuse, chunked prefill, and LoRA) for ML model inference and infrastructure.Conduct thorough investigations into the codebases of TensorRT, PyTorch, TensorRT-LLM, vllm, sglang, CUDA, and other libraries to troubleshoot and resolve ML performance issues.Scale and apply optimization techniques across a diverse array of ML models, with a focus on large language models.
Role overview This Software Engineer position at OpenAI focuses on inference and performance optimization. Based in San Francisco, the role centers on increasing the speed and efficiency of advanced AI systems. Collaboration with experienced engineers is a key part of the work, with an emphasis on refining AI performance. What you will do Work on optimizing the performance of AI inference systems Collaborate with other engineers to improve efficiency and speed Contribute to solutions that enhance AI system capabilities Location This role is based in San Francisco.
OpenAI is seeking a Software Engineer in San Francisco to focus on improving productivity by optimizing model performance. This position centers on developing solutions that make machine learning models more efficient and effective. Role overview This role involves working closely with teams across different functions to identify and address areas where model performance can be improved. The aim is to deliver changes that have a measurable impact on both systems and workflows. What you will do Collaborate with engineers and other specialists to enhance model efficiency Develop and implement solutions that improve the effectiveness of machine learning systems Contribute to projects that streamline processes and drive productivity gains Impact Your work will help shape improvements in how models operate and how teams at OpenAI achieve their goals. The changes you help deliver will support more effective use of resources and better outcomes for the organization.
Full-time|$180K/yr - $286K/yr|Remote|San Francisco, CA | Remote
About the RoleJoin us in creating a cutting-edge AI-driven platform and web application designed for the seamless and rapid generation of audio and video content. Our mission is to revolutionize how users can record, transcribe, edit, and mix multimedia online, presenting us with unique technical challenges that require innovative solutions. We are seeking talented Software Engineers at both the senior and staff levels to fill several positions on our Agent team, dedicated to shaping the future of Agentic Video Editing!About the Team:Since launching in Open Beta in July 2025, the Agent team at Descript has been focused on enhancing user experiences for chat-based editing, expanding our agent's capabilities, and refining our infrastructure to support the ongoing advancement of AI models. Our team comprises AI Researchers and Software Engineers who are passionate about making video editing accessible to everyone. Collaborate with peers located in San Francisco, New York, and across the US and Canada.This role is perfect for you if you're excited about:Pushing the boundaries of quality in video editing agents through experimentation (e.g., tool design, token/context optimizations, reinforcement learning, multimodal capabilities).Creating an exceptional product experience for agentic video editing interactions that boosts user retention (e.g., refining prompt templates, addressing user feedback, implementing essential product features like chat history).Establishing a top-notch developer experience for building agents (e.g., logging systems, evaluation frameworks, online monitoring, and feedback loops).
Join Canva as a Staff Software Engineer specializing in Video Templates! In this pivotal role, you will be responsible for designing and implementing innovative video solutions that enhance user experience. Collaborate with cross-functional teams to create visually appealing and user-friendly video templates that cater to diverse needs. Your expertise in software engineering will contribute to our mission of empowering people to create stunning visual content effortlessly.
Join Canva as a Senior Engineering Manager specializing in Video Templates. In this pivotal role, you will lead a dynamic team of engineers to innovate and enhance our video template offerings, ensuring a seamless and engaging user experience. You will be responsible for setting the technical direction, driving projects from inception to delivery, and collaborating with cross-functional teams to align on product vision and execution.As a key leader, you will mentor and guide your team members, fostering a culture of creativity and excellence. Your expertise will help shape the future of video content creation at Canva, making it accessible and enjoyable for users around the globe.
Join Cloudflare as a Software Engineer dedicated to enhancing our network performance and reliability. In this dynamic role, you will collaborate with cross-functional teams to develop innovative software solutions that optimize our network infrastructure and ensure high availability and performance for our users. Your contributions will directly impact millions of users worldwide, making the internet a safer place for everyone.
Full-time|$180K/yr - $250K/yr|On-site|San Francisco
Join fal in our pursuit to maintain a leading edge in model performance for generative media models. You'll be instrumental in designing and implementing innovative solutions for model serving architecture, built on our proprietary inference engine. Your focus will be on maximizing throughput while minimizing latency and resource consumption. In addition, you will create performance monitoring and profiling tools to identify bottlenecks and optimization opportunities. Collaborate closely with our Applied ML team and clients in the media sector to ensure their workloads leverage our accelerator effectively.
Dec 16, 2025
Sign in to browse more jobs
Create account — see all 7,033 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.