Software Engineer - Specializing in Quantization

FuriosaAISeoul HQ

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

ResponsibilitiesDevelopment of model compression toolsAcquisition and performance validation of various quantized modelsDevelopment of advanced compression algorithms based on findingsMinimum QualificationsExtensive experience in PyTorch developmentExperience in commercial software developmentMinimum of 3 years of practical experience in a related fieldPreferred QualificationsKnowledge and experience in DevOps and MLOpsExperience using LLM inference tools such as vLLM and TensorRT-LLMKnowledge and experience in deep learning quantizationExperience working in a company related to deep learning acceleration

About the job

About the Algorithm Team - Model Compression Division

It is widely recognized that LLM quantization can significantly enhance inference efficiency. However, implementing this in real-world applications presents ongoing challenges. The Model Compression Division is dedicated to developing user-friendly model compression tools that address these challenges and empower customers to maximize the efficiency of their NPU.

When model compression tools incorporate hardware-specific optimizations, they can achieve greater efficiency. To meet this demand, we have developed proprietary tools equipped with optimization features tailored for our NPU, enabling the provision of an essential software stack that maximizes NPU performance.

The FuriosaAI Model Compression tool is continuously evolving, with a focus on increasing automation, scalability, and reliability, leading to a growing demand for enhanced capabilities. As such, we are seeking talented software engineers with substantial software engineering experience who aspire to advance their careers as Model Compression Engineers.

About FuriosaAI

FuriosaAI is at the forefront of developing cutting-edge technologies that enhance model compression and inference efficiency. Our innovative solutions empower businesses to leverage their NPU capabilities to the fullest, ensuring optimal performance in real-world applications.

Similar jobs

1 - 20 of 502 Jobs

Search for Senior Software Engineer Ai Agent

502 results

Select all on this page (20)

Apply

Software Engineer - Agent System Developer

Furiosa AI

Full-time|On-site|Seoul HQ

Join Our AI Transformation TeamOur dynamic AI Transformation Team is dedicated to tackling intricate engineering challenges through innovative machine learning techniques, transitioning these solutions into impactful real-world applications. We specialize in creating agent systems designed to automate problem-solving, constantly refining and enhancing our approaches.Our mission is to seamlessly connect research with practical application by constructing platforms that utilize agentic AI for tangible engineering solutions.Key ResponsibilitiesArchitect and develop the foundational infrastructure for our Agent System Framework.Implement essential components of the agent runtime, focusing on orchestration, context management, tool execution, and memory abstraction.Work alongside cross-functional teams to quickly prototype and scale new agent functionalities into production environments.Design and enhance developer-friendly interfaces including CLI tools and SDKs.Minimum QualificationsA Bachelor’s degree in Computer Science, a related technical discipline, or equivalent practical experienceProven experience in at least two of the following areas:Designing and managing distributed systems or extensive backend systems.Building and deploying machine learning/AI systems, particularly those based on LLMs, in a production setting.Developing agent systems, workflow engines, or orchestration frameworks.Backend development proficiency in Python, Go, or C++.Preferred QualificationsMaster’s or PhD degree in Engineering, Computer Science, or a related fieldDemonstrated success in designing and implementing agentic AI system frameworks in real-world or large-scale contexts.Hands-on experience in developing LLM-based applications (e.g., for tool utilization, planning, reasoning systems).Experience in creating developer tools, such as CLI or SDKs.Contact UsFor inquiries, reach out to us at: recruit@furiosa.ai

Mar 20, 2026

Apply

Senior Software Engineer, AI Agent

Sendbird

Full-time|On-site|Seoul, South Korea

Join us at the forefront of agentic AI, where you'll be part of a team that builds intelligent systems capable of deep understanding. At Sendbird, we are not just developing prototypes; we are deploying AI solutions at scale that genuinely think and adapt.About SendbirdSendbird is committed to shaping the future of AI-driven customer interactions. With over a decade of experience in creating robust infrastructures for chat, voice, video, and messaging APIs, we've established ourselves as the leading CPaaS platform for in-app communications. Trusted by over 4,000 brands and facilitating 7 billion messages monthly, we connect 300 million active users.Our clients include industry leaders like DoorDash, Match Group, Noom, Yahoo Sports, and Rakuten. Our track record speaks for our expertise and excellence.Recognizing the transformative potential of AI in customer communication, we pivoted to an AI-first strategy in December 2024, launching our AI agent for enterprise customer experience by February 2025. Our rebranding to Delight.ai in November 2025 encapsulates our mission: to not just enhance efficiency, but to restore the essence of personalized service—creating delightful experiences for customers.About Delight.aiDelight.ai acts as the AI concierge for customer interactions, ensuring continuity and understanding across various channels. Unlike other AI agents, our solution retains memory, learns individual preferences, and maintains context—delivering a truly personalized experience.Why Join as a Senior Software Engineer, AI Agent?In an industry filled with superficial solutions, Sendbird's AI Agent platform stands out for its robustness, scalability, and genuine intelligence. This is not just maintenance work; you'll shape the core platform, designing orchestration, RAG pipelines, evaluation, and observability that will impact millions of customer interactions for years to come.

Mar 9, 2026

Apply

AI Agent Engineering Intern

Sendbird

Internship|Hybrid|Seoul, South Korea

At Sendbird, we merge omnichannel AI with world-class, validated communication APIs to empower businesses in creating AI agents that enable meaningful customer connections at scale.Sendbird offers solutions with enterprise-level stability, security, and compliance. Over 4,000 leading global apps, including DoorDash, Match Group, Noom, and Yahoo Sports, trust Sendbird, facilitating over 7 billion conversations monthly through our platform.To date, we have raised a total of $220 million in funding from top-tier Silicon Valley investors, including ICONIQ, SoftBank, Tiger Global, and Y Combinator, and we are the first global unicorn company in Korea with a valuation exceeding 1 trillion won. Our headquarters in San Mateo, California, along with R&D centers and offices in Seoul, APAC, New York, London, Singapore, India, and Canada, reflect our global reach.Internship Opportunity: AI Agent EngineerThe AI Agent Engineering Intern will collaborate with engineering and product teams to build, integrate, and deploy intelligent agents. Throughout the internship, which lasts between 8 to 12 weeks, you will gain hands-on experience from prototyping to production using the latest web frameworks, LLM APIs, and tooling pipelines.Key Responsibilities:Agent Prototyping and DevelopmentDevelop micro-APIs for tasks such as hallucination detection and conversation resolution through prompt engineering.Test and validate the accuracy of AI agents using internal datasets and tools.Integrate backend APIs that support retrieval-augmented generation (RAG), knowledge bases, and custom toolchains.Prompt Engineering and ToolingCreate, test, and fine-tune prompts for various use cases.Establish CI/CD pipelines for prompt versions and agent deployment.Integration and DeploymentEmbed agents into web, chat, and API-centric environments.Implement observability features like logging and dashboards to monitor real-time performance metrics.Collaboration and DocumentationFacilitate close collaboration among team members, including senior engineers, product managers, and UX designers.Ideal Candidate:Proficient in Python with experience in writing clean and maintainable code.Hands-on experience in backend development using server-side frameworks like Django, FastAPI, or Express.Excellent communication skills and experience in collaborative environments, especially in agile teams.Preferred Qualifications:Experience with prompt engineering frameworks (e.g., LangChain).Understanding of cloud platforms (AWS Lambda, Vercel, GCP Cloud Functions).Experience in AI product-related UX design or product management.Familiarity with Django or similar Python web frameworks.What You Will Gain:Deep insights into cutting-edge agent-based AI development and system deployment.Cross-functional collaboration with engineering, product, and design experts.Mentoring from senior AI engineers, with opportunities for full-time transition.Experience with real customer use cases, performance tuning, and production-grade tool utilization.Flexible Work PolicySendbird supports flexible working schedules. Given the importance of collaboration and teamwork, all team members work together in the office at least three days a week. Some roles may require more frequent office attendance, so please discuss specific requirements with your manager.

Mar 5, 2026

Apply

Senior Software Engineer

zoyi

Full-time|On-site|Seoul

Join Us to Create the Future of Communication!At ChannelTalk, we are dedicated to driving sustainable growth for businesses through our all-in-one AI messenger, which focuses on enhancing customer conversations to guide business directions effectively.Embracing the philosophy that 'the customer has the answers', we integrate CRM-based support experiences with AI automation to streamline customer service efficiency and improve customer experiences, all within a single product. Our rapid growth, achieving over 20% market share in Japan, sets a solid foundation for our ambitious expansion into the US market as a global SaaS leader.We collaborate with top talents to develop a 'future classic' product that could symbolize a new generation, much like Google Search and the iPhone.

Aug 5, 2025

Apply

System Software Engineer at Furiosa AI | Seoul

Furiosa AI

Full-time|On-site|Seoul HQ

Key ResponsibilitiesDevelop and implement firmware solutions for Furiosa devices, ensuring optimal performance and reliability.Create and maintain device drivers tailored for Furiosa hardware, enhancing system integration.Establish testing frameworks to ensure the highest quality standards for firmware and device drivers.

Oct 3, 2025

Apply

Software Engineer, Developer Experience at Furiosa AI | Seoul

Furiosa AI

Full-time|On-site|Seoul HQ

Join Furiosa AI as a Software Engineer focused on Developer Experience! In this role, you will collaborate with cross-functional teams to enhance the development workflow, tools, and processes that empower our engineering teams. Your contributions will be pivotal in creating efficient and high-quality software solutions that drive our mission forward.

Apr 10, 2026

Apply

Security Software Engineer at Furiosa AI | Seoul HQ

Furiosa AI

Full-time|On-site|Seoul HQ

Key ResponsibilitiesCraft and implement a robust end-to-end security architecture tailored for SoC-based products.Develop features for Secure Boot and Anti-rollback mechanisms.Manage Key Provisioning alongside Hardware Root of Trust.Oversee Secure Lifecycle State management (LCS) for firmware, keys, and devices.Design and integrate security modules to enhance firmware and system-level defenses.Conduct thorough vulnerability assessments, attack surface evaluations, and implement memory protection hardening strategies.Integrate, validate, and optimize hardware cryptographic engines including AES/GCM engines, TRNG, PKA, RSA, PQC, and hash accelerators.Establish secure firmware update and rollback protocols along with comprehensive product lifecycle controls.QualificationsDeep understanding of embedded security principles, including Secure Boot, key management, and cryptography.Demonstrated experience with TLS, X.509, ECC, AES, and various cryptographic or security libraries.Proficient in C/C++ or Rust for developing secure system software.Experience identifying and analyzing security vulnerabilities in SoC/FPGA-based systems.Practical experience with hardware cryptographic engines and security IP blocks (e.g., AES/GCM, SHA accelerators, TRNG/DRBG, PKA).Solid understanding of firmware and key lifecycle management.Preferred QualificationsFamiliarity with TPM, HSM, TrustZone, OP-TEE, or related hardware security architectures.Experience in implementing secure firmware signing, provisioning, and deployment pipelines.Background in validating cryptographic engines under standards such as FIPS 140-3, NIST CMVP.Security engineering experience specifically for AI accelerators, networking chips, or SoCs.Contribute to development practices aligned with standards such as FIPS, PSA Certified, CAVP, and CMVP.

Dec 12, 2025

Apply

Software Engineer, Drivers/Runtime (Windows)

Furiosa AI

Full-time|On-site|Seoul HQ

Role OverviewJoin Furiosa AI as a Software Engineer specializing in Drivers and Runtime for Windows. In this role, you will be a key player in developing vital components of our AI software stack tailored for Windows environments.Key ResponsibilitiesLead the development of Drivers and Runtime systems that are integral to our AI software on Windows.Collaborate closely with Product Managers and Hardware/Platform developers to ensure seamless integration.Implement functionality for transferring and receiving models, data, and inference results to and from hardware.Develop features for monitoring hardware status, including temperature and error management.Oversee the abstraction of hardware from a cloud platform software perspective.QualificationsMinimum RequirementsBachelor's degree in Computer Science or a related field.Proficiency in C/C++ programming.Knowledge of Windows kernel drivers and system programming.Strong interest in operating systems and hardware architecture.Excellent communication skills.Preferred QualificationsExperience in Windows system programming or driver development.Familiarity with WinDbg and Win32 API.Experience with reverse engineering.

Oct 3, 2025

Apply

Software Engineer - Compiler (AX Engineer)

FuriosaAI

Full-time|On-site|Seoul HQ

Role overview The Compiler team at FuriosaAI builds the software stack that enables machine learning models to run efficiently on the company’s AI accelerator hardware. This Software Engineer - Compiler (AX Engineer) role is a new position based at the Seoul headquarters. The focus is on developing and refining AI-assisted engineering workflows, identifying bottlenecks, running experiments, and building tools and guides that the team can use repeatedly. While the role is part of the Compiler team, deep compiler expertise is not required. Daily work includes code reviews, debugging, CI and testing, documentation, and project management. This position is a good fit for someone who enjoys shaping new roles and working through open-ended challenges. What you will do Identify challenges and opportunities in engineering workflows, including development, code review, debugging, CI/testing, documentation, and project delivery. Research, prototype, and benchmark AI and automation tools, such as coding agents, LLM-assisted reviews, and debugging assistants, using real team workflows. Create and maintain prompt libraries, workflow templates, and integration guides tailored to team needs. Improve CI/testing feedback and signals with the goal of reducing flaky tests and streamlining failure triage. Lead onboarding sessions and regular Q&A meetings to share knowledge about new tools. Define and track metrics like PR cycle time, review turnaround, regression triage time, and CI flakiness, iterating based on findings and team feedback. Stay current on developments in the AI tooling ecosystem and keep the team informed about relevant updates. Requirements At least 3 years of experience in software engineering. Hands-on experience with LLM-based coding tools (such as GitHub Copilot, Cursor, Claude Code, or Codex). Strong understanding of software development workflows from an engineer’s perspective. Ability to quickly evaluate new tools and provide practical guidance to the team. Preferred qualifications Bachelor’s degree in Computer Science or a related field.

Apr 23, 2026

Apply

Software Engineer - Kernel Programming Model

Furiosa AI

Full-time|On-site|Seoul HQ

ResponsibilitiesDevelop the PyTorch-Native Kernel Programming Model by designing an integration layer and runtime environments that facilitate the execution of custom kernels as native extensions in the PyTorch framework.Create a Tensor-Level Kernel Language by establishing high-level abstractions and programming models to support efficient model integration and expressive tensor operations.Implement a Virtual ISA to address performance bottlenecks through direct hardware control while preserving high-level programmability.Contribute to the Kernel Developer Ecosystem by building essential programming tools, technical specifications, and reference implementations to streamline kernel development and foster community contributions.

Mar 5, 2026

Apply

SoC Software Engineer

Furiosa AI

Full-time|On-site|Seoul HQ

Key ResponsibilitiesDevelop and sustain ROM code along with software for early system initialization.Port and tailor Arm Trusted Firmware (TF-A / TF-M) to meet project needs.Implement essential system trust primitives including Secure Boot, firmware updates, and RoT components.Work closely with hardware teams for effective SoC bring-up, low-level debugging, and silicon validation.Design system-level software with a focus on performance, reliability, and security requirements.

Dec 12, 2025

Apply

Software Engineer - Low Level Programming Interface & Runtime

Furiosa AI

Full-time|On-site|Seoul HQ

Join Furiosa AI as a Software Engineer specializing in Low Level Programming Interface & Runtime. In this role, you will develop and optimize low-level software systems that enhance the performance and efficiency of our cutting-edge AI technology.

Apr 10, 2026

Apply

AI Research Engineer

Furiosa AI

Full-time|On-site|Seoul HQ

Job ResponsibilitiesStay abreast of cutting-edge AI research trends, including LLM, Agentic AI, Diffusion, and Inference acceleration, and explore integration opportunities with our proprietary NPU.Lead groundbreaking AI research aimed at publication in top-tier academic conferences.Forge research collaborations with prestigious global institutions and manage associated projects.

Oct 3, 2025

Apply

NPU Management Interface Software Engineer

Furiosa AI

Full-time|On-site|Seoul HQ

Key ResponsibilitiesDesign and implement the NPU Management Interface (MI) firmware/software to facilitate seamless communication between Host/BMC and NPU devices.Develop and sustain MCTP, PLDM, and custom MI command handling for comprehensive out-of-band NPU management, monitoring, and control.Craft device-management features utilizing SMBus/I²C, I3C, PCIe VDM, or custom sideband channels.Integrate MI capabilities into the NPU firmware, including:Health and error reporting mechanismsThermal and power telemetry systemsRuntime status, utilization metrics, and intricate debug informationEnsure adherence to industry standards through thorough spec-driven design, implementation, and validation.Assist in bring-up processes, interoperability testing, rack-scale platform integration, and system-level validation.Formulate test strategies and validation tools aligned with MCTP and PLDM specifications.Conduct protocol compliance testing, regression testing, and interoperability verification.

Dec 12, 2025

Apply

Senior Hardware System Engineer

Furiosa AI

Full-time|On-site|Seoul HQ

Key ResponsibilitiesSpearhead the design of AI system PCBs.Create and refine schematics while providing layout consultation for PCBs.Analyze and offer insights on PCB signal integrity and power integrity.Essential QualificationsBachelor's degree in Electrical or Electronics Engineering.5 to 10 years of relevant industry experience.In-depth knowledge of passive components, DC-DC converters, LDOs, NOR-flash, DDR, and logic gates.Proficient in Allegro software for PCB design.Strong understanding of PCB stack-up configurations.Experienced in using measurement equipment effectively.Preferred QualificationsSolid understanding and hands-on experience with various VRM topologies.Comprehensive knowledge of passive components.Extensive experience in power supply system design.Experience with PCB-level power integration.Familiarity with circuit simulation tools such as SIMPLIS, Allegro, and HSPICE.Knowledge of high-speed circuit design principles.Contact Informationrecruit@furiosa.ai

Oct 3, 2025

Apply

Software Engineer - Compiler (Middle-end)

FuriosaAI

Full-time|On-site|Seoul HQ

About the RoleAt FuriosaAI, we are dedicated to advancing high-performance and energy-efficient AI systems through our innovative compiler technology. As deep learning models continue to diversify and evolve, the challenges of compilation become more complex. Our mission revolves around transforming these models into optimized executable programs while ensuring the integrity of their meaning and structure.We believe in establishing robust abstractions and well-organized compiler logic instead of relying solely on heuristics. This foundational approach ensures consistency in correctness, scalability, and performance throughout the compiler.In your role as a middle-end engineer, you will significantly influence the design of these abstractions and the operational framework of the compiler. This position demands a blend of analytical reasoning and innovative problem-solving to navigate the intricate challenges of compilation. Your contributions will be vital in creating a compiler that is both principled and pragmatic.

Apr 6, 2026

Apply

Software Engineer - Compiler Developer Productivity Engineer

Furiosa AI

Full-time|On-site|Seoul HQ

About the RoleAt Furiosa AI, we are committed to enhancing the productivity of our compiler team by systematically designing and continuously improving our CI and development workflows.In this role, you will focus on transforming CI into a more reliable, efficient, and user-friendly system through the development of developer productivity tools, including CI orchestrators.Our compiler team executes large-scale tests across multiple clusters to optimize model performance in terms of both time and energy. Within this context, CI operates not merely as a pipeline executor but as a sophisticated orchestration system that coordinates complex workloads and resources, demanding high reliability and automated recovery capabilities.Aim for engineering that defines and resolves problems rather than simple operations.Engineers are encouraged to analyze issues independently and propose improvement directions.Important decisions are made collaboratively within the team.Improve both development productivity and stability through practical approaches.

Apr 8, 2026

Apply

Software Quality Assurance Engineer

furiosa-ai

Full-time|On-site|Seoul HQ

Key ResponsibilitiesDesign, develop, execute, and manage test cases to ensure the quality of our software solutions.Lead the verification of our solutions' functionality, performance, compatibility, and reliability.Analyze and reproduce issues that arise during the verification process and collaborate with relevant teams such as development, hardware, and operations to resolve them.Analyze and diagnose issues occurring in customer environments, contributing to internal solution improvements and enhancing customer satisfaction.Develop test scenarios and establish validation plans based on customer requirements.Build and enhance a verification framework based on quality standards throughout the product lifecycle.

Nov 20, 2025

Apply

Senior AI Engineer at fieldguide | Seoul, South Korea

fieldguide

Full-time|On-site|Seoul, South Korea

Join fieldguide as a Senior AI Engineer and be at the forefront of innovative artificial intelligence solutions. In this role, you will collaborate with cross-functional teams to design and develop advanced AI models, contribute to cutting-edge projects, and drive AI strategy across the organization. Your expertise will help shape the future of AI at fieldguide.

Mar 26, 2026

Apply

Software Engineer - Specializing in Quantization

FuriosaAI

Full-time|On-site|Seoul HQ

About the Algorithm Team - Model Compression DivisionIt is widely recognized that LLM quantization can significantly enhance inference efficiency. However, implementing this in real-world applications presents ongoing challenges. The Model Compression Division is dedicated to developing user-friendly model compression tools that address these challenges and empower customers to maximize the efficiency of their NPU.When model compression tools incorporate hardware-specific optimizations, they can achieve greater efficiency. To meet this demand, we have developed proprietary tools equipped with optimization features tailored for our NPU, enabling the provision of an essential software stack that maximizes NPU performance.The FuriosaAI Model Compression tool is continuously evolving, with a focus on increasing automation, scalability, and reliability, leading to a growing demand for enhanced capabilities. As such, we are seeking talented software engineers with substantial software engineering experience who aspire to advance their careers as Model Compression Engineers.

Oct 3, 2025

Create account — see all 502 results