Research Engineer Large Language Models jobs in New York City – Browse 1,306 openings on RoboApply Jobs

Research Engineer Large Language Models jobs in New York City

Open roles matching “Research Engineer Large Language Models” with location signals for New York City. 1,306 active listings on RoboApply Jobs.

1,306 jobs found

1 - 20 of 1,306 Jobs
Apply
companyMirage logo
Full-time|On-site|Union Square, New York City

About Mirage Mirage builds an AI-powered video platform that connects production and editing through natural language processing. Our models use contextual awareness to mirror the choices of skilled editors, streamlining workflows for experienced teams and making video creation more accessible to a wider audience. Learn More Our Product (Captions by Mirage) Our Research (Seeing Voices, technical white paper) Latest Updates (Mirage on X / Twitter) Mirage has been featured in TechCrunch, Forbes AI 50, and Fast Company. Our Investors Mirage is backed by leading venture firms and entrepreneurs, including Index Ventures, Kleiner Perkins, Sequoia Capital, Andreessen Horowitz, and others. Location Requirement All roles at Mirage require in-person work at our Union Square headquarters in New York City. Role Overview: Research Engineer – Large Language Models Mirage seeks a Research Engineer to design, build, and scale systems for training and deploying large language models, with a focus on multimodal creative applications in video analysis. This role works closely with researchers to turn new ideas into efficient, production-ready systems that strengthen our platform.

Apr 14, 2026
Apply
companyMirage logo
Full-time|On-site|Union Square, New York City

Mirage builds an AI-native platform for video production and editing, centered in Union Square, New York City. The platform uses natural language to guide intelligent orchestration, allowing advanced models to understand context and mimic the creative decisions of experienced editors. This approach aims to boost productivity for professional teams and open up video creation to a wider audience. About the Team The team at Mirage brings together people from a range of backgrounds, blending technical and artistic skills to solve tough challenges in generative media. The work goes beyond routine model development, focusing on problems that remain unsolved across the industry. Role Overview: Research Scientist, Large Language Models This early team role offers the chance to shape the core technology behind Mirage. The position involves tackling foundational questions in generative AI and creative tooling, with the potential to influence how people create and edit video for years to come.

Apr 14, 2026
Apply
companyAnthropic logo
Full-time|Remote|Remote-Friendly (Travel-Required) | San Francisco, CA | New York City, NY

Anthropic is looking for a Research Engineer focused on model evaluations. This position involves research and development to assess and strengthen the performance of AI models. Teams are based in San Francisco and New York City, and the role supports remote work with required travel. Key responsibilities Design and implement evaluations for Anthropic's AI models Collaborate with team members to enhance model performance Contribute to research that pushes the boundaries of AI systems Location Remote-friendly (travel required) San Francisco, CA New York City, NY

Apr 28, 2026
Apply
companyRogo logo
Full-time|On-site|New York City

Why Join Rogo?At Rogo, we are pioneering the development of Wall Street's first authentic AI analyst. Our mission is to empower finance professionals at leading investment banks, private equity firms, and investment organizations with AI that offers unmatched speed, precision, and insight. We are not merely enhancing financial workflows; we are fundamentally transforming them.This is a rare opportunity to become part of a groundbreaking company at a pivotal moment. With a rapidly expanding client base, proven product-market fit, and support from top-tier investors, we are scaling swiftly and creating a new frontier in enterprise AI.Our team is intelligent, driven, and passionately committed to our mission. We work with vigor, take ownership of complex challenges, and maintain a relentless focus on our users. If you excel in a fast-paced environment, demand excellence, and wish to contribute to shaping the future of finance, we welcome you to join us.Your ResponsibilitiesAs an AI Researcher at Rogo, you will be at the forefront of translating cutting-edge technological advancements into revolutionary financial products. You will utilize the latest innovations in Large Language Models (LLMs) and Reinforcement Learning, technologies that we believe will fundamentally transform knowledge work, placing you at the center of this exciting evolution.Design and implement end-to-end learned systems that integrate the capabilities of LLMs and Reinforcement Learning to automate comprehensive financial workflows.Develop large-scale infrastructure for assessing answer and response quality, and feed this data back into the learning systems as a reward mechanism.Collaborate closely with product managers and engineers to ensure that every solution you create delivers genuine user value and addresses real needs.Elevate standards for code quality, reliability, and product velocity. Together, you will challenge yourself and your colleagues to grow both technically and interpersonally.QualificationsBachelor's degree in Computer Science or a related field.3-6 years of experience in training large language models or post-training (or equivalent PhD experience).Proficient in writing LLM training code using PyTorch or JAX.Experience with a strongly typed programming language (e.g., Rust, C++, Java).Strong programming skills and a solid foundation in Computer Science principles.

Apr 29, 2025
Apply
companyPercepta logo
Full-time|On-site|New York City

About UsAt Percepta, we are on a mission to revolutionize pivotal industries through applied AI. Our focus is on ensuring that the sectors that drive our world, including healthcare, manufacturing, and energy, harness the power of cutting-edge technology. To achieve this, we closely collaborate with top-tier clients to facilitate AI transformation. We unite:Expertise in engineering, product, and research that is deployed at the forefront of innovation.Mosaic, our proprietary toolkit designed for the rapid implementation of intelligent workflows.Strategic alliances with leading firms such as Anthropic, McKinsey, AWS, and others within the General Catalyst portfolio.Our team consists of a dynamic group of Applied AI Engineers, Embedded Product Managers, and Researchers who are passionate about integrating AI to enhance everyday experiences. Percepta is a proud partner of General Catalyst, a global leader in transformation and investment.Role OverviewAs a Research Engineer/Scientist specializing in LLM Modeling at Percepta, you will be at the forefront of developing and deploying large-scale language models. You will engage in pre-training, post-training (including instruction tuning, alignment, and distillation), reinforcement learning, and the crafting of specialized architectures to enhance reasoning, decision-making, and adaptability across critical sectors.Collaboration is key as you will work closely with Embedded Product Managers and engineers to create innovative yet practical decision systems that significantly transform business operations.Identify significant challenges and formulate research strategies encompassing pre-training, post-training, RL, and specialized model development.Prototype and scale training pipelines for large language models, experimenting with architectures, optimization methods, and post-training tactics.Contribute to the infrastructure necessary for high-performance distributed training.Conduct extensive real-world evaluations that yield millions in value.Collaborate with applied AI engineers to transition successful research findings into actionable features within our Mosaic platform.Effectively communicate research outcomes to both technical and non-technical stakeholders, ensuring clarity on the implications of research and its applications.

Oct 2, 2025
Apply
companyMedal logo
Full-time|On-site|New York City

About General IntuitionGeneral Intuition is at the cutting edge of AI research, dedicated to developing foundational models that excel in deep spatial and temporal reasoning. Over the past year, we have advanced the capabilities of AI agents that navigate complex environments, created world models that serve as training grounds for these agents, and innovated video understanding models aimed at real-world application.We are proud to have raised a seed funding round of $133 million from General Catalyst and Khosla, driving our mission to uncover the next generation of intelligence.What We're Looking ForMinimum of 5 years of experience in deep learning research or reinforcement learning, specifically with embodied agents or simulation environments.Solid foundation in representation learning and generative modeling, particularly using architectures like diffusion models, VAEs, and transformers applied to video data.Experience with world models and predictive control — you possess knowledge on training models that simulate dynamics and plan actions within learned environments.Proficiency in reinforcement learning (RL, model-based RL, or imitation learning), coupled with the capability to design and evaluate policy networks.Excellent programming skills in Python and deep learning frameworks such as PyTorch.Strong experimental capabilities — adept in handling large-scale training, evaluation pipelines, and managing intricate datasets or simulations.Publications or contributions to open-source projects in domains such as world modeling, simulation learning, or agent policies are a significant advantage.In-person requirement: We are specifically seeking candidates located in New York City, with a commitment to working in the office five days a week.Ownership & scientific rigor: You are committed to seeing ideas through from conception to proof of concept to deployment. You write clean, reproducible code and uphold high standards for experimental validity.Performance and scaling mindset: You understand how research can be translated into production systems, with a keen awareness of compute efficiency, distributed training, and potential data bottlenecks.Curiosity-driven and results-oriented: You thrive on tackling open-ended problems, while also knowing how to set measurable goals and deliver impactful systems.Passion for gaming & simulation: A strong interest in interactive environments and physics-based simulations is highly desirable.

Oct 5, 2025
Apply
company
Full-time|On-site|New York City

About UsMirror Physics is an innovative AI company based in New York City, pioneering the next generation of scientific simulation technologies. Our mission is to create intelligent systems that grasp the fundamental principles of physics, thereby providing essential acceleration for advanced research and development across various technological fields. We are currently developing a leading-edge AI platform that predicts experimental outcomes in chemistry and materials science, seamlessly integrating physical simulation with high-throughput experimental verification. This endeavor aims to hasten discoveries in biotechnology, energy, manufacturing, and more. Supported by top-tier investors and scientific experts, we are seeking to expand our research team during this crucial period in the industry.The RoleAs the principal AI researcher focusing on physics model development, you will lead efforts in designing innovative architectures, training algorithms, and evaluation processes to transform vast amounts of physical simulation data into scalable, precise, and versatile predictive engines applicable in both scientific and industrial contexts.Key ResponsibilitiesCreate robust, scalable, and universally applicable atomistic models with high fidelity across various chemical domains.Compile diverse and multi-fidelity datasets into cohesive training corpora; innovate new objectives to enhance data efficiency.Produce groundbreaking datasets that encompass an unmatched variety of chemical systems, consistently computed at the highest theoretical levels suitable for general chemistries.Design diagnostic tools for model performance evaluation, failure mode assessment, and uncertainty quantification; propose new benchmarks to rigorously test predictive accuracy, physical consistency, and extrapolation capabilities.Develop downstream tools to improve model precision and processing speed, including model distillation and fine-tuning techniques.Collaborate with the AI-for-science community through research publications and contributions at leading conferences such as NeurIPS, ICML, and ICLR.Mentor junior researchers and work closely with applied science and engineering teams.

Jun 10, 2025
Apply
company
Full-time|On-site|New York City

Role overview Distributed Spectrum is seeking a Machine Learning Research Specialist to advance research on RF Foundation Models. The focus of this position is on developing new machine learning methods that support radio frequency communications. What you will do Lead research into machine learning techniques tailored for RF communications. Create and evaluate new data-driven models to enhance connectivity. Work closely with colleagues who combine expertise in machine learning and radio frequency technology. The team Distributed Spectrum’s team investigates innovative approaches to connectivity and communication. Their projects push the boundaries of what is possible in machine learning and RF signal processing. Location This position is located in New York City.

Apr 24, 2026
Apply
companydoji logo
Full-time|On-site|New York City

The OpportunityAt doji, we are revolutionizing the fashion shopping experience through the use of AI avatars. Our focus is on developing innovative diffusion models and crafting unique, personalized interfaces that enhance user engagement.Based in the vibrant heart of New York City, we are a passionate team of creative innovators with a unique blend of taste and extensive AI knowledge. Our backgrounds include launching successful consumer products at renowned companies like Apple, DeepMind, Meta, and emerging startups. Supported by investors from OpenAI, Cursor, and SKIMS, we stand at the crossroads of AI, technology, and culture.The RoleWe are in search of talented Research Engineers to play a pivotal role in advancing our avatar and virtual try-on models. This position requires in-person collaboration at our NYC office.As a Research Engineer, your responsibilities will include:Developing state-of-the-art generative AI features for our productsConstructing data pipelines to integrate large-scale image and video datasets into model trainingEnhancing model performance based on user feedback and preference tuningOptimizing virtual try-on experiences for various body types, clothing styles, and aesthetic preferencesCollaborating within a shared codebase while maintaining high coding standardsCreating cutting-edge personalized try-on solutions in video and 3D formatsWorking closely with our founders to develop industry-leading technologiesYou are an ideal candidate if you:Possess hands-on experience in training and fine-tuning diffusion models (a must-have)Are knowledgeable in implementing and optimizing diffusion model training pipelinesHave a strong passion for craftsmanship and creating groundbreaking experiencesThrive in dynamic, fast-paced environments (such as startups) or have been a high-performing individual contributor in academia or open-source projectsAre self-motivated, proactive, and quick to learn new concepts deeplyHave experience implementing machine learning papers from scratch using PyTorch or JAXGet excited about exploring new machine learning methodologies and read academic papers for enjoymentPossess problem-solving instincts to navigate challenges effectivelyHave a genuine interest in fashion, avatars, or creative expression

May 6, 2025
Apply
companyAnthropic logo
Full-time|Hybrid|New York City, NY; New York City, NY | Seattle, WA; San Francisco, CA

Join our dynamic team at Anthropic as a Research Engineer or Research Scientist specializing in Tokens. In this role, you will engage in groundbreaking research and development, contributing to the advancement of our token systems and enhancing their capabilities. Your work will be pivotal in shaping the future of our technology and driving innovation in our projects.

Mar 12, 2026
Apply
companyQuantile Health logo
Full-time|$195K/yr - $275K/yr|On-site|New York City

Role: AI Engineer & Data Science Team LeadJoin our innovative team at Quantile Health as we harness the power of artificial intelligence to revolutionize healthcare. We are seeking a skilled AI Engineer & Data Science Team Lead who excels in data-driven problem-solving. In this pivotal role, you will guide our AI and Data Science team in utilizing large language models (LLMs) to convert unstructured data into meaningful ontologies and actionable insights, facilitating drug launches and access strategies.As a leader and mentor for a team of 3-4 data scientists, you will establish technical direction, oversee project management, and remain actively involved in developing scalable, production-ready AI applications at the intersection of AI and healthcare.About Quantile HealthQuantile Health, a seed-stage AI startup based in New York, is dedicated to creating agentic applications aimed at expanding patient access to transformative medications while reducing costs for drug innovators.ResponsibilitiesLead the development and delivery of impactful agentic LLM applications that drive significant business outcomes.Manage AI initiatives from conception to execution, ensuring quality and timeliness while mentoring a talented team of data scientists.Collaborate with company founders to shape and steer the product roadmap and technical strategy.Engage in hands-on research, model development, and the construction of production-ready AI systems.Work in tandem with the Engineering team to design system architecture and scalable data infrastructure.QualificationsStrong conceptual thinker with the ability to translate complex business challenges into structured data problems.Deep passion for LLMs and real-world AI applications, exhibiting a pragmatic and impact-oriented mindset.PhD in a quantitative discipline (Computer Science, Computational Biology, Statistics, etc.) with a minimum of 3 years of industry experience, or a Bachelor’s/Master’s degree with at least 7 years of relevant industry experience in finance, biotech, or tech startups.At least 1 year of team management experience.Hands-on expertise in building AI applications, including prompt engineering and advanced analytics, with a focus on converting large-scale unstructured data into production-ready solutions.

Feb 23, 2026
Apply
companyDecagon logo
Full-time|On-site|New York City

Role overview Decagon seeks a Staff Research Engineer based in New York City. This position helps guide research efforts and supports the development of technical solutions that serve a range of industries. The Staff Research Engineer works alongside a skilled team, taking projects from early ideas through to working implementations. What you will do Advance research initiatives at Decagon through direct engineering contributions Collaborate with team members to design and build new technical solutions Apply technical expertise to projects that impact several sectors Requirements Solid background in both research and engineering Interest in tackling complex technical problems Strong collaborator who values team-based problem solving

Apr 23, 2026
Apply
companyDecagon logo
Full-time|On-site|New York City

Decagon seeks a Senior Research Engineer based in New York City. This role centers on driving research initiatives and developing new technologies that align with the company’s objectives. Role overview The Senior Research Engineer collaborates with team members from various disciplines. The work involves designing experiments, analyzing outcomes, and creating solutions for complex challenges. Key responsibilities Advance research projects that contribute to Decagon’s goals Build and refine technologies to address technical problems Work with colleagues from diverse backgrounds to design experiments and interpret findings

Apr 23, 2026
Apply
companyPercepta logo
Full-time|On-site|New York City

Who We AreAt Percepta, our mission is to revolutionize essential sectors through applied AI. We are dedicated to ensuring that key industries, such as healthcare, manufacturing, and energy, leverage cutting-edge technology. By collaborating with industry leaders, we facilitate AI transformation.Expertise in engineering, product development, and research.Mosaic, our proprietary toolkit for swift deployment of agentic workflows.Strategic alliances with companies such as Anthropic, McKinsey, AWS, and others in the General Catalyst portfolio.Our team is rapidly expanding, consisting of Applied AI Engineers, Embedded Product Managers, and Researchers who are passionate about integrating AI advancements into our daily lives. Percepta is a direct partnership with General Catalyst, a global leader in transformation and investment.About the RoleAs a Research Engineer/Scientist (Optimization) at Percepta, you will play a crucial role at the convergence of AI research and impactful real-world applications. You will push the boundaries of decision-making in vital industries by integrating state-of-the-art machine learning with robust optimization research. Collaborate closely with our Embedded Product Managers (EPMs) and engineers to ensure that our advanced decision systems are both innovative and practically beneficial for transforming organizational operations.ResponsibilitiesEstablish and lead ambitious research initiatives that broaden the horizons of data-driven decision-making.Develop innovative optimization and machine learning methodologies for significant challenges such as planning, scheduling, routing, pricing, and inventory management.Create high-fidelity simulators and comprehensive benchmarks that accurately reflect real-world constraints, uncertainties, and multi-objective trade-offs.Translate research into practical applications by partnering with engineers to swiftly prototype solutions and effectively implement research insights.You May Be a Great Fit If YouHold a degree in Computer Science, Operations Research, Industrial Engineering, or Applied Mathematics (MS/PhD preferred) or possess equivalent research or industry experience.Have demonstrated experience in optimization and machine learning.Exhibit a strong ability to collaborate and communicate effectively across multidisciplinary teams.

Oct 2, 2025
Apply
companyReddit, Inc. logo
Full-time|$124.7K/yr - $199.1K/yr|On-site|New York City, NY

Reddit, the leading platform for community engagement, is seeking a dynamic Senior Client Partner to join our Large Customer Sales team within the Tech vertical. This pivotal role involves cultivating strategic relationships with prominent brands and agencies, leveraging Reddit’s comprehensive advertising solutions to help clients achieve their business goals. Join us in empowering authentic conversations while driving innovative advertising strategies.

Feb 9, 2026
Apply
companyAnthropic logo
Full-time|$350K/yr - $850K/yr|On-site|San Francisco, CA | New York City, NY

About Anthropic Anthropic builds AI systems with a focus on reliability, interpretability, and steerability. The company’s mission centers on making AI safe and beneficial for both users and society. Teams at Anthropic include researchers, engineers, policy specialists, and business leaders who work together to advance AI for the greater good. Safeguards Labs Team Safeguards Labs operates where research meets engineering. The team explores new safety methods to protect Claude and its users. Members design and prototype approaches for model safety, usage controls, and production reliability. Before moving ideas into production, the team tests concepts through offline analysis and controlled traffic trials. Safeguards Labs collaborates with groups focused on account abuse prevention and model behavior safeguards, serving as the research arm that turns complex safety challenges into practical defenses. Role Overview: Research Engineer Research Engineers in Safeguards Labs shape and execute the team’s research agenda. This role involves scoping independent projects, running experiments end to end, and deciding when a concept is ready for production or should be set aside. The team is intentionally small, with about three researchers for every software engineer. This structure gives each person broad autonomy and a strong voice in the direction of the group. Location San Francisco, CA or New York City, NY

Apr 17, 2026
Apply
companyReddit, Inc. logo
Full-time|On-site|New York City, NY

As a Senior Client Partner in Large Customer Sales focusing on the Pharma sector, you will play a pivotal role in driving strategic partnerships and delivering innovative solutions to our clients. You will leverage your deep industry knowledge and sales expertise to engage with key stakeholders, understand their needs, and provide tailored approaches that enhance their business outcomes.This position requires a proactive individual who can navigate complex sales cycles and foster long-term relationships with clients. You will collaborate closely with internal teams to ensure the successful execution of campaigns and initiatives.

Mar 25, 2026
Apply
companyReddit, Inc. logo
Full-time|On-site|New York City, NY

Join Reddit, a dynamic and innovative platform, as a Senior Client Partner in Large Customer Sales (Travel). In this role, you will be at the forefront of driving strategic partnerships with our largest clients in the travel sector. Your expertise will help shape marketing strategies and ensure our clients achieve their business objectives through tailored solutions on the Reddit platform.As a Senior Client Partner, you will leverage your extensive experience in sales and client management to foster relationships, identify opportunities, and deliver exceptional value. You will collaborate closely with cross-functional teams to provide insights and develop campaigns that resonate with our travel audience.

Mar 27, 2026
Apply
company
Full-time|On-site|New York City HQ

At Trunk Tools, we are pioneering the integration of AI in the construction industry, which is the second-largest sector globally. Recently, we successfully secured a $40 million Series B funding round led by Insight Partners, bringing our total funding to $70 million from esteemed investors such as Redpoint and Innovation Endeavors. This investment is propelling us into our next stage of growth as we deploy AI agents across various job sites.Our mission is to revolutionize construction through intelligent automation. Despite being a $13 trillion industry, many processes remain outdated and analog. We are on a mission to transform this landscape by embedding AI directly into field operations.Founded by professionals from the construction and technology sectors, including alumni of Stanford and MIT, our team has developed software utilized by over 140,000 field professionals, impacting millions and contributing to over $10 billion in completed projects. Many team members have firsthand experience in the field, granting us a profound understanding of the industry's unique challenges.After years of development, we are ready to launch production-level AI agents, starting with intelligent document processing and question-answering solutions, and we are rapidly expanding into core operational workflows. With our team having doubled in size over the past year to over 65 employees (including more than 25 engineers), we are entering a phase of hypergrowth. This is a unique opportunity to join us at a pivotal moment.

Jun 30, 2025
Apply
companyOXMAN logo
Full-time|On-site|NYC Office

OverviewAt OXMAN, a pioneering nature-focused research and design firm based in Manhattan, we leverage nature-centric solutions to tackle pressing challenges at the intersection of Nature and Humanity. Our interdisciplinary approach spans various sectors, fostering innovative concepts that redefine the interaction between the built environment and the natural ecosystem.Our initiative, EDEN, is dedicated to applied ecological systems operating on architectural, urban, and landscape scales. We are at the forefront of creating a digital design framework that engineers biodiverse and resilient ecosystems by developing software that empowers designers and engineers to incorporate essential ecosystem services—such as carbon sequestration, thermal buffering, and biodiversity—into their master planning and design processes. Our mission is to elevate ecosystem services from undervalued byproducts to vital infrastructure elements. We invite talented professionals to join us in revolutionizing the future of nature-centric design.We are currently seeking a Machine Learning Research Engineer to integrate into our dynamic design team in New York City. This role will be pivotal in supporting our efforts to deliver exceptional, data-driven design proposals utilizing EDEN’s innovative suite of ecosystem engineering tools.

Feb 24, 2026

Sign in to browse more jobs

Create account — see all 1,306 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.