companyZyphra logo

Research Scientist, Model Architectures

ZyphraSan Francisco
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Qualifications:A strong research acumen and intuition. Proven ability to navigate research projects from initial conception to execution and final write-up. Exceptional implementation and prototyping skills, with the capability to swiftly transform ideas into experimental outcomes. A collaborative spirit and the ability to thrive in a fast-paced research environment. A deep curiosity and enthusiasm for understanding intelligence.

About the job

Zyphra is a cutting-edge artificial intelligence firm headquartered in the vibrant city of San Francisco, California.

Position Overview:

As a Research Scientist specializing in Model Architectures, you will play a pivotal role in Zyphra’s AI Architecture Research Team. Your responsibilities will include the design and thorough evaluation of innovative model architectures and training methodologies aimed at enhancing essential modeling capabilities (e.g., loss per flop or loss per parameter) and tackling core limitations inherent in current models. You will collaborate closely with our pre-training team to ensure that your findings are seamlessly integrated into our next-generation models.

Qualifications:

  • A strong research acumen and intuition.

  • Proven ability to navigate research projects from initial conception to execution and final write-up.

  • Exceptional implementation and prototyping skills, with the capability to swiftly transform ideas into experimental outcomes.

  • A collaborative spirit and the ability to thrive in a fast-paced research environment.

  • A deep curiosity and enthusiasm for understanding intelligence.

Requirements:

  • Experience with long-term memory, RAG/retrieval systems, dynamic/adaptive computation, and alternative credit assignment strategies.

  • Knowledge of reinforcement learning, control theory, and signal processing techniques.

  • A passion for exploring and critically evaluating unconventional ideas, with the ability to maintain a unique perspective.

  • Familiarity with modern training pipelines and the hardware necessities for designing efficient architectures compatible with GPU hardware.

  • Strong understanding of experimental methodologies for conducting rigorous ablations and hypothesis testing.

  • High proficiency in PyTorch and Python programming.

  • Ability to quickly assimilate into large pre-existing codebases and contribute effectively.

  • Prior publication of machine learning research in reputable venues.

  • Postgraduate degree in a scientific discipline (e.g., Computer Science, Electrical Engineering, Mathematics, Physics).

Why Join Zyphra?

  • We emphasize a structured research methodology that systematically addresses ambitious challenges in AI.

About Zyphra

About Zyphra:Zyphra is a cutting-edge artificial intelligence firm headquartered in the vibrant city of San Francisco, California, dedicated to pushing the boundaries of AI research and development.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.