About the job
Join Sciforium, a pioneering AI infrastructure company, as we develop state-of-the-art multimodal AI models along with our proprietary, high-efficiency serving platform. With substantial multi-million-dollar funding and direct support from AMD engineers, our dynamic team is rapidly expanding to build the complete stack that powers cutting-edge AI models and real-time applications.
About the Role
This exciting position invites you to contribute to the essential systems that drive Sciforium’s multimodal AI models. You will play a vital role in constructing the model serving platform, utilizing C++, Python, runtime execution, and distributed infrastructure to establish a fast and reliable engine for real-time AI applications.
This role offers hands-on experience in performance engineering, providing insight into how large AI models are optimized and deployed at scale. You will work in close collaboration with ML researchers and seasoned systems engineers. If you are passionate about creating exceptional Developer Experiences, are committed to performance excellence, and seek exposure to the entire AI stack, this position promises impactful work and significant growth opportunities.
Your Responsibilities
Interactive AI Chat Interface: Design and implement a low-latency, chat-like interface for users to explore our LLMs.
Developer Console: Develop the essential UI enabling users to generate API Keys, set budget limits, and visualize real-time usage graphs.
Payments & Billing: Integrate Stripe or a similar service to manage complex subscription models.
Documentation Portal: Create a dynamic API reference section for users.
CLI/SDK Integration: Develop user-friendly client-side wrappers to facilitate seamless connections to our API endpoints.
Backend APIs: Construct secure API gateways and implement low-latency streaming solutions.
Ideal Candidate Profile
Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience).
3+ years of software engineering experience, focusing on front-end development.
Strong proficiency in Typescript and Python.
Familiarity with responsive design and UX fundamentals.
Excellent collaboration and communication skills, with the ability to work effectively across engineering and ML teams.

