company

Senior Machine Learning Engineer (Immediate Joiner)

prox-worksRemote — India
Remote Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Required Qualifications4-10 years of experience in Machine Learning or Applied ML Engineering. Strong fundamentals in deep learning, Transformers, and generative model architectures. Hands-on experience in model training at scale, including fine-tuning large models (LoRA, full fine-tune) on custom datasets. Solid MLOps experience: experiment tracking (MLflow, W&B), CI/CD for ML, model versioning, and serving frameworks (Triton, TorchServe, vLLM, or equivalent). Proficient in Python with fluency in PyTorch and the modern ML stack. Experience deploying and operating ML systems in distributed cloud environments (GCP, AWS, or Azure) focusing on GPU provisioning, autoscaling, and cost management. Ability to tackle ambiguous, high-impact problems while collaborating with cross-functional teams. Preferred QualificationsExperience with video generation, diffusion models, or multimodal architectures (DiT, U-Net, audio-video joint models). Familiarity with LoRA/IC-LoRA fine-tuning workflows for character or identity consistency. Background in media, OTT, sports, or large-scale content platforms. Knowledge of inference optimization techniques: quantization (FP8/INT8), batching, async orchestration, and GPU memory management.

About the job

About the Role

We are seeking a proactive and skilled Senior Machine Learning Engineer ready for an immediate start. This role blends robust machine learning expertise with backend infrastructure knowledge, essential for transitioning models from research to dependable, scalable production systems. You will be at the forefront of innovation in generative video AI while applying MLOps methodologies to ensure efficient operation at scale.

In this position, you will collaborate closely with backend, platform, and content teams to create high-performance machine learning components that meet rigorous standards for quality, latency, and throughput.

Key Responsibilities

  • Train, fine-tune, and assess generative video and multimodal models (including image-to-video, text-to-video, lip-sync, and character consistency).
  • Develop and manage comprehensive ML pipelines encompassing data ingestion, preprocessing, training, evaluation, and versioning.
  • Oversee model deployment and serving infrastructure, focusing on containerization, GPU-optimized inference, model registries, and rollout strategies.
  • Implement MLOps best practices such as experiment tracking, model monitoring, drift detection, A/B testing, and observability.
  • Design and maintain scalable inference systems tailored for low latency, high throughput, and economical GPU utilization.
  • Devise caching and batching strategies to achieve SLA targets in production video generation workflows.
  • Partner with backend engineering teams to integrate ML services into distributed systems.
  • Contribute to the long-term roadmap, including foundational model training strategies, LoRA fine-tuning pipelines, and multi-character generalization.

About prox-works

prox-works is a pioneering company at the forefront of generative video AI technology, dedicated to creating innovative solutions that enhance the production and performance of multimedia content. We pride ourselves on a collaborative work environment that fosters creativity and the development of cutting-edge solutions.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.