We are seeking an AI Researcher to join our team in Pune, India. The successful candidate will have a strong background in AI research or engineering and experience with large language models, cloud compute environments, and model deployment pipelines.
Requirements
- Proven experience in AI research or engineering within academic or industry settings.
- Deep understanding of large language model (LLM) internals — including transformer architecture, attention mechanisms, fine-tuning methods, adapters (LoRA/QLoRA), and inference optimization.
- Proficiency in modern ML programming languages and frameworks such as Python, PyTorch, JAX, or Hugging Face Transformers.
- Hands-on experience developing or fine-tuning retrieval-augmented systems and agent-based models.
- Strong grasp of system architecture involving cloud compute environments, GPUs, and model deployment pipelines.
- Ability to prototype quickly, translating research papers into working demos within days.
- Excellent communication skills with the capability to convey complex research insights to cross-functional teams.