PhD or Equivalent in Computer Science, Machine Learning, or a relevant field (or comparable industry research experience)
2-3+ years experience working on large-scale deep learning, particularly in transformers or multi-modal models
Demonstrated success through top-tier publications or industry AI products showcasing innovative results
Advanced proficiency in PyTorch, TensorFlow, or similar frameworks, with practical knowledge of distributed training
Strong familiarity with data pipeline engineering for large text/image/code datasets, including advanced data augmentation and retrieval-augmented generation
Experience leading or managing small AI research teams, balancing daily implementation with big-picture strategy
Desirable
Proven ability to iterate quickly, incorporating feedback while retaining a clear technical vision
Familiar with AWS
Familiar with Dev Ops and cost-effective training methods
What the job involves
Architect & Develop a Large-Scale Model:
Design, prototype, and refine a multi-modal transformer-based system that handles text, code, and image inputs
Establish robust training and versioning processes to ensure reproducible research outcomes
Drive Research & Innovation:
Investigate advanced techniques in language modelling, code generation, and multi-modal representations
Track recent developments in both academic and industry research, applying relevant innovations to our roadmap
Set Experimental Standards
Define performance metrics, conduct thorough ablation studies, and maintain clear documentation of experiments
Collaborate with DevOps to orchestrate large-scale training on cloud-based GPU clusters
Mentor & Lead:
Supervise and mentor ML Engineers and Data Specialists, advising on research methods, data strategies, and model refinements
Provide technical leadership on code reviews and oversee AI model deployment within a discreet environment
Collaborate Across the Company
Partner with product managers and other stakeholders to align research milestones with strategic objectives
Communicate progress and potential risks proactively, respecting the stealth nature of our operations
Safeguard Quality & Integrity:
Oversee security and ethical aspects of model deployment, ensuring compliance with the latest standards
Monitor and validate model performance, focusing on real-world application and reliability