[Remote] AI Research Lead (Multimodal & Video Foundation Model)
Note: The job is a remote job and is open to candidates in USA. Tether.io is pioneering a global financial revolution by building cutting-edge blockchain solutions. The AI Research Lead will drive the technical directions and build multimodal foundation models for image, video, and 3D generation, while collaborating with world-class engineers and researchers to advance open source development and the global AI community.
Responsibilities
• Lead the research, design, and development of state-of-the-art image, video, and 3D generation models, including multimodal foundation models
• Lead high-impact, specialized projects focused on innovative text, images, audio and video applications
• Define and drive the technical roadmap for multimodal AI initiatives, aligning research goals with business and product objectives
• Provide technical leadership and mentorship to teams of AI researchers and engineers, fostering innovation and skill development
• Oversee the end-to-end lifecycle of multimodal model development, from dataset curation and model training to deployment and performance evaluation
• Lead large-scale multi-node GPU model training, ensuring scalability, efficiency, and reproducibility of experiments
• Collaborate closely with cross-functional teams, including product, design, and engineering, to integrate AI solutions into production systems
• Drive applied research initiatives in image/video/3D generation, editing, animation, and other related domains
• Monitor advancements in AI research and multimodal technologies, and incorporate novel techniques to improve model capabilities and performance
• Contribute to the AI research community, including publications, open-source contributions, and participation in conferences
• Establish best practices and standards for coding, model evaluation, and experimentation within the team
• Lead and manage complex projects, ensuring timely delivery, quality outcomes, and alignment with strategic objectives
• Communicate technical insights and updates effectively to executive leadership, stakeholders, and external collaborators
• Promote a culture of collaboration, innovation, and excellence, maintaining high team morale and accountability
Skills
• PhD, MS or equivalent experience
• Hands on experience in building Image/Video/3D generation and multimodal foundation models building from scratch
• 5+ years of experience in managing or leading 10+ research & engineer teams
• Excellent communication and interpersonal skills
• Excellent understanding of an AI-based product lifecycle
• Hands-on experience in building end-to-end multimodal foundation models on thousands of multi-node GPUs
• Proficiency in modern deep learning and diffusion frameworks & libraries
• Demonstrated expertise in computer vision, video generation foundation model and/or multimodal research especially building them from scratch
• Strong history of delivering innovation in the space of multimodal & video
• Ability to develop a long-term vision and execute strategies at scale while maintaining a grasp of technical details for better decision-making
• Experience with VP-level presentations and reporting
• Publications at leading AI conferences such as CVPR, ICCV, ECCV, ICML, ICLR, NeurIPS etc
Company Overview
• Tether has evolved to meet global needs with agility and vision. It was founded in 2014, and is headquartered in Seattle, Washington, USA, with a workforce of 201-500 employees. Its website is https://tether.io.
Apply tot his job
Apply To this Job