Research Engineer / Scientist, Multimodal

New Today

About the Team Our team is dedicated to shaping the future of artificial intelligence by equipping ChatGPT with the ability to hear, see, speak, and create visually compelling images, transforming how people interact with AI in everyday life. About the Role We’re looking for Research Engineers and/or Research Scientists to help shape the frontiers of multimodal research. We seek candidates who will excel in fast-paced environments and deliver results with precision and finesse. The ideal candidate will be someone who’s passionate about exploring new technologies, and navigating complex and impactful problems. We are looking for people who are passionate about employing scientific methods to study and understand deep learning, neural network architectures, datasets, evaluations, and ML systems. A good candidate will have experience in designing and conducting experiments, navigating complex results, and driving innovations. Both roles are based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees. In this role, you will be: Building and innovating various components of the multimodal stack, including datasets, modeling, evaluation, applications and more.
Collaborating with team members to deliver cutting-edge multimodal models.
Bringing the benefits of frontier research in perception to billions of users.
You might thrive in this role if you: Are a team player, willing to do a variety of tasks that move the team forward.
Excels in execution, consistently and completing tasks efficiently.
Have experience in advancing the frontier of deep learning with innovations.
Have experience in working on large-scale multimodal models in areas such as image, video, or audio, etc.
Location:
San Francisco

We found some similar jobs based on your search