Sonyglobal
AI

Full-Time - Audio-Visual AI Research Scientist_ICASSP

Sonyglobal · 13-Tokyo - Osaki Japan · $90k - $100k

Actively hiring Posted about 1 year ago

Technology Field
Computer Vision
Speech/Audio Signal Processing

Position Summary
We are seeking Research Scientist to join our fundamental and applied research teams at Sony in Tokyo.Our aim is to rapidly advance the process of cinematic content creation. To achieve this, we work together with Sony Pictures Entertainment to develop AI technologies that restore and enhance movie content.With us, you will research and develop innovative computer vision and machine learning technologies for cinematic content creation. You will also have many opportunities to publish your findings and collaborate with a variety of academic institutions worldwide.See More Here: https://sony.github.io/creativeai/

Responsibilities
■ Research and development of novel computer vision technologies in areas including generative methods, audio-visual scene understanding, audio-visual sound separation/localization, and beyond.■ Implement findings from computer vision research into real products through collaboration.■ Work with a strong international team of researchers and engineers with various areas of expertise to develop innovative solutions.■ Collaborate with Sony`s various branches, including Sony Pictures Entertainment.■ Collaborate with academic institutions to drive state-of-the-art research.■ Contribute to the development of research publications to be published at top-tier conferences and journals.
 
Required qualifications
■ Experience publishing research about machine learning/computer vision at conferences and/or in journals (e.g. CVPR/ICCV/ECCV/NeurIPS/ICLR/ICML/IJCV/PAMI).■ Experience developing ML/deep learning models for computer vision tasks.■ Fluency in Python and deep learning frameworks.Preferred qualifications
■ Ph.D. Degree (graduated or currently pursuing) in computer science, machine learning, or electrical engineering, OR equivalent practical experience.■ Experience developing ML/deep learning models for audio-visual tasks or other multi-modal tasks.■ Experience developing ML/deep learning-based generative models.■ Professional proficiency in English.
 
Product, Service
Movie production for Sony Pictures Entertainment.
Development Environment
OS: Windows and Linux

Application Requirements
Essay: Required
Coding test: Not Required

Required Skills:
Audio Signal Processing, Computer Vision, Speech Processing
Required Skills:Audio Signal Processing, Computer Vision, Speech ProcessingOptional Skills:

Tags & focus areas

Used for matching and alerts on DevFound
Research Scientist Ai Full Time Python
Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.