IST COLLOQUIUM 2025

Vision at Work: Reflections on Real-World Computer Vision from Robots to Video Understanding and Back

講演者の画像

Austin Myers 🌐

Agility Robotics
講演者経歴 Austin Myers is a research engineer currently working on embodied perception at Agility Robotics. His career has spanned academia and industry, with roles at Waymo, Google Research, and DeepMind, where he developed vision systems for robot manipulation, self-driving vehicles, and large-scale video understanding. Austin received his PhD from the University of Maryland, focusing on understanding the affordances of object parts through geometric reasoning. His broad research interests lie at the intersection of joint video and language representation learning, large multimodal models, and embodied perception for everyday robots.
日時:7月7日(月) 13:15〜14:45

場所:総合研究7号館1階情報2講義室

Over the past decade, I’ve worked on perception systems spanning everyday robot manipulation, self-driving cars, and large-scale video understanding across academia and industry labs like Waymo, Google Research, DeepMind, and now Agility Robotics. In this talk, I’ll share perspectives on developments in the field and industry, and lessons from deploying computer vision systems in the real world. I’ll tell you why construction cones are harder than they look, what it takes to build vision-language models that understand YouTube videos without human supervision, how to transfer fundamental research to products that touch billions of users, and I’ll revisit how we once tried to enable robots to use tools they had never seen before—and how we've come back full circle with new tools at our disposal in my work at Agility Robotics. Beyond the research itself, I’ll reflect on navigating careers between academia and industry, choosing impactful problems, and the exciting challenges ahead in embodied AI.

2024年の講演はこちら>>