日時:7月7日(月) 13:15〜14:45
場所:総合研究7号館1階情報2講義室
Over the past decade, I’ve worked on perception systems spanning everyday robot manipulation, self-driving cars, and large-scale video understanding across academia and industry labs like Waymo, Google Research, DeepMind, and now Agility Robotics. In this talk, I’ll share perspectives on developments in the field and industry, and lessons from deploying computer vision systems in the real world. I’ll tell you why construction cones are harder than they look, what it takes to build vision-language models that understand YouTube videos without human supervision, how to transfer fundamental research to products that touch billions of users, and I’ll revisit how we once tried to enable robots to use tools they had never seen before—and how we've come back full circle with new tools at our disposal in my work at Agility Robotics. Beyond the research itself, I’ll reflect on navigating careers between academia and industry, choosing impactful problems, and the exciting challenges ahead in embodied AI.