2024 Fall Robotics Colloquium: Erdem Biyik (USC)

แชร์
ฝัง
  • เผยแพร่เมื่อ 14 ต.ค. 2024
  • Title: Robot Learning with Minimal Human Feedback
    Speaker: Erdem Biyik (USC)
    Abstract: The lack of large robotics datasets is arguably the most important obstacle in front of robot learning. While large pretrained models and algorithms like reinforcement learning from human feedback led to breakthroughs in other domains like language and vision, robotics has not experienced such a significant influence due to the excessive cost of collecting large datasets. In this talk, I will discuss techniques that enable us to train robots from very little human feedback, as little as one demonstration or one language instruction, or their natural eye gaze. I will dive into reinforcement learning from human feedback, and propose an alternative type of human feedback based on language corrections to improve data-efficiency. I will finalize my talk by presenting how existing large pretrained vision-language models can be used to generate direct supervision for robot learning.
    Biography: Erdem Bıyık is an assistant professor in Thomas Lord Department of Computer Science at the University of Southern California, and in Ming Hsieh Department of Electrical and Computer Engineering by courtesy. He leads the Learning and Interactive Robot Autonomy Lab (Lira Lab). Prior to joining USC, he was a postdoctoral researcher at UC Berkeley's Center for Human-Compatible Artificial Intelligence. He received his Ph.D. and M.Sc. degrees in Electrical Engineering from Stanford University, working at the Stanford Artificial Intelligence Lab (SAIL), and his B.Sc. degree in Electrical and Electronics Engineering from Bilkent University in Ankara, Türkiye. During his studies, he worked at the research departments of Google and Aselsan. Erdem was an HRI 2022 Pioneer and received an honorable mention award for his work at HRI 2020. His works were published at premier robotics and artificial intelligence journals and conferences, such as IJRR, CoRL, RSS, NeurIPS.

ความคิดเห็น •