Talk by Matthew E. Taylor

M. E. Taylor - Reinforcement Learning in the real world: How to “cheat” and still feel good about it

日付:

2020年12月17日

著者:

Hrvoje Stojic

Reinforcement Learning in the Real-world: How to “cheat” and still feel good about it

Abstract

Reinforcement learning has had many successes, but significant amounts of time and/or data can be required to reach acceptable performance. If agents or robots are to be deployed in real-world environments, it is critical that our algorithms take advantage of existing data and human knowledge. This talk will discuss a selection of recent work that improves reinforcement learning by leveraging demonstrations and feedback from imperfect users, with an emphasis on how interactive machine learning can be extended to best leverage the unique abilities of both computers and humans.

Notes

Dr Matthew E. Taylor is an Associate Professor of Computing Science at the University of Alberta and a Fellow and Fellow-in-Residence at the Alberta Machine Intelligence Institute (Amii). He is the Director of the Intelligent Robot Learning (IRL) Lab (irll.ca)and a Principal Investigator at the Reinforcement Learning & Artificial Intelligence (RLAI) Lab, both at the University of Alberta.
His publication record on Google Scholar can be found here, and personal website here

ソーシャルメディアで共有

Mickael Binois - Leveraging replication in active learning

We were recently joined by Mickael Binois, to talk about 'Leveraging replication in active learning'.

2024/06/24

見る

Mickael Binois - Leveraging replication in active learning

We were recently joined by Mickael Binois, to talk about 'Leveraging replication in active learning'.

2024/06/24

見る

Mickael Binois - Leveraging replication in active learning

We were recently joined by Mickael Binois, to talk about 'Leveraging replication in active learning'.

2024/06/24

見る

Mickael Binois - Leveraging replication in active learning

We were recently joined by Mickael Binois, to talk about 'Leveraging replication in active learning'.

2024/06/24

見る

Ilija Bogunovic - From Data to Confident Decisions

We were recently joined by Ilija Bogunovic, to talk about 'Robust and Efficient Algorithmic Decision Making'.

2024/06/13

見る

Ilija Bogunovic - From Data to Confident Decisions

We were recently joined by Ilija Bogunovic, to talk about 'Robust and Efficient Algorithmic Decision Making'.

2024/06/13

見る

Ilija Bogunovic - From Data to Confident Decisions

We were recently joined by Ilija Bogunovic, to talk about 'Robust and Efficient Algorithmic Decision Making'.

2024/06/13

見る

Ilija Bogunovic - From Data to Confident Decisions

We were recently joined by Ilija Bogunovic, to talk about 'Robust and Efficient Algorithmic Decision Making'.

2024/06/13

見る

Dario Azzimonti - Preference learning with Gaussian processes

We were recently joined by Dario Azzimonti, to talk about 'Preference learning with Gaussian processes'.

2024/05/23

見る

Dario Azzimonti - Preference learning with Gaussian processes

We were recently joined by Dario Azzimonti, to talk about 'Preference learning with Gaussian processes'.

2024/05/23

見る

Dario Azzimonti - Preference learning with Gaussian processes

We were recently joined by Dario Azzimonti, to talk about 'Preference learning with Gaussian processes'.

2024/05/23

見る

Dario Azzimonti - Preference learning with Gaussian processes

We were recently joined by Dario Azzimonti, to talk about 'Preference learning with Gaussian processes'.

2024/05/23

見る

Mojmír Mutný - Optimal Experiment Design in Markov Chains

We were recently joined by Mojmír Mutný (ETH Zurich), to talk about 'Optimal Experiment Design in Markov Chains'.

2024/03/28

見る

Mojmír Mutný - Optimal Experiment Design in Markov Chains

We were recently joined by Mojmír Mutný (ETH Zurich), to talk about 'Optimal Experiment Design in Markov Chains'.

2024/03/28

見る

Mojmír Mutný - Optimal Experiment Design in Markov Chains

We were recently joined by Mojmír Mutný (ETH Zurich), to talk about 'Optimal Experiment Design in Markov Chains'.

2024/03/28

見る

Mojmír Mutný - Optimal Experiment Design in Markov Chains

We were recently joined by Mojmír Mutný (ETH Zurich), to talk about 'Optimal Experiment Design in Markov Chains'.

2024/03/28

見る

Secondmind

システム設計

適合

資料

資料

研究

Secondmind Labsについて

研究論文

セミナー

会社

お問い合わせ

一般的な問い合わせ

就職希望

プライバシーポリシー

Secondmind

システム設計

適合

資料

研究

Secondmind Labsについて

研究論文

セミナー

会社

お問い合わせ

一般的な問い合わせ

就職希望

プライバシーポリシー

Secondmind

システム設計

適合

資料

研究

Secondmind Labsについて

研究論文

セミナー

会社

お問い合わせ

一般的な問い合わせ

就職希望

プライバシーポリシー

Reinforcement Learning in the Real-world: How to “cheat” and still feel good about it

Abstract

Notes

関連するセミナー

Mickael Binois - Leveraging replication in active learning

We were recently joined by Mickael Binois, to talk about 'Leveraging replication in active learning'.

2024/06/24

Mickael Binois - Leveraging replication in active learning

We were recently joined by Mickael Binois, to talk about 'Leveraging replication in active learning'.

2024/06/24

Mickael Binois - Leveraging replication in active learning

We were recently joined by Mickael Binois, to talk about 'Leveraging replication in active learning'.

2024/06/24

Mickael Binois - Leveraging replication in active learning

We were recently joined by Mickael Binois, to talk about 'Leveraging replication in active learning'.

2024/06/24

Ilija Bogunovic - From Data to Confident Decisions

We were recently joined by Ilija Bogunovic, to talk about 'Robust and Efficient Algorithmic Decision Making'.

2024/06/13

Ilija Bogunovic - From Data to Confident Decisions

We were recently joined by Ilija Bogunovic, to talk about 'Robust and Efficient Algorithmic Decision Making'.

2024/06/13

Ilija Bogunovic - From Data to Confident Decisions

We were recently joined by Ilija Bogunovic, to talk about 'Robust and Efficient Algorithmic Decision Making'.

2024/06/13

Ilija Bogunovic - From Data to Confident Decisions

We were recently joined by Ilija Bogunovic, to talk about 'Robust and Efficient Algorithmic Decision Making'.

2024/06/13

Dario Azzimonti - Preference learning with Gaussian processes

We were recently joined by Dario Azzimonti, to talk about 'Preference learning with Gaussian processes'.

2024/05/23

Dario Azzimonti - Preference learning with Gaussian processes

We were recently joined by Dario Azzimonti, to talk about 'Preference learning with Gaussian processes'.

2024/05/23

Dario Azzimonti - Preference learning with Gaussian processes

We were recently joined by Dario Azzimonti, to talk about 'Preference learning with Gaussian processes'.

2024/05/23

Dario Azzimonti - Preference learning with Gaussian processes

We were recently joined by Dario Azzimonti, to talk about 'Preference learning with Gaussian processes'.

2024/05/23

Mojmír Mutný - Optimal Experiment Design in Markov Chains

We were recently joined by Mojmír Mutný (ETH Zurich), to talk about 'Optimal Experiment Design in Markov Chains'.

2024/03/28

Mojmír Mutný - Optimal Experiment Design in Markov Chains

We were recently joined by Mojmír Mutný (ETH Zurich), to talk about 'Optimal Experiment Design in Markov Chains'.

2024/03/28

Mojmír Mutný - Optimal Experiment Design in Markov Chains

We were recently joined by Mojmír Mutný (ETH Zurich), to talk about 'Optimal Experiment Design in Markov Chains'.

2024/03/28

Mojmír Mutný - Optimal Experiment Design in Markov Chains

We were recently joined by Mojmír Mutný (ETH Zurich), to talk about 'Optimal Experiment Design in Markov Chains'.

2024/03/28

Secondmind

資料

研究

会社

お問い合わせ

Secondmind

資料

研究

会社

お問い合わせ

Secondmind

資料

研究

会社

お問い合わせ

Secondmind

資料

研究

会社

お問い合わせ