[B! DRL] caesar_wanyaのブックマーク

Generative Adversarial Imitation Learning

caesar_wanya 2018/07/09

DRL

リンク

深層強化学習ライブラリChainerRL - Preferred Networks Research & Development

Chainerを使った深層強化学習ライブラリChainerRLを公開しました． https://github.com/pfnet/chainerrl PFNエンジニアの藤田です．社内でChainerを使って実装していた深層強化学習アルゴリズムを”ChainerRL”というライブラリとしてまとめて公開しました．RLはReinforcement Learning（強化学習）の略です．以下のような最近の深層強化学習アルゴリズムを共通のインタフェースで使えるよう実装してまとめています． Deep Q-Network (Mnih et al., 2015) Double DQN (Hasselt et al., 2016) Normalized Advantage Function (Gu et al., 2016) (Persistent) Advantage Learning (Bellemar

caesar_wanya 2018/01/19

DQN
DRL

リンク

A Deep Reinforced Model for Abstractive Summarization

Attentional, RNN-based encoder-decoder models for abstractive summarization have achieved good performance on short input and output sequences. For longer documents and summaries however these models often include repetitive and incoherent phrases. We introduce a neural network model with a novel intra-attention that attends over the input and continuously generated output separately, and a new tr

caesar_wanya 2018/01/18

DRL
DNN

リンク

Bayesian Deep Learning Workshop | NeurIPS 2021

Schedule The start and end times are 11am -- 7pm GMT / 12pm -- 8pm CET / 6am -- 2pm EST / 3am - 11am PST / 8pm -- 4am JST. Our friends in the Americas are welcome to join the latter sessions, and our friends in eastern time zones are welcome to join the earlier sessions. The schedule interleaves invited speakers, contributed talks, and gather.town poster presentations to allow for networking and s

caesar_wanya 2018/01/18

リンク

Richard Socher

Hey there, I am the CEO/Founder of you.com, the first chat-search assistant. I also invest in and mentor other startups at AIX Ventures where I’m founder and managing partner. Bio: Richard previously served as the Chief Scientist and EVP at Salesforce, where he lead teams working on fundamental research, applied research, product incubation, search, customer service automation and a cross-product

caesar_wanya 2017/12/19

research
DRL

リンク

はてなブックマーク

タグ

関連タグで絞り込む (3)

DRLに関するcaesar_wanyaのブックマーク (5)

お知らせ

今週のはてなブックマーク数ランキング（2025年1月第1週）

今週のはてなブックマーク数ランキング（2024年12月第4週）

「あとで読む」タグで振り返る2024年〜今年の「あとで読む」、今年のうちに〜

公式Twitter

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス

タグ

関連タグで絞り込む (3)

DRLに関するcaesar_wanyaのブックマーク (5)

Generative Adversarial Imitation Learning

深層強化学習ライブラリChainerRL - Preferred Networks Research & Development

A Deep Reinforced Model for Abstractive Summarization

Bayesian Deep Learning Workshop | NeurIPS 2021

Richard Socher

お知らせ

今週のはてなブックマーク数ランキング（2025年1月第1週）

今週のはてなブックマーク数ランキング（2024年12月第4週）

「あとで読む」タグで振り返る2024年 〜今年の「あとで読む」、今年のうちに〜

公式Twitter

キーボードショートカット一覧

はてなブックマーク

公式Twitter

はてなのサービス

「あとで読む」タグで振り返る2024年〜今年の「あとで読む」、今年のうちに〜