Chainerを使った深層強化学習ライブラリChainerRLを公開しました. https://github.com/pfnet/chainerrl PFNエンジニアの藤田です.社内でChainerを使って実装していた深層強化学習アルゴリズムを”ChainerRL”というライブラリとしてまとめて公開しました.RLはReinforcement Learning(強化学習)の略です.以下のような最近の深層強化学習アルゴリズムを共通のインタフェースで使えるよう実装してまとめています. Deep Q-Network (Mnih et al., 2015) Double DQN (Hasselt et al., 2016) Normalized Advantage Function (Gu et al., 2016) (Persistent) Advantage Learning (Bellemar
Attentional, RNN-based encoder-decoder models for abstractive summarization have achieved good performance on short input and output sequences. For longer documents and summaries however these models often include repetitive and incoherent phrases. We introduce a neural network model with a novel intra-attention that attends over the input and continuously generated output separately, and a new tr
Schedule The start and end times are 11am -- 7pm GMT / 12pm -- 8pm CET / 6am -- 2pm EST / 3am - 11am PST / 8pm -- 4am JST. Our friends in the Americas are welcome to join the latter sessions, and our friends in eastern time zones are welcome to join the earlier sessions. The schedule interleaves invited speakers, contributed talks, and gather.town poster presentations to allow for networking and s
Hey there, I am the CEO/Founder of you.com, the first chat-search assistant. I also invest in and mentor other startups at AIX Ventures where I’m founder and managing partner. Bio: Richard previously served as the Chief Scientist and EVP at Salesforce, where he lead teams working on fundamental research, applied research, product incubation, search, customer service automation and a cross-product
リリース、障害情報などのサービスのお知らせ
最新の人気エントリーの配信
処理を実行中です
j次のブックマーク
k前のブックマーク
lあとで読む
eコメント一覧を開く
oページを開く