PTT數位生活區 / Prob_Solve (計算數學 Problem Solving)

[問題] A3C Actor Gradient

看板Prob_Solve (計算數學 Problem Solving)作者longlyeagle (長鷹寶寶實驗室)時間7年前 (2017/10/08 10:31)推噓0(0推 0噓 5→)

留言5則, 1人參與討論串1/1

Working on A3C deep reinforcement learning. Since I am too lazy to modify the last layer of my NN to softmax, I use a softmax filter to let the linear layer directly target the softmax output. The algorithm works in my test cases for now. But it might go wrong when the reward is on a different scale. Can anyone help me to check if my implementation is correct? https://goo.gl/FV8sFu -- ※ 發信站: 批踢踢實業坊(ptt.cc), 來自: 114.35.245.133 ※ 文章網址: https://www.ptt.cc/bbs/Prob_Solve/M.1507429910.A.70E.html

→

11/05 22:07, 7年前 , 1^F

11/05 22:07, 1^F

→

11/05 22:08, 7年前 , 2^F

11/05 22:08, 2^F

→

11/05 22:08, 7年前 , 3^F

11/05 22:08, 3^F

→

11/05 22:09, 7年前 , 4^F

11/05 22:09, 4^F

→

11/05 22:09, 7年前 , 5^F

11/05 22:09, 5^F

‣ 返回看板[ Prob_Solve ] 研討

‣ 更多 longlyeagle 的文章

文章代碼(AID): #1PsOuMSE (Prob_Solve)

Prob_Solve 近期熱門文章

3

5

[問題] 給定一個無向圖，求將節點兩兩分組的方式

1年前, 04/10

1

5

Re: [問題] 排列組合(?)的一題

1年前, 03/29

2

7

大樂透算法問題，有這種C取計算機嗎？

1年前, 11/26

1

12

Re: [問題] LeetCode 2608. Shortest Cycle in a Graph

2年前, 05/21

1

5

[問題] 馬丁格爾法的機率問題(懸賞)

2年前, 03/17

1

3

[討論] 業務邏輯最佳化解

2年前, 12/15

1

8

Re: 想問各位先進一個統計問題

2年前, 10/23

1

1

[問題] 問一個機率的問題謝謝

2年前, 10/06

更多近期熱門文章 >>

PTT數位生活區即時熱門文章

9

26

[新聞] 買手機請注意！高通 Snapdragon 8 Gen 3

1小時前, 07/09

10

25

[新聞] vivo XFold5大摺台灣登場，7/22前預先登記

1小時前, 07/09

5

57

Re: [情報] Noctua A12x25 G2現已發售

[ PC_Shopping ]

2小時前, 07/09

4

6

[閒聊] 幹嘛要辦那麼多交易所的帳戶?

[ DigiCurrency ]

3小時前, 07/09

83

214

[心得] 不要在網路上買任何MSI產品

[ PC_Shopping ]

3小時前, 07/09

4

16

[問題] 夾上機車手機架自動關螢幕的困擾

4小時前, 07/09

6

13

Re: [情報] 西亞 5090 水超龍 €2296

[ PC_Shopping ]

4小時前, 07/09

3

8

[選購] 淘寶100%鍵盤套件抉擇

[ Key_Mou_Pad ]

5小時前, 07/09

更多即時熱門文章 >>

‣ 返回看板[ Prob_Solve ] 研討

‣ 更多 longlyeagle 的文章

文章代碼(AID): #1PsOuMSE (Prob_Solve)