Yasuhiro Osaki
On this page
Home * People * Yasuhiro Osaki
Yasuhiro Osaki [1] Yasuhiro Osaki,
a Japanese software engineer and computer scientist at Sony. Until 2010, Yasuhiro Osaki was affiliated with the laboratory of professor Yoshiyuki Kotani at the Tokyo University of Agriculture and Technology.
Selected Publications
- Yasuhiro Osaki, Kazutomo Shibahara, Yasuhiro Tajima, Yoshiyuki Kotani (2007). Reinforcement Learning of Evaluation Functions Using Temporal Difference-Monte Carlo learning method. 12th Game Programming Workshop
- Yasuhiro Osaki, Kazutomo Shibahara, Yasuhiro Tajima, Yoshiyuki Kotani (2008). An Othello Evaluation Function Based on Temporal Difference Learning using Probability of Winning. CIG'08, pdf
- Yasuhiro Osaki, Yoshiyuki Kotani (2009). A Learning Method of Evaluation Function Based on Selective Simulations. 14th Game Programming Workshop
External Links
References
- ↑ YasuhiroOsaki (Yasuhiro Osaki) · GitHub
- ↑ Yasuhiro Osaki, Kazutomo Shibahara, Yasuhiro Tajima, Yoshiyuki Kotani (2007). Reinforcement Learning of Evaluation Functions Using Temporal Difference-Monte Carlo learning method. 12th Game Programming Workshop
- ↑ TD-Lamda from Wikipedia
- ↑ dblp: Yasuhiro Osaki
Next
Yasuhiro Tajima