Suivre
Gerald Tesauro
Gerald Tesauro
IBM Research
Adresse e-mail validée de us.ibm.com - Page d'accueil
Titre
Citée par
Citée par
Année
Temporal difference learning and TD-Gammon
G Tesauro
Communications of the ACM 38 (3), 58-68, 1995
31791995
Practical issues in temporal difference learning
G Tesauro
Advances in neural information processing systems 4, 1991
14831991
TD-Gammon, a self-teaching backgammon program, achieves master-level play
G Tesauro
Neural computation 6 (2), 215-219, 1994
13561994
Learning to learn without forgetting by maximizing transfer and minimizing interference
M Riemer, I Cases, R Ajemian, M Liu, I Rish, Y Tu, G Tesauro
arXiv preprint arXiv:1810.11910, 2018
8962018
Utility functions in autonomic systems
WE Walsh, G Tesauro, JO Kephart, R Das
International Conference on Autonomic Computing, 2004. Proceedings., 70-77, 2004
6212004
A hybrid reinforcement learning approach to autonomic resource allocation
G Tesauro, NK Jong, R Das, MN Bennani
2006 IEEE International Conference on Autonomic Computing, 65-73, 2006
4892006
R3: Reinforced Ranker-Reader for Open-Domain Question Answering
S Wang, M Yu, X Guo, Z Wang, T Klinger, W Zhang, S Chang, G Tesauro, ...
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
4112018
Agent-human interactions in the continuous double auction
R Das, JE Hanson, JO Kephart, G Tesauro
International joint conference on artificial intelligence 17 (1), 1169-1178, 2001
3822001
On-line policy improvement using Monte-Carlo search
G Tesauro, G Galperin
Advances in neural information processing systems 9, 1996
3801996
A multi-agent systems approach to autonomic computing
G Tesauro, DM Chess, WE Walsh, R Das, A Segal, I Whalley, JO Kephart, ...
Proceedings of the Third International Joint Conference on Autonomous Agents …, 2004
3692004
Programming backgammon using self-teaching neural nets
G Tesauro
Artificial Intelligence 134 (1-2), 181-199, 2002
3182002
Diverse few-shot text classification with multiple metrics
M Yu, X Guo, J Yi, S Chang, S Potdar, Y Cheng, G Tesauro, H Wang, ...
arXiv preprint arXiv:1805.07513, 2018
3082018
Extending Q-learning to general adaptive multi-agent systems
G Tesauro
Advances in neural information processing systems 16, 2003
3072003
Neural networks for computer virus recognition
GJ Tesauro, JO Kephart, GB Sorkin
IEEE expert 11 (4), 5-6, 1996
2641996
Multiresolution recurrent neural networks: An application to dialogue response generation
I Serban, T Klinger, G Tesauro, K Talamadupula, B Zhou, Y Bengio, ...
Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017
2532017
Metric learning for kernel regression
KQ Weinberger, G Tesauro
Artificial intelligence and statistics, 612-619, 2007
2352007
Analyzing complex strategic interactions in multi-agent systems
WE Walsh, R Das, G Tesauro, JO Kephart
AAAI-02 Workshop on Game-Theoretic and Decision-Theoretic Agents, 109-118, 2002
2312002
Biologically inspired defenses against computer viruses
JO Kephart, GB Sorkin, WC Arnold, DM Chess, GJ Tesauro, SR White, ...
IJCAI (1), 985-996, 1995
2251995
Pricing in agent economies using multi-agent Q-learning
G Tesauro, JO Kephart
Autonomous agents and multi-agent systems 5, 289-304, 2002
2222002
Dialog-based interactive image retrieval
X Guo, H Wu, Y Cheng, S Rennie, G Tesauro, R Feris
Advances in neural information processing systems 31, 2018
2112018
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20