A unified view on multi-class support vector classification

Ü Doǧan, T Glasmachers, C Igel - The Journal of Machine Learning …, 2016 - dl.acm.org
A unified view on multi-class support vector machines (SVMs) is presented, covering most
prominent variants including the one-vs-all approach and the algorithms proposed by …

Emergence of cooperation in two-agent repeated games with reinforcement learning

ZW Ding, GZ Zheng, CR Cai, WR Cai, L Chen… - Chaos, Solitons & …, 2023 - Elsevier
Cooperation is the foundation of ecosystems and the human society, and the reinforcement
learning provides crucial insight into the mechanism for its emergence. However, most …

Addressing environment non-stationarity by repeating Q-learning updates

S Abdallah, M Kaisers - Journal of Machine Learning Research, 2016 - jmlr.org
In this paper, we present a new framework for large scale online kernel learning, making
kernel methods efficient and scalable for large-scale online learning applications. Unlike the …

Multiagent reinforcement learning: spiking and nonspiking agents in the iterated Prisoner's Dilemma

V Vassiliades, A Cleanthous… - IEEE transactions on …, 2011 - ieeexplore.ieee.org
This paper investigates multiagent reinforcement learning (MARL) in a general-sum game
where the payoffs' structure is such that the agents are required to exploit each other in a …

[PDF][PDF] Evolving subjective utilities: Prisoner's Dilemma game examples

K Moriyama, S Kurihara, M Numao - The 10th International Conference …, 2011 - ifaamas.org
We have proposed the utility-based Q-learning concept that supposes an agent internally
has an emotional mechanism that derives subjective utilities from objective rewards and the …

Multiagent reinforcement learning with spiking and non-spiking agents in the iterated prisoner's dilemma

V Vassiliades, A Cleanthous… - … Conference on Artificial …, 2009 - Springer
Abstract This paper investigates Multiagent Reinforcement Learning (MARL) in a general-
sum game where the payoffs' structure is such that the agents are required to exploit each …

Design and verification of parallel multipliers using arithmetic description language: ARITH

K Ishida, N Homma, T Aoki, T Higuchi - … 34th International Symposium …, 2004 - computer.org
The evolution of strategies in n-choice social dilemma game with punishment is studied on
spatial environment. This paper presents and investigates the application of co-evolutionary …

[PDF][PDF] Cooperation-eliciting prisoner's dilemma payoffs for reinforcement learning agents

K Moriyama, S Kurihara… - Proceedings of the 2014 …, 2014 - aamas.csc.liv.ac.uk
This work considers a stateless Q-learning agent in iterated Prisoner's Dilemma (PD). We
have already given a condition of PD payoffs and Q-learning parameters that helps stateless …

Co-evolutionary learning in the n-choice iterated prisoner's dilemma with PSO algorithm in a spatial environment

X Wang, Y Yi, H Chang, Y Lin - 2013 IEEE Symposium on …, 2013 - ieeexplore.ieee.org
The evolution of strategies in n-choice iterated prisoner's dilemma game is studied on
spatial environment. This paper presents and investigates the application of co-evolutionary …

Evolving cooperation in spatial population with punishment by using PSO algorithm

X Wang, L Zhang, X Du, Y Sun - Natural Computing, 2017 - Springer
Understanding the effects of punishment in multiplayer spatial games, however, is a
formidable challenge. In this paper, we present a multiplayer evolutionary game model in …