[CT421]: Week 07 materials & notes

2025-02-28 12:34:18 +00:00
parent 6caec2b47a
commit 9cc71409e1
4 changed files with 199 additions and 0 deletions
--- a/year4/semester2/CT421/materials/03.
+++ b/year4/semester2/CT421/materials/03.
--- a/year4/semester2/CT421/notes/CT421.pdf
+++ b/year4/semester2/CT421/notes/CT421.pdf
--- a/year4/semester2/CT421/notes/CT421.tex
+++ b/year4/semester2/CT421/notes/CT421.tex
@ -535,7 +535,206 @@ This can be represented as a \textbf{state transformation function}:
 For the time being, we will make the simplifying assumption that an agent can make one of two actions: to co-operate $C$ or to defect $D$.
 We say a certain move is \textbf{rational} if the outcomes that arise through the action are better than all outcomes that arise from the alternative action.
 \begin{figure}[H]
    \centering
    \includegraphics[width=\textwidth]{./images/rationalchoice.png}
    \caption{For player $j$, $D$ is the rational choice}
 \end{figure}
 \subsection{Dominant Strategy}
 Given a particular strategy $s$ for agent $i$, there will be a number of possible outcomes.
 We say $s_1$ dominates $s_2$ if every outcome possible by agent $i$ playing $s_1$ is preferred over every possible outcome by agent $i$ playing $s_2$.
 A rational agent will never play a dominated strategy.
 However, there is not usually a unique undominated strategy.
 \subsection{Nash Equilibrium}
 Two strategies $s_1$ and $s_2$ are in \textbf{Nash equilibrium} if:
 \begin{itemize}
    \item   Assuming agent $i$ plays $s_1$, agent $j$ can do no better than play $s_2$; and
    \item   Assuming agent $j$ plays $s_2$, agent $i$ can do no better than play $s_1$.
 \end{itemize}
 In Nash equilibrium, neither agent has any incentive to deviate from their strategy.
 Not all possible interactions have a Nash equilibrium, and some interactoins can have several Nash equilibria.
 \subsection{Prisoner's Dilemma}
 The \textbf{Prisoner's Dilemma} is usually expressed in terms of pay-offs (or rewards) for co-operating or defecting:
 \[
 \begin{array}{c c|c c}
  & & \text{Player } j & \\
  & & \text{C} & \text{D} \\
 \hline
  \text{Player } i & \text{C} & (3, 3) & (0, 5) \\
                  & \text{D} & (5, 0) & (1, 1)
 \end{array}
 \]
 \begin{itemize}
    \item   If both co-operate, they each get a reward of 3.
    \item   If both defect, they each get a reward of 1.
    \item   If one co-operates and the other defects, the com-operators gets 0 (the sucker's payoff) and the other gets 5.
 \end{itemize}
 The individually rational action is to defect:
 it guarantees a payoff of no worse than 1, whereas co-operating guarantees a payoff of no worse than 0.
 So, defection is the best response to all strategies;
 however, common sense indicates that this is not the best response.
 \\\\
 The prisoner's dilemma occurs in many domains and is suitable for modelling large classes of multi-agent interactions.
 There have been many real-world scenarios that are implicitly prisoner's dilemmas (or variations):
 \begin{itemize}
    \item   Arms race;
    \item   Environmental issues;
    \item   Free-rider systems;
    \item   Warfare;
    \item   Behaviour in many biological systems --- bats, guppie fish, etc;
    \item   Competition between nodes in a distributed computer system;
    \item   Modelling competition and collaboration between information providers;
    \item   Sports.
 \end{itemize}
 Variations on the prisoner's dilemma include:
 \begin{itemize}
    \item   \textbf{$N$-player dilemma:} for example, the voter's paradox, where it is true that a particular endeavour would return a benefit to all members where each individual would receive rewards;
            it is also true that any member would receive an even greater reward by contributing nothing.
            Elections, environment actions, and the tragedy of the commons are all examples of this phenomenon.
    \item   \textbf{Spatial organisations:} where agents are placed in some 2-dimensional space and can only interact with neighbours.
    \item   \textbf{Partial co-operation:} acts are no longer co-operative or non-co-operative, but can be in some range.
            If we consider extending the classical IPD to this domain, we can define landscapes using pay-off equations.
    \item   \textbf{Noise:} problems arise if we introduce any degree of noise, which will lead co-operations to be interpreted as defections, etc.
            Consider two TFTs playing witha  degree of noise.
 \end{itemize}
 Summay so far:
 \begin{itemize}
    \item   We need a means to organise \& co-ordinate agents.
            There are underlying problems here with respect to co-operation.
    \item   Game theory \& extensions provides a tool to reason about and to develop multi-agent systems.
    \item   We assume agents have a rational ordering of possible outcomes and a set of actions they may choose to bring about those outcomes.
    \item   We have limited the types of interactions to very simple cases.
 \end{itemize}
 One extension is the \textbf{ultimatum game}.
 We are no longer just discussing outcomes for simple choices:
 \begin{itemize}
    \item   Two players $i$ and $j$.
    \item   The goal is to distribute some resource, e.g., €100.
    \item   Player $i$ picks a number $x$, in a range (0-100).
    \item   Player $j$ must accept or reject the offer.
    \item   If Player $j$ rejects: both get 0.
    \item   If Player $j$ accepts: Player $i$ gets $x$ and Player $j$ gets $100-x$.
 \end{itemize}
 This allows us to reason about more complex scenarios.
 Many extensions are available and have been researched.
 If we wish to reason about two or more agents/systems agreeing on value for some exchange (information, service), we can look to auction theory.
 To reason about more complex scenarios, negotiation \& argumentation theory has been adopted.
 \subsection{Auction Theory}
 \textbf{Auction theory} can be used as a method to allow agents to arrive at an agreement regarding events \& actions when agents are self-interested.
 In some cases, no agreement is possible at all.
 However, in most scenarios, there is the potential to arrive at a mutually beneficial agreement.
 There are several approaches that have been adopted to do this;
 all can bee seen as a form of negotiation or argumentation by the agents.
 Negotiation or argumentation is governed by some protocol or mechanism:
 this protocol defines how the agents are to interact, i.e., the actual rules of encounter.
 Questions that arise include:
 \begin{itemize}
    \item   How to design a protocol such that certain properties exist?
    \item   How to design strategies for agents to use a given set of protocols?
 \end{itemize}
 Desired features from protocols include: guaranteed success, simplicity, maximising social utility, pareto-efficiency, \& individual rationality.
 \textbf{Auctions} represent a class of useful protocols, and are used in many domains.
 An auction takes place between an agent (auctioneer) and a set of other agents (bidders).
 The goal is to allocate the goods to one of the bidders.
 Usually, an auctioneer attempt to maximise the price;
 the bidders desire to minimise the price.
 We can categorise auctions according to a range of features:
 \begin{itemize}
    \item   Bids may be:
            \begin{itemize}
                \item   Open-cry;
                \item   Sealed bid.
            \end{itemize}
    \item   Bidding may be:
            \begin{itemize}
                \item   One shot;
                \item   Ascending;
                \item   Descending.
            \end{itemize}
 \end{itemize}
 Selling goods by auction is more flexible than setting a fixed price and less time-consuming than explicit negotiation (haggling).
 In many domains, the value of an item may vary enough to preclude direct \& absolute pricing.
 It is a pure form of market;
 it is efficient in that auctions usually ensure goods are allocated to those who value them most.
 The price is set, not by the sellers, but by the buyers.
 No one auction protocol is the best;
 some are preferred by sellers, others by buyers.
 Some auctions attempt to prevent cheating, or at least decrease the incentive to cheat;
 others provide several means to cheat.
 People tend to bid in auctions for two reasons:
 \begin{itemize}
    \item   They wish to acquire the goods (bases bid on private evaluation).
    \item   They wish to acquire the goods to re-sell (bases bid on private evaluation and estimates on future valuations).
 \end{itemize}
 \subsubsection{English Auction}
 In an \textbf{English auction}, the auctioneer begins with the lowest acceptable price (reserve), and proceeds to obtain successively higher bids from bidders until no-one will increase the bid.
 It is effectively first-price, open-cry, \& ascending.
 The dominant strategy is to successively bid a small amount more than the current highest id until it reaches their valuation, then withdraw.
 Potential problems with English auctions include:
 \begin{itemize}
    \item   Rings;
    \item   Shills in the bidders;
    \item   Winner's curse.
 \end{itemize}
 In some English auctions, the reserve price is kept secret to attempt to prevent rings from forming.
 \subsubsection{Dutch Auction}
 In a \textbf{Dutch auction}, bidding starts at an artificially high price.
 Lower prices are offered, in descending order, until a bidder equals to the current price.
 Goods are then sold to the bidder for that price
 Dutch auctions are descending, open-cry auctions.
 From a seller's perspective, the key to a successful auction is the effect of competition on the bidders.
 In an English auction, a winner may pay well under their valuation and thus the seller loses out;
 this is not the case in a Dutch auction.
 \subsubsection{First-Price, Sealed Bid}
 \textbf{First-price, sealed bid} auctions are usually one-shot auctions.
 Each bidders submits a sealed bid.
 The goods are sold to the highest bidders.
 Best strategy is to bid to true valuation.
 Interesting variations exist if there are a number of goods to be sold and a number of rounds.
 \subsubsection{Vickrey Auction}
 A \textbf{Vickrey auction} is a sealed-bid, second-price auction.
 The price paid by the winner is that price offered by the second-placed bidder.
 In this type of auction, contrary to initial intuition, sellers make as much, if not more than the first-price auctoins.
 In reality, bidders are not afraid to bid high, knowing that they will have to pay the second price;
 bidders tend to be more competitive.
 \\\\
 Other auction types exist also: reverse auctions, double auctions, haphazard (whisper auction, handshake auction), etc.
 We can use auctions as a means to allow agents to agree on a price for buying goods or services.
 Depending on the type of auction chosen, we will favour buyers or sellers.
 We sill have some problems though:
 \begin{itemize}
    \item   Are auctions the best way?
    \item   What happens following an auction, if upon receiving goods, one doesn't pay?
    \item   What happens following an auction, if upon paying, one realise that the goods are not as expected?
    \item   Is it possible to prevent shills, rings, \& other forms of manipulation?
    \item   In auctions, agents agree on a price; can we deal with more dimensions of negotiation?
 \end{itemize}
 \end{document}
--- a/year4/semester2/CT421/notes/images/rationalchoice.png
+++ b/year4/semester2/CT421/notes/images/rationalchoice.png