Game Theory & Mathematics

The Zhejiang Study: Winning Rock-Paper-Scissors with Win-Stay, Lose-Shift

How a landmark empirical study from China exposed the deep psychological flaws in human randomness, allowing strategic players to systematically defeat the Nash Equilibrium.

👤 By Sanket Sharma

📅 Published: May 26, 2026

⏱️ Reading Time: 10 min

✅ Status: Mathematically Validated

The Paradox of Rock-Paper-Scissors: Nash Equilibrium vs. Human Nature

From a classical game theory perspective, the game of Rock-Paper-Scissors is perfectly solved. Because each of the three choices (Rock, Paper, or Scissors) has an equal probability of winning, losing, or tying against any other choice, the mathematically optimal strategy is to select each item exactly **one-third** of the time in a completely random sequence. This is known as the **Nash Equilibrium (mixed strategy)**. Under this model, no player can gain an advantage by changing their strategy, and the expected win rate over time is exactly 50%.

However, the Nash Equilibrium relies on an impossible assumption: that human beings are capable of generating true randomness. In reality, human decision-making is severely constrained by cognitive biases, memory constraints, and subconscious heuristic patterns. In 2014, researchers at **Zhejiang University** in China conducted a massive empirical study to find out how real people actually play. By tracking 360 students playing 300 rounds of Rock-Paper-Scissors each, they uncovered a startling deviation from the Nash model: players do not select choices at random. Instead, their decisions are dominated by a psychological heuristic known as **Win-Stay, Lose-Shift (WSLS)**.

Decoding the Zhejiang University Empirical Study

The Zhejiang study recorded a total of 108,000 rounds of play. While the overall distribution of Rock, Paper, and Scissors choices remained close to the theoretical 1/3 mark, the *transition states* between rounds revealed highly predictable patterns. The researchers discovered that humans utilize an ancestral reinforcement loop when dealing with immediate gains and losses.

Specifically, the data demonstrated two powerful behaviors:

Win-Stay (The Reinforcement Bias): When a player wins a round, they are statistically far more likely to repeat the same winning choice in the next round. If their Rock defeats their opponent's Scissors, they subconsciously assume that "Rock works," leading them to play Rock again.
Lose-Shift (The Avoidance Bias): When a player loses a round, they almost immediately abandon their losing choice. Crucially, they do not shift randomly; they almost always shift *forward* in the order of the game (Rock → Paper → Scissors) or copy the choice that just defeated them.

By exploiting these subconscious behavioral loops, a strategic player can anticipate their opponent's next move with over 60% accuracy, rendering the classical Nash Equilibrium obsolete in real-world scenarios.

Previous Round Outcome	Subconscious Human Reaction	Predictable Next Move	The Winning Counter-Move
Opponent Wins	"Win-Stay" Bias	Opponent plays the exact same hand that just won.	Play whatever loses to their previous winning hand (which beats their repeat).
Opponent Loses	"Lose-Shift" Bias (Shifts Clockwise)	Opponent shifts "forward" (e.g. Rock → Paper, or Paper → Scissors).	Play whatever beats their projected clockwise shift.
Opponent Ties	Uncertainty / Novelty Search	Opponent tends to shift to the item that wasn't played.	Anticipate the unplayed hand and counter it.

The Mathematics of Win-Stay, Lose-Shift (WSLS) Heuristics

Let's formalize this using probability matrices. In a standard Nash mixed-strategy, the probability of selecting Rock ($R$), Paper ($P$), or Scissors ($S$) given the previous round's play $X_{t-1}$ is represented as:

$P(R_t | X_{t-1}) = P(P_t | X_{t-1}) = P(S_t | X_{t-1}) = \frac{1}{3}$

However, the Zhejiang study proved that the real transition matrix for human players resembles the following conditional state probabilities:

If the player wins with choice $C_i$ at time $t-1$, the probability of playing $C_i$ at time $t$ rises to approximately **45%** (far above the 33.3% Nash prediction).
If the player loses with choice $C_i$ at time $t-1$, they will shift to a different choice with a probability of **80%**. More specifically, they will shift clockwise in the cycle (i.e. $C_{i+1}$) with a probability of **55%**, or counter-clockwise ($C_{i-1}$) with a probability of **25%**.

This massive skew in transition states creates a predictable, readable pattern that a computer algorithm—or a well-trained human brain—can instantly exploit.

🔬 The Algorithmic Edge in Browser Play:

When playing against computer algorithms in digital platforms like Rock Paper Scissors, many advanced AI bots are programmed to scan your history for these exact WSLS transition skews. To defeat a high-level bot, you must actively "anti-loop"—that is, intentionally force yourself to shift after a win, and stay after a loss. This reverses the expected transition matrix, confusing the bot's prediction models.

How to Actively Apply WSLS to Outplay Real Opponents

To implement the Zhejiang findings in your own matches, memorize these three practical, actionable rules:

1. The "Winner's Repeat" Trap

If you lose a round, assume your opponent will use the "Win-Stay" heuristic. They will play the same choice that just beat you. Therefore, you should immediately switch to the choice that defeats their previous selection. For example: If they beat your Scissors with Rock, they are highly likely to play Rock again. You should counter with Paper.

2. The "Loser's Shift" Escalation

If you win a round, assume your opponent will shift using the "Lose-Shift" heuristic. Subconsciously, they will move clockwise in the cycle or copy your winning move. Knowing they will avoid their losing hand, you must play the hand that counters their projected shift. For example: If your Paper beats their Rock, they will avoid Rock. They will likely shift to Scissors. To counter their predicted Scissors, you should play Rock.

3. Exploding the Tie

When a tie occurs, humans experience a sudden cognitive reset. Statistically, they rarely repeat the same choice that caused the tie. Instead, they usually shift to the hand that would have beaten their previous choice. Anticipate this mental pivot and play the hand that counters their prospective upgrade.

Conclusion: Cognitive Flexibility as the Ultimate Game Tool

The Zhejiang study is one of the most brilliant examples of behavioral economics and game theory in modern science. It demonstrates that even in games of pure, theoretical chance, human psychology introduces predictable structure. By understanding the Win-Stay, Lose-Shift phenomenon, you transform a game of luck into a battle of cognitive reading and tactical agility. Put these theories to the test today by practicing your mental reads on yuvamedia's Rock Paper Scissors and watch your win ratios soar!

👨‍💻

Sanket Sharma

Senior Web Architect & Game Theorist

Sanket Sharma is a digital systems designer and cognitive enthusiast. He builds interactive algorithms and publishes detailed breakdowns of game design physics, spatial mechanics, and cognitive performance frameworks.

View Team Profile Editorial Standards