DEMO / 2026 Visual Deck Index

Pongpong

Taiwanese 16-tile Mahjong self-play reinforcement-learning platform — research slides, system walkthroughs, and GA-proposal supplements. All links below open standalone HTML decks.

蘇泓叡 · C112156233 · 大學部專題 · github.com/hray3182/pongpong

01 English · Main Deck

Incident Trajectory
(English edition)

English narrative with Chart.js-rendered trajectory curves (Δscore + CWR vs steps). Primary deck for the course presentation.

ppt_reference_en.html OPEN →

02 System · Visualization

Pongpong System
架構視覺化

以真實麻將牌圖拆解觀測編碼 (34 × 133 tensor)、109 動作空間、模型架構。每張牌都標註 tile index 對照 observe.go。

pongpong_system.html OPEN →

03 GA · Supplement

GA Parameter Space
（麻將版）

GA 搜尋空間以麻將手牌比喻呈現 —— 學習率、entropy、GAE 等超參數做成「候選手牌」。較早期的 proposal 附錄視覺化。

parameters_mahjong.html OPEN →

04 Methodology · Supplement

Valuebench
實驗方法論

條件變異數分解的完整實驗流程：snapshot 擷取、wall reshuffle、Monte-Carlo playout、 Law of Total Variance 推導，以及實測 oracle critic 的交叉驗證（val R² = 0.106 vs 理論 0.084）。

valuebench_methodology.html OPEN →

Pongpong

Incident Trajectory(English edition)

Pongpong System架構視覺化

GA Parameter Space（麻將版）

Valuebench實驗方法論

Incident Trajectory
(English edition)

Pongpong System
架構視覺化

GA Parameter Space
（麻將版）

Valuebench
實驗方法論