Stefan Pohl Computer Chess

Home of famous UHO openings and EAS Ratinglist


SPCC Super 3 Tournament

 

 

Latest update: 2024/11/16 (next updates will follow every 10-12 days...)

Tournament overall runtime (average: 47.2 games per day are played): 184 days

Download all played games since start (May 2024) here

 

From today (2024/10/13), the new Rebel Extreme by Ed Schroeder is invited to the Super 3 Tournament as a "Guest Star", because I am curious to see, how this super aggressive engine will perform here.

From the website of Rebel:

"Rebel Extreme is our flagship when it is about playing unbelievable sharp games, but it comes with a price of an 140 elo loss in comparison with Rebel 16.3. The level of aggressiveness is measured by Stefan Pohl great tool Engines Aggressiveness Statistics, or EAS."

Sadly, Rebel uses only 8 threads (and even this only works, when "poweruser" is added to the command-line of the Rebel engine) - so Rebel will run only with around 60% speed, compared to the other 3 engines. But with the long thinking-time of the Super 3 Tournament, this should lead only to a small Celo-loss... Speed of Rebel Extreme: Around 7-8 MN/s in the middlegame.

 

An endless RoundRobin-tournament with 3 engines, which are at the same level of strength (around 3400 Elo) but are completely different in their inner structure and their way of thinking.

Why? The strongest engine since more a decade (Stockfish) is open source, so many, many other engines are (at least) "inspired" by Stockfish... And additionally, a lot of engines (including Stockfish!) are using Lc0-training-data for building their neural nets: The high-end computerchess has become very incestuous... To say this clear here: This is not good or bad, it is just the reality of high-end computerchess in these days.

So, IMHO, it is very interesting to run a tournament with engines, which are completely different, not only in their playing-style, but also in their inner structure and way of thinking, but on a close level of playing-strength.

The Super 3 tournament is not about the results (as you can see below, all 3 engines are at the same level of strength), but about generating interesting enginegames.

             | Search    | Evaluation   | nodes per second (early middlegame)
-------------|-----------|--------------|-------------------------------------
Lc0 CPU      | MCTS      | float-neural |      1.100
-------------|-----------|--------------|-------------------------------------
Revenge 1.0  | AlphaBeta | int-neural   | 11.000.000 (10.000x faster than Lc0)
-------------|-----------|--------------|-------------------------------------
Komodo 14.1  | AlphaBeta | Handcrafted  | 19.000.000 (17.300x faster than Lc0)
-------------|-----------|--------------|-------------------------------------

As you can see, these 3 engines have nothing in common, considering their way of thinking. Komodo 14.1 uses a classical handcrafted evaluation, Lc0 CPU uses a (float) neural net, and Revenge 1 uses a (integer) nnue-net, like most modern engines in these days. Because floating-point calculations are brutally slow on CPUs, Lc0 CPU is way slower than the 2 opponents... (this is the reason, why Lc0 normally runs on the GPU, not the CPU). If you want to learn more about the neural-net of Lc0 and about nnue-nets, I recommend this e-book by Dominik Klein, which can be downloaded for free as a PDF-file.

And, additionally, Lc0 uses a complete different search (MCTS). And Revenge 1 is one of the most aggressive playing engines of all time (see the EAS-Ratinglist below).

 

HardwareAMD Ryzen 7840HS 8-core (16 threads) notebook with 32GB RAM. Turboboost off.

Speed: See above. Each engine uses 14 threads, when thinking (Lc0 cpu dnll has the UCI option "Threads" like any normal CPU-engine, so it uses the CPU like all other engines) - the GPU stays (of course) unused.

Hash: 8 GB per engine (20.000.000 NNCachesize for Lc0 - enough for storing all evaluated positions of a complete game)

GUI: CutechessGUI (GUI ends game, when a 6-piece endgame is on the board, all other games are played until mate or draw by chess-rules (3fold, 50-moves, stalemate))

Tablebases: None for engines, 6 Syzygy for CutechessGUI

Openings: My UHO_2024_8mvs_+085_+094.pgn openings are used (randomly mixed, each opening repeated with reversed colors, of course (=Gamepairs))

Ponder, Large Memory Pages & learning: Off

Thinking time: 10min+5sec per game/engine (average game-duration: 30 minutes), so only 50 games are played in 24 hours = high quality enginechess

 

Here you can see the shortest wins with sacrifices, played between the latest two site-updates, filtered by my Interesting Wins Search Tool. Download this cool tool in the "Downloads & Links" section or right here

Many thanks to ChessBase for the pgn-replayer tool, which is very easy to use (only 3 lines of code!) and very powerful - use the fan (propeller?)-icon right near the arrows below the chessboard, to start and stop the online-analyzing with the Fritz-engine! Perhaps you have to clear your browser-cache to see the latest games - otherwise the pgn-replayer does not update the games correctly...if you can not see the chessboard, check, that your browser has Javascript activated or if an AdBlocker is the problem.

 

 

 

 

Below the results (first normal Celos (by ORDO), followed by gamepair-rescored Celos, followed by EAS-Ratinglist).

 

     Program                Celo    +    - Games    Score   Av.Op. Draws

   1 Rebel Extreme 1.0    : 3452   13   13   780    51.6%   3441   48.6%
   2 Lc0 791921 CPU       : 3450    6    6  5540    51.8%   3438   49.0%
   3 Komodo 14.1 HCE      : 3441    6    6  5540    49.9%   3442   49.4%
   4 Revenge 1.0 avx2     : 3433    6    6  5540    48.1%   3446   48.5%


Games        : 8700 (finished)

White Wins   : 4185 (48.1 %)
Black Wins   : 256   (2.9 %)
Draws        : 4259 (49.0 %)


Gamepairs:

   # PLAYER               :    Celo  Error   Pairs    W     D    L   (%)  CFS(%)
   1 Rebel Extreme 1.0    :    3457     26     390  113   189   88  53.2      69
   2 Lc0 791921 CPU       :    3450   ----    2770  776  1395  599  53.2     100
   3 Komodo 14.1 HCE      :    3431     10    2770  687  1351  732  49.2      96
   4 Revenge 1.0 avx2     :    3421     11    2770  622  1369  779  47.2     ---

 

------------------------------------------------------------------- 
--- Number of all Gamepairs          : 4350 
--- Number of drawn Gamepairs overall: 2152 (= 49.47%) 
--- Number of 1:1 drawn Gamepairs    : 1061 (= 24.39%) 
--- Number of 2-draws drawn Gamepairs: 1091 (= 25.08%) 
------------------------------------------------------------------- 

 


Head to head statistics:

 

1) Rebel Extreme 1.0 3457 :    390 (+113,=189,-88),  53.2 %

   vs.                     :  pairs (   +,   =,  -),   (%) :   Diff,   SD, CFS (%)
   Lc0 791921 CPU          :    130 (  34,  64, 32),  50.8 :     +7,   13,   69.1
   Komodo 14.1 HCE         :    130 (  39,  56, 35),  51.5 :    +26,   13,   97.5
   Revenge 1.0 avx2        :    130 (  40,  69, 21),  57.3 :    +35,   13,   99.6

 

2) Lc0 791921 CPU    3450 :   2770 (+776,=1395,-599),  53.2 %

   vs.                     :  pairs (   +,    =,   -),   (%) :   Diff,   SD, CFS (%)
   Rebel Extreme 1.0       :    130 (  32,   64,  34),  49.2 :     -7,   13,   30.9
   Komodo 14.1 HCE         :   1320 ( 385,  663, 272),  54.3 :    +19,    5,  100.0
   Revenge 1.0 avx2        :   1320 ( 359,  668, 293),  52.5 :    +29,    5,  100.0

 

3) Komodo 14.1 HCE   3431 :   2770 (+687,=1351,-732),  49.2 %

   vs.                     :  pairs (   +,    =,   -),   (%) :   Diff,   SD, CFS (%)
   Rebel Extreme 1.0       :    130 (  35,   56,  39),  48.5 :    -26,   13,    2.5
   Lc0 791921 CPU          :   1320 ( 272,  663, 385),  45.7 :    -19,    5,    0.0
   Revenge 1.0 avx2        :   1320 ( 380,  632, 308),  52.7 :    +10,    5,   96.4

 

4) Revenge 1.0 avx2  3421 :   2770 (+622,=1369,-779),  47.2 %

   vs.                     :  pairs (   +,    =,   -),   (%) :   Diff,   SD, CFS (%)
   Rebel Extreme 1.0       :    130 (  21,   69,  40),  42.7 :    -35,   13,    0.4
   Lc0 791921 CPU          :   1320 ( 293,  668, 359),  47.5 :    -29,    5,    0.0
   Komodo 14.1 HCE         :   1320 ( 308,  632, 380),  47.3 :    -10,    5,    3.6


 

Here the EAS-Ratinglist, calculated by my EAS-Tool:

                                 bad  avg.win 
Rank  EAS-Score  sacs   shorts  draws  moves  Engine/player 
-------------------------------------------------------------------
   1    149157  25.28%  16.35%  13.58%   73   Revenge 1.0 avx2  
   2    145026  20.66%  12.68%  10.55%   78   Rebel Extreme 1.0  
   3     69036  11.24%  14.67%  28.35%   71   Komodo 14.1 HCE  
   4     54773  07.62%  13.51%  22.67%   72   Lc0 791921 CPU  
-------------------------------------------------------------------
*** Average length of all won games:     72 moves

 

 

A: Most high-value sacrifices (3+ pawnunits)         : [1]:05.22% Revenge 1.0 avx2  
                                                       [2]:03.76% Rebel Extreme 1.0  
                                                       [3]:01.93% Komodo 14.1 HCE  
                                                       [4]:00.26% Lc0 791921 CPU  


B: Most sacrifices overall                           : [1]:25.28% Revenge 1.0 avx2  
                                                       [2]:20.66% Rebel Extreme 1.0  
                                                       [3]:11.24% Komodo 14.1 HCE  
                                                       [4]:07.62% Lc0 791921 CPU  


C: Very short wins (40 moves or less)                : [1]:02.35% Rebel Extreme 1.0  
                                                       [2]:01.59% Revenge 1.0 avx2  
                                                       [3]:01.15% Komodo 14.1 HCE  
                                                       [4]:00.26% Lc0 791921 CPU  


D: Most short wins overall                           : [1]:16.35% Revenge 1.0 avx2  
                                                       [2]:14.67% Komodo 14.1 HCE  
                                                       [3]:13.51% Lc0 791921 CPU  
                                                       [4]:12.68% Rebel Extreme 1.0  


E: Average length of all won games                   : [1]:071 Komodo 14.1 HCE  
                                                       [2]:072 Lc0 791921 CPU  
                                                       [3]:073 Revenge 1.0 avx2  
                                                       [4]:078 Rebel Extreme 1.0  


F: Smallest number of bad draws                      : [1]:10.55% Rebel Extreme 1.0  
                                                       [2]:13.58% Revenge 1.0 avx2  
                                                       [3]:22.67% Lc0 791921 CPU  
                                                       [4]:28.35% Komodo 14.1 HCE