Stefan Pohl Computer Chess

Home of famous UHO openings and EAS Ratinglist


SPCC Super 3 Tournament

 

 

Latest update: 2024/09/02 (next updates will follow every 8-12 days...)

Tournament overall runtime (average: 47.2 games per day are played): 111 days

Download all played games since start (2024/05/13) here

 

An endless RoundRobin-tournament with 3 engines, which are at the same level of strength (around 3400 Elo) but are completely different in their inner structure and their way of thinking.

Why? The strongest engine since more a decade (Stockfish) is open source, so many, many other engines are (at least) "inspired" by Stockfish... And additionally, a lot of engines (including Stockfish!) are using Lc0-training-data for building their neural nets: The high-end computerchess has become very incestuous... To say this clear here: This is not good or bad, it is just the reality of high-end computerchess in these days.

So, IMHO, it is very interesting to run a tournament with engines, which are completely different, not only in their playing-style, but also in their inner structure and way of thinking, but on a close level of playing-strength.

The Super 3 tournament is not about the results (as you can see below, all 3 engines are at the same level of stregth), but about generating interesting enginegames.

             | Search    | Evaluation   | nodes per second (early middlegame)
-------------|-----------|--------------|-------------------------------------
Lc0 CPU      | MCTS      | float-neural |      1.100
-------------|-----------|--------------|-------------------------------------
Revenge 1.0  | AlphaBeta | int-neural   | 11.000.000 (10.000x faster than Lc0)
-------------|-----------|--------------|-------------------------------------
Komodo 14.1  | AlphaBeta | Handcrafted  | 19.000.000 (17.300x faster than Lc0)
-------------|-----------|--------------|-------------------------------------

As you can see, these 3 engines have nothing in common, considering their way of thinking. Komodo 14.1 uses a classical handcrafted evaluation, Lc0 CPU uses a (float) neural net, and Revenge 1 uses a (integer) nnue-net, like most modern engines in these days. Because floating-point calculations are brutally slow on CPUs, Lc0 CPU is way slower than the 2 opponents... (this is the reason, why Lc0 normally runs on the GPU, not the CPU). If you want to learn more about the neural-net of Lc0 and about nnue-nets, I recommend this e-book by Dominik Klein, which can be downloaded for free as a PDF-file.

And, additionally, Lc0 uses a complete different search (MCTS). And Revenge 1 is one of the most aggressive playing engines of all time (see the EAS-Ratinglist below).

 

HardwareAMD Ryzen 7840HS 8-core (16 threads) notebook with 32GB RAM. Turboboost off.

Speed: See above. Each engine uses 14 threads, when thinking (Lc0 cpu dnll has the UCI option "Threads" like any normal CPU-engine, so it uses the CPU like all other engines) - the GPU stays (of course) unused.

Hash: 8 GB per engine (20.000.000 NNCachesize for Lc0 - enough for storing all evaluated positions of a complete game)

GUI: CutechessGUI (GUI ends game, when a 6-piece endgame is on the board, all other games are played until mate or draw by chess-rules (3fold, 50-moves, stalemate))

Tablebases: None for engines, 6 Syzygy for CutechessGUI

Openings: My UHO_2024_8mvs_+085_+094.pgn openings are used (randomly mixed, each opening repeated with reversed colors, of course (=Gamepairs))

Ponder, Large Memory Pages & learning: Off

Thinking time: 10min+5sec per game/engine (average game-duration: 30 minutes), so only 50 games are played in 24 hours = high quality enginechess

 

Here you can see the shortest wins with sacrifices, played between the latest two site-updates, filtered by my Interesting Wins Search Tool. Download this cool tool in the "Downloads & Links" section or right here

Many thanks to ChessBase for the pgn-replayer tool, which is very easy to use (only 3 lines of code!) and very powerful - use the fan (propeller?)-icon right near the arrows below the chessboard, to start and stop the online-analyzing with the Fritz-engine! Perhaps you have to clear your browser-cache to see the latest games - otherwise the pgn-replayer does not update the games correctly...if you can not see the chessboard, check, that your browser has Javascript activated or if an AdBlocker is the problem.

 

 

 

 

Below the results (first normal Celos (by ORDO), followed by gamepair-rescored Celos, followed by EAS-Ratinglist).

 

     Program               Celo    +    - Games    Score   Av.Op. Draws

   1 Lc0 791921 CPU      : 3450    5    5  3496    51.4%   3440   48.3%
   2 Komodo 14.1 HCE     : 3443    5    5  3496    50.0%   3443   49.1%
   3 Revenge 1.0 avx2    : 3437    6    6  3496    48.6%   3447   48.6%


Games        : 5244 (finished)

White Wins   : 2539 (48.4 %)
Black Wins   : 153   (2.9 %)
Draws        : 2552 (48.7 %)


Gamepairs:

   # PLAYER              :    Celo  Error   Pairs    W    D    L   (%)  CFS(%)
   1 Lc0 791921 CPU      :    3450   ----    1748  478  880  390  52.5      99
   2 Komodo 14.1 HCE     :    3435     13    1748  428  870  450  49.4      80
   3 Revenge 1.0 avx2    :    3429     13    1748  408  866  474  48.1     ---

 

------------------------------------------------------------------- 
--- Number of all Gamepairs          : 2622 
--- Number of drawn Gamepairs overall: 1308 (= 49.89%) 
--- Number of 1:1 drawn Gamepairs    : 651  (= 24.83%) 
--- Number of 2-draws drawn Gamepairs: 657  (= 25.06%) 
------------------------------------------------------------------- 

 


Head to head statistics:

 

1) Lc0 791921 CPU   3450 :   1748 (+478,=880,-390),  52.5 %

   vs.                    :  pairs (   +,   =,   -),   (%) :   Diff,   SD, CFS (%)
   Komodo 14.1 HCE        :    874 ( 246, 442, 186),  53.4 :    +15,    7,   98.5
   Revenge 1.0 avx2       :    874 ( 232, 438, 204),  51.6 :    +21,    7,   99.9

 

2) Komodo 14.1 HCE  3435 :   1748 (+428,=870,-450),  49.4 %

   vs.                    :  pairs (   +,   =,   -),   (%) :   Diff,   SD, CFS (%)
   Lc0 791921 CPU         :    874 ( 186, 442, 246),  46.6 :    -15,    7,    1.5
   Revenge 1.0 avx2       :    874 ( 242, 428, 204),  52.2 :     +6,    7,   80.5

 

3) Revenge 1.0 avx2 3429 :   1748 (+408,=866,-474),  48.1 %

   vs.                    :  pairs (   +,   =,   -),   (%) :   Diff,   SD, CFS (%)
   Lc0 791921 CPU         :    874 ( 204, 438, 232),  48.4 :    -21,    7,    0.1
   Komodo 14.1 HCE        :    874 ( 204, 428, 242),  47.8 :     -6,    7,   19.5


 

Here the EAS-Ratinglist, calculated by my EAS-Tool:

                                 bad  avg.win 
Rank  EAS-Score  sacs   shorts  draws  moves  Engine/player 
-------------------------------------------------------------------
   1    160509  26.27%  17.43%  12.94%   72   Revenge 1.0 avx2  
   2     76863  11.35%  16.18%  27.76%   71   Komodo 14.1 HCE  
   3     53654  08.29%  12.49%  22.97%   73   Lc0 791921 CPU  
-------------------------------------------------------------------
*** Average length of all won games:     72 moves

 

 

A: Most high-value sacrifices (3+ pawnunits): [1]:05.42% Revenge 1.0 avx2   

                                              [2]:02.25% Komodo 14.1 HCE   

                                              [3]:00.21% Lc0 791921 CPU 


B: Most sacrifices overall                  : [1]:26.27% Revenge 1.0 avx2   

                                              [2]:11.35% Komodo 14.1 HCE   

                                              [3]:08.29% Lc0 791921 CPU 


C: Very short wins (40 moves or less)       : [1]:01.65% Revenge 1.0 avx2   

                                              [2]:00.90% Komodo 14.1 HCE   

                                              [3]:00.31% Lc0 791921 CPU 


D: Most short wins overall                  : [1]:17.43% Revenge 1.0 avx2   

                                              [2]:16.18% Komodo 14.1 HCE   

                                              [3]:12.49% Lc0 791921 CPU  


E: Average length of all won games          : [1]:071 Komodo 14.1 HCE   

                                              [2]:072 Revenge 1.0 avx2   

                                              [3]:073 Lc0 791921 CPU 


F: Smallest number of bad draws             : [1]:12.94% Revenge 1.0 avx2   

                                              [2]:22.97% Lc0 791921 CPU   

                                              [3]:27.76% Komodo 14.1 HCE