Large families of resolution rules

berthier

In my book ("The Hidden Logic of Sudoku") and in the associated solver (SudoRules), I consider 4 and only 4 families of resolution rules.

A) First it may be useful to state formally what I call a resolution rule. It is a logical formula which:
1) is a logical consequence of the 4 constraints defining the game,
2) has the form of a production (or condition-action) rule; the action part can only be the addition of a value or the deletion of a candidate.
Rules of this form are "operational": they tell you what to do, whereas a constraint only specifies a desired final state.
All the rules debated in this forum have this form (or can be rewritten so).

This approach is motivated by my goal: modelling the rules used by human players.

Notice that the 4 constraints defining the game are obviously NOT resolution rules. In order to get a resolution theory (a set of resolution rules), they have to be repalced by some of their logical consequences. The problem is that no complete set of resolution rules equivalent to the initial problem is known. This is probably the reason why we are all talking of Sudoku.

B) Now for my four families. Trying to find a limited number of rule types was motivated by two reasons:
- one practical: if a player has to search a puzzle for hundreds of different patterns, this may make the game a little tedious (but I understand that this might also be part of the fun),
- one theoretical: many rules are proposed, but the relationships between them and their usefulness are far from being clear.

So I have:

1) the family of the elementary constraints propagation rules. The word "constraint" in the name may be misleading, but they are indeed resolution rules in the above sense (they are an obvious operational reformulation of the 4 initial constraints).

2) the family of the four well known interaction rules: block --> row, row --> block, column --> block and block --> column

3) the family of subset rules. Naked Single, … Naked Quadruplets and their Hidden and "Super Hidden" counterparts (known as X-Wing, Swordfish and Jellifish). This family is proven to be strongly closed under all the symmetries of the game: no other resolution rule can be obtained from them by any symmetry. Apart from this result, there is nothing new in these three families.

4) The family of chain rules. This is where things get interesting. This family can be split into three sub-families:
a) xy-chains and their extension xyt-chains,
b) xyzt-chains,
and c) c-chains,
each of these sub-families being completed with the hidden counterparts (hxy, hxyt, hxyzt chains - there are no hidden c-chains). There are no super-hidden chains (more exactly, hidden chains have a symmetry property that makes them equal to their super-hidden counterparts). Chains have a type and a length.

(Although I have some rules for uniqueness and T&E, I do not consider them here and in the following).

C) Families 3 and 4 are stratified according to the number of cells necessary to define the associated patterns (in the proper representation space). This gives the following definition for my levels. Every level is defined in such a way that it is closed under all the symmetries of the problem.

Level 1_0 is only family 1 + Naked Single and Hidden Single
Level 1 adds the interaction rules
Level 2 adds the subset rules for pairs
Level 3 adds the subset rules for triplets and the chain rules for chains of length 3 (equivalent to XY-Wing and XYZ-Wing)
Level 4_0 adds the subset rules for quadruplets
Level 4 adds all the chains rules for chains of length 4
For any n > 4, level n adds to the previous level the chains rules for chains of length n.

D) Finally, what seems interesting here is that with a very limited number of rule types I get the following results (with all the chains being limited to length 13):
- 99.68% of the Royle collection of 17-minimal puzzles are solved;
- 97% of the randomly generated puzzles are solved;
- also, as appears from my "online supplements" to the book, lots of puzzles designed to illustrate additional rules one can find in this forum can be solved using only my set of rules.
Notice that these results could probably still be improved if I allowed combinations of chains of various types (I have some theorems allowing to do this), which I have not (or not yet) implemented in SudoRules.

This is not to say that my set of rules is better than any other (until a complete resolution theory is found, no one can make such claims). On the contrary, I think that having a profusion of rules is very useful (and probably a source of fun - for both the inventor and the player). But I also think that my results prove that it is worth trying to organise these rules into large families.

Finally-finally, how do I analyse these results? The detailed classification results in chapter XXI leave no doubt on the power of the two general types of rules I have introduced: xyt-chains and hidden chains.

gsf

berthier

gsf

gsf

berthier

gsf

berthier