THE CONCEPT OF A RESOLUTION RULE AND ITS APPLICATIONS
I think that the first post of a thread should somehow define what its creator wants and does not want to discuss.
The purpose of this thread is to discuss applications of the concept of a resolution rule to longstanding (real or factitious) problems (e.g. T&E).
More generally, it is also to apply the new global conceptual framework introduced in my book, The Hidden Logic of Sudoku - HLS in the sequel), a framework based on first order logic.
A large excerpt of HLS has been available on my Web pages since the end of May. You may choose to ignore it, but you are supposed to have read it if you participate in this thread.
The basic definitions I'm repeating briefly here have first appeared in HLS and on several occasions on this forum.
RESOLUTION RULES AND RESOLUTION PATHS
Definition: A resolution rule is a formula of first order logic (or predicate calculus), in the language of Sudoku, that satisfies the two conditions:
- it is a logical consequence of the Sudoku axioms,
- it is written in the condition => action form, where "action" can only be the assertion of a value or the negation of a candidate in some cells associated to the pattern (the target cells).
The condition is also called the pattern of the rule.
The language of Sudoku is built on a very limited number of primary predicates and associated atomic formulæ:
n1 = n2
r1 = r2
c1 = c2
b1 = b2
value(n, r, c)
candidate(n, r, c)
where n, r, c,… are variables (and = is the usual equality sign)
Auxiliary predicates can of course be defined. Formally, they are just formulæ with free variables.
For instance, share-a-unit(r1, c1, b1, r2, c2, b2) is defined as: r1 = r2 or c1 = c2 or b& = b2
A resolution rule is proven once and for all, and the way it has been proven, by valid mathematical methods, has no impact of any kind on its logical validity.
Being logically valid doesn't prevent a rule from being more or less easy to apply, but this has nothing to do with the way it has been proven. The problems
- of proving a rule once and for all (generally these proofs are very straightforward)
- and of spotting the occurences (instantiations) of its defining pattern on a real grid
are totally independent.
Definition: A resolution path is a sequence of occurences of resolution rules with their conclusions - with no additional ad hoc pieces of reasoning. A resolution path is thus the mathematical proof of the solution.
Unfortunately, examples given in forums very rarely satisfy this definition, although they often could.
This is mainly due to:
- ambiguous examples with incomplete resolution paths and inadapted notations,
- a generalised confusion, on this forum and others, between the description of a purely factual situation (e.g. the existence of a general link in my sense, i.e. a unit being shared between two cells, or of a c-link or conjugacy link) and the way this can be used in an inference step. In this regard, the notions of strong and weak links are certainly the most confusing ones that can have been introduced - leading e.g. to hyper-realistic debates on whether a strong link is also a weak link.
EXAMPLE 1: CHAINS
Most advanced resolution rules are based on chains of various kinds.
Definition: A (2D) chain is a sequence (i.e. a strictly, linearly ordered list) of cells such that each cell in the chain (but the first) shares a unit with (synonymous: is linked to) the previous one and such that the first and last are different.
As a consequence, a chain has no global loop. It may have internal loops, although I have proven that this is useless for some types of chains (xy- or c- chains).
We shall be interested in specific types of chains, to which resolution rules can be associated. Specific chains will satisfy addditional conditions, but they will always first be chains in the above, quasi "physical" sense.
Generally, the condition (or the pattern) defining a specific kind of chain and the associated rule are written in a graphico-logical form, i.e. not explicitly as a logical formula but in a (much more compact) form that can be translated straightforwardly into a logical formula (the details of this translation are given in HLS). The graphico-logical form is considered as a shorthand for the logical formula.
Example:
xy4-pattern: {1 2} - {2 3} - {3 4} - {4 1}
xy4- rule: * {1 2} - {2 3} - {3 4} - {4 1}* => TC≠1
where:
- curly brackets describe the content of cells,
- numbers are shorthands for variable names (1 for x1, 2 for x2, …),
- "-" denotes a link between cells,
- the starred cells are those that MUST be linked to a target cell; the two endpoints are always starred (for xy-chains, only the two endpoints are starred; but additional cells can be starred in xyzt-chains),
- TC is any target cell, i.e. any cell having a link (possibly a different one) with both endpoints of the chain.
Notice that the conclusion of the rules is almost always "TC≠1", a symbolic notation for negating candidate x1 in celll TC (only very few rules assert values).
A target cell of a chain NEVER belongs to the chain. This is very important for two reasons: 1) all the chain rules that I have introduced in HLS can thus be based on HOMOGENEOUS patterns; 2) it allows a chain to have several targets.
Here homogeneous means that the pattern is a sequence of similar bricks. In particular, a target cell of an xy-chain doesn't have to be linked to its endpoints by xy-links. Homogeneity couldn't be granted if the target cell had to be included in the pattern. This applies to chains of all types, not only xy.
This also allows to discard irrelevant distinctions depending on the type of links a target cell has with the chain (such as Nice Loops being "continuous" or "discontinuous"

I have proven in HLS that all xy-loops or Nice Loops (not considering subset links) can be reduced to such open chains (this doesn't forbid using loops, but you should be aware that they do not lead to more eliminations).
EXAMPLE 2: GENERALISED SYMMETRIES AND HIDDEN CHAINS
A general theorem proven in HLS is the following:
Theorem (informal version): If a resolution-rule is block-free, i.e. doesn't mention blocks, then the two formulæ obtained by permuting the words "column" and "number" (resp. "row" and "number"

This abstract generalised symmetry property led me to introduce the rn- and cn- spaces (or representations).
Theorem: If a formula is block-free, then it is valid for Sudoku if and only if it is valid for Latin Squares.
Informally speaking, all and only the rules of Latin Squares can be applied in the rn- and cn- spaces.
No one should complain about having no blocks in the rn- and cn- spaces. This is not my choice but a consequence of the Sudoku axioms.
Many resolution rules can thus be transposed to the rn- and cn- spaces and I have designed an extended Sudoku Board in order to facilitate their use - so that this abstract symmetry becomes very concrete for the player.
This applies to the above example: xy-chains give rise to hxy-rn- and hxy-cn- chains.
hxy-chains are special case of AICs, but cases that can be looked for (in their 2D "base space" - see below) as if they were simple xy-chains,
Moreover the new types of chains introduced in HLS (xyt- and xyzt-) also have transposed versions, hxyt- or hxyzt- chains, which are also completely new types of chains.
A 2D chain has a "base space": rc, rn or cn, i.e. the cells and the links in the defining pattern must be interpreted as rc-, rn- or cn- cells and links
In rc- space, there are three types of links: row, column and block.
In rn- space, there are two types of links: row and number.
In cn- space, there are two types of links: column and number.
Since the conclusion of a chain rule is always the same (=> TC≠1), it can be omitted from the graphico-logical representation and included in the " logically valid" sign (that I'll write as "/-" because of problems on this forum with special characters). We thus get the graphico-logical representations for the xy4-, the hxy4-rn and the hxy4-cn resolution rules:
rc /- {1 2}* - {2 3} - {3 4} - {4 1}*
rn/- {1 2}* - {2 3} - {3 4} - {4 1}*
cn/- {1 2}* - {2 3} - {3 4} - {4 1}*
EXAMPLE 3: FULLY SUPERSYMMETRIC CHAINS (3D chains)
The above examples of resolution rules can be generalised to "3D chains".
Recently I've introduced the fully supersymmetric ("3D"

This also shows that the two apparently conflicting views of chains (chains of cells as in 2D chains and chains of candidates as in 3D chains) are fully compatible.
For details, see the thread on "fully supersymmetric chains".
RESOLUTION RULE vs RESOLUTION TECHNIQUE vs REPRESENTATION TECHNIQUE
Whereas I have given the phrase "resolution rule" a precise, purely logical definition, it is not the case for the word "technique". Fist, I think two different kinds of techniques should be defined.
By "resolution technique", we generally mean a procedure that can help get closer to a solution.
Definition: T&E is the following procedure: recursively make ad hoc hypotheses on something that you do not know to be true at the time you make them, explore their consequences, with the possibility of retracting each of these hypotheses and their consequences if they lead to a dead end.
T&E is a resolution technique is the above sense.
Some colouring techniques, depending on what one means by this, can be resolution techniques in this sense.
By "representation technique", we generally mean a representation that can be the support for several resolution techniques.
Marking conjugate candidates with upper and lower case letters is a technique in this sense. It can be used for finding Nice Loops or AICs.
The rn- and cn- spaces I've introduced in HLS are representation techniques in this sense.
First applications of these notions:
Theorem: Trial and Error is a resolution technique but not a resolution rule
Proof: since T&E can be applied any time a cell has at least two candidates, if it was expressible as a resolution rule, the condition of this rule could only be:
"if a cell has at least two candidates".
But it is obvious that, from such a condition, no general conclusion can be obtained (no value can be asserted and no candidate can be eliminated).
This should put an end to many factitious debates about T&E and to the absurd assimilation of some types of resolution rules with T&E, e.g. under the pretext that their proof uses reasoning by cases.
Similarly, tabling is a resolution technique, but not a resolution rule. Because it is based on the ad hoc propagation of complementary hypotheses, tabling is a (limited) form of T&E.
A given resolution rule can, in practice, be applied in various ways. Said otherwise, different resolution techniques can sometimes be attached to a single resolution rule
This distinction is important. Forgetting it may lead to the absurd assimilation of an algorithm and a logical formula - two notions that do not belong to the same universe of discourse (CS vs logic).
An example of this has been given in the xyt-thread, where my initial technique of drawing arrows from the right linking candidate of a cell to the left linking candidate of the next cell and John's modified xy-colouring technique can both be used to spot xyt-chains.
As another alternative, within each associated technique, the search for an xyt-chain can start from a target cell (e.g. because one would like to eliminate some candidate in this cell) or from the first cell in the chain or from a previously obtained shorter chain.
Validity of a resolution rule is based on logic. Validity of a technique is based either on its conformity with the underlying resolution rule if there is one OR on the algorithm describing it.
For a technique that is not based on a resolution rule, it may be very difficult to guarantee that it doesn't amount to T&E.
Most, but not all, of the usual techniques could be re-written as resolution rules.
EXAMPLES OF TOPICS THAT COULD BE USEFULLY DEBATED IN THE PRESENT FRAMEWORK
How can we compare the complexities of two resolution rules?
How can we compare the complexities of two resolution techniques associated to the same resolution rule?
Is there a simple relation between the logical syntactic complexity of a rule and the difficulty of spotting its instantiations?
Given a resolution rule, which technique is best fitted to spot its occurrences in a real grid?
What are the relevant properties of a resolution rule? (being based on a homogeneous pattern, being 2D,…)
What are the relevant properties of a resolution technique? (being based on a resolution rule; for a chain: being more or less extensible in both directions, being based on a target cell, …)
This list is obviously non exhaustive.