Why DLX?

redtan · Posted: Sat Nov 22, 2008 4:45 am Post subject: Why DLX?

I'm doing Final Project for my Bachelor Degree. I'm creating application that can solve sudoku puzzle. From this forum I discovered that DLX is the most popular algorithm to do that (since there are so many use it in this forum.

But my problem is that find proof that DLX is better than the others (such as GA, ACO, etc) since my supervisor want me to write the reason why I use DLX instead of others in my project report.

Anyone can help?

Adak · Posted: Sat Nov 22, 2008 8:45 am Post subject:

I don't believe DLX is the most popular algorithm used to solve Sudoku puzzles.

Probably because most people have not heard of DLX, and don't know what it's especially good for, etc.

IMO the most popular way for a program to solve a Sudoku puzzle is to first, run the puzzle through some functions that mimic the way a human would start to solve the puzzle - forced singles in rows, columns, or boxes.

Afterward, a variety of other "human" type logic functions are used to try and decrease further the number of possibles remaining.

At some point, if the puzzle isn't solved yet, the logic functions stop, and the brute force "guessing" function, starts. Here, the remaining possibles (and the cells they came from), are sorted from least to most, and then
searched by trial and error.

The most efficient way to solve Soduku, imo, is by using bitboards. Check out Brit's blazing bit-based solver, as an excellent example, in this forum. It is not only the fastest solver I've tested, but it is fairly new, and still being optimized.

Bitboards are "different" to work with, and even for experienced programmers, take a while to really get comfortable with, and use well.

The advantage of course, is that in just one or two machine cycles, you can do work, with multiple squares, all at the same time.

It's chess, but check out : www.talkchess.com/ particularly Robert Hyatt's posts on the matter, in the Programming forum at talkchess. His chess program "Crafty", is a famous bitboard based, and open source, program.

I'm a hobby programmer, and have enough trouble with mail-box chess programs, but you have to admire the beauty of bitboards for tasks like
Sudoku.

DLX is a better algorithm than what I use in my solver (described above), but bitboards are better still.

m_b_metcalf · Posted: Sat Nov 22, 2008 8:52 am Post subject: Re: Why DLX?

Lunatic · Posted: Sat Nov 22, 2008 11:57 am Post subject:

Adak · Posted: Sun Nov 23, 2008 2:32 pm Post subject:

Did we ever get a chance to see what Merri's code was doing?

It sounded great, but I didn't see any code posted for it. Is an executable for Windows available for Merri's program?

Lunatic · Posted: Sun Nov 23, 2008 10:33 pm Post subject:

zerothbase · Posted: Sat Nov 29, 2008 6:06 pm Post subject:

Ok, I'll bite: What are "GA" and "ACO"? Genetic Algorithm or Greedy Algorithm? Ant Colony optimization?

All of these don't really work for the case of Sudoku. All 3 of are meant for problems where "grey area" solutions are allowed. For example, the travelling saleman problem allows for sub-optimal solutions (a slightly longer path), as long as you get close to the optimal solution via the above heuristics. There is, of course, one "right" answer - the most optimal path, but to find it may require a very long time (depending on the problem size), even with these heuristics.

Sudoku is an exact cover problem, in that either you have a solution or you don't. There is no grey area. This is similar to pentominoes and the n-queens problem. The question is how many ways are there to exactly cover a given problem space. That is, if you find a "sub-optimal solution" - you don't have a solution, you have a conflict (e.g. two 5's on a single row etc).

DLX is designed to count (and iterate through) all of the solutions in a very fast manner. This goal of DLX (fast counting of exact solutions) seems to fit well with what most programmers want out of a solver - give me the number of solutions, and give it to me fast (assuming you aren't creating a human-logic based solver). Can you guarantee that GA or ACO would find all possible solutions to a given problem? Or might it just get stuck in some local optima and return the wrong number of solutions (e.g. return 0 when there is a solution, or 1 when there are multiple)? If you add code to get out of local optima, how much would that slow that whole algorithm down?

The standard 9x9 (or 3x3 if you prefer) sudoku is what Knuth describes as a small problem. You can use specialty algorithms that have been tuned (bitboards) to be faster than DLX, but DLX definitely holds its own. On larger problems (16x16, 25x25...) DLX should close that gap and eventually overtake bitboards (though I have yet to see absolute proof of when that happens).

If you have other algorithms that compete with DLX for performance and precision, I would love to see them. A genetic algorithm for sudoku solving could be very cool - though I expect it would be slow. If you can break 1000 puzzles/sec/GHz, you are doing well, and if you can hit 5000 puzzles/sec/GHz, you are competitive with bitboards and DLX.

--Zerothbase

Adak · Posted: Sat Nov 29, 2008 6:50 pm Post subject:

Looking at the contest times he achieved, his solver is still fast. I'm very surprised at the speed his test showed on a file of grids.

He obviously has some very efficient optimizations.

I would very much like to ask you to make up the executable for his program!

Thanks much for the offer, Lunatic. I thought he was pretty much boasting about his own program, but the test showed it was no boast, at all. I'll have to study his description of his program.

Lunatic · Posted: Sat Nov 29, 2008 8:11 pm Post subject:

zerothbase · Posted: Sat Nov 29, 2008 9:19 pm Post subject:

Lunatic · Posted: Sat Nov 29, 2008 10:51 pm Post subject:

JasonLion · Posted: Sun Nov 30, 2008 12:59 am Post subject: