The reference C code, written by Ted Krovetz. The code is intended to clarify the algorithm; it is not optimized for speed.
Other code (eg, Java, assembly, or optimized C) will be made available if contributed; if you've written nice code, feel welcome to send it in!