Awesome module, an answer to the question I asked several YEARS ago: http://drupal.org/node/599404

So, an extension of my earlier question:

What if I have variant characters that can't be classified as cn, tw, sg, hk? For example, classical, out of use, Korean, Japanese variants. The most complete list I've seen is here:

http://hkiug.ln.edu.hk/unicode/hkiug_tsvcc_table-UnicodeVersion-1.0.html