I have a screenscraping thing in place that targets an ASCII only environment (LambdaMOO mud). Manually replacing unicode stuff like this has been a pain in the butt. Luckily that was all just for fun and didn't need to be perfect.
Is there a good library out there (in any language) that does good unicode --> ASCII substitutions for major languages?