- http://ta.wikipedia.org/wiki/%E0%AE%B5%E0%AE%BF%E0%AE%95%E0%AF%8D%E0%AE%95%E0%AE%BF%E0%AE%AA%E0%AF%8D%E0%AE%AA%E0%AF%80%E0%AE%9F%E0%AE%BF%E0%AE%AF%E0%AE%BE:%E0%AE%86%E0%AE%B2%E0%AE%AE%E0%AE%B0%E0%AE%A4%E0%AF%8D%E0%AE%A4%E0%AE%9F%E0%AE%BF
- http://ta.wikipedia.org/wiki/விக்கிப்பீடியா:ஆலமரத்தடி
- http://tawp.in/r/262
In a modern browser, the URL shows like the second option, it is sad that it reverts to gobbledygook when it is copy pasted. There must be a reason that once was valid, the question is if this reason is still valid.
Thanks,
GerardM
I believe it's for compatibility. The browser doesn't know whether the eventual target of the cut-n-paste can deal with Unicode IRIs (the much more attractive version with literal characters) or will need traditional encoded ASCII URLs, so following the principle of least surprise the fully-compatible ASCII URL is used for cut-n-paste.
ReplyDeleteSmart software (including wikis, blog posting systems etc) should be able to detect the encoded UTF-8 and, like the browsers themselves do, transform it back into the pretty Unicode IRI form when showing it again, but it's not done as widely as it could be.