Prof Abimbola Olowofoyeku wrote:
You could look at the Info-ZIP code to see if/how it handles the issue under Unix, or try some zip file with problematic characters and see if unzip under Linux does any conversion and if the result is correct. (If you don't have a Linux machine handy, you can send me such a file to try it.)
unzip fails most miserably under Linux. Of course, "unzip -l" displays the supposed contents of the zip file, but the names are totally wrong (truncated in most cases on encountering the first "foreign" character). "unzip -d" makes a brave attempt to extract the files, but all the "foreign" characters in the filenames are replaced by the "?" character. So the filenames are totally wrong as well. Windows tries to make some sense of the characters - but unzip under Linux doesn't even try (which is what I assume the question marks to mean).
I can send you a sample zip file if you want.
Yes, please.
So the question of how to convert these "foreign" characters to something that the OS can understand remains. Like I said before, there is a simple WinAPI routine that does that, but I am trying to find a portable solution from gcc or other (L)GPL libraries.
I can't tell as long as I don't even know what character set this is in the first place.
Frank