[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: utf8 filenames in html browser

From: Fumitoshi UKAI <ukai_at_debian.or.jp>
Date: 2003-03-11 16:28:11 CET

Hi,

At Tue, 11 Mar 2003 13:26:06 +0000,
Chris Croome wrote:

> On Tue 11-Mar-2003 at 05:09:12AM -0800, solo turn wrote:
> > svn now even works with utf8 filenames. one little thing seems to
> > be left there: if you browse it maybe with internet explorer 6, it
> > is not displayed correctly. i.e. "?" (e with accent) becomes a
> > sqare on the display.
> >
> > is there a solution for this, or would this be an issue for svn?
>
> This is not an issue for svn it's a user agent / os issue.

Well, but there is an issue for svn to support other than latin characters.
On LANG=ja_JP.eucJP, I can commit file with Japanese filename in EUC-JP
encoding and handle it by svn command line, but html pages via mod_dav_svn
shows broken filename.

For example,

 % echo $LANG
 ja_JP.eucJP
 % echo test > テスト.txt
 % svn add テスト.txt
 A テスト.txt
 % svn commit -m 'add Japanese filename'
 Adding tmp/テスト.txt
 Transmitting file data .
 Committed revision 81.

Yes, it's ok to handle Japanese filename with svn command.

But html page of this repository looks like:

 HTTP/1.1 200 OK
 Date: Tue, 11 Mar 2003 15:13:05 GMT
 Server: Apache/2.0.44 (Debian GNU/Linux) DAV/2 SVN/0.18.1
 ETag: "81//tmp"
 Accept-Ranges: bytes
 Content-Length: 707
 Connection: close
 Content-Type: text/html; charset=UTF-8

 <html><head><title>Revision 81: /tmp</title></head>
 <body>
  <h2>Revision 81: /tmp</h2>
  <ul>
   <li>..</li>
   <li>????????????.txt</li>
  </ul>
  <hr noshade><em>Powered by Subversion version 0.18.1 (r5118).</em>
 </body></html>

where the line テスト.txt is:
0000000 < l i > < a h r e f = " %
0000020 c 2 % a 5 % c 3 % 8 6 % c 2 % a
0000040 5 % c 2 % b 9 % c 2 % a 5 % c 3
0000060 % 8 8 . t x t " > 302 245 303 206 302 245 302
0000100 271 302 245 303 210 . t x t < / a > < / l
0000120 i > \n

Note that テスト is
 EUC-JP 245 306 245 271 245 310 (octal)
        a5 c6 a5 b9 a5 c8 (hexadecimal)
 UTF-8 343 203 206 343 202 271 343 203 210 (octal)
        e3 83 86 e3 82 b9 e3 83 88 (hexadecimal)

Why mod_dav_svn output as below?
        302 245 303 206 302 245 302 271 302 245 303 210 (octal)
        c2 a5 c3 86 c2 a5 c2 b9 c2 a5 c3 88 (hexadecimal)

Regards,
Fumitoshi UKAI

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Tue Mar 11 16:29:03 2003

This is an archived mail posted to the Subversion Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.