[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: Weird characters in log, WAS: svn commit: rev 5250 - in trunk/subversion/bindings: java/org/tigris/subversion swig

From: Kai Nacke <kai.nacke_at_redstar.de>
Date: 2003-03-09 22:04:46 CET

On Sunday 09 March 2003 21:36, Philip Martin wrote:
> Branko ÄŒibej <brane@xbc.nu> writes:
> > >0xA0 is ISO-8859-1's non-breaking space.
> >
> > Hmm, I wonder if it's represented as a Unicode unbreakable space
> > (U+000A, encoded in UTF-8, of course) in the repository.
>
> What I see in gdb is
>
> Breakpoint 3, svn_subst_detranslate_string (new_value=0xbffff504,
> value=0x8083d90, pool=0x8083540)
> at ../svn/subversion/libsvn_subr/subst.c:860
> 860 if (value == NULL)
> (gdb) x/16c value->data+69
> 0x8083a8d: 106 'j' 97 'a' 118 'v' 97 'a' 10 '\n' -62 'Â' -96 ' ' 32
> ' ' 0x8083a95: 65 'A' 100 'd' 100 'd' 32 ' ' 97 'a' 116 't' 116 't'
> 114 'r' (gdb)fin
> Run till exit from #0 svn_subst_detranslate_string (new_value=0xbffff514,
> value=0x8083d90, pool=0x8083540)
> at ../svn/subversion/libsvn_subr/subst.c:860
> log_message_receiver (baton=0xbffff8cc, changed_paths=0x0, rev=5250,
> author=0x80839f8 "2003-03-08 21:09:02 +0000 (Sat, 08 Mar 2003)",
> date=0x8083580 "2003-03-08T21:09:02.623645Z",
> msg=0x80835a0 "*
> subversion/bindings/java/org/tigris/subversion/SubversionException.java\nÂ
> Add attributes status, file, line and public getter methods. Changed
> constructors\n to reflect new attributes. Added Throw"...,
> pool=0x8083540) at ../svn/subversion/clients/cmdline/log-cmd.c:222 222
> msg_native = logmsg->data;
> Value returned is $2 = (struct svn_error *) 0x0
> (gdb) x/16c logmsg->data+69
> 0x8085195: 106 'j' 97 'a' 118 'v' 97 'a' 10 '\n' -96 ' ' 32 ' ' 65
> 'A' 0x808519d: 100 'd' 100 'd' 32 ' ' 97 'a' 116 't' 116 't' 114 'r'
> 105 'i'
>
>
> So the repository appears to contain 0xC2 0xA0 which gets converted to
> 0xA0 in my C locale.

I looked with emacs in the file I used for the log message. There is the sequence:

 'j' 'a' 'v' 'a' '\n' ' ' ' ' 'A' 'd'
0x6a 0x61 0x76 0x61 0x0a 0xa0 0x20 0x41 0x64

The 0xC2 is a UTF-8 prefix, right?

Kai

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Sun Mar 9 22:02:38 2003

This is an archived mail posted to the Subversion Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.