[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: UnicodeDecodeError getting revision log with Python

From: C. Michael Pilato <cmpilato_at_collab.net>
Date: Mon, 01 Jun 2009 13:01:00 -0400

Jack Bates wrote:
>> David James wrote on Thu, 28 May 2009 at 15:58 -0700:
>>> Subversion standardizes on UTF-8 internally, but I don't
>>> think we verify that log messages are UTF-8 -- I think that users can
>>> encode their log messages using any encoding they want.
>> As of 1.6, we do. See validate_prop() in libsvn_repos/fs-wrap.c.
>
> May I file a feature request for,
>
> svn.fs.revision_prop(..., svn.core.SVN_PROP_REVISION_LOG)
>
> - to return a unicode object instead of a bytestream?

No.

Subversion doesn't make any promises about the revision properties except
that they are bytestreams. Some of them (the ones managed by Subversion ...
svn:log, svn:author, etc.) are "supposed to be" UTF-8 strings in order for
Subversion to function properly. But those are requirements of API
functions in levels higher than that of the svn.fs module (or its matching
svn_fs_* underlying C functions).

-- 
C. Michael Pilato <cmpilato_at_collab.net>
CollabNet   <>   www.collab.net   <>   Distributed Development On Demand
------------------------------------------------------
http://subversion.tigris.org/ds/viewMessage.do?dsForumId=462&dsMessageId=2357592

Received on 2009-06-01 19:01:33 CEST

This is an archived mail posted to the Subversion Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.