On Wed, Jan 17, 2007 at 03:17:01PM -0800, Ross Boylan wrote:
> I have GNU/Linux system with LANG=en_US and another with
> LANG=en_US.UTF-8.
> 
> A program produces some output, which is compared to an expected
> output file (e.g., test1.Rout.save).  The latter is kept in
> subversion. 
> 
> The output differs between the two systems because of the encoding
> differences.
> 
> Is there anything I can do to make this work more smoothly?  In
> particular, it would be nice if the file (which is text) were created
> with the correct encoding by the svn client.
> 
> The docs indicate that the server stores all filenames and logs as
> UTF-8, but files appear to be kept as unmodified binaries.  mime-type
> can affect comparisons and eol-style can affect end of lines, but I
> don't see anything dealing with the encoding per se (even if it is
> part of the MIME type, and I'm not sure it is).
The MIME specification provides a charset parameter for text files
that seems to cover the encoding.  Given subversion's current
approach, I assume that parameter is ignored, and it might even
confuse the code that decides whether a file is text or binary.
Ross
> 
> I know that historically version control systems attempt to be clever
> about their files has often led to problems, but it seems to me
> conversions between encoding schemes are fairly well-defined, and
> would be useful.  (Yes, I know some conversions are impossible!).
> 
> Thanks.
> Ross Boylan
> 
---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@subversion.tigris.org
For additional commands, e-mail: users-help@subversion.tigris.org
Received on Fri Jan 19 20:17:30 2007