[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Unicode characters in filenames on windows

From: Варфоломеев Игорь <i3v_at_mail.ru>
Date: Wed, 12 Jun 2013 04:58:18 +0400

Hi all,
I'm still not sure if it's a bug, or if I'm doing something wrong. But I'm unable to get TortoiseSVN
command-line tool to work with files with UTF-8 characters in their name.
(1.7.10 r1485443, part of TortoiseSVN 1.7.13 @Win7 x64)

I've posted the following message here
( http://tortoisesvn.tigris.org/ds/viewMessage.do?dsForumId=4061&dsMessageId=3057388 )
but was suggested
( http://tortoisesvn.tigris.org/ds/viewMessage.do?dsForumId=4061&dsMessageId=3057494 )
to re-address it to users_at_subversion.apache.org :

---------------------------------------------------------------------------------------------------------------------------------------
*** THE ISSUE ***

Workflow:

1. Create "c:\temp\UNCtest\R_UNCtest\" folder
2. Create a repository with default file structure in it
3. Checkout "trunk" dir to "c:\temp\WC\trunk"
4. Create file "c:\temp\WC\trunk\1‐2.txt" ,
note, that filename consists of 3 symbols, and the one in the middle is "HYPHEN" or &#8208, (see http://www.fileformat.info/info/unicode/char/2010/index.htm )

5. Add and commit this file with Tortoise GUI.
(this works OK)
6. start windows cmd
7. make sure your cmd is set to use UTF-8 compatible font, for example, "Consolas" (see http://stackoverflow.com/questions/10764920/utf-16-on-cmd-exe/10765469#10765469 ).
8. navigate to "c:\temp\WC\trunk"
9. type "dir" - you should see the listing correctly, including "1‐2.txt" file
10. Type "mkdir 1‐2" - this should correctly create a directory.
11. Type "svn info 1‐2.txt"
Result:
--------------------------------------------------
svn: warning: W155010: The node 'C:\TEMP\UNCtest\WC\trunk\1?2.txt' was not found
.

svn: E200009: Could not display info for all targets because some targets don't
exist
--------------------------------------------------
* Note those 8208 symbol was replaced with question mark.
* changing cmd's codepage to 65001 (see http://stackoverflow.com/questions/388490/unicode-characters-in-windows-command-line-how/388500#388500 ) won't help
* launching cmd with '/u' switch (see http://stackoverflow.com/questions/2182568/batch-script-is-not-executed-if-chcp-was-called/2462138#2462138 ) won't help

---------------------------------------------------------------------------------------------------------------------------------------
*** EXISTING ISSUES ***

I wasn’t able to find a related issue in the list
( http://subversion.tigris.org/issues/buglist.cgi )

The closest issue I’ve found is
( http://subversion.tigris.org/issues/show_bug.cgi?id=4075 )

---------------------------------------------------------------------------------------------------------------------------------------
*** EXISTING POSTS ***

Most posts I've found on this are quite outdated, though:
Once anyone complain about Unicode support,
he is usually suggested to either change his LANG setting,
( http://article.gmane.org/gmane.comp.version-control.subversion.user/80036 )
, to rename a file
( http://www.svnforum.org/threads/37439-Checking-out-files-with-Unicode-characters-in-file-names )
, or - to install some "+unicode_path" port
( http://superuser.com/questions/176243/problem-with-svn-filename-encoding-on-mac-os-x )

---------------------------------------------------------------------------------------------------------------------------------------
*** So... ***

* Am I doing something wrong?
* Or could this situation be treated as a bug?
* Or, maybe a “feature request”?

Thank you for your attention!

Best regards,
Igor Varfolomeev
Received on 2013-06-12 03:08:39 CEST

This is an archived mail posted to the Subversion Users mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.