[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: UTF-8 (was: Re: property names)

From: Greg Hudson <ghudson_at_MIT.EDU>
Date: 2000-12-22 00:44:38 CET

> Forget about the C code, what about the memory? A 1000 byte file
> requires 2000 bytes of memory in a unicode representation. If each
> character required 32 bits or memory, a 1 meg file would require 4
> megs of system memory. That is just crazy!

We never read whole files into memory.

And anyway, file contents will always be opaque octets for us. UTF-8
applies to the other bits of data we pass around.
Received on Sat Oct 21 14:36:18 2006

This is an archived mail posted to the Subversion Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.