Ask a Question related to Ruby, Design and Development.
-
Bedo Sandor #1
utf8 -> latin2
Hi,
How can I convert utf-8 encoded strings to latin-2?
I have tried it using libuconv with little success:
require 'uconv'
class String
def un_utf8
Uconv.u8tou16(self).gsub(/\000/, '')
end
def to_utf8
tmp = ""
self.each_byte { |b|
tmp += b.chr + "\000"
}
Uconv.u16tou8(tmp)
end
end
This program is ugly, and does not exactly what I want.
u8tou16 generates a string with 16 bit long characters,
for example "test".un_utf8 == "t\000e\000s\000t\000".
gsub clears the unnecessery "\000" characters from
the string. But there are characters in Hungarian,
that has non-zero second byte in the output of the
u8tou16, so they fail to convert. Anyway this is an
ugly hack.
How is it done nicely?
--
bSanyI
Bedo Sandor Guest
-
Problem with DBD::DB2 and UTF8.
Folks, Here is my setup: DB2 V8.2 ESE running on Linux with utf-8 code set. Database territory = us... -
utf8 & Jos? Feliciano
MULTILANGUAGE AND UTF8 Here it is... when I import "Jos? Feliciano" into my application... the MySQL db stores it as the string you see on the... -
UTF8 encode
Hello there, I need help! :-) How do we pass variables to php in UTF 8 encode? I have one php, whicth sends emails, and the subject of them... -
problem with utf8
hi people, i have a script : <? $num=0; $backendURL = "http://rezo.net/backend/afp"; $limit = 1; // nombre maxi de news à afficher. $file... -
ISO-Latin and UTF8
"Philip M. Gollucci" <pgollucci@ejpress.com> wrote in... ISO-LATIN. You might want to try the newer Encode in Perl 5.8.1. my $utf8_line =... -
Wesley J Landaker #2
Re: utf8 -> latin2
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
On Friday 14 November 2003 7:46 am, Bedo Sandor wrote:I think the iconv module handles this nicely:> Hi,
>
> How can I convert utf-8 encoded strings to latin-2?
> I have tried it using libuconv with little success:
>
> require 'uconv'
>
> class String
> def un_utf8
> Uconv.u8tou16(self).gsub(/\000/, '')
> end
>
> def to_utf8
> tmp = ""
> self.each_byte { |b|
> tmp += b.chr + "\000"
> }
> Uconv.u16tou8(tmp)
> end
> end
>
> This program is ugly, and does not exactly what I want.
> u8tou16 generates a string with 16 bit long characters,
> for example "test".un_utf8 == "t\000e\000s\000t\000".
> gsub clears the unnecessery "\000" characters from
> the string. But there are characters in Hungarian,
> that has non-zero second byte in the output of the
> u8tou16, so they fail to convert. Anyway this is an
> ugly hack.
>
> How is it done nicely?
require 'iconv'
Iconv.conv("utf-8","latin2","this is a test")
- --
Wesley J. Landaker - [email]wjl@icecavern.net[/email]
OpenPGP FP: 4135 2A3B 4726 ACC5 9094 0097 F0A9 8A4C 4CD6 E3D2
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.3 (GNU/Linux)
iD8DBQE/tPU68KmKTEzW49IRAj8nAJ9QgkOiYp4UKf7gFBFrUW6qm0NW8g CfXMIy
fz4uq1FmNXxDvhkGSdyWJIA=
=ntbU
-----END PGP SIGNATURE-----
Wesley J Landaker Guest



Reply With Quote

