Groups > WordPerfect Office > WordPerfect for Linux > Re: Unicode in WP - not, and the reasons




Re: Unicode in WP - not, and the reasons

Re: Unicode in WP - not, and the reasons
Fri, 04 Jan 2008 10:12:32 -050
Andreas von Heydwolff wrote:
> 
>
http://www.wpuniverse.com/vb/showthread.php?s=2ced1a86231c7867e9f20d4ea93e2792&a
mp;threadid=16431 
> 
> 
> Dear all,
> 
> as an occasional lurker using WP9 in VMware now and then I have found 
> what is for me part of the answer why the Linux product was dropped. I 
> haven't seen it mentioned here but please excuse me if I am repeating 
> the obvious.
> 
> The thread at the URL above explains nicely why WP probably will never 
> get unicode support. Therefore there will be no easy compatibility with 
> modern Linux distros, as far as international usage is concerned.
> 
> My current issue is that the mail merge feature has always been so much 
> better than the OO for Linux solution which I find tedious to use. But 
> at least for me needing German and other special characters on a daily 
> basis exporting the kaddressbook entries to csv and importing this file 
> into WP just doesn't work well. The umlauts get garbled or omitted.
> 
> BTW, does WP X3 have a feature to define masks for importing address 
> data from other sources into its address book? Not having to re-define 
> import filters (field names etc.) every time when an updated data set is 
> being imported would be helpful. A script could do the charset 
> conversion in the Linux csv file and importing it might then just be a 
> few clicks away.
> 
> Greetings,
> 
> Andreas
> 
Andreas: thanks for posting this and the informative link.

I taught German for ten years and used WP 6.x DOS extensively in 
preparing my materials.  It was easy for me to insert umlauts and esszet
by using alt + ascii code on the numeric key pad, e.g. umlaut o = alt 
148.  Moving to WP8/Linux slowed things down by having to use the insert 
character menu.
-- 
Leon A. Goldstein

Powered by Libranet 2.8.1 Debian Linux
System G2

Post Reply
Unicode in WP - not, and the reasons
Fri, 04 Jan 2008 15:24:29 +010
http://www.wpuniverse.com/vb/showthread.php?s=2ced1a86231c7867e9f20d4ea93e2792&a
mp;threadid=16431

Dear all,

as an occasional lurker using WP9 in VMware now and then I have found 
what is for me part of the answer why the Linux product was dropped. I 
haven't seen it mentioned here but please excuse me if I am repeating 
the obvious.

The thread at the URL above explains nicely why WP probably will never 
get unicode support. Therefore there will be no easy compatibility with 
modern Linux distros, as far as international usage is concerned.

My current issue is that the mail merge feature has always been so much 
better than the OO for Linux solution which I find tedious to use. But 
at least for me needing German and other special characters on a daily 
basis exporting the kaddressbook entries to csv and importing this file 
into WP just doesn't work well. The umlauts get garbled or omitted.

BTW, does WP X3 have a feature to define masks for importing address 
data from other sources into its address book? Not having to re-define 
import filters (field names etc.) every time when an updated data set is 
being imported would be helpful. A script could do the charset 
conversion in the Linux csv file and importing it might then just be a 
few clicks away.

Greetings,

Andreas
Post Reply
Re: Unicode in WP - not, and the reasons
Mon, 14 Jan 2008 17:13:35 -070
Leon A. Goldstein wrote:
> 
> Andreas von Heydwolff wrote:
>>
>>
http://www.wpuniverse.com/vb/showthread.php?s=2ced1a86231c7867e9f20d4ea93e2792&a
mp;threadid=16431 
>>
>>
>> Dear all,
>>
>> as an occasional lurker using WP9 in VMware now and then I have found 
>> what is for me part of the answer why the Linux product was dropped. I

>> haven't seen it mentioned here but please excuse me if I am repeating 
>> the obvious.
>>
>> The thread at the URL above explains nicely why WP probably will never

>> get unicode support. Therefore there will be no easy compatibility 
>> with modern Linux distros, as far as international usage is concerned.
>>
>> My current issue is that the mail merge feature has always been so 
>> much better than the OO for Linux solution which I find tedious to 
>> use. But at least for me needing German and other special characters 
>> on a daily basis exporting the kaddressbook entries to csv and 
>> importing this file into WP just doesn't work well. The umlauts get 
>> garbled or omitted.
>>
>> BTW, does WP X3 have a feature to define masks for importing address 
>> data from other sources into its address book? Not having to re-define

>> import filters (field names etc.) every time when an updated data set 
>> is being imported would be helpful. A script could do the charset 
>> conversion in the Linux csv file and importing it might then just be a

>> few clicks away.
>>
>> Greetings,
>>
>> Andreas
>>
> Andreas: thanks for posting this and the informative link.
> 
> I taught German for ten years and used WP 6.x DOS extensively in 
> preparing my materials.  It was easy for me to insert umlauts and esszet
> by using alt + ascii code on the numeric key pad, e.g. umlaut o = alt 
> 148.  Moving to WP8/Linux slowed things down by having to use the insert 
> character menu.

It would be nice if it supported a compose key and dead keys.  This 
would actually mean that it used X for the keyboard decoding and 
wouldn't really care how X managed to produce the key codes.

AFA Unicode is concerned.  I found the argument useless.  They presumed 
that it would have to be based on 16 bits.  This is simply wrong. 
Unicode usually means UTF-8 which is NOT 16 bits per character.  I think 
that it would be easy to convert WP to UTF-8 and UT16.  You have a first 
bite indicating WP coding and a code for the "Code Page" to use, then

the second byte is the character code.  If there is room for two more 
Code Page codes, it should work.  The only modification would be that it 
would have to accept a variable number of bytes (instead of one) for the 
character code.  IIRC, this would be necessary for UTF-8 which uses a 
variable number of bytes.  It uses only one byte for the first 128 
characters of ISO 8859-1 and then uses a Huffman code like system to use 
various lengths of byte strings for other codes.  UT16 is similar except 
that it uses 2 byte words (16 bits) and can represent *all* Unicode 
characters.  But in both cases, WP would have to use the character 
decoding algorithm and use as many bytes or words as necessary.

The problem here is that both types of Unicode Transformation Formats 
only work with a stream.  If WP doesn't read text objects as streams, 
then there would be major problems.

They are going to have to do a major rewrite anyway if they are going to 
be able to use one of the XML formats.

-- 
JRT
Post Reply
about | contact