Apache OpenOffice (AOO) Bugzilla – Full Text Issue Listing |
Summary: | i18npool: OUString's are really utf16 strings. Attached .odt continually loops due to mismatch | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | Internationalization | Reporter: | caolanm | ||||||
Component: | i18npool | Assignee: | stefan.baltzer | ||||||
Status: | CLOSED FIXED | QA Contact: | issues@l10n <issues> | ||||||
Severity: | Trivial | ||||||||
Priority: | P3 | CC: | issues, kamataki | ||||||
Version: | OOo 3.1 | ||||||||
Target Milestone: | --- | ||||||||
Hardware: | All | ||||||||
OS: | All | ||||||||
Issue Type: | PATCH | Latest Confirmation in: | --- | ||||||
Developer Difficulty: | --- | ||||||||
Issue Depends on: | |||||||||
Issue Blocks: | 102943 | ||||||||
Attachments: |
|
Description
caolanm
2009-06-18 17:13:41 UTC
Created attachment 63074 [details]
sample problematic character
Created attachment 63075 [details]
possible patch, or at least helpfully indicative
Ayay.. thanks! I don't see anything wrong with this patch at a first glance. There just may be more places where iterateCodePoints() should be used. Yeah what's changed here is probably good, just not sure if more should be changed, especially around seeing if or if not such a char should be considered in the gendict lookup table Reassigning to spare time account. In cws locales32: revision 275072 i18npool/inc/xdictionary.hxx i18npool/source/breakiterator/breakiteratorImpl.cxx i18npool/source/breakiterator/breakiterator_cjk.cxx i18npool/source/breakiterator/xdictionary.cxx Also adapted local iterateCodePoints() in breakiteratorImpl.cxx to cope with surrogates at text end. Use OUString::iterateCodePoints() in BreakIterator_CJK::getLineBreak() I actually have no idea if and how surrogates could be handled with the gendict dictionary. Reassigning to QA for verification. Verified in CWS locales32. closed, seen m60 |