Bug 60608 - Improve charset handling in hwmf
Summary: Improve charset handling in hwmf
Status: RESOLVED FIXED
Alias: None
Product: POI
Classification: Unclassified
Component: POI Overall (show other bugs)
Version: 3.16-dev
Hardware: PC All
: P2 minor (vote)
Target Milestone: ---
Assignee: POI Developers List
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2017-01-19 19:06 UTC by Tim Allison
Modified: 2017-01-19 20:20 UTC (History)
0 users



Attachments
test file from common crawl (4.56 KB, image/x-wmf)
2017-01-19 19:06 UTC, Tim Allison
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Tim Allison 2017-01-19 19:06:58 UTC
Created attachment 34649 [details]
test file from common crawl

hwmf currently assumes CHARSET_1252 for all text.  Let's update hwmf to include the charset information made available by the fonts.
Comment 1 Tim Allison 2017-01-19 19:13:43 UTC
Unit test for attached could include, e.g.:
тона
Общо предлагане
9 278
9 636
9 935
Начална наличност
2 740
2 096
1 400
Производство
6 500
7 500
8 500
Внос
38
40
35
Общо потребление
7 182
8 236
9 500
Консумация от населението
3 200
3 628
3 800
Храна за пчелите
600
650
700
Износ
3 382
3 958
5 000
Крайна наличност
2 096
1 400
435
* - прогноза
Баланс на пчелен мед
2001
2002
2003 *
Comment 2 Tim Allison 2017-01-19 20:20:52 UTC
r1779519

small bit of refactoring to store text as bytes and convert to String based on encodings stored in Fonts during rendering.