bzr branch
http://suren.me/webbzr/rusxmms/librcd
14
by Suren A. Chilingaryan
GPL disclaimers are added to all source files |
1 |
Library for autodection charset of russian text |
2 |
||
3 |
LibRCD is used by RusXMMS project for encoding auto-detection. It is optimized |
|
4 |
to handle very short titles, like ID3 tags, file names and etc, and provides |
|
5 |
very high accuracy even for short 3-4 letter words. Current version supports |
|
6 |
Russian and Ukrainian languages and able to distinguish UTF-8, KOI8-R, CP1251, |
|
7 |
CP866, ISO8859-1. If compared with Enca, LibRCC provides better detection |
|
8 |
accuracy on short titles and is able to detect ISO8859-1 (non-cyrillic) |
|
9 |
encoding what allows to properly display correct ID3 v.1 titles. |