-
-
Notifications
You must be signed in to change notification settings - Fork 28
Open
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomers
Description
This is a running list of all the (mildly to extremely) cursed encodings, and whether or not we should implement them. More can be suggested on Twitter here Here goes:
- UNIVAC encoding (Section 1, part 2, PDF page 9 http://www.bitsavers.org/pdf/univac/418/UP-7599r1_rtosAssemb_Jul70.pdf)
- ISO 8859-1
- ISO 8859-15
- ISO/IEC 2022 Encodings (https://en.wikipedia.org/wiki/ISO/IEC_2022)
- ISO/IEC 646 Encodings (https://en.wikipedia.org/wiki/ISO/IEC_646)
- DOS Codepages (https://www.aivosto.com/articles/charsets-codepages-dos.html#codepage861)
-
MULE_INTERNAL (Multilanguage Emacs internal encoding)Garbage encoding for an even more garbage text editor. - PETSCII (with state for lower/upper mapping based on literal "SHIFT" button state)
- ATASCII (with state for lower/upper mapping based on literal "SHIFT" button state)
- SHIFT-JIS (already implemented in example code)
- Tatar (Missing encodings: Tatar #15)
-
UTF-EBCDICThis may be patent-encumbered or license-checked, and therefore cannot be implemented. -
UTF-7This may be patent-encumbered or license-prohibited, and therefore cannot be implemented. -
UTF-7-IMAPThis may be patent-encumbered or license-prohibited, and therefore cannot be implemented. -
UTF-1Not a good encoding.
Some that might not be possible within the framework of this library:
- Early Canjie input method translation: this is moreso a system of input that is then converted to characters, rather than a character set itself. It also seems to have a (potentially?) unbounded set of inputs that can produce an equally wild amount of outputs, making the
encode_one/decode_onelimitations potentially useless? Needs more research
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomers