Everipedia Logo
Everipedia is now IQ.wiki - Join the IQ Brainlist and our Discord for early access to editing on the new platform and to participate in the beta testing.
Optical Character Recognition (Unicode block)

Optical Character Recognition (Unicode block)

Optical Character Recognition is a Unicode block containing signal characters for OCR standards.

Optical Character Recognition
RangeU+2440..U+245F
(32 code points)
PlaneBMP
ScriptsCommon
Symbol setsOCR controls
Assigned11 code points
Unused21 reserved code points
Unicode version history
1.0.011 (+11)
Note: [2][3]

Block

Optical Character Recognition[1][2]
Official Unicode Consortium code chart [8] (PDF)
0123456789ABCDEF
U+244x
U+245x
Notes
^As of Unicode version 12.0
^Grey areas indicate non-assigned code points

Subheadings

The Optical Character Recognition block has three informal subheadings (groupings) within its character collection: OCR-A, MICR, and OCR.[4]

OCR-A

The OCR-A subheading contains six characters taken from the OCR-A font described in the ISO 1073-1:1976 standard: U+2440 ⑀ OCR HOOK, U+2441 ⑁ OCR CHAIR, U+2442 ⑂ OCR FORK, U+2443 ⑃ OCR INVERTED FORK, U+2444 ⑄ OCR BELT BUCKLE, and U+2445 ⑅ OCR BOW TIE. The OCR bow tie is given the informative alias "unique asterisk".

MICR

The MICR subheading contains four punctuation characters for bank cheque identifiers, taken from the magnetic ink character recognition E-13B font (codified in the ISO 1004:1995 standard): U+2446 ⑆ OCR BRANCH BANK IDENTIFICATION, U+2447 ⑇ OCR AMOUNT OF CHECK, U+2448 ⑈ OCR DASH, and U+2449 ⑉ OCR CUSTOMER ACCOUNT NUMBER.

The latter two characters are misnamed (their names were inadvertently switched when they were named in ISO/IEC 10646:1993).[5] Although their formal names remain unchanged due to the Unicode stability policy, they both have corrected normative aliases: U+2448 ⑈ is MICR ON US SYMBOL, and U+2449 ⑉ is MICR DASH SYMBOL[6] (the standard notes that "the Unicode character names include several misnomers").

These symbols had previously been encoded by the ISO-IR-98 encoding defined by ISO 2033:1983, in which they were simply named SYMBOL ONE through SYMBOL FOUR.[7] All four characters have informative aliases in the Unicode charts: "transit", "amount", "on us", and "dash" respectively.

OCR

The OCR subheading consists of a single character: U+244A ⑊ OCR DOUBLE BACKSLASH.

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Optical Character Recognition block:

VersionFinal code points[1]CountL2 IDWG2 IDDocument
1.0.0U+2440..244A11(to be determined)
L2/10-416R [9]Moore, Lisa (2010-11-09), "Consensus 125-C39", UTC #125 / L2 #222 Minutes,Create two formal aliases, U+2448 MICR ON US SYMBOL and U+2449 MICR DASH SYMBOL for Unicode 6.1.
N4103 [10]"T.3. Optical Character Recognition", Unconfirmed minutes of WG 2 meeting 58, 2012-01-03

References

[1]
Citation Linkopenlibrary.orgProposed code points and characters names may differ from final code points and names
Sep 29, 2019, 12:31 AM
[2]
Citation Linkwww.unicode.org"Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
Sep 29, 2019, 12:31 AM
[3]
Citation Linkwww.unicode.org"Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.
Sep 29, 2019, 12:31 AM
[4]
Citation Linkwww.unicode.org"Unicode Code Charts: Optical Character Recognition" (PDF). The Unicode Standard, Version 6.3. Retrieved 27 February 2014.
Sep 29, 2019, 12:31 AM
[5]
Citation Linkwww.unicode.orgISO/IEC JTC 1/SC 2/WG 2 (2012-01-03). "T.3. Optical Character Recognition". Unconfirmed minutes of WG 2 meeting 58 (PDF). p. 29. SC2 N4188 / WG2 N4103.
Sep 29, 2019, 12:31 AM
[6]
Citation Linkwww.unicode.orgFreytag, Asmus; McGowan, Rick; Whistler, Ken (2017-04-10). Known Anomalies in Unicode Character Names (4 ed.). Unicode Consortium. Unicode Technical Note #27.
Sep 29, 2019, 12:31 AM
[7]
Citation Linkwww.itscj.ipsj.or.jpISO/TC97/SC2 (1985-08-01). "ISO-IR-98: A set of 14 graphic characters of the E13B font" (PDF). ITSCJ/IPSJ.
Sep 29, 2019, 12:31 AM
[8]
Citation Linkwww.unicode.orgOfficial Unicode Consortium code chart
Sep 29, 2019, 12:31 AM
[9]
Citation Linkwww.unicode.orgL2/10-416R
Sep 29, 2019, 12:31 AM
[10]
Citation Linkwww.unicode.orgN4103
Sep 29, 2019, 12:31 AM
[11]
Citation Linkwww.unicode.org"Unicode character database"
Sep 29, 2019, 12:31 AM
[12]
Citation Linkwww.unicode.org"Enumerated Versions of The Unicode Standard"
Sep 29, 2019, 12:31 AM
[13]
Citation Linkwww.unicode.org"Unicode Code Charts: Optical Character Recognition"
Sep 29, 2019, 12:31 AM
[14]
Citation Linkwww.unicode.orgUnconfirmed minutes of WG 2 meeting 58
Sep 29, 2019, 12:31 AM
[15]
Citation Linkwww.unicode.orgKnown Anomalies in Unicode Character Names
Sep 29, 2019, 12:31 AM
[16]
Citation Linkwww.itscj.ipsj.or.jp"ISO-IR-98: A set of 14 graphic characters of the E13B font"
Sep 29, 2019, 12:31 AM
[17]
Citation Linken.wikipedia.orgThe original version of this page is from Wikipedia, you can edit the page right here on Everipedia.Text is available under the Creative Commons Attribution-ShareAlike License.Additional terms may apply.See everipedia.org/everipedia-termsfor further details.Images/media credited individually (click the icon for details).
Sep 29, 2019, 12:31 AM