readmrz

Machine readable zone reader on ID cards https://pypi.org/project/readmrz

MIT License

Downloads
489
Stars
13
Committers
4

readmrz

readmrz detects the machine readable zone on ID cards and extracts the text in that zone.

This zone contains the name, surname, date of birth, etc. of the person to whom the identity card was issued. It has universal standards in new generation identity cards and passports.

In conclusion, readmrz is a tool to read mrz code on identity cards and passports.

Install

Please install tesseract before installing the package,

On macOS,

$ brew install tesseract

On Ubuntu,

$ sudo apt-get install -y tesseract-ocr

On Windows,

$ choco install tesseract

Then you can install the latest release,

$ pip install readmrz

Usage

>>> import json
>>> from readmrz import MrzDetector, MrzReader

>>> detector = MrzDetector()
>>> reader = MrzReader()

>>> image = detector.read('/path/to/file')
>>> cropped = detector.crop_area(image)
>>> result = reader.process(cropped)
>>> print(json.dumps(result))
{
    "surname": "STEARNE",
    "name": "JOHN TIMOTHY KELLY",
    "country": "CAN",
    "nationality": "CAN",
    "birth_date": "580702",
    "expiry_date": "240904",
    "sex": "M",
    "document_type": "P",
    "document_number": "GA302922",
    "optional_data": "",
    "birth_date_hash": "0",
    "expiry_date_hash": "3",
    "document_number_hash": "0",
    "final_hash": "2"
}

or using url,

>>> import json
>>> from readmrz import MrzDetector, MrzReader

>>> detector = MrzDetector()
>>> reader = MrzReader()

>>> image = detector.read_from_url('/url/to/image')
>>> cropped = detector.crop_area(image)
>>> result = reader.process(cropped)
>>> print(json.dumps(result))
{
    "surname": "STEARNE",
    "name": "JOHN TIMOTHY KELLY",
    "country": "CAN",
    "nationality": "CAN",
    "birth_date": "580702",
    "expiry_date": "240904",
    "sex": "M",
    "document_type": "P",
    "document_number": "GA302922",
    "optional_data": "",
    "birth_date_hash": "0",
    "expiry_date_hash": "3",
    "document_number_hash": "0",
    "final_hash": "2"
}

The result is returned as a dict so it's easy to access the fields. You can also use command-line,

$ readmrz -f /path/to/file

or using url,

$ readmrz -u /url/to/image

Example

Please check to the notebook to see the results step by step.

Contribution

Please check to the pylint and flake8 steps in workflow before contribution.

Package Rankings
Top 14.45% on Pypi.org
Related Projects