HASS-amazon-rekognition-text

Home Assistant integration to extract text from digital and mechanical displays using AWS rekognition computer vision service.

This integration adds an entity where the state of the entity is the detected text in the camera image. A region of interest (roi) should be used to select the region of the image containing the text you wish to read. Optionally various processing can be performed to help improve detection. You should experiment with these options if you are experiencing errors in the detected text. The processing options are:

make_bw will convert images to black and white before processing
erode can be used for pixelated LCD screens, to erode discrete pixels into single characters, start with low

Configuration in Home Assistant

Place the custom_components folder in your configuration directory (or add its contents to an existing custom_components folder). Then configure the integration.

Example config:

image_processing:
  - platform: amazon_rekognition_text
    aws_access_key_id: yours
    aws_secret_access_key: yours
    region_name: eu-west-1 # optional region, default is us-east-1
    roi_x_min: 0.35
    roi_x_max: 0.83
    roi_y_min: 0.7
    roi_y_max: 0.9
    save_file_folder: /config/rekognition/
    save_timestamped_file: True
    unit_of_measurement: £
    source:
      - entity_id: camera.local_file

Configuration variables:

aws_access_key_id: Your AWS key ID
aws_secret_access_key: Your AWS key secret
region_name: Your preferred AWS region
roi_x_min: (optional, default 0), range 0-1, must be less than roi_x_max
roi_x_max: (optional, default 1), range 0-1, must be more than roi_x_min
roi_y_min: (optional, default 0), range 0-1, must be less than roi_y_max
roi_y_max: (optional, default 1), range 0-1, must be more than roi_y_min
numbers_only: (optional, default False), if True, attempts to return only the numbers detected
make_bw: (optional, default False), if True, converts image to black and white before processing
erode: (optional, default None, values are low, medium, high), useful for merging black pixels
save_file_folder: (Optional) The folder to save processed images to. Note that folder path should be added to whitelist_external_dirs
save_timestamped_file: (Optional, default False, requires save_file_folder to be configured) Save the processed image with the time of detection in the filename.
unit_of_measurement: (Optional) the units to add, required if you want a graph for numbers
source: Must be a camera.

For the roi, the (x=0,y=0) position is the top left pixel of the image, and the (x=1,y=1) position is the bottom right pixel of the image. It might seem a bit odd to have y running from top to bottom of the image, but that is the coordinate system used by pillow. A streamlit app is provided to help with configuration of the ROI values, documented at the end of this readme. Note that to view the configured roi you must configure the save_file_folder and view the latest saved image, which can be displayed on the HA UI with a local_file camera

local_file camera example

Example config for displaying the latest saved image:

camera:
  - platform: local_file
    name: rekognition_text
    file_path: /config/rekognition/rekognition_text_local_file_1_latest.png

Streamlit app

A streamlit app is available to help with config. To use a hosted version go to:

https://share.streamlit.io/robmarkcole/hass-amazon-rekognition-text/main

Or run locally following the instructions below:

Create a venv: python3 -m venv venv
Activate venv: source venv/bin/activate
Install requirements: pip3 install -r requirements-app.txt
Run streamlit app: streamlit run streamlit_app.py

Related Projects

label_generator

Training data generator for text detection

30 Apr 2015 39

layered-vision

A tool to allow the composition of images or videos via a configuration file (e.g. as a virtual w...

20 Oct 2020 7

securityspy

SecuritySpy Integration for Home Assistant with Camera Streams and Motion Detection

27 Mar 2020 33

Object-and-facial-detection-in-python

This repo contains, training material, dlib implementation, tensorflow implementation and an own ...

21 Feb 2018 16

Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

24 Sep 2023 37,349

DeepCamera

Open-Source AI Camera. Empower any camera/CCTV with state-of-the-art AI, including facial recogni...

05 Mar 2019 1,823

streamlit-sparrow-labeling-comp

Streamlit component for invoice document labeling

27 Nov 2022 51

Face-Recognition-Door-Lock-with-AWS-Rekognition-Raspberry-Pi3

Face Recognitio nDoor Lock with AWS Rekognition Raspberry Pi3

05 Apr 2020 14

rakali

Rakali is a imaging library and video camera tool-set

14 Feb 2019 8

get-me-through

A Free, Offline, Real-Time, Open-source web-app to assist organisers of any event in allowing onl...

06 Jul 2017 287

profile-photo

Center + Crop Image to create a Profile Pic or Headshot

18 Feb 2023 2

ocrd_detectron2

OCR-D wrapper for detectron2 based segmentation models

21 Jan 2022 16

HASS-plate-recognizer

Read number plates with https://platerecognizer.com/

12 Feb 2021 112

streamlit-segmentation-app

streamlit app for binary segmentation

12 Sep 2022 15