Gnome OCR Screenshot

A simple OCR (Optical Character Recognition) tool for the GNOME desktop environment that allows you to extract text as well as scan QR codes directly automatically from screenshots.

Feb 6, 2025    #python   #gtk   #tesseract   #linux  
waves

GNOME Screenshot OCR is a simple, native tool for the GNOME desktop environment that allows users to instantly extract text or scan QR codes directly from a selected area of their screen via a screenshot.

GitHub Repository: funinkina/Gnome-OCR-Screenshot

Screenshot Demo

Why I made this?

Oftentimes I needed to copy a error or debug message from the screen but it won’t be in a nice text box, so i decided to make this simple utility to do that for me.

How it is built

This tool is primarily built using Python 3. It leverages python-gobject and GTK 4 to integrate natively with the GNOME desktop environment, specifically utilizing the GNOME screenshot portal for capturing screen regions. The core functionality relies on Python Tesseract OCR (pytesseract) for the optical character recognition part, and pyzbar (optionally) for scanning QR codes. The project emphasizes minimal dependencies and a single file structure to make it easy to set up and integrate with keyboard shortcuts.

Current features

The GNOME Screenshot OCR tool currently provides the following capabilities:



Next: DEM Super Resolution