Document Image Enhancement (DIE) model
Welcome to the Document Image Enhancement (DIE) model demo on Hugging Face!
This interactive application showcases a specialized AI model developed by the Artificial Intelligence group at the Alfréd Rényi Institute of Mathematics.
Our DIE model is designed to enhance and restore archival and aged document images by removing various types of degradation, thereby making historical documents more legible and suitable for Optical Character Recognition (OCR) processing.
The model effectively tackles 20-30 types of domain-specific noise found in historical records, such as scribbles, bleed-through text, faded or worn text, blurriness, textured noise, and unwanted background elements. By applying deep learning techniques, specifically a U-Net-based architecture, the model accurately cleans and clarifies text while preserving original details. This improved clarity dramatically boosts OCR accuracy, making it an ideal pre-processing tool in digitization workflows.
If you’re interested in learning more about the model’s capabilities or potential applications, please contact us at: gabar92@renyi.hu.