ProjectsProject Details

Digitizing The Yerushalmi Catalogue

Project ID: 7235-2-23
Year: 2024
Student/s: Rami Halabi, Salah Abbas
Supervisor/s: Ori Bryt

Joseph Yerushalmi, a librarian at the University of Haifa Library, created a catalogue with around 65,000 records on paper cards. The catalogue contains articles from the 1940s to the 1970s, focusing on individuals like artists, writers, philosophers, intellectuals, and historical figures. the collection also includes reviews on books and literary works.

 

To preserve this valuable catalogue, digitization is needed, the project is divided to two parts:

The first part is to Detect text regions, which means classifying each region to its appropriate label: Title, Author, Text, and other.

The second part is text recognition, in this part the process of converting printed or handwritten text into digital text using software algorithms and machine learning, namely deep networks like RCNN and LSTM networks as a first option, or other deep networks like YOLO and Resnet

Poster for Digitizing The Yerushalmi Catalogue