Università degli Studi dell'Insubria Insubria Space
 

InsubriaSPACE - Thesis PhD Repository >
Insubria Thesis Repository >
01 - Tesi di dottorato >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10277/730

Authors: Noce, Lucia
Internal Tutor: GALLO, IGNAZIO
Title: Document image classification combining textual and visual features.
Abstract: This research contributes to the problem of classifying document images. The main addition of this thesis is the exploitation of textual and visual features through an approach that uses Convolutional Neural Networks. The study uses a combination of Optical Character Recognition and Natural Language Processing algorithms to extract and manipulate relevant text concepts from document images. Such content information are embedded within document images, with the aim of adding elements which help to improve the classification results of a Convolutional Neural Network. The experimental phase proves that the overall document classification accuracy of a Convolutional Neural Network trained using these text-augmented document images, is considerably higher than the one achieved by a similar model trained solely on classic document images, especially when different classes of documents share similar visual characteristics. The comparison between our method and state-of-the-art approaches demonstrates the effectiveness of combining visual and textual features. Although this thesis is about document image classification, the idea of using textual and visual features is not restricted to this context and comes from the observation that textual and visual information are complementary and synergetic in many aspects.
Keywords: Document image classification, convolutional neural network, natural language processing
Subject MIUR : INF/01 INFORMATICA
Issue Date: 2016
Language: eng
Doctoral course: Informatica e matematica del calcolo
Academic cycle: 29
Publisher: Università degli Studi dell'Insubria
Citation: Noce, L.Document image classification combining textual and visual features. (Doctoral Thesis, Università degli Studi dell'Insubria, 2016).

Files in This Item:

File Description SizeFormatVisibility
Phd_Thesis_Nocelucia_completa.pdftesto completo tesi13,08 MBAdobe PDFView/Open

This item is licensed under a Creative Commons License
Creative Commons


Items in InsubriaSPACE are protected by copyright, with all rights reserved, unless otherwise indicated.


Share this record
Del.icio.us

Citeulike

Connotea

Facebook

Stumble it!

reddit


 

  ICT Support, development & maintenance are provided by the AePIC team @ CILEA. Powered on DSpace Software.  Feedback