|
|
Automatically Deciding if a Document was Scanned or Photographed
|
|
|
|
|
نویسنده
|
e Silva Gabriel Pereira ,Lins Rafael Dueire ,Miro Brenno ,Simske Steven J. ,Thielo Marcelo
|
منبع
|
journal of universal computer science - 2009 - دوره : 15 - شماره : 18 - صفحه:3364 -3375
|
چکیده
|
Portable digital cameras are being used widely by students and professionals in different fields as a practical way to digitize documents. tools such as photodoc enable the batch processing of such documents, performing automatic border removal and perspective correction. a photodoc processed document and a scanned one look very similar to the human eye if both are in true color. however, if one tries to automatically binarize a batch of documents digitized from portable cameras compared to scanners, they have different features. the knowledge of their source is fundamental for successful processing. this paper presents a classification strategy to distinguish between scanned and photographed documents. over 16,000 documents were tested with a correct classification rate of over 99.96%.
|
کلیدواژه
|
Keywords: MPEG-7 ,content-based Multimedia Retrieval ,Hypermedia systems ,Web-based services ,XML ,Semantic Web ,Multimedia
|
آدرس
|
Federal University of Pernambuco, Brazil, Federal University of Pernambuco, Brazil, Federal University of Pernambuco, Brazil, HP Labs, USA, HP Labs, Brazil
|
پست الکترونیکی
|
marcelo.resende.thielo@hp.com
|
|
|
|
|
|
|
|
|
|
|
|
Authors
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|