Configure FuzzyOCR

We will be storing the image hashes in a mysql database to improve on performance such that images that we have already scanned do not get scanned again as OCR is a resource intense activity.

Create mysql Database

  • The sql script creates the database and tables and adds a user fuzzyocr with the password fuzzyocr
 # mysql -p < /usr/local/src/devel/FuzzyOcr.mysql
  • Change the password
 # mysqladmin -u fuzzyocr -p password

Basic settings

  • Edit /etc/mail/spamassassin/FuzzyOCR.cf and set the basic options
focr_path_bin /usr/bin:/usr/local/bin
focr_minimal_scanset 1
focr_autosort_scanset 1
focr_enable_image_hashing 3
focr_logfile /tmp/FuzzyOcr.log

Make FuzzyOCR use the database

  • Edit the file /etc/mail/spamassassin/FuzzyOcr.cf and add
focr_mysql_db FuzzyOcr
focr_mysql_hash Hash
focr_mysql_safe Safe
focr_mysql_user fuzzyocr
focr_mysql_pass password
focr_mysql_host localhost
focr_mysql_port 3306
focr_mysql_socket /var/lib/mysql/mysql.sock
 
 
configure_fuzzyocr.txt · Last modified: 2008/02/14 04:32 by topdog
 
Recent changes RSS feed Valid XHTML 1.0 Valid CSS Driven by DokuWiki