-
-
Save IaroslavR/834066ba4c0e25a27078 to your computer and use it in GitHub Desktop.
| sudo yum install autoconf aclocal automake | |
| sudo yum install libtool | |
| sudo yum install libjpeg-devel libpng-devel libtiff-devel zlib-devel | |
| cd ~/downloads | |
| wget http://www.leptonica.com/source/leptonica-1.72.tar.gz | |
| tar -zxvf leptonica-1.72.tar.gz | |
| cd leptonica-1.72 | |
| ./configure | |
| make | |
| sudo make install | |
| cd .. | |
| wget https://github.com/tesseract-ocr/tesseract/archive/3.04.00.tar.gz | |
| tar -zxvf 3.04.00.tar.gz | |
| cd tesseract-3.04.00/ | |
| ./autogen.sh | |
| ./configure | |
| make | |
| sudo make install | |
| sudo ldconfig | |
| cd /usr/local/share/tessdata | |
| sudo wget http://tesseract-ocr.googlecode.com/files/tesseract-ocr-3.02.eng.tar.gz | |
| sudo tar xvf tesseract-ocr-3.02.eng.tar.gz | |
| sudo wget hhttp://tesseract-ocr.googlecode.com/files/tesseract-ocr-3.01.osd.tar.gz | |
| sudo tar xvf tesseract-ocr-3.01.osd.tar.gz | |
| export TESSDATA_PREFIX=/usr/local/share/ | |
| sudo mv tesseract-ocr/tessdata/* . | |
| sudo rm tesseract-ocr-3.02.eng.tar.gz | |
| # we need osd for autorotate | |
| sudo rm tesseract-ocr-3.01.osd.tar.gz | |
| nano ~/.bash_profile | |
| # Copy this line to the end: export TESSDATA_PREFIX=/usr/local/share/ | |
| # Verify: | |
| tesseract --list-langs |
Great guide, thanks!
had to run autoreconf --force --install before step 16 because was getting "Version mismatch error. This is libtool 2.4.6," error
Also, url on line 22 is now https://sourceforge.net/projects/tesseract-ocr-alt/files/tesseract-ocr-3.02.eng.tar.gz
in my linux ami till not install tesseract please help. I am doing some step
@Sudarshan-gurav
Now we have better option. It's docker container. See https://github.com/tesseract-ocr/tesseract/wiki/4.0-Docker-Containers for more details
My amazon linux machine didn't have C++ complier by default in order to compile tesseract. Solved by installing additional libraries available in AWS:
sudo yum groupinstall "Development Tools"
Followed by:
mv download tesseract-ocr-3.01.osd.tar.gz
No package aclocal available. on Amazon Linux 2
Great guide, except I had to
export PKG_CONFIG_PATH=/usr/local/lib/pkgconfigbefore being able to./configuretesseract, and I usedcurlinstead ofwgetbecause I'm using Amazon Linux Minimal