Skip to content

ttv20/gDriveOCR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

gDriveOCR

Version License

Node CLI program for OCR large PDF documents

Purpose

Let you OCR (extract text from image) from PDF by Google Drive API

Table of Contents

  1. Installation
  2. How to get credentials file
  3. Usage
  4. TODO

installation

# npm install -g gdriveocr

How to get credentials file

to use this program you need to enable the Drive API on your Google account:

  • Go to Google Drive API
  • Click on 'ENABLE THE DRIVE API'
  • Download the cert file by clicking on 'DOWNLOAD CLIENT CONFIGURATION'

Usage

Usage: gdriveocr [options] <file>

OCR by Google Drive API

Options:
  -V, --version      output the version number
  -l, --lang <lang>  2 digit language language hint for the OCR (if not specifid Google recognized alone)
  -c, --cert <cert>  Google API cert file path (default: "./credentials.json")
  -h, --help         output usage information

TODO

  • support multiple files and images
  • better error handling
  • API