Return to site

Deskew Scanned Pdf

broken image


If the document has been scanned into a PDF, then using Adobe Acrobat 9.0, you should be able to deskew the document as follows: 1. Open the document into Adobe Acorbat Pro 9.0 2. Click 'Document' from the top menu 3. Select Optimized Scanned PDF Note that Deskew is set to automatic so just press the OK button and the tilt should be corrected. Click and drag a line that should be horizontal in the PDF. (Alternatively, you can also click two points.) The Page Setup dialog will appear, showing a preview of the adjusted PDF. Click OK to complete the align/de-skew operation. Another fast way to deskew is to add the Deskew button to one of your tool bars.

Introduction

The most recent version of this scrip is locatedhere.

One nice way to save old notes is to scan them to a PDF. However, not all scanners makethe notes correctly orientated along with also taking up a lot of space. There areseveral methods to make notes look better likeunpaper orpdfsandwich. It is also possible to enhancenotes such as using mzucker's noteshrinkapproach or use ImageMagick to dothe conversion such as lelandbatey's whiteboardcleaner, but this last method can be quiteslow on single images.

This solution mostly focuses on black and white notes, and can be changed to use color. Itcan shrink a pdf to about 20% of its size depending on how many lines there are on thepage. It can also deskew notes that unpaper can not deskew, and can be modified veryeasily.

Examples

The image on the left is the original image (JPG 210.4 KiB, PNG 680.9 KiB) and the imageon the right is the deskewed image (PNG 171.9 KiB). The image to PDF method (img2pdf) onlyadds an additional 500 to 700 Bytes to the overall file size, and as such does not reducethe image quality (unlike ImageMagick).

Another example is when using the dark border removal, which happens when the backgroundof the scanner uses black instead of white.

Like the first example, the image on the left is the original image (JPG 354.5 KiB, PNG1.6 MiB) and the image on the right is the converted image (PNG 44.0 KiB).

Deskew Scanned Pdf

Running deskew.sh

This script uses bash, mktemp (from coreutils), pdfinfo and pdftoppm (from poppler-utils),convert (from imagemagick), img2pdf, gs (from ghostscript), and exiftool (fromlibimage-exiftool-perl).

Deskew Pdf Acrobat Dc

This tool also comes with several command line switches, along with defaults used when theswitch is not present.

Deskew Scanned Pdf Mac

  • -b is the amount of offset inpixels to change. Thedefault is 5.
  • -d is the dpi that will be used on the PDFs. The default is 325.
  • -h is the help menu.
  • -p is the PDF image file format used foreach image. The default is pgm (gray).
  • -o signifies the output file. By default, the original file has _skew added to the endof the file and the new file named after the original file.
  • -r creates a 1 pixel black border, and then changes any black pixels linked to thatborder to white.

Deskew Scanned Pdf Files

deskew.sh



broken image