Skip navigation
MarkHines
Currently Being Moderated

Need to Optimize Files with JPEG2000 Compression

Jul 31, 2012 2:04 PM

Tags: #scanned_pdf

I have 250 scanned technical publications that are larger than 500 MB, for a total of 290 GB!  They are so large, because they contain color maps and other foldouts scanned at 300 dpi.

Obviously, they need to be compressed, but I am constrained by the fact that they must be PDF/A-2 compliant.  As I understand it, that would require JPEG2000 compression.  (I need lossless compression.) PDF Optimizer in Acrobat 9 will work, but it is so slow that it could take me months to downsize the files individually. Batch Sequences in Acrobat 9 doesn’t allow for JPEG2000 compression.

 

Is there a way that I can batch-compress these files, using Acrobat 9 Pro?  If not, is there another program that can do the job?

 

Thanks!

 
Replies
  • Currently Being Moderated
    Aug 1, 2012 1:45 AM   in reply to MarkHines

    JPEG2000 may not be lossless, in fact it usually isn't. And there are other lossless compressions, such as ZIP.

     

    PDF/A does NOT forbid lossy image compression. Is there a web page somewhere which claims it does?

     
    |
    Mark as:
  • Currently Being Moderated
    Aug 2, 2012 1:51 PM   in reply to MarkHines

    Hmmm. I'm not an expert on PDF, but I see what you quote as a subordinate to the earlier (in PDF/A-2) " Organizations that need to ensure that a conforming file is an accurate representation of original source material might need to impose additional requirements on the processes that generate the conforming file beyond those imposed by this part of ISO 19005."

     

    Optimizing can be worthwhile; only you can decide (it seems to me) if lossy processes are acceptable. Lossy processes aren't limited to compression. For instance Acrobat's scanned document optimizer can perform more or less aggressive background removal. Converting to pure monochrome might be lossy, but once done, JBIG2 compression can be extremely effective.

     

    Some more thoughts on JPEG2000.

    1. It is no more lossless than JPEG in general. There is a lossless JPEG, but nobody ever uses that (not available in Acrobat, nor even PDF at all I think). There is also a lossless JPEG2000. Be sure to select "lossless" as the compression option as otherwise it won't be (if that's what you want).

    2. Neither JPEG nor JPEG2000 is - to me - an obvious choice for scanned material.

    3. If you aren't offered a choice of JPEG2000, but expect it, make sure you are using PDF 1.5 or later, and PDF/A-2 (since PDF/A-1 forbids PDF 1.5).

     

    I would never consider a one way process that simply "optimized" the PDFs in any way and used only the new copies. I would always preserve the originals in case questions ever arise. Disk is cheap, even if bandwidth is expensive.

     
    |
    Mark as:

More Like This

  • Retrieving data ...

Bookmarked By (0)

Answers + Points = Status

  • 10 points awarded for Correct Answers
  • 5 points awarded for Helpful Answers
  • 10,000+ points
  • 1,001-10,000 points
  • 501-1,000 points
  • 5-500 points