This content has been marked as final. Show 8 replies
can anyone help me please?
please help...or if its not possible please tell me why...
Crop is pretty straight forward using a Batch Sequence - there is already a Crop function listed there which you can use to define the page size and crop multiple PDF at once.
The descriptions for Doc.getPageNthWord and Doc.getPageNumWords are quite verbose and examples are provided, and well as Doc.getPaheNthWordQuads for finding the location of the given word on the page (since, as you know from reading the PDF Reference, the order in which text is displayed on the page is certainly not necessarily the same order as it is in the content stream - which is where your project just starts to get complicated).
Could you please give me some details about this? I nearly figure this out but has one step left.
What I want to do is use regular expression to search and add http link to all matches.
In order to use regular expression, I change PDF to plain text. But the index information for each word is lost. So I can't call getPageNthWordQuads to get coordination.