1 Reply Latest reply on Sep 24, 2008 9:58 AM by Newsgroup_User

    Parsing PDF Files

    Magikaru
      Hi everyone. So I have a bunch of pdf files that contain text data which I need to retrieve. Is there any way to parse it? The data is stored with the first line has the column names followed delimited commas. Every line after is one row of data.

      ex:
      id,fname,lname,age
      532,Tom,Stevens,33
      42,John,Baldwin,38
      ...

      I've tried using the cfpdf tag, but as far as I know that tag is mainly used for pdf creation, not extraction.
      It does have a read function: <cfpdf action="read" source="data.pdf" name="mypdf"> but I don't know how to use the 'mypdf' variable after I read the file. If I dump it out, it just shows details about the pdf file, but not what it contains.

      Thanks for any help you can provide.