I have a document management application that contains data
about PDF documents in a database with a reference to the file name
to access the PDF, like:
id Name Number URL
1 Doc Title 1 100-199 100-199.pdf
2 Doc Title 2 100-287 100-287.pdf
We currently search on the name and number from the database,
but users want to search the contents of the PDF as well. The name
and number are inside the PDF, but there is no metadata to return
clean search results.
Is there a way to combine the database query collection with
the file search collection to create a single, searchable
collection without manipulating the PDF files?