related to my earlier slow solr post. Running a refresh on the first file then an update on all others. Recurse may not be necessary.
extensions=".html, .htm, .xls, .xlsm, .doc, .docx, .pdf, .txt"
Does fine until it hits a corrupt PDF File. If I try to open the file manually in PDF reader I get the message that the file may be corrupt.
I need it to get past this file and continue indexing the rest. I have tried a request timeout of three minutes but that does not work. I have attempted CFPDF Info extraction but it hangs reading it too. I do not know how to test the doc to see if it is corrupt.
Ultimately I would like it to give up on the file after about 3 minutes.
Wow - Rather shocked no one has experienced this.
I have isolated a couple of folders that have the issue and have run every test i can think of to make it move past this. Really need an assist.