The ones that are just images can be easily found using Everything 1.5:
Code: Select all
"C:\some folder" ext:pdf !dotall:regex:fromdisk:content:^(.{50})
Next to all-picture-PDFs, this will also list encrypted as well as corrupted PDFs.
Might be useful if you want to convert those to text (using OCR)
Just sharing ...
The 50 characters ilimit is a practical one. I found there were some "noise" characters detected once in a while, but never more than 20. And a text-PDF will contain more than 50 characters.
If you find (for example) 30 works better, please post your experience so this can be updated.