I too have the same requirement. How do we enable this full text extraction in cq 5.5 DAM search ,by default it doesnot search in content of
PDFs or any other supported doc , it just searches only in metadata of the asset.
@Deepikaa :- Please install the 5.5 update1 package & then reindex.
- stemming Porter stemmer is default one rather than dictionary-backed stemmer.The way Porter steamer works is both Country & Countries steam to countri. However you can write your own Analyzer implementation or other workaround would be to use QueryParser for search results.
- The spellchecker dictionary is actually built from the words contained in your site's content. This mighr be an optimal spellchecker and should handle cases where your product name is mispelled by users. In other words, you should not need to change the dictionary and if you did want to you would have to implement custom code to do that.
- IIRC To enable the synonym lookup mechanism need to use the tilda (~) character which can be configured in workspace.xml under the SearchIndex element.
- The new indexing rules http://wiki.apache.org/jackrabbit/IndexingConfiguration
Does AEM 5.6 needs to install any package to support extracting pdf content in DAM? I tried but failed.