What is the field type, text or string? By default solr does a fuzzy search on text field. You need to set up your field as a string field and add no tokenizer then you'll get an exact match.
<field name="name" type="string" indexed="true" stored="false" required="true" />
<field name="nameString" type="string" indexed="true" stored="false" required="true" />
<copyField source="name" dest="nameString"/>
<requestHandler name="accounts" class="solr.SearchHandler">
nameString^10.0 name^5.0 description^1.0
If in your collection, you have 10 pdf and 10 .html then what is the result? The number of documents is 10 or 20?
I have to confess that your response assumes knowledge on my part that I do not have. I have located the Schema.xml and SolrConfig.xml files for the collection in question, but that is about it.
In the Schema.xml file I see “<field name=” entries for what appear to be the fields returned by the queries, but do not know what fields I should be setting to string. I am attempting to do a full-text search of the contents of the html documents, not on metadata.
In the SolrConfig.xml file I see a number of “<requestHandler” entries, but do not know if I am supposed to modify one of them to match what you sent (in which case I need to know which one), or add what you sent in its entirety.
Also, In an unrelated matter, I am attempting to do a query of queries search on the result of my cfsearch, but any attempt to create a “where Key=’xxx’ ” or “where Key LIKE “%xxx%’ “ clause fails. I CAN use a “where Rank=’1’ clause, but that isn’t what I am trying to do. Is that because the Key field is text where it needs to be string? (Basically, what I am attempting to do is to restrict the search to only certain subdirectories in the collection. In ColdFusion 8, with the Verity search, this was fairly easy to do by adding a Key field clause to the criteria string.)
Finally, the collection with the PDF documents: It contains 31000 .HTM documents and 10861 .PDF documents, and reports 41850 documents in the collection. However, the indexing fails, as do queries against this collection. Both are seemingly due to memory size issues. In ColdFusion 8, the Verity search indexing handled this size collection easily.