dimanche 22 mars 2015

Incremental/Continuous crawl and case sensitivity - Duplicate items



I just stumbled upon an interesting behavior:



  1. Have a list item:

    • Title: My Special List Item



  2. Perform a crawl (continuous / incremental crawl)

    • Search returns 1 result.



  3. Rename the list item with different casing: "my special list item"

  4. Perform a continuous / incremental crawl

    • Search returns 2 results. One for each case (my special... & My Special...)

    • The DocId managed property is different for both search results

    • The Duplicate Rows are 0 or in other words: Total Rows: 2, Total Rows including Duplicates: 2, so trimduplicates=true does not help, still 2 results.



  5. When I now perform a full crawl, the duplicate entry is removed.


It seems as continuous and incremental crawl do not take care of casing and create a duplicate entry for when only the casing of an item is changed. Besides running a full crawl, is there anything else I could do about this?








0 commentaires:

Enregistrer un commentaire