Jump to content

Near Dupe: 0 Documents Processed


Guest Daniel Hahnke

Recommended Posts

Guest Daniel Hahnke

If Near Dupe returns with 0 documents processed, you may be running documents that are too large through Content Analyst.

As often as it is overlooked, make sure the text file sizes being sent to Content Analyst are no larger than 4mb each.

 

 

Link to comment
Share on other sites

  • 8 months later...

I've also noticed that you can't run near duplicate detection right after running email threading. You have to wait a few minutes before you do. This is due in part to the way that the values are imported into Eclipse after CAAT processes the docs. The eclipse indexing agents and scheduler need to complete the import before running another CAAT operation.

Link to comment
Share on other sites

I've also noticed that you can't run near duplicate detection right after running email threading. You have to wait a few minutes before you do. This is due in part to the way that the values are imported into Eclipse after CAAT processes the docs. The eclipse indexing agents and scheduler need to complete the import before running another CAAT operation.

Link to comment
Share on other sites

Archived

This topic is now archived and is closed to further replies.

×
×
  • Create New...