Jump to content

Near Dupe: 0 Documents Processed


Guest Daniel Hahnke
 Share

Recommended Posts

Guest Daniel Hahnke

If Near Dupe returns with 0 documents processed, you may be running documents that are too large through Content Analyst.

As often as it is overlooked, make sure the text file sizes being sent to Content Analyst are no larger than 4mb each.

 

 

Link to comment
Share on other sites

  • 8 months later...

I've also noticed that you can't run near duplicate detection right after running email threading. You have to wait a few minutes before you do. This is due in part to the way that the values are imported into Eclipse after CAAT processes the docs. The eclipse indexing agents and scheduler need to complete the import before running another CAAT operation.

Link to comment
Share on other sites

I've also noticed that you can't run near duplicate detection right after running email threading. You have to wait a few minutes before you do. This is due in part to the way that the values are imported into Eclipse after CAAT processes the docs. The eclipse indexing agents and scheduler need to complete the import before running another CAAT operation.

Link to comment
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
 Share

×
×
  • Create New...