Register Members List Search Today's Posts Mark Forums Read

Reply
 
Thread Tools
  #1  
Old 04 May 2010, 15:13
FractalizeR's Avatar
FractalizeR FractalizeR is offline
 
Join Date: Oct 2005
Real name: Vladislav
Sphinx: WARNING: duplicate document ids found

The following is the output of cronjob /usr/local/sphinx/cron/delta.sh:


Block Disabled:      (Update License Status)  
Suspended or Unlicensed Members Cannot View Code.

Please look at ThreadPostDelta indexing:

WARNING: duplicate document ids found message appears. Is that a normal behavior of Sphinx? What is the document id used?
__________________
* My mods and articles * Forum: vB4.x, 400K posts, 300K users, 3M monthly hits
Reply With Quote
Comments
  #2  
Old 04 May 2010, 16:41
sung sung is offline
 
Join Date: Feb 2002
I got the warning as well (so glad it isn't just me), which I've reported in the vbulletin.com forums.

It can cause all sorts of nasty problems with Sphinx.

There are a few different restrictions imposed on the source data which is going to be indexed by Sphinx, of which the single most important one is:

ALL DOCUMENT IDS MUST BE UNIQUE UNSIGNED NON-ZERO INTEGER NUMBERS (32-BIT OR 64-BIT, DEPENDING ON BUILD TIME SETTINGS).

If this requirement is not met, different bad things can happen. For instance, Sphinx can crash with an internal assertion while indexing; or produce strange results when searching due to conflicting IDs. Also, a 1000-pound gorilla might eventually come out of your display and start throwing barrels at you. You've been warned.
Reply With Quote
  #3  
Old 04 May 2010, 21:42
FractalizeR's Avatar
FractalizeR FractalizeR is offline
 
Join Date: Oct 2005
Real name: Vladislav
The following combination is used in configuration file to make so-called Document ID, that MUST be unique:


Block Disabled:      (Update License Status)  
Suspended or Unlicensed Members Cannot View Code.

On some reason, it appears non-unique. However, I don't see how it can be other than really duplicating rows are returned by complete query
__________________
* My mods and articles * Forum: vB4.x, 400K posts, 300K users, 3M monthly hits
Reply With Quote
  #4  
Old 20 Jun 2010, 00:58
graham_w graham_w is offline
 
Join Date: Apr 2005
Did you ever sort this out - i'm noticing the same error.

Cheers
Reply With Quote
  #5  
Old 20 Jun 2010, 07:49
FractalizeR's Avatar
FractalizeR FractalizeR is offline
 
Join Date: Oct 2005
Real name: Vladislav
No, but it looks like it doesn't affect search quality.
__________________
* My mods and articles * Forum: vB4.x, 400K posts, 300K users, 3M monthly hits
Reply With Quote
  #6  
Old 20 Jun 2010, 08:53
graham_w graham_w is offline
 
Join Date: Apr 2005
Thanks for the reply - yeah I did find a thread saying similar on the sphinx website.

Cheers
Reply With Quote
  #7  
Old 20 Jun 2010, 18:35
JesterP JesterP is offline
 
Join Date: Jun 2007
Originally Posted by graham_w View Post
Thanks for the reply - yeah I did find a thread saying similar on the sphinx website.

Cheers
I recieved in my inbox this morning:

--->8---

### SAVE ORDERED IDS TO SEARCH CACHE ###;

MySQL Error : Duplicate entry '92f3f32f09b269797e91242ce55639a6-lastpost-DESC' for key 2
Error Number : 1062
Request Date : Sunday, June 20th 2010 @ 10:44:01 AM

---8<---
Everything is still running and I am not seeing anything bad happening. No errors since.
Reply With Quote
Reply



Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump


New To Site? Need Help?

All times are GMT. The time now is 08:51.

Layout Options | Width: Wide Color: