[SOLR-6020] Auto-generate a unique key in schema-less mode if data does not have an "id" field - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 4.10, 6.0
Component/s: Schema and Analysis
Labels:
None

Description

Currently it is not possible to use the schema-less example if my data does not have an "id" field.

I was indexing data where the unique field name was "url" in schema-less mode. This requires one to first change unique key name in the schema and then start solr and then index docs. If one had already started solr, one'd first need to remove managed-schema, rename schema.xml.bak to schema.xml and then make the necessary changes in schema.xml. I don't think we should fail on such simple things.

Here's what I propose:

We remove "id" and uniqueKey from the managed schema example
If there's a field named "id" in the document, we use that as the uniqueKey
Else we fallback on generating a UUID or a signature field via an update processor and store it as the unique key field. We can name it as "id" or "_id"
But if a uniqueKey is already present in original schema.xml then we should expect the incoming data to have that field and we should preserve the current behavior of failing loudly.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

SOLR-6020.patch
29/Jul/14 16:02
12 kB
Shalin Shekhar Mangar
SOLR-6020.patch
29/Jul/14 13:51
12 kB
Shalin Shekhar Mangar
SOLR-6020.patch
29/Jul/14 13:37
11 kB
Shalin Shekhar Mangar
SOLR-6020.patch
13/Jul/14 11:22
38 kB
Vitaliy Zhovtyuk

Activity

People

Assignee:: Shalin Shekhar Mangar

Reporter:: Shalin Shekhar Mangar

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 25/Apr/14 19:18

Updated:: 09/May/16 18:45

Resolved:: 29/Jul/14 16:49