Issue Details (XML | Word | Printable)

Key: LUCENE-1150
Type: Bug Bug
Status: Closed Closed
Resolution: Fixed
Priority: Major Major
Assignee: Michael McCandless
Reporter: Nicolas Lalevée
Votes: 0
Watchers: 0
Operations

If you were logged in you would be able to see more operations.
Lucene - Java

The token types of the standard tokenizer is not accessible

Created: 25/Jan/08 10:16 AM   Updated: 08/May/08 07:47 PM
Return to search
Component/s: Analysis
Affects Version/s: 2.3
Fix Version/s: 2.3.2, 2.4

Time Tracking:
Not Specified

File Attachments:
  Size
Text File Licensed for inclusion in ASF works LUCENE-1150.patch 2008-01-25 12:40 PM Michael McCandless 7 kB
Text File Licensed for inclusion in ASF works LUCENE-1150.take2.patch 2008-01-25 07:16 PM Michael McCandless 14 kB

Lucene Fields: New
Resolution Date: 15/Apr/08 09:09 AM


 Description  « Hide
The StandardTokenizerImpl not being public, these token types are not accessible :
public static final int ALPHANUM          = 0;
public static final int APOSTROPHE        = 1;
public static final int ACRONYM           = 2;
public static final int COMPANY           = 3;
public static final int EMAIL             = 4;
public static final int HOST              = 5;
public static final int NUM               = 6;
public static final int CJ                = 7;
/**
 * @deprecated this solves a bug where HOSTs that end with '.' are identified
 *             as ACRONYMs. It is deprecated and will be removed in the next
 *             release.
 */
public static final int ACRONYM_DEP       = 8;

public static final String [] TOKEN_TYPES = new String [] {
    "<ALPHANUM>",
    "<APOSTROPHE>",
    "<ACRONYM>",
    "<COMPANY>",
    "<EMAIL>",
    "<HOST>",
    "<NUM>",
    "<CJ>",
    "<ACRONYM_DEP>"
};

So no custom TokenFilter can be based of the token type. Actually even the StandardFilter cannot be writen outside the org.apache.lucene.analysis.standard package.



 All   Comments   Work Log   Change History   Subversion Commits      Sort Order: Ascending order - Click to sort in descending order
Michael McCandless made changes - 25/Jan/08 12:36 PM
Field Original Value New Value
Assignee Michael McCandless [ mikemccand ]
Michael McCandless made changes - 25/Jan/08 12:40 PM
Attachment LUCENE-1150.patch [ 12374028 ]
Michael McCandless made changes - 25/Jan/08 07:16 PM
Attachment LUCENE-1150.take2.patch [ 12374074 ]
Michael McCandless made changes - 29/Jan/08 10:52 AM
Status Open [ 1 ] Resolved [ 5 ]
Resolution Fixed [ 1 ]
Fix Version/s 2.4 [ 12312681 ]
Michael McCandless made changes - 09/Apr/08 09:29 AM
Fix Version/s 2.3.2 [ 12313057 ]
Michael McCandless made changes - 15/Apr/08 08:17 AM
Status Resolved [ 5 ] Reopened [ 4 ]
Resolution Fixed [ 1 ]
Michael McCandless made changes - 15/Apr/08 09:09 AM
Resolution Fixed [ 1 ]
Status Reopened [ 4 ] Resolved [ 5 ]
Michael Busch made changes - 08/May/08 07:47 PM
Status Resolved [ 5 ] Closed [ 6 ]