|
What about the hashCode() implementations? IMO returning 0 is equivalent with not having an equals method.
./alex -- .w( the_mindstorm )p. IMO returning 0 as hashCode implies all Values across all types are equal! The value should atleast be different for different Types.
And it also doesn't really satisfy the javadoc comment of "Returns zero to satisfy the Object equals/hashCode contract. This class is mutable and not meant to be used as a hash key." A HashMap with a value as a key will fail rather silently. With an UnsupportedOperationException one will know the error of one's way at once. From an api design point of view we can dump Value equals and hashcode i.e. let direct comparisons using equals always return false (i.e. Object's implementation holds true) and same for hashCode. And we could introduce a Values class that could introduce overloaded equals methods (Like Arrays, Collections etc.), we can merge Values and the present ValueHelper class into one class. And Jukka I will get down to providing you with a patch and testcases. It might take some time though... The current hashCode() implementation is based on a discussion we had in April 2005, see http://thread.gmane.org/gmane.comp.apache.jackrabbit.devel/1561 (starting a few messages down the thread). There are valid arguments for many different implementations, but once we implement equals() the Object contract requires that hashCode() is also implemented at least in some way. Returning a constant was considered the best alternative.
> From an api design point of view we can dump Value equals and hashcode No, we can't. :-) The JCR API explicitly says that Value.equals() must be implemented in a specific way. See the javadoc link for details. > And Jukka I will get down to providing you with a patch and testcases. It might take some time though... Thanks! I'm not decided if this is a serious enough breach of the JCR API to be a critical issue for the 1.0 release. If it is, then I can participate in getting this fixed, otherwise there is no need to hurry. Piyush wrote:
> IMO returning 0 as hashCode implies all Values across all types are equal! The value should atleast be different for different Types. this is not true. returning 0 (or any other fixed value) is a valid return value for hashCode(). See contract of Object.hashCode(). In fact the opposite is true: values that are equal implies that their hashCodes are equal! Objects of different types can return the same hashCode() as long as equals will return false for two instances of different type. Marcel you are spot on I erred there in my comment. What I wanted to say was returning different int values would have signaled different types of values.. I actually got carried away there :)
> No, we can't. :-) The JCR API explicitly says that Value.equals() must be implemented in a specific way. See the javadoc link for details. You are right Jukka, I forgot jackrabbit 0.9 doesn't mean that the JCR API which is already 1.0 is still negotiable... :) and thanks for that discussion/mailing-list-archive link says a lot. So I guess I should stop throwing the hashCode issue around? Here are some fresh comments on this issue 1. I have started a new package org.apache.jackrabbit.test.api.value with a TestAll and other classes. I hope that is fine. 2. Is there a way of getting the default encoding i.e. BaseValue.DEFAULT_ENCODING? and even more importantly is there a way of setting a Default_Encoding conveniently through out the repository for example what if a project wants to use UTF16 instead of UTF8 ? Has a thought been put into that? Perhaps a config property? 3. I have read the javadoc for Value (The third time mind you). Here are a few cases (to see if I got it right) a) two Binary Value with texts "some text" according to api they should be equal and by implementation they are. b) two Binary Values with bytes from "some text" in the same encoding according to api should be equal(?) and with the implementation I proposed they are equal. what should happen if they have different encodings but the same content? I guess equals should fail. c) two BinaryValues with input streams from the same file will fail equals if the input streams were instantiated independently. The javadoc asks not to try and compare big blobs so we stick to that. But what happens when one BinaryValue has text = "some text" and another a byte array = "some text".getBytes(DEFAULT_ENCODING)? and what happens when one BinaryValue has text = "some text" and another a stream from a file with the contents "some text"? If we decide that a BV with text="some text" must equal a BV with byte array = "some text".getBytes(DEFAULT_ENCODING) etc. etc. then I need to know now as the present equals implementation and my proposed one fall way short! We also need to write more tests then.. many more Next question: do we really need such a complex equals method i.e. do the other parts of the API really care? Another thought - we could simplify things if we didn't have a string in there and just have a byte-array or a stream... as we can always re/create the string from the byte-array. I hope no one's going to say ... seeesh leave it alone it's just an equals method! ;) I commented on the mailing list. Let's keep the Jira for tightly scoped comments about the issue at hand and use the mailing list for more general discussion. :-)
Piyush, did you get around to creating a patch for this? If not, I'll just take your code from the above comments and apply it.
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
http://www.day.com/maven/jsr170/javadocs/jcr-1.0/javax/jcr/Value.html
Can you provide a patch of your improvements and test cases?