Difference between revisions of "Comparison of objects"

From Liberty Eiffel Wiki
Jump to navigation Jump to search
Line 133: Line 133:
   
 
Before describing <tt>is_equal</tt> any further, let's be clear that <tt>is_equal</tt> can be used when the type of the two compared
 
Before describing <tt>is_equal</tt> any further, let's be clear that <tt>is_equal</tt> can be used when the type of the two compared
objects is expanded. For example, with [[library_class:INTEGER|<tt>INTEGER</tt>]]s, the result of a comparison using the predefined <tt>=</tt> operator has exactly the same effect as using the <tt>is_equal</tt> routine. Note however that using <tt>is_equal</tt> is quite unusual when the two operands are expanded, especially if it's a basic expanded type like [[library_class:INTEGER|<tt>INTEGER</tt>]], [[library_class:CHARACTER|<tt>CHARACTER</tt>]], [[library_class:REAL|<tt>REAL</tt>]] or [[library_class:BOOLEAN|<tt>BOOLEAN</tt>]]. In general, by convention and especially for the sake of simplicity, <tt>=</tt> and <tt>/=</tt> are always used for primitive expanded types.
+
objects is expanded. For example, with [[library_class:INTEGER|<tt>INTEGER</tt>]]s, the result of a comparison using the predefined <tt>=</tt> operator has exactly the same effect as using the <tt>is_equal</tt> routine. Note however that using <tt>is_equal</tt> is quite unusual when the two operands are expanded, especially if it's a basic expanded type like [[library_class:INTEGER|<tt>INTEGER</tt>]], [[library_class:CHARACTER|<tt>CHARACTER</tt>]], [[library_class:REAL|<tt>REAL</tt>]] or [[library_class:BOOLEAN|<tt>BOOLEAN</tt>]]. In general, by convention and especially for the sake of simplicity, <tt>=</tt> and <tt>/=</tt> are preferred for primitive expanded types.
   
 
As soon as the objects are more complex, with for example some attributes that themselves point towards other objects, it's necessary to pay attention to the predefined behaviour of the <tt>is_equal</tt> function. Consider for example the case of comparing two <tt>TRIANGLE</tt>s, as in the following figure:
 
As soon as the objects are more complex, with for example some attributes that themselves point towards other objects, it's necessary to pay attention to the predefined behaviour of the <tt>is_equal</tt> function. Consider for example the case of comparing two <tt>TRIANGLE</tt>s, as in the following figure:
   
 
[[Image:IsEqualTriangles.png|is_equal with some TRIANGLEs]]
 
[[Image:IsEqualTriangles.png|is_equal with some TRIANGLEs]]
  +
  +
Both objects of class <tt>TRIANGLE</tt> in the figure above are identical but if one procedes to compare them using the predefined <tt>is_equal</tt> routine the result will be <tt>False</tt>! In fact, let's emphasise that <tt>triangle_a</tt> and <tt>triangle_b</tt> designate, even if they are similar, two distinct objects in memory. Indeed, if the <tt>is_equal</tt> function has not been redefined in the <tt>TRIANGLE</tt> class, the comparison of <tt>triangle_a</tt> and <tt>triangle_b</tt> using <tt>is_equal</tt>, in other words [[Glossary#Expression|expression]]:
  +
  +
triangle_a.is_equal(triangle_b)
  +
  +
is equivalent to the expression:
  +
  +
(triangle_a.p1 = triangle_b.p1) and (triangle_a.p2 = triangle_b.p2) and (triangle_a.p3 = triangle_b.p3)

Revision as of 15:37, 10 September 2005


*** 
*** Please if you read French, translate the corresponding page using the French version as a model.
*** If you decide to translate this page, start to add here a *** note in order to avoid multiple
*** translators.
*** Please, check also our Author guidelines first !
*** Thanks in advance.
*** Colnet 11:59, 6 Sep 2005 (CEST)
***
*** Well, I've made a small start, but others are welcome (as far as I'm concerned) to contribute.
*** I'm having fun translating it, but this could take some days!
*** Peter Gummer 6 Sep 2005
***
*** I've done the next section.  I will flag up here before starting on another one.
*** Oliver Elphick, 7 Sep 2005
***
*** Thank you all for the great work. Colnet 09:41, 7 Sep 2005 (CEST)
***

Comparing two objects

How to compare two objects is a problem that has to be mastered fairly quickly, for it's a very common problem posed in many applications, if not almost all applications.

In Eiffel, as in most object-oriented languages, there are several ways to compare two objects, be it to compare their respective addresses or be it to compare the information contained in each of the two objects. In other words, using Eiffel terminology, one must choose whether to use the predefined = operator or the redefinable is_equal routine.

Because the Eiffel language aims, as far as possible, to prevent careless mistakes, it isn't possible to compare any type of expression with any other type of expression. The validity of a comparison will be detailed below.

Finally, just to be complete, this part will also discuss the possibility of carrying out a third sort of comparison, deep equality, a comparison uniquely reserved for a very rare family of applications.

Comparison using the operators = or /=

Let there be two variables point_a and point_b both of type POINT. Let us suppose that these two variables have been initialised with something other than Void. The following code fragment shows a possible usage of the predefined = comparison operator:

if point_a = point_b then
   io.put_string("We have a single, unique POINT !")
else
   io.put_string("We have two POINTs at different memory locations.")
end

Comparison using the = operator is the most basic building block for comparison. In the case of a reference type, it corresponds simply to the direct comparison of the two addresses. What the two addresses point to is not considered. If our two variables point_a and point_b actually designate a single, unique POINT in memory, then the test above evaluates to true. Once there are two distinct POINTs in memory, then the test is false even if the two POINTs seem exactly identical, in the debugger for example, or when printed.

Note that we also have the /= operator, which corresponds to the negation of the = operator. The /= operator is also predefined and is simply a shortcut to avoid having to use not for negation. So to test whether a reference variable refers to an object, the usage is:

if point /= Void then
   io.put_string("The variable point refers to a POINT!")
else
   io.put_string("The variable point does not refer to any object.")
end

When comparing variables of an expanded type, the effect of the = and /= operators is simply to compare the values in question. For two variables of type INTEGER, that boils down to comparing the two numbers:

if integer_a = integer_b then
   io.put_string("integer_a is the same value as integer_b!")
else
   io.put_string("The two values are different.")
end

The essential details about the = and /= predefined operators have now been clearly explained above, at least, so we hope. If you are reading this to find out about comparison methods, go straight to the section on is_equal. Otherwise, continue to the next section for additional information about these two predefined operators.

The = and /= operators with expanded types

The = and /= operators, apart from being predefined, are non-redefinable operators. The effect of = and /= is completely fixed, within the compiler.

When comparing two expressions of a reference type with the = or /= operators, as has already been mentioned above, only the two addresses are compared. Let's see what happens when the two compared expressions are of an expanded type.

Comparing two expanded expressions using = simply comes down to comparing, at the lowest level, bit by bit, the two areas of memory corresponding to the two objects. Of course, the expanded type used on the left of the = operator must be compatible with the expanded type used on the right. Indeed, apart from some rare exceptions that we'll detail later, a comparison is accepted only when the two expanded types are exactly the same. It's obviously possible to compare two expressions of type CHARACTER with each other; and, for example, to compare two expressions of type INTEGER with each other. In order to avoid any careless mistakes, however, directly comparing a CHARACTER with an INTEGER is forbidden. Note that in the latter case, it's up to the program's designer to make a call to the appropriate conversion routine. There are multiple possibilities, be it to convert the INTEGER expression into a CHARACTER, or be it, inversely, to convert the CHARACTER expression into an INTEGER. In both cases the possible conversion routines are themselves numerous. Several routines exist, for example, to convert from CHARACTER to INTEGER. Here are two such examples, among many others:

if my_character.to_integer = my_integer then
   io.put_string("Conversion with possible sign extension.")
end

and again:

if my_character.decimal_value = my_integer then
    io.put_string("Conversion using the value of the corresponding number.")
end

Thus, it seems that it's absolutely not possible to automate the choice of conversion that ought to be applied before the comparison. It depends completely on the context of the comparison, on the problem being dealt with, and this choice must be made manually by the program's designer. The strict rule, consisting of allowing the comparison of two expanded objects only when they have exactly the same type, is indeed the simplest and safest. Thus, if the compiler rejects your comparison expression, read the error message carefully and choose the most appropriate conversion with care. Having said this, two exceptions to this very strict rule remain, probably for historical reasons.

The two exceptions concern two special cases that leave no doubt thanks to the fact that the expanded objects concerned are very elementary. The first exception to the strict rule of the identity of two expanded types concerns the case of two objects taken from the set {INTEGER_8, INTEGER_16, INTEGER_32, INTEGER, INTEGER_64}. Comparison using = is true if and only if the two values correspond to exactly the same integer value. In fact, without any loss of information, it is always possible to represent in 16 bits any number that is represented in 8 bits; and so on and so forth. This exception to the hard and fast rule therefore lets us write simply and without risk:

if my_integer_8 = my_integer_16 then ...

knowing that it's as if we had written:

if my_integer_8.to_integer_16 = my_integer_16 then ...

The second and last exception is similar to the preceding one and concerns the following set of expanded types: {REAL_32, REAL_64, REAL, REAL_80, REAL_128, REAL_EXTENDED}. Like the integers, for the set of floating point numbers in question, it is always possible to convert a given floating point number to a bigger number of bits without any loss of information. Thus, a comparison expression using = or /= can mix two types taken from the REAL_* family.

Note, finally, that it is forbidden to compare directly any object from the REAL_* family with any object from the INTEGER_* family. If you find yourself having to make such a comparison, which isn't illogical, it's up to you to decide which conversion routine is appropriate to your needs. The Eiffel language has no rule that could decide this for you.

The redefinable is_equal routine

The redefinable is_equal routine provides a less elementary comparison than using the predefined = operator. Consider once again our example of comparing two variables of type POINT, but this time let's use is_equal. Like the above case, let's suppose for the moment that the two variables point_a and point_b are initialised with something other than Void:

if point_a.is_equal(point_b) then
   io.put_string("Either they are both the same POINT or else they are 2 identical POINTs.")
else
   io.put_string("They are two different POINTs.")
end

In a given class, if the is_equal routine hasn't been redefined by the user, the behaviour of is_equal consists of comparing all attributes of the two objects with each other. That means, for the POINT example, successively comparing the two x attributes and then the two y attributes. In other words, it's as if we'd written:

if (point_a.x = point_b.x) and (point_a.y = point_b.y) then
   io.put_string("Either they are both the same POINT or else they are 2 identical POINTs.")
else
   io.put_string("They are two different POINTs.")
end

Note how the comparison between attributes does not use is_equal, but the elementary fixed = operator. The following diagram presents a picture of the possible memory contents during comparison of point_a and point_b. Note that in the following case, these two POINTs situated in two memory locations are identical and, consequently, comparison using is_equal will return True:

is_equal with some POINTs

In the example above, the two objects that are being compared are both instances of the POINT class: very simple objects. As the diagram shows, a POINT is composed of two expanded attributes, or, more exactly, the attributes x and y of type REAL, a primitive expanded type (i.e. without an intermediate pointer). In this case the default definition of the is_equal function fits perfectly.

Before describing is_equal any further, let's be clear that is_equal can be used when the type of the two compared objects is expanded. For example, with INTEGERs, the result of a comparison using the predefined = operator has exactly the same effect as using the is_equal routine. Note however that using is_equal is quite unusual when the two operands are expanded, especially if it's a basic expanded type like INTEGER, CHARACTER, REAL or BOOLEAN. In general, by convention and especially for the sake of simplicity, = and /= are preferred for primitive expanded types.

As soon as the objects are more complex, with for example some attributes that themselves point towards other objects, it's necessary to pay attention to the predefined behaviour of the is_equal function. Consider for example the case of comparing two TRIANGLEs, as in the following figure:

is_equal with some TRIANGLEs

Both objects of class TRIANGLE in the figure above are identical but if one procedes to compare them using the predefined is_equal routine the result will be False! In fact, let's emphasise that triangle_a and triangle_b designate, even if they are similar, two distinct objects in memory. Indeed, if the is_equal function has not been redefined in the TRIANGLE class, the comparison of triangle_a and triangle_b using is_equal, in other words expression:

triangle_a.is_equal(triangle_b)

is equivalent to the expression:

(triangle_a.p1 = triangle_b.p1) and (triangle_a.p2 = triangle_b.p2) and (triangle_a.p3 = triangle_b.p3)