Virtual function
In object-oriented programming, in languages such as C++, and Object Pascal, a virtual function or virtual method is an inheritable and overridable function or method for which dynamic dispatch is facilitated. This concept is an important part of the (runtime) polymorphism portion of object-oriented programming (OOP). In short, a virtual function defines a target function to be executed, but the target might not be known at compile time.
Polymorphism |
---|
Ad hoc polymorphism |
Parametric polymorphism |
Subtyping |
Most programming languages, such as Java, PHP and Python, treat all methods as virtual by default[1] and do not provide a modifier to change this behavior. However, some languages provide modifiers to prevent methods from being overridden by derived classes (such as the final keyword in Java[2] and PHP[3]).
Purpose
The concept of the virtual function solves the following problem:
In object-oriented programming, when a derived class inherits from a base class, an object of the derived class may be referred to via a pointer or reference of the base class type instead of the derived class type. If there are base class methods overridden by the derived class, the method actually called by such a reference or pointer can be bound either 'early' (by the compiler), according to the declared type of the pointer or reference, or 'late' (i.e., by the runtime system of the language), according to the actual type of the object referred to.
Virtual functions are resolved 'late'. If the function in question is 'virtual' in the base class, the most-derived class's implementation of the function is called according to the actual type of the object referred to, regardless of the declared type of the pointer or reference. If it is not 'virtual', the method is resolved 'early' and the function called is selected according to the declared type of the pointer or reference.
Virtual functions allow a program to call methods that don't necessarily even exist at the moment the code is compiled.
In C++, virtual methods are declared by prepending the virtual
keyword to the function's declaration in the base class. This modifier is inherited by all implementations of that method in derived classes, meaning that they can continue to over-ride each other and be late-bound. And even if methods owned by the base class call the virtual method, they will instead be calling the derived method. Overloading occurs when two or more methods in one class have the same method name but different parameters. Overriding means having two methods with the same method name and parameters. Overloading is also referred to as function matching, and overriding as dynamic function mapping.
Example
For example, a base class Animal
could have a virtual function Eat
. Subclass Llama
would implement Eat
differently than subclass Wolf
, but one can invoke Eat
on any class instance referred to as Animal, and get the Eat
behavior of the specific subclass.
class Animal {
public:
// Intentionally not virtual:
void Move(void) {
std::cout << "This animal moves in some way" << std::endl;
}
virtual void Eat(void) = 0;
};
// The class "Animal" may possess a definition for Eat if desired.
class Llama : public Animal {
public:
// The non virtual function Move is inherited but not overridden.
void Eat(void) override {
std::cout << "Llamas eat grass!" << std::endl;
}
};
This allows a programmer to process a list of objects of class Animal
, telling each in turn to eat (by calling Eat
), without needing to know what kind of animal may be in the list, how each animal eats, or what the complete set of possible animal types might be.
We can better see how virtual functions work by implementing the above example in C
#include <stdio.h>
/* an object points to its class... */
struct Animal {
const struct AnimalClass * class;
};
/* which contains the virtual function Animal.Eat */
struct AnimalClass {
void (*Eat)(struct Animal *); // 'virtual' function
};
/* Since Animal.Move is not a virtual function
it is not in the structure above. */
void Move(struct Animal * self)
{
printf("<Animal at %p> moved in some way\n", (void *) self);
}
/* unlike Move, which executes Animal.Move directly,
Eat cannot know which function (if any) to call at compile time.
Animal.Eat can only be resolved at run time when Eat is called. */
void Eat(struct Animal * self)
{
const struct AnimalClass * class = *(const void **) self;
if (class->Eat)
class->Eat(self); // execute Animal.Eat
else
fprintf(stderr, "Eat not implemented\n");
}
/* implementation of Llama.Eat this is the target function
to be called by 'void Eat(struct Animal *).' */
static void _Llama_eat(struct Animal * self)
{
printf("<Llama at %p> Llama's eat grass!\n", (void *) self);
}
/* initialize class */
const struct AnimalClass Animal = {(void *) 0}; // base class does not implement Animal.Eat
const struct AnimalClass Llama = {_Llama_eat}; // but the derived class does
int main(void)
{
/* init objects as instance of its class */
struct Animal animal = {& Animal};
struct Animal llama = {& Llama};
Move(& animal); // Animal.Move
Move(& llama); // Llama.Move
Eat(& animal); // cannot resolve Animal.Eat so print "Not Implemented" to stderr
Eat(& llama); // resolves Llama.Eat and executes
}
Abstract classes and pure virtual functions
A pure virtual function or pure virtual method is a virtual function that is required to be implemented by a derived class if the derived class is not abstract. Classes containing pure virtual methods are termed "abstract" and they cannot be instantiated directly. A subclass of an abstract class can only be instantiated directly if all inherited pure virtual methods have been implemented by that class or a parent class. Pure virtual methods typically have a declaration (signature) and no definition (implementation).
As an example, an abstract base class MathSymbol
may provide a pure virtual function doOperation()
, and derived classes Plus
and Minus
implement doOperation()
to provide concrete implementations. Implementing doOperation()
would not make sense in the MathSymbol
class, as MathSymbol
is an abstract concept whose behaviour is defined solely for each given kind (subclass) of MathSymbol
. Similarly, a given subclass of MathSymbol
would not be complete without an implementation of
doOperation()
.
Although pure virtual methods typically have no implementation in the class that declares them, pure virtual methods in C++ are permitted to contain an implementation in their declaring class, providing fallback or default behaviour that a derived class can delegate to, if appropriate.[4]
Pure virtual functions can also be used where the method declarations are being used to define an interface - similar to what the interface keyword in Java explicitly specifies. In such a use, derived classes will supply all implementations. In such a design pattern, the abstract class which serves as an interface will contain only pure virtual functions, but no data members or ordinary methods. In C++, using such purely abstract classes as interfaces works because C++ supports multiple inheritance. However, because many OOP languages do not support multiple inheritance, they often provide a separate interface mechanism. An example is the Java programming language.
Behavior during construction and destruction
Languages differ in their behavior while the constructor or destructor of an object is running. For this reason, calling virtual functions in constructors is generally discouraged.
In C++, the "base" function is called. Specifically, the most derived function that is not more derived than the current constructor's class is called.[5] If that function is a pure virtual function, then undefined behavior occurs.[6] This is true even if the class contains an implementation for that pure virtual function. A conforming C++ implementation is not required (and generally not able) to detect indirect calls to pure virtual functions at compile time or link time. Some runtime systems will issue a pure virtual function call error when encountering a call to a pure virtual function at run time.
In Java and C#, the derived implementation is called, but some fields are not yet initialized by the derived constructor (although they are initialized to their default zero values).[7] Some design patterns, such as the Abstract Factory Pattern, actively promote this usage in languages supporting this ability.
Virtual destructors
Object-oriented languages typically manage memory allocation and de-allocation automatically when objects are created and destroyed. However, some object-oriented languages allow a custom destructor method to be implemented, if desired. If the language in question uses automatic memory management, the custom destructor (generally called a finalizer in this context) that is called is certain to be the appropriate one for the object in question. For example, if an object of type Wolf that inherits Animal is created, and both have custom destructors, the one called will be the one declared in Wolf.
In manual memory management contexts, the situation can be more complex, particularly in relation to static dispatch. If an object of type Wolf is created but pointed to by an Animal pointer, and it is this Animal pointer type that is deleted, the destructor called may actually be the one defined for Animal and not the one for Wolf, unless the destructor is virtual. This is particularly the case with C++, where the behavior is a common source of programming errors if destructors are not virtual.
See also
References
- "Polymorphism (The Java™ Tutorials > Learning the Java Language > Interfaces and Inheritance)". docs.oracle.com. Retrieved 2020-07-11.
- "Writing Final Classes and Methods (The Java™ Tutorials > Learning the Java Language > Interfaces and Inheritance)". docs.oracle.com. Retrieved 2020-07-11.
- "PHP: Final Keyword - Manual". www.php.net. Retrieved 2020-07-11.
- Pure virtual destructors - cppreference.com
- Meyers, Scott (June 6, 2005). "Never Call Virtual Functions during Construction or Destruction".
- "N4659: Working Draft, Standard for Programming Language C++" (PDF). §13.4.
- Ganesh, S.G. (August 1, 2011). "Joy of Programming: Calling Virtual Functions from Constructors".