Knowledge Base
Euclidean Distance

Euclidean Distance

The comparison for the numeric columns is done using Euclidean distance. Euclidean distance is like a measure of how different two collection of numbers are. Imagine you have pairs of numbers (coordinates) like (20, 21, 22, 23, 24) and (28, 31, 32, 33, 34). The Euclidean distance between them would be the square root of the sum of the squares of the differences between each pair of coordinates. In this case, it would be 22.36.

Formally, Euclidean distance is a metric on a vector space where the distance between two vectors is the length of the line segment connecting them. It's named after Euclid, the Greek mathematician who invented it.

Example

Let's say we have tables:

Tested Frame

IDAnother IDYet Another IDNameAge
1101201John20
2102202Mary21
2103203Jane22
3103203Jack23
3104205Jill24

Expected Frame

IDAnother IDYet Another IDNameAge
1101201John28
2102202Mary31
2103203Jane32
3103203Jack33
3104205Jill34

In this example, the ages are different. The Euclidean distance is 22.36, showing that the age columns is not identical.