K-Nearest Neighbors

K-Nearest Neighbors (KNN) algorithms are a fundamental building block for many complex Dimensionality Reduction and Clustering algorithms (such as UMAP, t-SNE, and OPTICS). DruidJS provides extremely fast web-native implementations of multiple exact and approximate KNN algorithms to find local neighborhoods efficiently.

This showcase allows you to interactively explore various KNN algorithms on synthetic datasets:

Uniform Data Generation: Adjust the number of points (up to 5,000) to generate a 2D uniform distribution.
Real-time Pointer Queries: Move your mouse over the scatterplot to perform instantaneous KNN searches at your cursor's position.
Algorithm Comparison: Switch between different exact and approximate methods to observe their build and search times.

Pipeline Settings

Uniform Points (N): 500

Algorithm

Neighbors (K): 15

M (Links per node): 16

EF Construction: 200

Index built in 0 ms.

Hover query fetched in 0 ms.

How-to (Code)

DruidJS supports algorithms optimized for high dimensionality like HNSW, NNDescent, Annoy, LSH, and spatial partition trees like KD-Tree and BallTree.

javascript

import * as druid from "@saehrimnir/druidjs";

const data = [
  /* ... multi-dimensional floats ... */
];

// 1. Build the index (e.g. NN-Descent)
const nndescent = new druid.NNDescent(data, {
  metric: druid.euclidean,
  samples: 50,
  rho: 0.8,
});

// 2. Perform a fast query (e.g. at a specific coordinate)
const queryVector = [0.5, 0.5];
const nearestNeighbors = nndescent.search(queryVector, 15);

// The list returned contains elements, their original index, and the computed distance
for (const match of nearestNeighbors) {
  console.log(`Neighbor ID: ${match.index}, Distance: ${match.distance}`);
}

K-Nearest Neighbors ​

Pipeline Settings

How-to (Code) ​

K-Nearest Neighbors

How-to (Code)