Nearest Neighbor Method

description317 papers

group0 followers

lightbulbAbout this topic

The Nearest Neighbor Method is a classification and regression technique in machine learning that predicts the output for a given input based on the closest training examples in the feature space. It operates on the principle that similar instances are likely to have similar outcomes.

lightbulbAbout this topic

Key research themes

1. How do different distance measures impact the accuracy and performance of k-Nearest Neighbor classification?

This research area investigates the role of various distance metrics in determining neighborhoods in k-NN algorithms and their effect on classification accuracy, sensitivity, specificity, and computational efficiency. It is critical because the choice of distance metric directly shapes the notion of similarity between data points, influencing the classifier's effectiveness across diverse data types such as network intrusion detection, medical data, foreign exchange forecasting, and student data classification.

Analysis of Distance Measures Using K-Nearest Neighbor Algorithm on KDD Dataset

by Dr. Arunachalam A S

2017

Key finding: Comparative evaluation of Euclidean, Manhattan, and Chebychev distance metrics on the KDD intrusion detection dataset revealed that Manhattan distance consistently outperformed the others by achieving higher accuracy,... Read more

articleView Paper downloadDownload

Effects of Distance Measure Choice on KNN Classifier Performance-A Review

by Ahmad Hassanat

2022

Key finding: Through extensive experimentation with eighteen diverse distance measures across multiple real-world datasets, the study demonstrated that k-NN classifier performance is significantly sensitive to distance metric choice, with... Read more

articleView Paper downloadDownload

The Distance Choice and Optimal Parameter Selection in k-NN algorithm for FX data

by Vindya K Pathirana

2023, Communications in Applied Analysis

Key finding: Empirical testing on student graduation data using Euclidean and Manhattan distances showed both metrics perform effectively in classifying students as timely or untimely graduates, with the best accuracy of 85.28% attained... Read more

articleView Paper downloadDownload

Clima Social Familiar y la Inteligencia Emocional en los Estudiantes del Tercer Grado de Educación Secundaria de la Institución Educativa N° 5124 Ventanilla, 2016

by VLADIMIR ACORI FLORES,

2023, Revista de Propuestas Educativas

Key finding: The study emphasizes the inadequacy of standard distance metrics like Euclidean in forecasting highly correlated, nonlinear foreign exchange data, demonstrating that Mahalanobis distance’s ability to account for feature... Read more

articleView Paper downloadDownload

Introduction to machine learning: k-nearest neighbors

by Peter Mcquire

2025, Annals of Translational Medicine

Key finding: This paper outlines the foundational role of distance calculations, notably Euclidean distance, in k-NN classification and demonstrates through conceptual examples how the choice of distance metric and the parameter k... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. Can adaptive and local parameter selection improve nearest neighbor classifier accuracy compared to fixed global parameters?

This theme addresses the optimization of the key k-NN hyperparameter k, moving beyond the classic fixed-k approach toward locally adaptive or dynamic selection methods. The goal is to tailor the neighborhood size for each test instance based on data distribution characteristics or clustering information, thereby enhancing classification precision and reducing misclassification caused by uniform parameter settings.

Locally adaptive k parameter selection for nearest neighbor classifier: one nearest cluster

by Faruk BULUT

2020

Key finding: The proposed dynamic local-k selection method, which employs clustering to determine the optimal k for each test instance, demonstrated improved classification accuracy over traditional fixed-k k-NN implementations across... Read more

articleView Paper downloadDownload

Exploiting the Essential Assumptions of Analogy-Based Effort Estimation

by Ayse Bener

2024, IEEE Transactions on Software Engineering

Key finding: By restricting analogy-based effort estimation to clusters with low variance in effort data, this method dynamically selects nearest neighbors instead of relying on fixed-sized neighborhoods, significantly reducing estimation... Read more

articleView Paper downloadDownload

Improving the accuracy of k-nearest neighbor using local mean based and distance weight

by Khairul Umam Syaliman

2022, Journal of Physics: Conference Series

Key finding: Integrating local mean-based classification with distance-weighted voting to determine class assignment for neighbors resulted in a consistent average accuracy improvement of 2.45% across multiple benchmark datasets, and up... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. How can approximate nearest neighbor search methods alleviate the curse of dimensionality and improve search efficiency in metric and non-metric spaces?

This area focuses on algorithmic and data structural innovations that enable efficient approximate nearest neighbor (ANN) search in high-dimensional and non-metric spaces, circumventing the computational impracticalities of exact search due to the curse of dimensionality. Research evaluates tradeoffs between speed and accuracy, comparing traditional metric-based trees and graph-based small world methods, with applications in similarity search across varied domains.

Comparative Analysis of Data Structures for Approximate Nearest Neighbor Search

by Tada syalahuddin Tada syalahuddin

2022

Key finding: Through empirical evaluation on metric and non-metric datasets, this study showed that small world graph based approaches provide superior efficiency-effectiveness tradeoffs compared to classical data structures like VP-tree... Read more

articleView Paper downloadDownload

A distributed memory architecture implementation of the False Nearest Neighbors method based on distribution of dimensions

by Enrique Arias

2022, The Journal of Supercomputing

Key finding: Parallelizing the False Nearest Neighbors algorithm across distributed memory architectures achieved speedups between 17x and 37x over the best sequential TISEAN implementation, enabling rapid identification of appropriate... Read more

articleView Paper downloadDownload

Tree approximation of the long wave radiation parameterization in the NCAR CAM global climate model

by Volodymyr Krasnopolskyi

2024, Journal of Computational and Applied Mathematics

Key finding: Introducing sparse occupancy tree structures as non-parametric approximators for the complex long wave radiation parameterization in climate models provides a computationally efficient emulation for a 220-dimensional input... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Nearest Neighbor Method

Robust Kalman-type Filtering in Positioning Applications

by Tommi Perälä

2026, InTech eBooks

descriptionView Paper arrow_downwardDownload

Kalman-type positioning filters with floor plan information

by Tommi Perälä

2026

A family of Kalman-type filters that estimate the user's position indoors, using range measurements and floor plan data, is presented. The floor plan information is formulated as a set of linear constraints and is used to truncate the... more

descriptionView Paper arrow_downwardDownload

Fingerprint Kalman Filter in indoor positioning applications

by Tommi Perälä

2026, IEEE International Conference on Control Applications

descriptionView Paper arrow_downwardDownload

Kernel-Based Discriminant Techniques for Educational Placement

by 素雲陳

2026, Journal of Educational and Behavioral Statistics

This article considers the problem of educational placement. Several discriminant techniques are applied to a data set from a survey project of science ability. A profile vector for each student consists of five science-educational... more

descriptionView Paper arrow_downwardDownload

Arbitrary trajectories tracking using multiple model based particle filtering in infrared image sequence

by Mukesh A Zaveri SVNIT

2025, International Conference on Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004.

Particle filtering is being investigated extensively due to its important feature of target tracking based on nonlinear and non-Gaussian model. It tracks a trajectory with a known model at a given time. It means that particle filter... more

descriptionView Paper arrow_downwardDownload

A ZigBee indoor positioning scheme using signal-index-pair data preprocess method to enhance precision

by Min-Hsiung Hung

2025, 2010 IEEE International Conference on Robotics and Automation

This paper develops a ZigBee indoor positioning scheme based on the location fingerprinting approach. The proposed scheme includes four workflows: (1) creating the location fingerprint table, (2) training the locating model using neural... more

descriptionView Paper arrow_downwardDownload

Statistical comparisons of methods for interpolating the output of a numerical air quality model

by Michael Stein

2025, Journal of Statistical Planning and Inference

This paper compares Models-3/Community Multiscale Air Quality (CMAQ) outputs at multiple resolutions by interpolating from coarse resolution to fine resolution and analyzing the interpolation difference. Spatial variograms provide a... more

descriptionView Paper arrow_downwardDownload

A study on automatic creation of a comparable document collection in cross‐language information retrieval

by Kal Jarvelin

2025, Journal of Documentation

We present a new method for creating a comparable document collection from two document collections in different languages. The best query keys were extracted from a Finnish source collection (articles of the newspaper Aamulehti) with the... more

descriptionView Paper arrow_downwardDownload

Developments in KD Tree and KNN Searches

by Vijay Tiwari

2025

KNN (K-nearest neighbor) is an important tool in machine learning and it is used in classification and prediction problems. In recent years several modified versions of KNN search algorithm have been developed and employed to improve the... more

descriptionView Paper arrow_downwardDownload

Associations between stand spatial structures and carbon sequestration on natural Larix gmelinii forests in Northeast China

by TIKA RAM POUDEL

2025, Trees, Forests and People

Forest structure is a fundamental component of the forest ecosystem and significantly impacts carbon sequestration. Previous studies mainly focused on optimizing forest non-spatial attributes for restoring carbon, but the significance of... more

descriptionView Paper arrow_downwardDownload

Tree approximation of the long wave radiation parameterization in the NCAR CAM global climate model

by Volodymyr Krasnopolskyi

2024, Journal of Computational and Applied Mathematics

The computation of Global Climate Models (GCMs) presents significant numerical challenges. This paper presents new algorithms based on sparse occupancy trees for learning and emulating the long wave radiation parameterization in the in... more

descriptionView Paper arrow_downwardDownload

Exploiting the Essential Assumptions of Analogy-Based Effort Estimation

by Ayse Bener

2024, IEEE Transactions on Software Engineering

Background: There are too many design options for software effort estimators. How can we best explore them all? Aim: We seek aspects on general principles of effort estimation that can guide the design of effort estimators. Method: We... more

descriptionView Paper arrow_downwardDownload

Efficient calculation of configurational entropy from molecular simulations by combining the mutual-information expansion and nearest-neighbor methods

by Michael Gilson

2024, Journal of Computational Chemistry

Changes in the configurational entropies of molecules make important contributions to free energies of reaction for processes such as protein-folding, noncovalent association, and conformational change. However, obtaining entropy from molecular simulations represents a long-standing computational challenge. Here, two recently introduced approaches, the nearest-neighbor (NN) method and the mutual-information expansion (MIE), are combined to furnish an efficient and accurate method of extracting the configurational entropy from a molecular simulation to a given order of correlations among the internal degrees of freedom. The resulting method takes advantage of the strengths of each approach. The NN method is entirely nonparametric (i.e., it makes no assumptions about the underlying probability distribution), its estimates are asymptotically unbiased and consistent, and it makes optimum use of a limited number of available data samples. The MIE, a systematic expansion of entropy in mutual information terms of increasing order, provides a wellcharacterized approximation for lowering the dimensionality of the numerical problem of calculating the entropy of a high-dimensional system. The combination of these two methods enables obtaining well-converged estimations of the configurational entropy that capture many-body correlations of higher order than is possible with the simple histogramming that was used in the MIE method originally. The combined method is tested here on two simple systems: an idealized system represented by an analytical distribution of 6 circular variables, where the full joint entropy and all the MIE terms are exactly known; and the R,S stereoisomer of tartaric acid, a molecule with 7 internalrotation degrees of freedom for which the full entropy of internal rotation has been already estimated by the NN method. For these two systems, all the expansion terms of the full MIE of the entropy are estimated by the NN method and, for comparison, the MIE approximations up to 3rd order are also estimated by simple histogramming. The results indicate that the truncation of the MIE at the 2-body level can be an accurate, computationally non-demanding approximation to the configurational entropy of anharmonic internal degrees of freedom. If needed, higher-order correlations can be

descriptionView Paper arrow_downwardDownload

Efficient calculation of configurational entropy from molecular simulations by combining the mutual‐information expansion and nearest‐neighbor methods

by Michael Gilson

2024, Journal of Computational Chemistry

Changes in the configurational entropies of molecules make important contributions to the free energies of reaction for processes such as protein‐folding, noncovalent association, and conformational change. However, obtaining entropy from... more

descriptionView Paper arrow_downwardDownload

Using Correspondence Analysis to Combine Classifiers

by Chris Merz

2024, Machine Learning

Several effective methods have been developed recently for improving predictive performance by generating and combining multiple learned models. The general approach is to create a set of learned models either by applying an algorithm... more

descriptionView Paper arrow_downwardDownload

Modeling data relationships with a local variance reducing technique: Applications in hydrology

by Luis Antonio Samaniego

2024, Water Resources Research

The assessment of an appropriate function describing the relationship between hydrological variables is a frequent problem. The usual way of estimating an overall function is a difficult task if the relationship between the variables is... more

descriptionView Paper arrow_downwardDownload

A nearest neighbor approach for automated transporter prediction and categorization from protein sequences

by Xinbin Dai

2024, Bioinformatics

Motivation: Membrane transport proteins play a crucial role in the import and export of ions, small molecules or macromolecules across biological membranes. Currently, there are a limited number of published computational tools which... more

descriptionView Paper arrow_downwardDownload

Repeatability of phenotypic characters and genetic distances in white oats in the presence and absence of fungicide

by Antonio Oliveira

2024

In oats, the year factor has a large influence on the phenotype expression. As a consequence, one year estimates of genetic distance between cultivars often have very little precision. The objectives of this work were to estimate: the... more

descriptionView Paper arrow_downwardDownload

by Antonio Oliveira

2024, Ciência e Agrotecnologia

Dezoito genótipos de aveia foram testados quanto à dissimilaridade genética, com e sem o controle de moléstias da parte aérea. As variáveis avaliadas foram rendimento de grãos desaristados, peso de mil grãos, peso do hectolitro, estatura... more

descriptionView Paper arrow_downwardDownload

Failure Prediction in IBM BlueGene/L Event Logs

by Yinglung Liang

2024, Seventh IEEE International Conference on Data Mining (ICDM 2007)

Frequent failures are becoming a serious concern to the community of high-end computing, especially when the applications and the underlying systems rapidly grow in size and complexity. In order to develop effective fault-tolerant... more

descriptionView Paper arrow_downwardDownload

Feature selection based on information theory, consistency and separability indices

by Włodzisław Duch

2024

Two new feature selection methods are introduced, the first based on separability criterion, the second on consistency index that includes interactions between the selected subsets of features. Comparison of accuracy was made against... more

descriptionView Paper arrow_downwardDownload

Numerical and experimental comparison of complete three-dimensional particle tracking velocimetry algorithms for indoor airflow study

by Mai Ssa

2024, Indoor and Built Environment

The experimental data retrieved from three-dimensional particle tracking velocimetry (3D PTV) are crucial for indoor environment engineering when designing ventilation strategies or monitoring airborne pollutants dispersion in inhabited... more

descriptionView Paper arrow_downwardDownload

USING A NONPARAMETRIC METHOD TO DESCRIBE DIAMETER DISTRIBUTIONS OF BIRCH (BETUlA PUBESCENS EHRH.) STANDS IN NORTHWEST SPAIN

by José Javier Gorgoso Varela

2024

In this study, the nonparametric k-nearest neighbour method was used to describe diameter distributions of birch stands in Northwest Spain. It was applied using the following essential steps: (i) estimation of the distance between target... more

descriptionView Paper arrow_downwardDownload

Pattern Recognition Theory and Applications

by Lev Goldfarb

2024

descriptionView Paper arrow_downwardDownload

Feature selection based on information theory, consistency and separability indices

by Wlodzislaw Duch

2024

descriptionView Paper arrow_downwardDownload

Codebook generation for Image Compression with Simple and Ordain GA

by Dr Sajjad Mohsin

2024, International Journal of Computers and …

In the present research we study the codebook generation problem of vector quantization, using two different techniques of Genetic Algorithm (GA). We used the Simple GA (SGA) method and Ordain GA (OGA) method in vector quantization. SGA... more

descriptionView Paper arrow_downwardDownload

by Frank Plastria

2024, Meteor Research Memorandum

People interested in the research are advised to contact the author for the final version of the publication, or visit the DOI to the publisher's website. • The final author version and the galley proof are versions of the publication... more

descriptionView Paper arrow_downwardDownload

Feature ranking, selection and discretization

by Jacek Biesiada

2024

Many indices for evaluation of features have been considered. Applied to single features they allow for filtering irrelevant attributes. Algorithms for selection of subsets of features also remove redundant features. Hashing techniques... more

descriptionView Paper arrow_downwardDownload

A medoid-based weighting scheme for nearest-neighbor decision rule toward effective text categorization

by Avideep Mukherjee and

2024, SN Applied Sciences

The k-nearest-neighbor (kNN) decision rule is a simple and robust classifier for text categorization. The performance of kNN decision rule depends heavily upon the value of the neighborhood parameter k. The method categorize a test... more

descriptionView Paper arrow_downwardDownload

by Tanmay Basu

2024, Fundamenta Informaticae

The similarity based decision rule computes the similarity between a new test document and the existing documents of the training set that belong to various categories. The new document is grouped to a particular category in which it has... more

descriptionView Paper arrow_downwardDownload

Evaluating a Nearest-Neighbor Method to Substitute Continuous Missing Values

by Nelson F R A N C I S C O F A V I L L A Ebecken

2024, Lecture Notes in Computer Science

This work proposes and evaluates a Nearest-Neighbor Method to substitute missing values in datasets formed by continuous attributes. In the substitution process, each instance containing missing values is compared with complete instances,... more

descriptionView Paper arrow_downwardDownload

Combining Multi Classifiers Based on a Genetic Algorithm – A Gaussian Mixture Model Framework

by Tiến Thịnh Nguyễn

2024, Lecture Notes in Computer Science

Combining outputs from different classifiers to achieve high accuracy in classification task is one of the most active research areas in ensemble method. Although many state-of-art approaches have been introduced, no method is outstanding... more

descriptionView Paper arrow_downwardDownload

Conditional Simulation and Estimation of Gauss-Markov Random Fields Using the Bayesian Nearest Neighbor Method

by Rachid ABABOU

2024, Springer eBooks

In this paper a specialized method for generating Markovian random fields, with or without conditioning, is presented. Here, the prior fields are assumed to be stationary second-order Gauss-Markov random fields in N-dimensional (N-D)... more

descriptionView Paper arrow_downwardDownload

Text classification supervised algorithms with term frequency inverse document frequency and global vectors for word representation: a comparative study

by International Journal of Electrical and Computer Engineering (IJECE) and

2024, International Journal of Electrical and Computer Engineering (IJECE)

Over the course of the previous two decades, there has been a rise in the quantity of text documents stored digitally. The ability to organize and categorize those documents in an automated mechanism, is known as text categorization which... more

descriptionView Paper arrow_downwardDownload

The Greedy Prepend Algorithm for Decision List Induction

by Michael de la Maza

2023, Springer eBooks

We describe a new decision list induction algorithm called the Greedy Prepend Algorithm (GPA). GPA improves on other decision list algorithms by introducing a new objective function for rule selection and a set of novel search algorithms... more

descriptionView Paper arrow_downwardDownload

Use of ETM+ images to extend stem volume estimates obtained from LiDAR data

by Enzo Pranzini

2023, ISPRS Journal of Photogrammetry and Remote Sensing

Airborne LiDAR techniques can provide accurate measurements of tree height, from which estimates of stem volume and forest woody biomass can be obtained. These techniques, however, are still expensive to apply repeatedly over large areas.... more

descriptionView Paper arrow_downwardDownload

Exploiting the Essential Assumptions of Analogy-Based Effort Estimation

by Tim Menzies

2023, IEEE Transactions on Software Engineering

descriptionView Paper arrow_downwardDownload

Scottish forest inventory information derived from satellite imagery and field data

by Daniel McInerney

2023

Up-to-date information of forest resources is required at a variety of scales in order to support forest management practices ranging from strategic to operational levels. The rate of change in Scottish forests is significant and may... more

descriptionView Paper arrow_downwardDownload

A Parallelized Binary Search Tree

by Bret Cooper

2023, Journal of information technology & software engineering

PTTRNFNDR is an unsupervised statistical learning algorithm that detects patterns in DNA sequences, protein sequences, or any natural language texts that can be decomposed into letters of a finite alphabet. PTTRNFNDR performs complex... more

Table 1: PTTRNFNDR execution time on different protein sequence datafiles. The versions of PTTRNFNDR differed only in the threading of the binary search tree executions.

Figure 1: Time cost for executing datafiles of different sizes using differen threads for the binary search tree executions in PTTRNFNDR.

descriptionView Paper arrow_downwardDownload

Desempenho preliminar de novos genótipos de aveia e trigo na Depressão Ccentral do RS

by Antonio Wilson Penteado Ferreira Fº.

2023, Pesquisa Agropecuaria Brasileira

RESUMO-Mais de 200 linhagens avançadas de aveia (Avena retive L.) e trigo (Triticuin aeflivuzn L.), selecionadas em 1983, foram avaliadas em dois experimentos, conduzidos em Guaíba, RS, durante o ano de 1984. O objetivo foi testar... more

descriptionView Paper arrow_downwardDownload

Structure-based nonparametric target definition and assessment procedures with an application to riparian forest management

by Kevin Gehringer

2023, Forest Ecology and Management

Forest policy makers increasingly desire the use of quantitative descriptions to define desirable forest characteristics as a target for forest management. A framework for quantitative, multivariate target definition and assessment is... more

descriptionView Paper arrow_downwardDownload

Variable scaling and nearest neighbor methods

by Gerhard Tutz

2023, Journal of Chemometrics

When employing nearest neighbor classifiers scaling of input variables is often useful. In this paper we propose a small modification in usual data preprocessing: scaling of variables should be done by use of pooled variances instead of... more

descriptionView Paper arrow_downwardDownload

Conflict-based negotiation strategy for human-agent negotiation

by Mehmet Onur Keskin

2023, Applied Intelligence

Day by day, human-agent negotiation becomes more and more vital to reach a socially beneficial agreement when stakeholders need to make a joint decision together. Developing agents who understand not only human preferences but also... more

descriptionView Paper arrow_downwardDownload