Key research themes
1. How do different distance measures impact the accuracy and performance of k-Nearest Neighbor classification?
This research area investigates the role of various distance metrics in determining neighborhoods in k-NN algorithms and their effect on classification accuracy, sensitivity, specificity, and computational efficiency. It is critical because the choice of distance metric directly shapes the notion of similarity between data points, influencing the classifier's effectiveness across diverse data types such as network intrusion detection, medical data, foreign exchange forecasting, and student data classification.
2. Can adaptive and local parameter selection improve nearest neighbor classifier accuracy compared to fixed global parameters?
This theme addresses the optimization of the key k-NN hyperparameter k, moving beyond the classic fixed-k approach toward locally adaptive or dynamic selection methods. The goal is to tailor the neighborhood size for each test instance based on data distribution characteristics or clustering information, thereby enhancing classification precision and reducing misclassification caused by uniform parameter settings.
3. How can approximate nearest neighbor search methods alleviate the curse of dimensionality and improve search efficiency in metric and non-metric spaces?
This area focuses on algorithmic and data structural innovations that enable efficient approximate nearest neighbor (ANN) search in high-dimensional and non-metric spaces, circumventing the computational impracticalities of exact search due to the curse of dimensionality. Research evaluates tradeoffs between speed and accuracy, comparing traditional metric-based trees and graph-based small world methods, with applications in similarity search across varied domains.



