Key research themes
1. How do different distance metrics influence the accuracy and robustness of nearest neighbor classifiers?
This theme investigates the impact of various distance measures on the classification performance of nearest neighbor (NN) algorithms. It is crucial as nearest neighbor classifiers rely heavily on the choice of distance metric to measure similarity between points, affecting accuracy, handling of noise, scalability, and computational efficiency in diverse datasets and domains.
2. What are effective feature selection and representation techniques that improve nearest neighbor classification accuracy in high-dimensional data?
High-dimensional feature spaces challenge NN classifiers with noise, irrelevant or redundant features, and computational inefficiency. This research theme targets strategies for selecting discriminative features or representing data efficiently to improve robustness, accuracy, and scalability of nearest neighbor methods in complex, large-scale, or biomedical datasets.
3. How can geometric constructs such as feature lines and proximity-based data structures advance nearest neighbor classification accuracy and efficiency?
This theme explores advanced representational techniques like nearest feature lines, line segments, and proximity search that extend NN classifiers beyond point-wise comparisons. Such geometric and heuristic approaches aim to capture local variation, alleviate sparsity, reduce search complexity, and improve decision boundaries, offering theoretically and empirically grounded improvements.