Machine Learning Research
Evaluates deep learning feature extractors ConvNeXt, ViT, DINOv2 for soft clustering in ambiguous multi-label classification. Tests 30 configurations across 190 artists and 7 art movements. Soft clustering methods (FCM, GMM) surface 10–19% multi-movement assignments that hard clustering suppresses.
Most classification pipelines assign a single hard label to each input, even when genuine ambiguity exists. For multi-label domains art history, medical imaging, content classification this is a category error. This paper evaluates deep learning feature extractors as representations for soft clustering, asking whether richer feature spaces surface ambiguity that simpler representations and hard clustering suppress. The evaluation covers 30 configurations across three feature extractors (ConvNeXt, ViT, DINOv2) and three clustering methods (FCM, GMM, K-Means++) on a corpus of 190 artists and 7 art movements. Soft clustering methods recover 10–19% multi-movement assignments cases where an artist's style spans multiple movements and a hard label would be wrong.
Do deep learning feature extractors (ConvNeXt, ViT, DINOv2) surface recoverable ambiguity in multi-label classification that hard clustering suppresses, and does feature extractor choice determine the degree of ambiguity recovery?
30-configuration evaluation grid across 3 feature extractors × 3 clustering methods; corpus of 190 artists across 7 art movements (10–19% ground-truth multi-movement); feature extraction via ConvNeXt, ViT, and DINOv2 pretrained models; soft clustering via FCM and GMM; hard baseline via K-Means++; evaluation metrics for ambiguity recovery rate, cluster separation, and multi-label assignment accuracy; ablation by extractor and clustering method.
UNDER REVIEW
Submitted to: MAKE · MDPI · Q1 · IF 6.0 · WoS & Scopus
Suggested citation
Kaur, R. & Pinsky, E. (2026)
Team
Lead Researcher / First Author
FilledRashanjot Kaur
Designed evaluation pipeline, configuration grid, feature extraction experiments, soft clustering analysis, and paper. First author.
Skills: Deep Learning, Clustering, ML Evaluation, Feature Extraction, Research Design
Faculty Advisor / Co-author
FilledProf. Eugene Pinsky
Academic advisor and co-author. Supervised research methodology and paper submission.
Skills: Machine Learning, Research Methodology, Academic Mentorship
Prof. Eugene Pinsky