Abstract: Existing unsupervised multi-view feature selection methods are limited by ignoring latent cluster structures and decoupling structure learning from feature selection. To address this issue, ...
This project investigates token quality from a noisy-label perspective and propose a generic token cleaning pipeline for SFT tasks. Our method filters out uninformative tokens while preserving those ...
Abstract: Multigranularity data analysis has recently become an active research topic in the intelligent computing and data mining fields. Feature selection via multigranularity data analysis is an ...