reproducibilityindex.ai

Unsupervised Personalized Feature Selection

Authors: Jundong Li, Liang Wu, Harsh Dani, Huan Liu

AAAI 2018 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Experimental results on real-world datasets verify the effectiveness of the proposed UPFS framework.
Researcher Affiliation	Academia	Jundong Li, Liang Wu, Harsh Dani, Huan Liu Computer Science and Engineering, Arizona State University, USA {jundong.li,wuliang,hdani,huan.liu}@asu.edu
Pseudocode	Yes	Algorithm 1 Unsupervised Personalized Feature Selection
Open Source Code	No	The paper does not provide any concrete access information (e.g., specific repository link, explicit code release statement) for its source code.
Open Datasets	Yes	We choose 9 datasets from various domains, including (1) four text datasets: CNNStory, Blog Catalog, Flickr and DBLP; (2) two image datasets: Yale and warp PIE10P; (3) three biology datasets: Carcinoma, Prostate GE and TOX171. Detailed statistics of the used datasets are shown in Table 1.
Dataset Splits	No	The paper does not explicitly provide specific training/validation/test dataset splits by percentage or sample counts.
Hardware Specification	No	The paper does not provide specific hardware details (e.g., exact GPU/CPU models, memory amounts, or detailed computer specifications) used for running its experiments.
Software Dependencies	No	The paper does not provide specific ancillary software details with version numbers (e.g., library or solver names with version numbers) needed to replicate the experiment.
Experiment Setup	Yes	In Laplacian Score, MCFS, NDFS, FSASL and UPFS, we specify the number of nearest neighbors k as 5. It is still an open question to decide the optimal number of selected features in feature selection research. Thus, we set the number of selected features among {10, 20, ..., 300} and report the best clustering results. To study how its variation affects the feature selection performance, we ﬁx two parameters each time and vary the third one in the range of {0.001, 0.01, 1, 10, 100, 1000}.