reproducibilityindex.ai

Discovering New Intents via Constrained Deep Adaptive Clustering with Cluster Refinement

Authors: Ting-En Lin, Hua Xu, Hanlei Zhang8360-8367

AAAI 2020 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Experimental results on the three benchmark datasets show that our method can yield signiﬁcant improvements over strong baselines.
Researcher Affiliation	Academia	1State Key Laboratory of Intelligent Technology and Systems, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China 2 Beijing National Research Center for Information Science and Technology(BNRist), Beijing 100084, China 3 School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China
Pseudocode	No	The paper does not contain any structured pseudocode or algorithm blocks.
Open Source Code	Yes	The code is available at https://github.com/thuiar/CDAC-plus
Open Datasets	Yes	We conduct experiments on three publicly available short text datasets. The detailed statistics are shown in Table 1.
Dataset Splits	Yes	Besides, we divide all dataset into training, validation, and test sets. First, we train the model by limited labeled data (containing known intents) and unlabeled data (containing all intents) in the training set. Second, we tune the model on the validation set, which only contains known intents.
Hardware Specification	No	The paper does not provide specific hardware details (e.g., CPU/GPU models, memory) used for running the experiments.
Software Dependencies	No	The paper mentions 'implemented in Py Torch' and 'pre-trained BERT model', but does not specify version numbers for these software components.
Experiment Setup	Yes	The training batch size is 256, and the learning rate is 5e-5. We use the same dynamic thresholds as DAC (Chang et al. 2017) and set u(λ) = 0.95 λ, l(λ) = 0.455 + 0.1 λ, and η = 0.009. During the reﬁnement stage, we perform K-means on intent representation I to obtain the initial cluster centroids U and set the stop criteria δlabel as 0.1%.