reproducibilityindex.ai

Sketch the Storyline with CHARCOAL: A Non-Parametric Approach

Authors: Siliang Tang, Fei Wu, Si Li, Weiming Lu, Zhongfei Zhang, Yueting Zhuang

IJCAI 2015 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	The experimental analysis and results demonstrate both interpretability and superiority of the proposed approach.
Researcher Affiliation	Academia	Siliang Tang, Fei Wu, Si Li, Weiming Lu , Zhongfei Zhang, Yueting Zhuang College of Computer Science, Zhejiang University Hangzhou, China {siliang, wufei, lisi zzz, luwm, zhongfei, yzhuang}@zju.edu.cn
Pseudocode	Yes	Algorithm 1 Model Estimation of CHARCOAL
Open Source Code	No	The paper does not provide concrete access to source code for the methodology described.
Open Datasets	No	The paper mentions using "4,908 news articles from the New York Times International News section" and "collections of annotated news articles from New York Times with different major categories." It describes dataset characteristics in Table 2 but does not provide a specific link, DOI, or formal citation to a publicly available dataset for access.
Dataset Splits	No	The paper does not provide specific dataset split information (e.g., percentages or sample counts) for train/validation/test sets, nor does it explicitly mention a validation set.
Hardware Specification	No	The paper does not provide specific hardware details used for running its experiments.
Software Dependencies	No	The paper mentions software like "Open NLP", "Stanford Named Entity Recognizer", and "Alchemy API" but does not provide specific version numbers for these or any other ancillary software components used in the experiments.
Experiment Setup	Yes	CHARCOAL and the two stage clustering models are initialised with story level concentration parameter α = 0.5, topic smooth β = 0.05, and time window size α = 8 days. the HDP inference is performed through a collapsed Gibbs sampler, with a burn-in of 500 samples; its hyperparameters are updated every 50 iterations. the h LDA inference is also performed by Gibbs sampling with a burnin period of 500 samples; its hyper-parameters are ﬁxed and initialized as follows: topic smooth α = 0.1, word smooth η = 0.1, and n CRP concentration parameter λ = 1.