reproducibilityindex.ai

Representation Learning-Assisted Click-Through Rate Prediction

Authors: Wentao Ouyang, Xiuwu Zhang, Shukui Ren, Chao Qi, Zhaojie Liu, Yanlong Du

IJCAI 2019 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Experiments on two large-scale datasets demonstrate that Deep MCP outperforms several state-of-the-art models for CTR prediction.
Researcher Affiliation	Industry	Wentao Ouyang, Xiuwu Zhang, Shukui Ren, Chao Qi, Zhaojie Liu and Yanlong Du Alibaba Group {maiwei.oywt, xiuwu.zxw, shukui.rsk, qichao.qc, zhaojie.lzj, yanlong.dyl}@alibaba-inc.com
Pseudocode	No	The paper does not contain structured pseudocode or algorithm blocks.
Open Source Code	Yes	We make the implementation code of Deep MCP publicly available1. 1https://github.com/oywtece/deepmcp
Open Datasets	Yes	Avito advertising dataset2. This dataset contains a random sample of ad logs from avito.ru, the largest general classiﬁed website in Russia. We use the ad logs from 2015-04-28 to 2015-05-18 for training, those on 2015-05-19 for validation, and those on 2015-05-20 for testing. ... 2https://www.kaggle.com/c/avito-context-ad-clicks/data
Dataset Splits	Yes	We use the ad logs from 2015-04-28 to 2015-05-18 for training, those on 2015-05-19 for validation, and those on 2015-05-20 for testing. ... We use ad logs of 30 consecutive days during Aug.-Sep. 2018 for training, logs of the next day for validation, and logs of the day after the next day for testing.
Hardware Specification	No	The paper does not provide specific hardware details (exact GPU/CPU models, processor types with speeds, memory amounts, or detailed computer specifications) used for running its experiments.
Software Dependencies	No	All the methods are implemented in Tensorﬂow and optimized by the Adagrad algorithm [Duchi et al., 2011].
Experiment Setup	Yes	We set the embedding dimension of each feature as K = 10, because the number of distinct features is huge. We set the number of fully connected layers in neural network-based models as 2, with dimensions 512 and 256. We set the batch size as 128, the context window size as C = 2 and the number of negative ads as Q = 4. The dropout ratio is set to 0.5.