reproducibilityindex.ai

BOK-VQA: Bilingual outside Knowledge-Based Visual Question Answering via Graph Representation Pretraining

Authors: MinJun Kim, SeungWoo Song, YouHan Lee, Haneol Jang, KyungTae Lim

AAAI 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Finally, through in-depth analysis, we demonstrated the actual effect of the knowledge information contained in the constructed training data on VQA.
Researcher Affiliation	Collaboration	1 Hanbat National University 2 Kakao Brain 3 Seoul National University of Science and Technology
Pseudocode	No	The paper includes equations and architectural diagrams, but no explicitly labeled "Pseudocode" or "Algorithm" block.
Open Source Code	No	The paper does not provide a direct link to its source code or explicitly state that the code for the described methodology is publicly available.
Open Datasets	Yes	This research used datasets from The Open AI Dataset Project (AI-Hub) (No. 2022-데이터-위41, 2023-지능데이터-위93). In addition, considering the usage frequency, we incorporated 1,079 objects derived from Image Net (Deng et al. 2009) and supplemented 32 additional relations. We assembled 282,533 triple knowledge entries comprising 1,579 objects and 42 relations from English Concept Net and DBpedia.
Dataset Splits	Yes	The dataset was split into 60% for training and 20% each for validation and testing, with five-fold validation.
Hardware Specification	No	The paper does not provide specific hardware details (e.g., GPU/CPU models, memory) used for running its experiments.
Software Dependencies	No	The paper mentions models like XLM-RoBERTa and ResNet50, but does not provide specific version numbers for underlying software libraries, frameworks (e.g., PyTorch, TensorFlow), or programming languages used for implementation.
Experiment Setup	Yes	all experiments were pretrained on English KB data using the Conv KB algorithm for 50,000 iterations.