Multimodal Weibull Variational Autoencoder for Jointly Modeling Image-Text Data. 2022

Chaojie Wang, and Bo Chen, and Sucheng Xiao, and Zhengjue Wang, and Hao Zhang, and Penghui Wang, and Ning Han, and Mingyuan Zhou

For multimodal representation learning, traditional black-box approaches often fall short of extracting interpretable multilayer hidden structures, which contribute to visualize the connections between different modalities at multiple semantic levels. To extract interpretable multimodal latent representations and visualize the hierarchial semantic relationships between different modalities, based on deep topic models, we develop a novel multimodal Poisson gamma belief network (mPGBN) that tightly couples the observations of different modalities via imposing sparse connections between their modality-specific hidden layers. To alleviate the time-consuming Gibbs sampler adopted by traditional topic models in the testing stage, we construct a Weibull-based variational inference network (encoder) to directly map the observations to their latent representations, and further combine it with the mPGBN (decoder), resulting in a novel multimodal Weibull variational autoencoder (MWVAE), which is fast in out-of-sample prediction and can handle large-scale multimodal datasets. Qualitative evaluations on bimodal data consisting of image-text pairs show that the developed MWVAE can successfully extract expressive multimodal latent representations for downstream tasks like missing modality imputation and multimodal retrieval. Further extensive quantitative results demonstrate that both MWVAE and its supervised extension sMWVAE achieve state-of-the-art performance on various multimodal benchmarks.

UI MeSH Term Description Entries
D007858 Learning Relatively permanent change in behavior that is the result of past experience or practice. The concept includes the acquisition of knowledge. Phenomenography
D008962 Models, Theoretical Theoretical representations that simulate the behavior or activity of systems, processes, or phenomena. They include the use of mathematical equations, computers, and other electronic equipment. Experimental Model,Experimental Models,Mathematical Model,Model, Experimental,Models (Theoretical),Models, Experimental,Models, Theoretic,Theoretical Study,Mathematical Models,Model (Theoretical),Model, Mathematical,Model, Theoretical,Models, Mathematical,Studies, Theoretical,Study, Theoretical,Theoretical Model,Theoretical Models,Theoretical Studies
D000069550 Machine Learning A type of ARTIFICIAL INTELLIGENCE that enable COMPUTERS to independently initiate and execute LEARNING when exposed to new data. Transfer Learning,Learning, Machine,Learning, Transfer

Related Publications

Chaojie Wang, and Bo Chen, and Sucheng Xiao, and Zhengjue Wang, and Hao Zhang, and Penghui Wang, and Ning Han, and Mingyuan Zhou
November 2021, Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference,
Chaojie Wang, and Bo Chen, and Sucheng Xiao, and Zhengjue Wang, and Hao Zhang, and Penghui Wang, and Ning Han, and Mingyuan Zhou
July 2025, Medical physics,
Chaojie Wang, and Bo Chen, and Sucheng Xiao, and Zhengjue Wang, and Hao Zhang, and Penghui Wang, and Ning Han, and Mingyuan Zhou
May 2021, Journal of imaging,
Chaojie Wang, and Bo Chen, and Sucheng Xiao, and Zhengjue Wang, and Hao Zhang, and Penghui Wang, and Ning Han, and Mingyuan Zhou
January 2020, IEEE transactions on neural networks and learning systems,
Chaojie Wang, and Bo Chen, and Sucheng Xiao, and Zhengjue Wang, and Hao Zhang, and Penghui Wang, and Ning Han, and Mingyuan Zhou
January 2025, Nuclear medicine review. Central & Eastern Europe,
Chaojie Wang, and Bo Chen, and Sucheng Xiao, and Zhengjue Wang, and Hao Zhang, and Penghui Wang, and Ning Han, and Mingyuan Zhou
April 2024, Neural networks : the official journal of the International Neural Network Society,
Chaojie Wang, and Bo Chen, and Sucheng Xiao, and Zhengjue Wang, and Hao Zhang, and Penghui Wang, and Ning Han, and Mingyuan Zhou
January 2022, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society,
Chaojie Wang, and Bo Chen, and Sucheng Xiao, and Zhengjue Wang, and Hao Zhang, and Penghui Wang, and Ning Han, and Mingyuan Zhou
August 2023, IEEE transactions on visualization and computer graphics,
Chaojie Wang, and Bo Chen, and Sucheng Xiao, and Zhengjue Wang, and Hao Zhang, and Penghui Wang, and Ning Han, and Mingyuan Zhou
March 2021, Accident; analysis and prevention,
Chaojie Wang, and Bo Chen, and Sucheng Xiao, and Zhengjue Wang, and Hao Zhang, and Penghui Wang, and Ning Han, and Mingyuan Zhou
February 2022, IEEE journal of biomedical and health informatics,
Copied contents to your clipboard!