Investigating Non-classical Correlations
Between Decision Fused Multi-modal
Documents
Dimitris Gkoumas1(B)
, Sagar Uprety1
, and Dawei Song1,2
1 The Open University, Milton Keynes, UK
{dimitris.gkoumas,sagar.uprety,dawei.song}@open.ac.uk
2 Beijing Institute of Technology, Beijing, China
Abstract. Correlation has been widely used to facilitate various information retrieval methods such as query expansion, relevance feedback, document clustering, and multi-modal fusion. Especially, correlation and independence are important issues when fusing di erent modalities that influence a multi-modal information retrieval process. The basic idea of correlation is that an observable can help predict or enhance another observable. In quantum mechanics, quantum correlation, called entanglement, is a sort of correlation between the observables measured in atomic-size particles when these particles are not necessarily collected in ensembles. In this paper, we examine a multimodal fusion scenario that might be similar to that encountered in physics by firstly measuring two observables (i.e., text-based relevance and image-based relevance) of a multi-modal document without counting on an ensemble of multi-modal documents already labeled in terms of these two variables. Then, we investigate the existence of non-classical correlations between pairs of multi-modal documents. Despite there are some basic di erences between entanglement and classical correlation encountered in the macroscopic world, we investigate the existence of this kind of non-classical correlation through the Bell inequality violation. Here, we experimentally test several novel association methods in a small-scale experiment. However, in the current experiment we did not find any violation of the Bell inequality. Finally, we present a series of interesting discussions, which may provide theoretical and empirical insights and inspirations for future development of this direction.
Keywords: Multi-modal information retrieval · Non-classical correlations · Decision fused multi-modal documents · CHSH inequality
1 Introduction
Nowadays, the Web surrounding us often involves multiple modalities - we read texts, watch images and videos, and listen to sounds. In general terms, modality refers to a certain type of information and/or the representation format in which information is stored. A research problem is characterized as multi-modal
c Springer Nature Switzerland AG 2019
B. Coecke and A. Lambert-Mogiliansky (Eds.): QI 2018, LNCS 11690, pp. 163–176, 2019. https://doi.org/10.1007/978-3-030-35895-2_11