The Semi-Paired Problem In Machine Learning

Published by Xin Guo

Zhengzhou University

These findings are described in the article entitled Joint Intermodal and Intramodal Correlation Preservation for Semi-paired Learning, recently published in the journal Pattern Recognition (Pattern Recognition 81 (2018) 36-49). This work was conducted by Xin Guo, Song Wang, Yun Tie and Lin Qi from Zhengzhou University, and Ling Guan from Ryerson University.

In the real world, it is common that one object is able to be observed from different views. Such multi-view observation often leads to a better understanding of the object. This ideology has guided our studies that exploring data from multiple views can acquire richer information than that from a single view in machine learning.


Most of the current studies consider the multi-view features as a one-to-one correspondence, which we call a fully-paired situation. However, this fully-paired requirement is difficult to satisfy in practice, due to numerous reasons like the sensors’ frequencies at different views not being synchronized, or due to missing features extracted from certain views. In such situations, methods have been proposed to figure out the semi-paired problem by exploring the relationship between the samples, paired and unpaired, and their neighbors. But only the structure information from individual views is captured in these methods, which limits the level of performance improvements these methods are able to offer.

Credit: Xin Guo

To improve learning performance under the semi-paired situation, there are three challenging problems to be addressed urgently, which are: (1) For unpaired multi-view samples, how can we generate the relationship among different views? (2) How do we exploit the discriminative information in the situation when there is no label information available at all? (3) How can we jointly optimize the cross-view correlation and within-view similarity simultaneously during the learning process?

For problem (1), the information of within-view neighborhood relationships and cross-view pairwise samples are used to estimate the cross-view correlation. Intuitively, if a sample from X-view and the other one from Y-view share more co-occurring paired neighbors, there is a higher probability they should be paired. To accelerate the procedure, instead of searching co-occurring paired neighbors from all the sample set, we only select the neighbors that are from the same cluster.

For problem (2), we make a reasonable cluster assumption that the neighboring samples are more likely to be from the same class. Thus, although there is no label information available, discriminative information can be exploited and the similarity within the same class can be preserved.


For problem (3), we combine the within-view correlation and cross-view correlation into a joint optimization problem. Fortunately, the joint optimization problem can be transformed into a typical generalized eigenvalue problem and solved in a close form.

To validate the effectiveness of the work, the proposed methods are compared with several existing related methods on both synthetic data and popular real-world datasets, i.e. UCI multiple feature dataset, UCI internet advertisement dataset, and Wiki dataset. All the experiments demonstrate that the proposed method achieved much better performance than the related methods.



Exploration For Oil Could Push Prehistoric Coelacanths To Extinction

Living off of the coast of Africa, there are only thirty African (or the West Indian Ocean) coelacanths known to exist […]

Over 100,000 Orangutans Have Died In Borneo In Past 16 Years

Borneo’s critically endangered orangutans have experienced a mass die-off over the past 16 years. Research conducted on a population of […]

Rising Sea Levels Could Destroy Hundreds Of Thousands Of Florida Homes Over The Next Few Decades

Over the next thirty years, more than 300,000 homes throughout Florida could experience chronic flooding and destruction, according to a […]

Increasing Surface Water Trends In Peninsular India

Water resources play a crucial role in India’s economic growth, as a large portion of India’s GDP is dependent on […]

Sintering And Densification In Nuclear Power

Like most power plants, nuclear power plants heat water to generate electricity. But nuclear power plants use heat from splitting […]

Eating Extra Virgin Olive Oil Found To Lower Risk Of Alzheimer’s Disease

The Mediterranean diet is ranked as one of the most healthful diets on the planet. It incorporates different foods which […]

In Situ Cryocrystallized Organometallic Liquids

The X-rays carries an invaluable source of information when applied to the chemical systems. Various X-ray based techniques are used […]

Science Trends is a popular source of science news and education around the world. We cover everything from solar power cell technology to climate change to cancer research. We help hundreds of thousands of people every month learn about the world we live in and the latest scientific breakthroughs. Want to know more?